BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019329
(342 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225457646|ref|XP_002275328.1| PREDICTED: uncharacterized protein LOC100263332 [Vitis vinifera]
gi|297745600|emb|CBI40765.3| unnamed protein product [Vitis vinifera]
Length = 353
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 257/364 (70%), Positives = 279/364 (76%), Gaps = 33/364 (9%)
Query: 1 MSGSETAV-------------NVMTSQPPASIQSMRLAFSADGTAVYKPITATSPTYQPS 47
MSGSET + N + SQP IQ+MRLAFS DG AVYKP++ TSP YQ S
Sbjct: 1 MSGSETGIMTTREPFSMGLQKNAVPSQP--VIQNMRLAFSPDGAAVYKPVSGTSPPYQSS 58
Query: 48 GAGGD------GAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVT 101
G G IP GLN MNMGS EP+KRKRGRPRKYGPDGTM+LAL P+PS V
Sbjct: 59 GGTGGDGSTGGAIIPH--GLN-MNMGS--EPLKRKRGRPRKYGPDGTMALALSPAPSGVN 113
Query: 102 TATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKA 161
+ G G SP S+KK+RGRPPGS KK Q+EALGSAGVGFTPHVITVKA
Sbjct: 114 VSQSGGAFSSPPASAGSASPSSLKKARGRPPGSS--KKQQMEALGSAGVGFTPHVITVKA 171
Query: 162 GEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLL 221
GEDVSSKIMSFSQ+GPRAVCILSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL
Sbjct: 172 GEDVSSKIMSFSQHGPRAVCILSANGAISNVTLRQPATSGGTVTYEGRFEILSLSGSFLL 231
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
SE+ GQRSRTGGLSVSLSGPDGRVLGG VAGLLTAA+PVQVVVGSF+ADGRKESKS+ ++
Sbjct: 232 SENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFIADGRKESKSASQV 291
Query: 282 ESLPVPPKLAP---GGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATG 338
E PPK+AP GG G SPPSRGTLSESSGGPGSPLN STGACNN++ P GM T
Sbjct: 292 EPSSAPPKIAPVGGGGGVTGTSSPPSRGTLSESSGGPGSPLNQSTGACNNSN-PPGM-TS 349
Query: 339 IPWK 342
IPWK
Sbjct: 350 IPWK 353
>gi|147809818|emb|CAN64876.1| hypothetical protein VITISV_030792 [Vitis vinifera]
Length = 390
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 256/364 (70%), Positives = 279/364 (76%), Gaps = 33/364 (9%)
Query: 1 MSGSETAVNVMTSQPPAS-------------IQSMRLAFSADGTAVYKPITATSPTYQPS 47
MSGSET + MT++ P S IQ+MRLAFS DG AVYKP++ TSP YQ S
Sbjct: 1 MSGSETGI--MTTREPFSMGLQKNAVPSQPVIQNMRLAFSPDGAAVYKPVSGTSPPYQSS 58
Query: 48 GAGGD------GAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVT 101
G G IP GLN MNMGS EP+KRKRGRPRKYGPDGTM+LAL P+PS V
Sbjct: 59 GGTGGDGSTGGAIIPH--GLN-MNMGS--EPLKRKRGRPRKYGPDGTMALALSPAPSGVN 113
Query: 102 TATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKA 161
+ G G SP S+KK+RGRPPGS KK Q+EALGSAGVGFTPHVITVKA
Sbjct: 114 VSQSGGAFSSPPASAGSASPSSLKKARGRPPGSS--KKQQMEALGSAGVGFTPHVITVKA 171
Query: 162 GEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLL 221
GEDVSSKIMSFSQ+GPRAVCILSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL
Sbjct: 172 GEDVSSKIMSFSQHGPRAVCILSANGAISNVTLRQPATSGGTVTYEGRFEILSLSGSFLL 231
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
SE+ GQRSRTGGLSVSLSGPDGRVLGG VAGLLTAA+PVQVVVGSF+ADGRKESKS+ ++
Sbjct: 232 SENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFIADGRKESKSASQV 291
Query: 282 ESLPVPPKLAP---GGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATG 338
E PPK+AP GG G SPPSRGTLSESSGGPGSPLN STGACNN++ P GM T
Sbjct: 292 EPSSAPPKIAPVGGGGGVTGTSSPPSRGTLSESSGGPGSPLNQSTGACNNSN-PPGM-TS 349
Query: 339 IPWK 342
IPW
Sbjct: 350 IPWN 353
>gi|255539322|ref|XP_002510726.1| DNA binding protein, putative [Ricinus communis]
gi|223551427|gb|EEF52913.1| DNA binding protein, putative [Ricinus communis]
Length = 374
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 242/379 (63%), Positives = 274/379 (72%), Gaps = 42/379 (11%)
Query: 1 MSGSETAVNVMTSQPPAS-------IQSMRLAFSADGTAVYKPITA--TSPTYQPS-GAG 50
MSGSET V T++ P IQ+MRLAF ADG++VYKP+T SP+YQPS A
Sbjct: 1 MSGSETGVMTSTTREPFGVVSPQPVIQNMRLAFGADGSSVYKPMTTATNSPSYQPSPSAA 60
Query: 51 GDGAIPQ--AQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTG 108
G + + G+NV NMGSG++ MKRKRGRPRKYGPDGTM+LALV +P SV G
Sbjct: 61 SPGGFVEGGSLGINV-NMGSGNDAMKRKRGRPRKYGPDGTMALALVSAPQSVGITQPAGG 119
Query: 109 SGLSSPGG------GP-------------------LSPDSIKKSRGRPPGSGSGKKHQLE 143
G S+P GP +SP IKK RGRPPGS KK QLE
Sbjct: 120 GGFSTPTSAAATSVGPSTTTIAANPSLPSGSGGGSVSPTGIKKGRGRPPGSN--KKQQLE 177
Query: 144 ALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGT 203
ALGSAG GFTPH+ITVKAGEDVSSKIMSFSQ+GPRAVCILSANGAISNVTLRQ ATSGG+
Sbjct: 178 ALGSAGFGFTPHIITVKAGEDVSSKIMSFSQHGPRAVCILSANGAISNVTLRQPATSGGS 237
Query: 204 VTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVV 263
VTYEGRFEILSLSGSFL SE+ GQRSRTGGLSVSLSGPDGRVLGG VAGLL AA+PVQVV
Sbjct: 238 VTYEGRFEILSLSGSFLPSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLLAASPVQVV 297
Query: 264 VGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNHST 323
V SF++D RKE KS + +E L +L P G SPPSRGT SESSGGPGSPLN ST
Sbjct: 298 VASFISDDRKELKSPNHLEPLSAMNRLTPVMGTTGPSSPPSRGTFSESSGGPGSPLNQST 357
Query: 324 GACNNNHLPQGMATGIPWK 342
GACNN++L QG+++ +PWK
Sbjct: 358 GACNNSNL-QGISS-MPWK 374
>gi|449455639|ref|XP_004145559.1| PREDICTED: uncharacterized protein LOC101207513 [Cucumis sativus]
gi|449522960|ref|XP_004168493.1| PREDICTED: uncharacterized LOC101207513 [Cucumis sativus]
Length = 351
Score = 360 bits (924), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 237/368 (64%), Positives = 263/368 (71%), Gaps = 43/368 (11%)
Query: 1 MSGSETAV-------------NVMTSQPPASIQSMRLAFSADGTAVYKPITATSPTYQPS 47
MSGSET V N + SQ P +QSM L F ADG VYKP+ SPTYQ S
Sbjct: 1 MSGSETGVISSGEHFTIGLQKNSVPSQQPV-MQSMHLPFGADG--VYKPVATASPTYQSS 57
Query: 48 ------GAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVT 101
AG DG+ A +NM S SEP+KRKRGRPRKYGPDG+M++A P++ T
Sbjct: 58 SVGVAGNAGADGSARDA----FVNMNSQSEPVKRKRGRPRKYGPDGSMAVAPAVRPAAAT 113
Query: 102 TATGGTG-SGLSSPGGG-PLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITV 159
++GG S ++P G SP S+KK RGRPPGS S KKH L+ SAGVGFTPHVITV
Sbjct: 114 QSSGGFSPSPTAAPQSGRSASPTSLKKPRGRPPGS-STKKHHLDTSESAGVGFTPHVITV 172
Query: 160 KAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF 219
KAGEDVSSKIMSFSQNGPRAVCIL+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+
Sbjct: 173 KAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSY 232
Query: 220 LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD-GRKESKSS 278
LLSE+ GQRSRTGGLSVSLSGPDGRVLGG VAGLLTAA+PVQVVVGSF+ D G KE +
Sbjct: 233 LLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQV 292
Query: 279 HRMESLPV--PPKLAP--GGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQG 334
+++E PV P KLAP G G SPPSRGTLSESSGGPGSP N S GACNNN
Sbjct: 293 NQIEQPPVSAPHKLAPIRAGM-TGASSPPSRGTLSESSGGPGSPFNQSAGACNNNT---- 347
Query: 335 MATGIPWK 342
IPWK
Sbjct: 348 ----IPWK 351
>gi|449522157|ref|XP_004168094.1| PREDICTED: uncharacterized LOC101211767 [Cucumis sativus]
Length = 364
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 234/378 (61%), Positives = 267/378 (70%), Gaps = 50/378 (13%)
Query: 1 MSGSETAVNVMTSQPPASI-------------QSMRLAFSADGTAVYKPIT-ATSPTYQP 46
MSGSET V +TS+ P + Q+MRLAF ADGT YKP+T +TSP+YQ
Sbjct: 1 MSGSETGV--ITSREPFGVGVQNSSLHSQSGTQNMRLAFGADGTG-YKPVTPSTSPSYQS 57
Query: 47 SGAGGDGAIPQA--------------QGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLA 92
S AG G G N+ ++GS E +KRKRGRPRKYGPDG+M+LA
Sbjct: 58 SMAGVSGNAGIEGSAGGGGGGGSMLPHGFNINSVGS--EQIKRKRGRPRKYGPDGSMALA 115
Query: 93 LVPSPSSVTTATGGTG----SGLSSPGGGPL-SPDSIKKSRGRPPGSGSGKKHQLEALGS 147
L P S GTG S +++ L SP+S KK++GRP GS KK QLEALGS
Sbjct: 116 LGSGPPS------GTGCFPPSNMANSASEALGSPNSSKKTKGRP--LGSKKKQQLEALGS 167
Query: 148 AGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYE 207
AG+GFTPHVI VKAGEDVSSKIMSFSQNGPRA+CILSANG+ISNVTLRQ ATSGGTVTYE
Sbjct: 168 AGIGFTPHVIDVKAGEDVSSKIMSFSQNGPRAICILSANGSISNVTLRQPATSGGTVTYE 227
Query: 208 GRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
GRFEILSLSGSFLLSE+ GQRSRTGGLSVSLSGPDGRVLGGSVAGLLTA +PVQVVVGSF
Sbjct: 228 GRFEILSLSGSFLLSENGGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTALSPVQVVVGSF 287
Query: 268 LADGRKESKSSHRMESLPVPPKL--APGGQPAGQCSPPSRGTLSESSGG-PGSPLNHSTG 324
+ADG KE K + + E P L A G G S PS GTLSESS G P SPLN+S+G
Sbjct: 288 IADGNKEPKPARQNELTTALPMLNTAGFGHLTGGASSPSHGTLSESSDGSPDSPLNNSSG 347
Query: 325 ACNNNHLPQGMATGIPWK 342
CNN++ PQGM +G+PWK
Sbjct: 348 GCNNSNHPQGM-SGMPWK 364
>gi|449458061|ref|XP_004146766.1| PREDICTED: uncharacterized protein LOC101211767 [Cucumis sativus]
Length = 364
Score = 349 bits (896), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 233/378 (61%), Positives = 267/378 (70%), Gaps = 50/378 (13%)
Query: 1 MSGSETAVNVMTSQPPASI-------------QSMRLAFSADGTAVYKPIT-ATSPTYQP 46
MSGSET V +TS+ P + Q+MRLAF ADGT YKP+T +TSP+YQ
Sbjct: 1 MSGSETGV--ITSREPFGVGVQNSSLHSQSGTQNMRLAFGADGTG-YKPVTPSTSPSYQS 57
Query: 47 SGAGGDGAIPQA--------------QGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLA 92
S AG G G N+ ++GS E +KRKRGRPRKYGPDG+M+LA
Sbjct: 58 SMAGVSGNAGIEGSAGGGGGGGSMLPHGFNINSVGS--EQIKRKRGRPRKYGPDGSMALA 115
Query: 93 LVPSPSSVTTATGGTG----SGLSSPGGGPL-SPDSIKKSRGRPPGSGSGKKHQLEALGS 147
L P S GTG S +++ L SP+S KK++GRP GS KK QLEALGS
Sbjct: 116 LGSGPPS------GTGCFPPSNMANSASEALGSPNSSKKTKGRP--LGSKKKQQLEALGS 167
Query: 148 AGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYE 207
AG+GFTPHVI VKAGEDVSSKIMSFSQNGPRA+CILSANG+ISNVTLRQ ATSGGTVTYE
Sbjct: 168 AGIGFTPHVIDVKAGEDVSSKIMSFSQNGPRAICILSANGSISNVTLRQPATSGGTVTYE 227
Query: 208 GRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
GRF+ILSLSGSFLLSE+ GQRSRTGGLSVSLSGPDGRVLGGSVAGLLTA +PVQVVVGSF
Sbjct: 228 GRFQILSLSGSFLLSENGGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTALSPVQVVVGSF 287
Query: 268 LADGRKESKSSHRMESLPVPPKL--APGGQPAGQCSPPSRGTLSESSGG-PGSPLNHSTG 324
+ADG KE K + + E P L A G G S PS GTLSESS G P SPLN+S+G
Sbjct: 288 IADGNKEPKPARQNELTTALPMLNTAGFGHLTGGASSPSHGTLSESSDGSPDSPLNNSSG 347
Query: 325 ACNNNHLPQGMATGIPWK 342
CNN++ PQGM +G+PWK
Sbjct: 348 GCNNSNHPQGM-SGMPWK 364
>gi|448872670|gb|AGE46020.1| putative AT-hook DNA-binding protein [Elaeis guineensis]
Length = 362
Score = 337 bits (864), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 240/364 (65%), Positives = 268/364 (73%), Gaps = 33/364 (9%)
Query: 1 MSGSETAVNV------MTSQPPASIQSMRLAFSADGTAVYKPITATSPTYQPSGAGGDG- 53
MSG E+ NV + SQP S+QSMRLAF+ DGTA+YKPIT +SP P GG
Sbjct: 10 MSGRES-FNVGMQKSPVQSQP--SMQSMRLAFAPDGTAIYKPITTSSPPPPPYQGGGGAG 66
Query: 54 ----------AIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTA 103
A GLN+ N+G EP+KRKRGRPRKYGPDGTMSLAL + +
Sbjct: 67 STGGGDGPSPAAITPHGLNI-NVG---EPVKRKRGRPRKYGPDGTMSLALTTVSPTAAVS 122
Query: 104 TGGTGSGLSSPGGG----PLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITV 159
G G SS G G S +++KK+RGRPPGSG KK QL ALGSAG+GFTPHVITV
Sbjct: 123 PGSGGFSPSSAGAGNPASSASAEAMKKARGRPPGSG--KKQQLAALGSAGIGFTPHVITV 180
Query: 160 KAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF 219
KAGEDVSSKIMSFSQ+GPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF
Sbjct: 181 KAGEDVSSKIMSFSQHGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF 240
Query: 220 LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSH 279
LLSES GQRSRTGGLSVSL+GPDGRVLGG VAGLLTAA+PVQVVVGSF+ADG+KE K +
Sbjct: 241 LLSESGGQRSRTGGLSVSLAGPDGRVLGGGVAGLLTAASPVQVVVGSFIADGKKEPKHTA 300
Query: 280 RMESLPVPPKLAPGGQPAGQCSPPSRGTL-SESSGGPGSPLNHSTGACNNNHLPQGMATG 338
+ P KLA GG AG SPPSRGTL S GGPGSPLN STG CNN++ QG++
Sbjct: 301 PSDPTLAPGKLAAGGAAAGANSPPSRGTLSESSGGGPGSPLNQSTGTCNNSNQ-QGLSN- 358
Query: 339 IPWK 342
+PWK
Sbjct: 359 MPWK 362
>gi|223943393|gb|ACN25780.1| unknown [Zea mays]
gi|414869457|tpg|DAA48014.1| TPA: AT-hook protein 1 [Zea mays]
Length = 388
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 208/354 (58%), Positives = 250/354 (70%), Gaps = 26/354 (7%)
Query: 12 TSQPPAS--IQSMRLAFSADGTAVYKPI--TATSPTYQPSGAGGDGAIPQAQ---GLNVM 64
S PPAS +QS+R+A+++DGTAV+ P+ +A +P+YQP GA ++ A G
Sbjct: 38 VSGPPASAAMQSVRMAYTSDGTAVFAPMRSSAATPSYQPQGAAHGASMSAATIIGGNGAA 97
Query: 65 NMGSGSEPM-KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGS-GLSSPGGGPL--- 119
S EP+ K+KRGRPRKYGPDG+M+LALVP ++ + T G GS G SP G L
Sbjct: 98 AAPSMGEPVPKKKRGRPRKYGPDGSMALALVPVSAATGSPTTGQGSSGPFSPAGSNLTNS 157
Query: 120 ----SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQN 175
SPD KK RGRP GS K +++A GS+G GFTPHVITV+AGEDVSSKIMSFSQ+
Sbjct: 158 LLVASPDGFKK-RGRP--KGSTNKPRMDAAGSSGAGFTPHVITVQAGEDVSSKIMSFSQH 214
Query: 176 GPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLS 235
GPRAVC+LSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL E GQRSRTGGLS
Sbjct: 215 GPRAVCVLSANGAISNVTLRQTATSGGTVTYEGRFEILSLSGSFLLVEDGGQRSRTGGLS 274
Query: 236 VSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQ 295
VSL+GPDGRVLGG VAGLL AA+PVQ+V+GSF + G+KE+K P P K+AP
Sbjct: 275 VSLAGPDGRVLGGGVAGLLVAASPVQIVLGSFNSGGKKEAKKHAPSGPTPAPLKVAPPTT 334
Query: 296 PA---GQCSPPSR-GTLSESSGGPGS---PLNHSTGACNNNHLPQGMATGIPWK 342
SPPSR GTLSESSGG GS PL+ A ++N P +++ +PWK
Sbjct: 335 TRMGPNSSSPPSRGGTLSESSGGAGSPPPPLHQGMAASSSNDQPPFLSSIMPWK 388
>gi|223947063|gb|ACN27615.1| unknown [Zea mays]
gi|223947407|gb|ACN27787.1| unknown [Zea mays]
gi|224029909|gb|ACN34030.1| unknown [Zea mays]
gi|414869452|tpg|DAA48009.1| TPA: AT-hook protein 1 isoform 1 [Zea mays]
gi|414869453|tpg|DAA48010.1| TPA: AT-hook protein 1 isoform 2 [Zea mays]
gi|414869454|tpg|DAA48011.1| TPA: AT-hook protein 1 isoform 3 [Zea mays]
gi|414869455|tpg|DAA48012.1| TPA: AT-hook protein 1 isoform 4 [Zea mays]
gi|414869456|tpg|DAA48013.1| TPA: AT-hook protein 1 isoform 5 [Zea mays]
Length = 376
Score = 333 bits (854), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 208/354 (58%), Positives = 250/354 (70%), Gaps = 26/354 (7%)
Query: 12 TSQPPAS--IQSMRLAFSADGTAVYKPI--TATSPTYQPSGAGGDGAIPQAQ---GLNVM 64
S PPAS +QS+R+A+++DGTAV+ P+ +A +P+YQP GA ++ A G
Sbjct: 26 VSGPPASAAMQSVRMAYTSDGTAVFAPMRSSAATPSYQPQGAAHGASMSAATIIGGNGAA 85
Query: 65 NMGSGSEPM-KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGS-GLSSPGGGPL--- 119
S EP+ K+KRGRPRKYGPDG+M+LALVP ++ + T G GS G SP G L
Sbjct: 86 AAPSMGEPVPKKKRGRPRKYGPDGSMALALVPVSAATGSPTTGQGSSGPFSPAGSNLTNS 145
Query: 120 ----SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQN 175
SPD KK RGRP GS K +++A GS+G GFTPHVITV+AGEDVSSKIMSFSQ+
Sbjct: 146 LLVASPDGFKK-RGRP--KGSTNKPRMDAAGSSGAGFTPHVITVQAGEDVSSKIMSFSQH 202
Query: 176 GPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLS 235
GPRAVC+LSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL E GQRSRTGGLS
Sbjct: 203 GPRAVCVLSANGAISNVTLRQTATSGGTVTYEGRFEILSLSGSFLLVEDGGQRSRTGGLS 262
Query: 236 VSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQ 295
VSL+GPDGRVLGG VAGLL AA+PVQ+V+GSF + G+KE+K P P K+AP
Sbjct: 263 VSLAGPDGRVLGGGVAGLLVAASPVQIVLGSFNSGGKKEAKKHAPSGPTPAPLKVAPPTT 322
Query: 296 PA---GQCSPPSR-GTLSESSGGPGS---PLNHSTGACNNNHLPQGMATGIPWK 342
SPPSR GTLSESSGG GS PL+ A ++N P +++ +PWK
Sbjct: 323 TRMGPNSSSPPSRGGTLSESSGGAGSPPPPLHQGMAASSSNDQPPFLSSIMPWK 376
>gi|224061839|ref|XP_002300624.1| predicted protein [Populus trichocarpa]
gi|222842350|gb|EEE79897.1| predicted protein [Populus trichocarpa]
Length = 277
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 185/253 (73%), Positives = 205/253 (81%), Gaps = 14/253 (5%)
Query: 93 LVPSPSSVTTATGGTGSGLSSPG---GGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAG 149
LVP+PS G+ G++ P GG +SP +KK+RGRPPGS KK QL+ALGSAG
Sbjct: 36 LVPTPSP------GSDVGVAGPAVALGGSVSPTGVKKARGRPPGSS--KKQQLDALGSAG 87
Query: 150 VGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGR 209
+GFTPHVITVKAGEDVSSKIMSFSQ+GPRAVCILSANGAISNVTLRQ ATSGGTVTYEGR
Sbjct: 88 IGFTPHVITVKAGEDVSSKIMSFSQHGPRAVCILSANGAISNVTLRQQATSGGTVTYEGR 147
Query: 210 FEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLA 269
FEIL+LSGS+L SE+ GQRSR+GGLSV LSGPDGRVLGGSVAGLL AA PVQVVV SF+A
Sbjct: 148 FEILALSGSYLPSENGGQRSRSGGLSVCLSGPDGRVLGGSVAGLLMAAAPVQVVVSSFIA 207
Query: 270 DGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGACNNN 329
DGRK SKS++ ME KL P G G SPPSRGTLSESSGGPGSPLN STGACNNN
Sbjct: 208 DGRKVSKSANHMEPSSATSKLPPTGGSTGVSSPPSRGTLSESSGGPGSPLNQSTGACNNN 267
Query: 330 HLPQGMATGIPWK 342
PQG++ +PWK
Sbjct: 268 --PQGISN-MPWK 277
>gi|226532898|ref|NP_001149717.1| AT-hook protein 1 [Zea mays]
gi|195629724|gb|ACG36503.1| AT-hook protein 1 [Zea mays]
Length = 377
Score = 320 bits (819), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 203/355 (57%), Positives = 246/355 (69%), Gaps = 27/355 (7%)
Query: 12 TSQPPAS--IQSMRLAFSADGTAVYKPIT--ATSPTYQPSGAGGDGAIPQAQ---GLNVM 64
S PPAS +QS+R+A+++DGTAV+ P++ A +P+YQP GA ++ A G
Sbjct: 26 VSGPPASAAMQSVRMAYTSDGTAVFAPMSSSAATPSYQPQGAAHGASMSAATIIGGNGAA 85
Query: 65 NMGSGSEPM-KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGS-GLSSPGGGPL--- 119
S EP+ K+KRGRPRKYGPDG+M+LALVP ++ + T G GS G SP G L
Sbjct: 86 AAPSMGEPVPKKKRGRPRKYGPDGSMALALVPVSAATGSPTTGQGSSGPFSPAGSNLTNS 145
Query: 120 ----SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQN 175
SPD KK RGRP GS K +++A GS+G GFTPHVITV+AGEDVSSKIMSFSQ+
Sbjct: 146 LLVASPDGFKK-RGRP--KGSTNKPRMDAAGSSGAGFTPHVITVQAGEDVSSKIMSFSQH 202
Query: 176 GPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLS 235
GPRAVC+LSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL E GQRSRTGGLS
Sbjct: 203 GPRAVCVLSANGAISNVTLRQTATSGGTVTYEGRFEILSLSGSFLLVEDGGQRSRTGGLS 262
Query: 236 VSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQ 295
VSL+GPDGRVLGG VAGLL AA+PVQ+V+GSF + G+KE+K P P K+AP
Sbjct: 263 VSLAGPDGRVLGGGVAGLLVAASPVQIVLGSFNSGGKKEAKKHAPSAPTPAPLKVAPPTT 322
Query: 296 PA---GQCSPPSR-GTLSE----SSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
SPPSR GTLSE + P PL+ A ++N P +++ +PWK
Sbjct: 323 TRMGPNSSSPPSRGGTLSESSGGAGSPPPPPLHQGMAASSSNDQPPFLSSIMPWK 377
>gi|2598227|emb|CAA10857.1| AT-hook protein 1 [Arabidopsis thaliana]
Length = 351
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 195/333 (58%), Positives = 228/333 (68%), Gaps = 47/333 (14%)
Query: 29 DGTAVYK-PITATSP--TYQPSGAGGDGAIPQAQGLNVMNM-----------GSGSEPMK 74
DGTA+YK P+ + SP YQP+ AG + +V+NM G+GSEP+K
Sbjct: 47 DGTALYKQPMRSVSPPQQYQPNSAGEN---------SVLNMNLPGGESGGMTGTGSEPVK 97
Query: 75 RKRGRPRKYGPD-GTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
++RGRPRKYGPD G MSL L P S T + +G D +K RGRPPG
Sbjct: 98 KRRGRPRKYGPDSGEMSLGLNPGAPSFTVSQPSSGG------------DGGEKKRGRPPG 145
Query: 134 SGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVT 193
S S K+ +LEALGS G+GFTPHV+TV AGEDVSSKIM+ + NGPRAVC+LSANGAISNVT
Sbjct: 146 SSS-KRLKLEALGSTGIGFTPHVLTVLAGEDVSSKIMALTHNGPRAVCVLSANGAISNVT 204
Query: 194 LRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGL 253
LRQ+ATSGGTVTYEGRFEILSLSGSF L E++GQRSRTGGLSVSLS PDG VLGGSVAGL
Sbjct: 205 LRQSATSGGTVTYEGRFEILSLSGSFHLLENNGQRSRTGGLSVSLSSPDGNVLGGSVAGL 264
Query: 254 LTAATPVQVVVGSFLADGRKESKSSHRMESL--PVPPKLAPGGQPAGQCSPPSRGTLSES 311
L AA+PVQ+VVGSFL DG KE K L PV P++AP SP SRGT+SES
Sbjct: 265 LIAASPVQIVVGSFLPDGEKEPKQHVGQMGLSSPVLPRVAPTQVLMTPSSPQSRGTMSES 324
Query: 312 S--GGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
S GG GSP++ STG NN + +PWK
Sbjct: 325 SCGGGHGSPIHQSTGGPYNNTI------NMPWK 351
>gi|18403332|ref|NP_565769.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|30685781|ref|NP_850215.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|42571033|ref|NP_973590.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|186505052|ref|NP_001118437.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|19548037|gb|AAL87382.1| At2g33620/F4P9.39 [Arabidopsis thaliana]
gi|20196849|gb|AAB80677.2| AT-hook DNA-binding protein (AHP1) [Arabidopsis thaliana]
gi|119657364|tpd|FAA00281.1| TPA: AT-hook motif nuclear localized protein 10 [Arabidopsis
thaliana]
gi|330253766|gb|AEC08860.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|330253767|gb|AEC08861.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|330253768|gb|AEC08862.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|330253769|gb|AEC08863.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 351
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 194/333 (58%), Positives = 228/333 (68%), Gaps = 47/333 (14%)
Query: 29 DGTAVYK-PITATSP--TYQPSGAGGDGAIPQAQGLNVMNM-----------GSGSEPMK 74
DGTA+YK P+ + SP YQP+ AG + +V+NM G+GSEP+K
Sbjct: 47 DGTALYKQPMRSVSPPQQYQPNSAGEN---------SVLNMNLPGGESGGMTGTGSEPVK 97
Query: 75 RKRGRPRKYGPD-GTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
++RGRPRKYGPD G MSL L P S T + +G D +K RGRPPG
Sbjct: 98 KRRGRPRKYGPDSGEMSLGLNPGAPSFTVSQPSSGG------------DGGEKKRGRPPG 145
Query: 134 SGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVT 193
S S K+ +L+ALGS G+GFTPHV+TV AGEDVSSKIM+ + NGPRAVC+LSANGAISNVT
Sbjct: 146 SSS-KRLKLQALGSTGIGFTPHVLTVLAGEDVSSKIMALTHNGPRAVCVLSANGAISNVT 204
Query: 194 LRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGL 253
LRQ+ATSGGTVTYEGRFEILSLSGSF L E++GQRSRTGGLSVSLS PDG VLGGSVAGL
Sbjct: 205 LRQSATSGGTVTYEGRFEILSLSGSFHLLENNGQRSRTGGLSVSLSSPDGNVLGGSVAGL 264
Query: 254 LTAATPVQVVVGSFLADGRKESKSSHRMESL--PVPPKLAPGGQPAGQCSPPSRGTLSES 311
L AA+PVQ+VVGSFL DG KE K L PV P++AP SP SRGT+SES
Sbjct: 265 LIAASPVQIVVGSFLPDGEKEPKQHVGQMGLSSPVLPRVAPTQVLMTPSSPQSRGTMSES 324
Query: 312 S--GGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
S GG GSP++ STG NN + +PWK
Sbjct: 325 SCGGGHGSPIHQSTGGPYNNTI------NMPWK 351
>gi|14326504|gb|AAK60297.1|AF385705_1 At2g33620/F4P9.39 [Arabidopsis thaliana]
Length = 351
Score = 313 bits (801), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 194/333 (58%), Positives = 227/333 (68%), Gaps = 47/333 (14%)
Query: 29 DGTAVYK-PITATSP--TYQPSGAGGDGAIPQAQGLNVMNM-----------GSGSEPMK 74
DGTA+YK P+ + SP YQP+ AG + +V+NM G+GSEP+K
Sbjct: 47 DGTALYKQPMRSVSPPQQYQPNSAGEN---------SVLNMNLPGGESGGMTGTGSEPVK 97
Query: 75 RKRGRPRKYGPD-GTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
++RGRPRKYGPD G MSL L P S T + +G D +K RGRPPG
Sbjct: 98 KRRGRPRKYGPDSGEMSLGLNPGAPSFTVSQPSSGG------------DGGEKKRGRPPG 145
Query: 134 SGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVT 193
S S K+ +L+ALGS G+GFTPHV+TV AGEDVSSKIM+ + NGPRAVC+LSANGAISNVT
Sbjct: 146 SSS-KRLKLQALGSTGIGFTPHVLTVLAGEDVSSKIMALTHNGPRAVCVLSANGAISNVT 204
Query: 194 LRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGL 253
LRQ ATSGGTVTYEGRFEILSLSGSF L E++GQRSRTGGLSVSLS PDG VLGGSVAGL
Sbjct: 205 LRQPATSGGTVTYEGRFEILSLSGSFHLLENNGQRSRTGGLSVSLSSPDGNVLGGSVAGL 264
Query: 254 LTAATPVQVVVGSFLADGRKESKSSHRMESL--PVPPKLAPGGQPAGQCSPPSRGTLSES 311
L AA+PVQ+VVGSFL DG KE K L PV P++AP SP SRGT+SES
Sbjct: 265 LIAASPVQIVVGSFLPDGEKEPKQHVGQMGLSSPVLPRVAPTQVLMTPSSPQSRGTMSES 324
Query: 312 S--GGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
S GG GSP++ STG NN + +PWK
Sbjct: 325 SCGGGHGSPIHQSTGGPYNNTI------NMPWK 351
>gi|224086106|ref|XP_002307818.1| predicted protein [Populus trichocarpa]
gi|222857267|gb|EEE94814.1| predicted protein [Populus trichocarpa]
Length = 272
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 187/279 (67%), Positives = 212/279 (75%), Gaps = 32/279 (11%)
Query: 89 MSLALVPSPSSV------TTATGGTGSGLSSPGG-------------------GPLSPDS 123
M+LAL +P SV +TATGG G SSP G +SP
Sbjct: 1 MALALASAPQSVAVTQPTSTATGG---GFSSPPAQTHPLVSPPPPPPPPGSDIGSVSPTG 57
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+KK+RGRPPGS KK QL ALGSAG GFTPHVITVKAGED+SSK+MSFSQ+GPRAVCIL
Sbjct: 58 VKKARGRPPGSS--KKQQLNALGSAGFGFTPHVITVKAGEDISSKVMSFSQHGPRAVCIL 115
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
SANGAISNVTLRQ ATSGGTVTYEGRFEIL+LSGS+L SE+ GQRSR+GGLSV LSGPDG
Sbjct: 116 SANGAISNVTLRQQATSGGTVTYEGRFEILALSGSYLPSENGGQRSRSGGLSVCLSGPDG 175
Query: 244 RVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPP 303
RVLGG+VAGLL AA PVQVVVGSF+ADGRKESK+++ E +L P G G SPP
Sbjct: 176 RVLGGTVAGLLVAAAPVQVVVGSFIADGRKESKTANHTEPSSATSRLPPRGGSTGVSSPP 235
Query: 304 SRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
SRGTLSESSGGPGSPLN STGACNN++ PQGM++ +PW+
Sbjct: 236 SRGTLSESSGGPGSPLNQSTGACNNSN-PQGMSS-MPWE 272
>gi|42408801|dbj|BAD10062.1| putative AT-hook DNA-binding protein [Oryza sativa Japonica Group]
gi|125562155|gb|EAZ07603.1| hypothetical protein OsI_29854 [Oryza sativa Indica Group]
Length = 354
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 197/342 (57%), Positives = 232/342 (67%), Gaps = 37/342 (10%)
Query: 20 QSMRLAFSADGTAVYKPITAT---SPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPM-KR 75
QS+R+A+++DGT V+ P++A P YQP GA SG EP+ K+
Sbjct: 31 QSVRMAYTSDGTPVFAPVSAAVSAPPGYQPGGA-------AGGNGAAALADSGGEPVAKK 83
Query: 76 KRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSP-------------- 121
KRGRPRKYGPDG+MSL LV SP TA T PG P SP
Sbjct: 84 KRGRPRKYGPDGSMSLGLVTSP----TAAASTPVAQGVPG--PFSPTQPKPPASFLSSGW 137
Query: 122 -DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
D +KK RGRP GS K +++A+GSAGVGFTPHVITV AGEDVS+KIMSF+Q+G RAV
Sbjct: 138 PDGVKK-RGRP--KGSTNKPRIDAVGSAGVGFTPHVITVLAGEDVSAKIMSFAQHGNRAV 194
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
C+LSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL++ GQRSRTGGLSVSL+G
Sbjct: 195 CVLSANGAISNVTLRQTATSGGTVTYEGRFEILSLSGSFLLTDHGGQRSRTGGLSVSLAG 254
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQC 300
PDGR+LGG VAGLL AATPVQ+VVGSF ++G+KE K E P K P G
Sbjct: 255 PDGRLLGGGVAGLLIAATPVQIVVGSFNSEGKKEPKQHAHSEPASAPSKAVPTAG-MGPN 313
Query: 301 SPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
SPPSRGTLSESSGG GSPL+ ++N P +++ +PWK
Sbjct: 314 SPPSRGTLSESSGGAGSPLHPGIAPPSSNSQPPFLSS-MPWK 354
>gi|297727103|ref|NP_001175915.1| Os09g0491708 [Oryza sativa Japonica Group]
gi|119657406|tpd|FAA00302.1| TPA: AT-hook motif nuclear localized protein 2 [Oryza sativa
Japonica Group]
gi|215740581|dbj|BAG97237.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255679015|dbj|BAH94643.1| Os09g0491708 [Oryza sativa Japonica Group]
Length = 359
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 193/338 (57%), Positives = 232/338 (68%), Gaps = 33/338 (9%)
Query: 19 IQSMRLAFSADGTAVYKPITATSPTYQPSGA---GGDGAIPQAQGLNVMNMGSGSEPMKR 75
+QS+R+A++ADGT ++ P+ + + GG+GA + +G +K+
Sbjct: 41 MQSVRMAYTADGTPIFAPVNSAPAPAPAATYPPAGGNGA---------AALDAGEPVVKK 91
Query: 76 KRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIK-------KSR 128
KRGRPRKYGPDG+MSLALVP ++ A SG SP SPD++ K R
Sbjct: 92 KRGRPRKYGPDGSMSLALVPVSTAAVAA-----SGPFSPAAAAKSPDAVSSAPPPGAKKR 146
Query: 129 GRPPGSGSGKKHQ----LEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
GRP GS + KKH + +GSAG GFTPHVI VKAGEDVS+KIMSFSQ+G R VC+LS
Sbjct: 147 GRPKGS-TNKKHVPSFGIGDIGSAGAGFTPHVIFVKAGEDVSAKIMSFSQHGTRGVCVLS 205
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSE+ G RSRTGGLSVSL+GPDGR
Sbjct: 206 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSENGGHRSRTGGLSVSLAGPDGR 265
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPS 304
VLGG VAGLLTAA+PVQ+VVGSF +G+K K + + P K+ P G SPPS
Sbjct: 266 VLGGGVAGLLTAASPVQIVVGSFNTEGKKGPKLHAPSDPMSAPLKMVPMSG-TGPSSPPS 324
Query: 305 RGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
RGTLSESSGGPGSPLN G +NH G+ + + WK
Sbjct: 325 RGTLSESSGGPGSPLNQ--GVTASNHGQPGLPS-LSWK 359
>gi|218202371|gb|EEC84798.1| hypothetical protein OsI_31862 [Oryza sativa Indica Group]
Length = 358
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 193/338 (57%), Positives = 232/338 (68%), Gaps = 33/338 (9%)
Query: 19 IQSMRLAFSADGTAVYKPITATSPTYQPSGA---GGDGAIPQAQGLNVMNMGSGSEPMKR 75
+QS+R+A++ADGT ++ P+ + + GG+GA + +G +K+
Sbjct: 40 MQSVRMAYTADGTPIFAPVNSAPAPAPAATYPPAGGNGA---------AALDAGEPVVKK 90
Query: 76 KRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSI-------KKSR 128
KRGRPRKYGPDG+MSLALVP ++ A SG SP SPD++ K R
Sbjct: 91 KRGRPRKYGPDGSMSLALVPVSTAAVAA-----SGPFSPAAAAKSPDAVLSAPPPGAKKR 145
Query: 129 GRPPGSGSGKKHQ----LEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
GRP GS + KKH + +GSAG GFTPHVI VKAGEDVS+KIMSFSQ+G R VC+LS
Sbjct: 146 GRPKGS-TNKKHVPSFGIGDIGSAGAGFTPHVIFVKAGEDVSAKIMSFSQHGTRGVCVLS 204
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSE+ G RSRTGGLSVSL+GPDGR
Sbjct: 205 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSENGGHRSRTGGLSVSLAGPDGR 264
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPS 304
VLGG VAGLLTAA+PVQ+VVGSF +G+K K + + P K+ P G SPPS
Sbjct: 265 VLGGGVAGLLTAASPVQIVVGSFNTEGKKGPKLHAPSDPMSAPLKMVPMSG-TGPSSPPS 323
Query: 305 RGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
RGTLSESSGGPGSPLN G +NH G+ + + WK
Sbjct: 324 RGTLSESSGGPGSPLNQ--GVTASNHGQPGLPS-LSWK 358
>gi|226503075|ref|NP_001151163.1| LOC100284796 [Zea mays]
gi|195644722|gb|ACG41829.1| AT-hook protein 1 [Zea mays]
Length = 369
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 188/351 (53%), Positives = 237/351 (67%), Gaps = 41/351 (11%)
Query: 19 IQSMRLAFSADGTAVYKPITAT--SPTYQPSGA------------GGDGAIPQAQGLNVM 64
+QS+R+A++ADGTAV+ P++++ +P+YQP GA GG+GA P A +
Sbjct: 33 MQSVRMAYTADGTAVFAPVSSSPATPSYQPQGAAHGASMSAATVVGGNGA-PAAPSMG-- 89
Query: 65 NMGSGSEPM-KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPL---- 119
EP+ K+KRGRPRKYGPDG+M+LA+VP+ S + + TG G S P P
Sbjct: 90 ------EPLAKKKRGRPRKYGPDGSMALAMVPA--SAASGSPATGQGFSGPFSPPALNPA 141
Query: 120 ------SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFS 173
SPD KK RGRP GS + +++A GS+G GFTPHVITV+AGEDV+SKIMSFS
Sbjct: 142 SSLVVASPDGFKK-RGRP--KGSTNRPRVDAAGSSGAGFTPHVITVQAGEDVASKIMSFS 198
Query: 174 QNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGG 233
Q+G VC+LSANG+ISNVTLRQ ATSG TVTYEG+FEILSLSGSF L+E QRSR GG
Sbjct: 199 QHGTHGVCVLSANGSISNVTLRQTATSGRTVTYEGQFEILSLSGSFFLAEDGVQRSRNGG 258
Query: 234 LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPG 293
LSVSL+GPDGR+LGG VAGLL AA+PVQ+V+GSF + G KE + E PP++AP
Sbjct: 259 LSVSLAGPDGRLLGGGVAGLLVAASPVQIVLGSFNSGGGKEPQKQAPSEPTSAPPRVAPT 318
Query: 294 GQPAGQCSPPSRGTLSESSGGPGS--PLNHSTGACNNNHLPQGMATGIPWK 342
G SP SRGTLSESSGG GS PL+ + A +N + +PW+
Sbjct: 319 AGMGGPSSPSSRGTLSESSGGAGSPPPLHRAMAASASNSNQPPFLSSMPWR 369
>gi|297823157|ref|XP_002879461.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325300|gb|EFH55720.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
Length = 344
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 183/328 (55%), Positives = 220/328 (67%), Gaps = 39/328 (11%)
Query: 16 PASIQSMRLAFSAD-GTAVYK-PITATSP--TYQPSGAGGDGAIPQAQGLNVMNMG---- 67
P Q+M+ +F D G +Y+ P+ + SP YQP+ AG + V+NM
Sbjct: 34 PQQSQNMQSSFGGDDGADLYRQPMRSASPPQQYQPNSAGEN---------PVLNMNMPGA 84
Query: 68 -----SGSEPMKRKRGRPRKYGPD-GTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSP 121
+GSEP+K++RGRPRKYGP+ G SL L S T + +G G GG
Sbjct: 85 EHGAVTGSEPVKKRRGRPRKYGPESGETSLGLFSGAPSFTVSQPVSGGG-----GGE--- 136
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
KK RGRPPGS S K+ +L+ALGS G+GFTPHV+TV GEDVSSKIM+ + NGPRAVC
Sbjct: 137 ---KKMRGRPPGSSS-KRLKLQALGSTGIGFTPHVLTVMTGEDVSSKIMALAHNGPRAVC 192
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
++SANGAISNVTLRQ+ TSGGTVTYEGRFEILSLSGSF L E+ GQRSRTGGLSVSLS P
Sbjct: 193 VMSANGAISNVTLRQSGTSGGTVTYEGRFEILSLSGSFHLLENDGQRSRTGGLSVSLSSP 252
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESL--PVPPKLAPGGQPAGQ 299
DG VLGGSVAGLL AA+PVQ+VVGSF+ DG KE K L P P++AP
Sbjct: 253 DGNVLGGSVAGLLIAASPVQIVVGSFIPDGEKEPKQHVGQMGLSSPTLPRVAPTQVLMTP 312
Query: 300 CSPPSRGTLSESS--GGPGSPLNHSTGA 325
SP SRGT+SESS GG GSP++ TG+
Sbjct: 313 GSPQSRGTMSESSCGGGHGSPIHQGTGS 340
>gi|194701430|gb|ACF84799.1| unknown [Zea mays]
gi|195646832|gb|ACG42884.1| AT-hook protein 1 [Zea mays]
gi|219886795|gb|ACL53772.1| unknown [Zea mays]
gi|223942375|gb|ACN25271.1| unknown [Zea mays]
gi|223947841|gb|ACN28004.1| unknown [Zea mays]
gi|223949081|gb|ACN28624.1| unknown [Zea mays]
gi|224028471|gb|ACN33311.1| unknown [Zea mays]
gi|238010744|gb|ACR36407.1| unknown [Zea mays]
gi|413925296|gb|AFW65228.1| AT-hook protein 1 isoform 1 [Zea mays]
gi|413925297|gb|AFW65229.1| AT-hook protein 1 isoform 2 [Zea mays]
gi|413925298|gb|AFW65230.1| AT-hook protein 1 isoform 3 [Zea mays]
gi|413925299|gb|AFW65231.1| AT-hook protein 1 isoform 4 [Zea mays]
Length = 369
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 188/351 (53%), Positives = 236/351 (67%), Gaps = 41/351 (11%)
Query: 19 IQSMRLAFSADGTAVYKPITAT--SPTYQPSGA------------GGDGAIPQAQGLNVM 64
+QS+R+A++ADGTAV+ P++++ +P+YQP GA GG+GA P A +
Sbjct: 33 MQSVRMAYTADGTAVFAPVSSSPATPSYQPQGAAHGASMSAATVVGGNGA-PAAPSMG-- 89
Query: 65 NMGSGSEPM-KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPL---- 119
EP+ K+KRGRPRKYGPDG+M+LA+VP+ S + + TG G S P P
Sbjct: 90 ------EPLAKKKRGRPRKYGPDGSMALAMVPA--SAASGSPATGQGFSGPFSPPALNPA 141
Query: 120 ------SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFS 173
SPD KK RGRP GS K +++A GS+G GFTPHVITV+AGEDV+SKIMSFS
Sbjct: 142 SSLVVASPDGFKK-RGRP--KGSTNKPRVDAAGSSGAGFTPHVITVQAGEDVASKIMSFS 198
Query: 174 QNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGG 233
Q+G VC+LSANG+ISNVTLRQ ATSG TVTYEG+FEILSLSGSF L+E QRSR G
Sbjct: 199 QHGTHGVCVLSANGSISNVTLRQTATSGRTVTYEGQFEILSLSGSFFLAEDGVQRSRNGS 258
Query: 234 LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPG 293
LSVSL+GPDGR+LGG VAGLL AA+PVQ+V+GSF + G KE + E PP++AP
Sbjct: 259 LSVSLAGPDGRLLGGGVAGLLVAASPVQIVLGSFNSGGGKEPQKQAPSEPTSAPPRVAPT 318
Query: 294 GQPAGQCSPPSRGTLSESSGGPGS--PLNHSTGACNNNHLPQGMATGIPWK 342
G SP SRGTLSESSGG GS PL+ + A +N + +PW+
Sbjct: 319 AGMGGPSSPSSRGTLSESSGGAGSPPPLHRAMAASASNSNQPPFLSSMPWR 369
>gi|12643044|gb|AAK00433.1|AC060755_3 putative AT-Hook DNA-binding protein [Oryza sativa Japonica Group]
gi|110289621|gb|ABB48013.2| AT-hook protein 1, putative, expressed [Oryza sativa Japonica
Group]
gi|110289622|gb|ABB48012.2| AT-hook protein 1, putative, expressed [Oryza sativa Japonica
Group]
gi|125533038|gb|EAY79603.1| hypothetical protein OsI_34743 [Oryza sativa Indica Group]
Length = 405
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 167/272 (61%), Positives = 196/272 (72%), Gaps = 10/272 (3%)
Query: 62 NVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSP 121
NVM MG E M++KRGRPRKY PDG+M+LAL P S+ A P G +S
Sbjct: 100 NVMGMG---ELMRKKRGRPRKYAPDGSMALALAPISSASGGAAPPPPPPGHQPHGFSISS 156
Query: 122 ---DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR 178
D K RGRPPGSG KK Q EALGS G+ FTPH++TVKAGEDV+SKIM+FSQ GPR
Sbjct: 157 PASDPNAKRRGRPPGSG--KKKQFEALGSWGIAFTPHILTVKAGEDVASKIMAFSQQGPR 214
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
VCILSANGAISNVTLRQ ATSGG VTYEGRFEI+SLSGSFLL+E RSRTGGLSV+L
Sbjct: 215 TVCILSANGAISNVTLRQPATSGGLVTYEGRFEIISLSGSFLLAEDGDTRSRTGGLSVAL 274
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK-ESKSSHRMESLPVPPKLAPGGQPA 297
+G DGRVLGG VAG+L AATPVQVVV SF+A+G+K + + ++E + PP++A PA
Sbjct: 275 AGSDGRVLGGCVAGMLMAATPVQVVVASFIAEGKKSKPVETRKVEPMSAPPQMA-TYVPA 333
Query: 298 GQCSPPSRGTLSESSGGPGSPLNHSTGACNNN 329
SPPS GT S SS GSP+NHS N++
Sbjct: 334 PVASPPSEGTSSGSSDDSGSPINHSGMPYNHS 365
>gi|225427270|ref|XP_002281340.1| PREDICTED: uncharacterized protein LOC100245362 [Vitis vinifera]
gi|297742130|emb|CBI33917.3| unnamed protein product [Vitis vinifera]
Length = 353
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 173/336 (51%), Positives = 211/336 (62%), Gaps = 32/336 (9%)
Query: 10 VMTSQPPASIQSMRLAFSADGTAVYKPITATSPTYQPSGAGGDGAIPQAQGLNVMNMGSG 69
+M A +Q+ R +F++ + A+ P P G G + + G N+
Sbjct: 36 MMNPNSAAIMQNNRFSFTS--------MVASKPVDSPYGDGSSTGL-RPCGFNI------ 80
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
EP K+KRGRPRKY PDG ++L L P+P T A G G P S K++RG
Sbjct: 81 -EPAKKKRGRPRKYAPDGNIALGLAPTPIPSTAAHGDAT-------GTPSSEPPAKRNRG 132
Query: 130 RPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAI 189
RPPGSG K QL+ALG+AGVGFTPHVITV GED++SKIM+FSQ GPR VCILSANGAI
Sbjct: 133 RPPGSG---KKQLDALGAAGVGFTPHVITVNVGEDIASKIMAFSQQGPRTVCILSANGAI 189
Query: 190 SNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGS 249
NVTLRQ A SGGT++YEGRF+I+SLSGSFLLSE +G R RTGGLSVSL+G DGRVLGG
Sbjct: 190 CNVTLRQPAMSGGTISYEGRFDIISLSGSFLLSEDNGSRHRTGGLSVSLAGSDGRVLGGG 249
Query: 250 VAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLS 309
VAG+LTAATPVQVVVGSF+ADG+K + + S P P ++ G P SP G+
Sbjct: 250 VAGMLTAATPVQVVVGSFIADGKKTNTNQSGSSSAP-PAQMLNFGAPVVPASPSQGGSSE 308
Query: 310 ESSGGPGSPLNHSTGACNN-----NHLPQGMATGIP 340
S GSPLN NN + +P A G P
Sbjct: 309 SSDENGGSPLNRGPLPYNNVSQPIHQMPMYAAMGWP 344
>gi|414589837|tpg|DAA40408.1| TPA: DNA binding protein [Zea mays]
Length = 378
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 193/323 (59%), Positives = 230/323 (71%), Gaps = 35/323 (10%)
Query: 13 SQPPAS--IQSMRLAFSADGTAVYKPITATSP---TYQP-----------SGAGGDGAIP 56
+ PP+S Q +R+A++ DGTA++ P+++ P TYQP +G GG+G P
Sbjct: 26 ATPPSSGGTQGLRMAYTTDGTAIFTPVSSVPPATATYQPVGGSAASASSLAGVGGNGGAP 85
Query: 57 QAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSS---VTTATGGTG----S 109
V + G+G K+KRGRPRKYGPDG+MSLALVP+ + A G +G +
Sbjct: 86 ------VHSGGAGEPGTKKKRGRPRKYGPDGSMSLALVPASMAGEPAPAALGASGPFSPN 139
Query: 110 GLSSPGGGP-LSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSK 168
G +P P SPD KK RGRP GS + KKH + ALG AG GFTPH+I VKAGEDVS+K
Sbjct: 140 GPKAPNTAPSASPDGAKK-RGRPKGS-TNKKH-VAALGPAGAGFTPHLIFVKAGEDVSAK 196
Query: 169 IMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQR 228
IMSFSQ+G RAVCILSANGAISNVTLRQ+ATSGGTVTYEGRFEILSLSGSFLLSE+ GQR
Sbjct: 197 IMSFSQHGTRAVCILSANGAISNVTLRQSATSGGTVTYEGRFEILSLSGSFLLSENGGQR 256
Query: 229 SRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG--RKESKSSHRMESLPV 286
SRTGGLSVSL+GPDGRVLGG VAGLLTAA+PVQ+VVGSF A G + + + ++ P
Sbjct: 257 SRTGGLSVSLAGPDGRVLGGCVAGLLTAASPVQIVVGSFDAGGKKQPKQQQQQQLAPSPA 316
Query: 287 PPKLAPGGQPAGQCSPPSRGTLS 309
P LAP G AG SPPSRGTLS
Sbjct: 317 PLNLAPTGVAAGPSSPPSRGTLS 339
>gi|242049668|ref|XP_002462578.1| hypothetical protein SORBIDRAFT_02g028500 [Sorghum bicolor]
gi|241925955|gb|EER99099.1| hypothetical protein SORBIDRAFT_02g028500 [Sorghum bicolor]
Length = 381
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 201/334 (60%), Positives = 233/334 (69%), Gaps = 39/334 (11%)
Query: 15 PPAS--IQSMRLAFSADGTAVYKPIT----ATSPTYQP------SGAGGDGAIPQAQGLN 62
PP+S QS+R+A++ DGTA++ P++ AT+ +QP +G GG+G P
Sbjct: 28 PPSSGVTQSLRMAYTTDGTAIFTPVSSAPPATATYHQPVAASSLAGVGGNGGAP------ 81
Query: 63 VMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLS-----SPGG- 116
V + G+G K+KRGRPRKYGPDG+MSLALVP P+S+ A + SP G
Sbjct: 82 VHSGGAGEPVAKKKRGRPRKYGPDGSMSLALVPVPASIAAAPAPAPAAPGASGPFSPSGP 141
Query: 117 -----GP-LSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIM 170
P SPD KK RGRP GS + KKH + ALG G GFTPH+I VKAGEDVS+KIM
Sbjct: 142 KALNTAPSASPDGAKK-RGRPKGS-TNKKH-VPALGPTGAGFTPHLIFVKAGEDVSAKIM 198
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSR 230
SFSQ+G RAVCILSANGAISNVTLRQ+ATSGGTVTYEGRFEILSLSGSFLLSE+ G RSR
Sbjct: 199 SFSQHGTRAVCILSANGAISNVTLRQSATSGGTVTYEGRFEILSLSGSFLLSENGGHRSR 258
Query: 231 TGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPP-- 288
TGGLSVSL+GPDGRVLGGSVAGLLTAA+PVQ+VVGSF ADG+KE K S P
Sbjct: 259 TGGLSVSLAGPDGRVLGGSVAGLLTAASPVQIVVGSFDADGKKEPKRKKLAPSPSDPSPA 318
Query: 289 --KLAPG--GQPAGQCSPPSRGTLSESSGGPGSP 318
KLAP G AG SPPSRGTLS S G+P
Sbjct: 319 PLKLAPATTGVAAGPSSPPSRGTLSLSESSSGAP 352
>gi|326519160|dbj|BAJ96579.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 190/339 (56%), Positives = 234/339 (69%), Gaps = 22/339 (6%)
Query: 19 IQSMRLAFSADGTAVYKPITATS--PTYQPSGAGGDGAI---PQAQGLNVMNM--GSGSE 71
+QS+R+A++ADGT V+ P+++ P +Q +GA G+ +A G N + G G
Sbjct: 36 MQSVRMAYTADGTPVFAPVSSAVAPPGFQTAGAPAHGSTMSAARAAGGNGVAAPPGMGEP 95
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTAT---GGTGSGLSSPGGGPL----SPDSI 124
K+KRGRPRKYGPD MSLALV P++ +A G +G S G SPD
Sbjct: 96 SAKKKRGRPRKYGPDAAMSLALVTVPTAAGSAAVTQGASGRPFSPTLPGNFVPSASPDGG 155
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
KK RGRP GS K +++ G AGVGFTPHV+TV+AGEDVSSKIMSFSQNG RAVC+LS
Sbjct: 156 KK-RGRP--KGSTNKPRVDGGGPAGVGFTPHVLTVQAGEDVSSKIMSFSQNGTRAVCVLS 212
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANG+ISNVTLRQ TSGGTVTYEGRFEILSLSGS ++++ GQR+RTGGLSVSL+GPDGR
Sbjct: 213 ANGSISNVTLRQTGTSGGTVTYEGRFEILSLSGSIFVTDNGGQRTRTGGLSVSLAGPDGR 272
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPG-GQPAGQCSPP 303
+LGG VAGLL AA+P+Q+VVGSF A G+KE K + S PVP K+ P G SPP
Sbjct: 273 LLGGGVAGLLIAASPIQIVVGSFNAGGKKEPKP--QAPSEPVPLKVVPSTGIGMAANSPP 330
Query: 304 SRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
SRGTLSESSGG SP + + NNN P + + +PWK
Sbjct: 331 SRGTLSESSGGTASPRHQGFASTNNNQPP--ILSSMPWK 367
>gi|226506092|ref|NP_001149781.1| LOC100283408 [Zea mays]
gi|195634613|gb|ACG36775.1| DNA binding protein [Zea mays]
Length = 377
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 190/317 (59%), Positives = 225/317 (70%), Gaps = 23/317 (7%)
Query: 13 SQPPAS--IQSMRLAFSADGTAVYKPITATSP---TYQPSGAGGDGA-----IPQAQGLN 62
+ PP+S Q +R+A++ DGTA++ P+++ P TYQP G A + G
Sbjct: 25 ATPPSSGGTQGLRMAYTTDGTAIFTPVSSVPPATATYQPVGGSAXXASSLAGVGXNGGAP 84
Query: 63 VMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSS---VTTATGGTG----SGLSSPG 115
V + G+G KRGRPRKYGPDG+MSLALVP+ + A G +G +G +P
Sbjct: 85 VHSGGAGEPGTXXKRGRPRKYGPDGSMSLALVPASMAGEPAPAALGASGPFSPNGPKAPN 144
Query: 116 GGP-LSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
P SPD KK RGRP GS + KKH + ALG AG GFTPH+I VKAGEDVS+KIMSFSQ
Sbjct: 145 TAPSASPDGAKK-RGRPKGS-TNKKH-VAALGPAGAGFTPHLIFVKAGEDVSAKIMSFSQ 201
Query: 175 NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGL 234
+G RAVCILSANGAISNVTLRQ+ATSGGTVTYEGRFEILSLSGSFLLSE+ GQRSRTGGL
Sbjct: 202 HGTRAVCILSANGAISNVTLRQSATSGGTVTYEGRFEILSLSGSFLLSENGGQRSRTGGL 261
Query: 235 SVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG--RKESKSSHRMESLPVPPKLAP 292
SVSL+GPDGRVLGG VAGLLTAA+PVQ+VVGSF A G + + + ++ P P LAP
Sbjct: 262 SVSLAGPDGRVLGGCVAGLLTAASPVQIVVGSFDAGGKKQPKQQQQQQLAPSPAPLNLAP 321
Query: 293 GGQPAGQCSPPSRGTLS 309
G AG SPPSRGTLS
Sbjct: 322 TGVAAGPSSPPSRGTLS 338
>gi|226502488|ref|NP_001148458.1| AT-hook protein 1 [Zea mays]
gi|194704752|gb|ACF86460.1| unknown [Zea mays]
gi|195619414|gb|ACG31537.1| AT-hook protein 1 [Zea mays]
gi|224030103|gb|ACN34127.1| unknown [Zea mays]
gi|224030137|gb|ACN34144.1| unknown [Zea mays]
gi|224033127|gb|ACN35639.1| unknown [Zea mays]
gi|414867873|tpg|DAA46430.1| TPA: AT-hook protein 1 isoform 1 [Zea mays]
gi|414867874|tpg|DAA46431.1| TPA: AT-hook protein 1 isoform 2 [Zea mays]
gi|414867875|tpg|DAA46432.1| TPA: AT-hook protein 1 isoform 3 [Zea mays]
Length = 417
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 166/277 (59%), Positives = 198/277 (71%), Gaps = 19/277 (6%)
Query: 62 NVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVP-------SPSSVTTATGGTGSGLSSP 114
NV+ MG E M++KRGRPRKY PDG+M+LAL P ++ G G +SSP
Sbjct: 109 NVLGMG---ELMRKKRGRPRKYAPDGSMALALAPISSASAGGAAAPGQQQHGGGFSISSP 165
Query: 115 GGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
P P++ K RGRPP GSGKK Q EALGS G+ FTPH++TVKAGEDV+SKIM+FSQ
Sbjct: 166 ---PSDPNA--KRRGRPP--GSGKKKQFEALGSWGIAFTPHILTVKAGEDVASKIMTFSQ 218
Query: 175 NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGL 234
GPR VCILSANGAISNVTLRQ ATSGG VTYEGRFEI+SLSGSFLL+E RSRTGGL
Sbjct: 219 QGPRTVCILSANGAISNVTLRQPATSGGLVTYEGRFEIISLSGSFLLAEDGDTRSRTGGL 278
Query: 235 SVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHR-MESLPV-PPKLAP 292
SV+L+G DGRVLGG VAG+L AATPVQVVV SF+A+G+K + R +E + PP++A
Sbjct: 279 SVALAGSDGRVLGGCVAGMLMAATPVQVVVASFIAEGKKSKPAEARKVEPMAAPPPQMAT 338
Query: 293 GGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGACNNN 329
P SPPS GT S SS GSP++HS +N+
Sbjct: 339 FVPPPLATSPPSEGTSSASSDDSGSPIHHSAMPFSNS 375
>gi|255583444|ref|XP_002532481.1| DNA binding protein, putative [Ricinus communis]
gi|223527806|gb|EEF29905.1| DNA binding protein, putative [Ricinus communis]
Length = 346
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 197/336 (58%), Positives = 220/336 (65%), Gaps = 39/336 (11%)
Query: 24 LAFSADGTAVYKPITATSPT-------------YQPSGAGGDGAIPQAQGLNVMNMGSG- 69
LAF ADG+ VYKPI + +Q GG G GL+ +NM +
Sbjct: 33 LAFRADGS-VYKPIMPPTDDGNPFHHVPLHYNHHQDMMVGGGGGGGIGIGLDALNMNNNH 91
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
SEP+KRKRGRPRKY P ++ L P G SP +P KK+RG
Sbjct: 92 SEPIKRKRGRPRKYSPPPHGNIDLTSPPQHQLYQCG-----FQSPTPSSTAP---KKARG 143
Query: 130 RPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAI 189
RPPGS +K+ L LGS G GFTPHVI VKAGEDV KIMSFSQNGPR VCILSA G I
Sbjct: 144 RPPGSA--RKNHLPNLGSGGTGFTPHVIFVKAGEDVLLKIMSFSQNGPRGVCILSAYGTI 201
Query: 190 SNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGS 249
SNVTLRQA T GGTVTYEGRFEILSLSGSFLLSE+SGQRSRTGGLSV LSGPDGRVLGG
Sbjct: 202 SNVTLRQATTIGGTVTYEGRFEILSLSGSFLLSENSGQRSRTGGLSVLLSGPDGRVLGGG 261
Query: 250 VAGLLTAATPVQVVVGSFLADGRKESK---SSHRMESLPVPPKLAPGGQPAGQCSPPSRG 306
VAGLLTAA+ VQV+VGSF+++ K SK + H S APG AG SPPSRG
Sbjct: 262 VAGLLTAASSVQVIVGSFISEDSKGSKLWINQHETMS-------APGASVAG--SPPSRG 312
Query: 307 TLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
T SESSGGPGSP N STGACNN++ QGM + WK
Sbjct: 313 TFSESSGGPGSPPNQSTGACNNSNT-QGMPN-VAWK 346
>gi|148905791|gb|ABR16059.1| unknown [Picea sitchensis]
Length = 383
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 172/289 (59%), Positives = 204/289 (70%), Gaps = 49/289 (16%)
Query: 64 MNMGSG-------SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGG 116
+NMG G +EP+KRKRGRPRKYGPDG+M+LAL P SSV
Sbjct: 81 VNMGVGMAVSVARTEPLKRKRGRPRKYGPDGSMALALAPL-SSVQ--------------- 124
Query: 117 GPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMS 171
G LSP K+ RGRPPGSG +K QL ALG SAG+GFTPHVIT+ AGED ++KIMS
Sbjct: 125 GSLSPTQ-KRGRGRPPGSG--RKQQLAALGEWLAGSAGMGFTPHVITIAAGEDAATKIMS 181
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
FSQ GPRAVCILSANGAIS+VTLRQ ATSGGTVTYEGRFEILSLSGSFLL+E+ G RSRT
Sbjct: 182 FSQQGPRAVCILSANGAISHVTLRQPATSGGTVTYEGRFEILSLSGSFLLTENGGTRSRT 241
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLA 291
GGLSVSL+GPDGRV+GG VAG+L AA+PVQVVVGSF+++GRK ++ PV P+ +
Sbjct: 242 GGLSVSLAGPDGRVIGGGVAGMLMAASPVQVVVGSFISNGRKA-------QAKPVNPEPS 294
Query: 292 PGGQPAGQ------CSPPSRGTLSESSGGP----GSP-LNHSTGACNNN 329
AG P S+G +++SSGG G+P LN +TGA +N
Sbjct: 295 IAQSQAGYSGGPAVAIPISKGAVNDSSGGKTGGAGNPSLNQNTGASVSN 343
>gi|326504396|dbj|BAJ91030.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516592|dbj|BAJ92451.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326530486|dbj|BAJ97669.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 170/278 (61%), Positives = 198/278 (71%), Gaps = 20/278 (7%)
Query: 60 GLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGT------GSGLSS 113
G N + MG + M++KRGRPRKY PDG+M+LAL P S+ A G +SS
Sbjct: 158 GSNALGMG---DLMRKKRGRPRKYAPDGSMALALAPLSSASGGAAPPPPPGQQHGFSISS 214
Query: 114 PGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFS 173
P P P++ K RGRPPGSG KK Q EALGS G+ FTPH+++VKAGEDV+SKIMSFS
Sbjct: 215 P---PSDPNA--KRRGRPPGSG--KKKQFEALGSWGISFTPHILSVKAGEDVASKIMSFS 267
Query: 174 QNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGG 233
Q GPR VCILSANGAISNVTLRQ ATSGG VTYEGRFEI+SLSGSFLL+E RSRTGG
Sbjct: 268 QQGPRTVCILSANGAISNVTLRQPATSGGLVTYEGRFEIISLSGSFLLAEDGDTRSRTGG 327
Query: 234 LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHR-MESLPVPPKLAP 292
LSV+L+G DGRVLGG VAG LTAATPVQVVV SF+A+G+K + R +E + PP++A
Sbjct: 328 LSVALAGSDGRVLGGCVAGQLTAATPVQVVVASFIAEGKKSKPAEPRKVEPMSAPPQMA- 386
Query: 293 GGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGACNNNH 330
PA SPPS GT S SS GSP+NH G NH
Sbjct: 387 TYVPAPVASPPSEGTSSASSDDSGSPINH--GGMPYNH 422
>gi|357148434|ref|XP_003574762.1| PREDICTED: uncharacterized protein LOC100825635 [Brachypodium
distachyon]
Length = 368
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 189/346 (54%), Positives = 226/346 (65%), Gaps = 34/346 (9%)
Query: 20 QSMRLAFSADGTAVYKPITAT--SPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPM---- 73
QS+R+A++ADGT V+ P+++ P YQP A + A G N + M
Sbjct: 34 QSVRMAYTADGTPVFAPVSSAVAPPGYQPVAAAPGSNMSTAAGAAGGNGVAALRDMGGPL 93
Query: 74 -KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGG-GPLSPDSIK------ 125
K+KRGRPRKYGPD +SLALV T G G + G GP SP +
Sbjct: 94 AKKKRGRPRKYGPDAAVSLALV------TVPPGAAGPTVVPQGASGPFSPTAPGSVVPSA 147
Query: 126 -----KSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
K RGRP GS K ++ G GVGFTPHVITV+AGEDVS+KIMSFSQ+G RAV
Sbjct: 148 SPEGGKKRGRP--KGSTNKPRVNVPGPVGVGFTPHVITVQAGEDVSAKIMSFSQHGTRAV 205
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
C+LSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFL++++ GQRS TGGLSVSL+G
Sbjct: 206 CVLSANGAISNVTLRQTATSGGTVTYEGRFEILSLSGSFLVTDNGGQRSLTGGLSVSLAG 265
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK----SSHRMESLPVPPKLAPGGQP 296
PDGR+LGG VAGLL AA+P+Q+VVGSF +DGRKE K ++ S P P K+ P
Sbjct: 266 PDGRLLGGGVAGLLIAASPIQIVVGSFNSDGRKEQKPQVMPKLQVSSEPTPLKVVPATG- 324
Query: 297 AGQCSPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
G SPPSRGTLSESSGG SP + A NNN P + + +PWK
Sbjct: 325 MGPNSPPSRGTLSESSGGTASPRHQGYTATNNNQPP--ILSSMPWK 368
>gi|357147512|ref|XP_003574372.1| PREDICTED: uncharacterized protein LOC100833716 [Brachypodium
distachyon]
Length = 433
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 166/271 (61%), Positives = 196/271 (72%), Gaps = 28/271 (10%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGT-------------GSGLSSPGGGPLS 120
++KRGRPRKY PDG+M+LAL P +++A+GG+ G +SSP P
Sbjct: 133 RKKRGRPRKYAPDGSMALALAP----LSSASGGSPMQPGQQQQQQHGGFSISSP---PSD 185
Query: 121 PDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
P++ K RGRPPGSG KK Q EALGS G+ FTPH+++VKAGEDV+SKIMSFSQ GPR V
Sbjct: 186 PNA--KRRGRPPGSG--KKKQFEALGSWGISFTPHILSVKAGEDVASKIMSFSQQGPRTV 241
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
CILSANGAISNVTLRQ ATSGG VTYEGRFEI+SLSGSFLL+E RSRTGGLSV+L+G
Sbjct: 242 CILSANGAISNVTLRQPATSGGLVTYEGRFEIISLSGSFLLAEDGDTRSRTGGLSVALAG 301
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHR-MESLPVPPKLAPGGQPAGQ 299
DGRVLGG VAG LTAATPVQVVV SF+A+G+K + R +E + PP++A PA
Sbjct: 302 SDGRVLGGCVAGQLTAATPVQVVVASFIAEGKKSKLAEARKVEPMSAPPQMA-NYVPAPV 360
Query: 300 CSPPSRGTLSESSGGPGSPLNHSTGACNNNH 330
SPPS GT S SS GSP+NH G NH
Sbjct: 361 ASPPSEGTSSASSDDSGSPINH--GGMPYNH 389
>gi|226507246|ref|NP_001149978.1| AT-hook protein 1 [Zea mays]
gi|195635841|gb|ACG37389.1| AT-hook protein 1 [Zea mays]
gi|219885389|gb|ACL53069.1| unknown [Zea mays]
gi|413919174|gb|AFW59106.1| AT-hook protein 1 isoform 1 [Zea mays]
gi|413919175|gb|AFW59107.1| AT-hook protein 1 isoform 2 [Zea mays]
Length = 402
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 155/265 (58%), Positives = 187/265 (70%), Gaps = 9/265 (3%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
SG + +K+KRGRPRKYGPDG++ L L + + VT ATG + GGG +P+ K
Sbjct: 119 SGGDLVKKKRGRPRKYGPDGSIGLGLKTAAAGVTEATG------AQSGGGGSTPNPDGKR 172
Query: 128 RGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
RGRPPGSG KK QL+ALGS+G FTPH+ITVK EDV+SKIM+FSQ GPR CI+SANG
Sbjct: 173 RGRPPGSG--KKKQLDALGSSGTSFTPHIITVKPNEDVASKIMAFSQQGPRTTCIISANG 230
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
A+ TLRQ ATSGG VTYEG F+ILSLSGSFLL+E RSRTGGLSV+L+G DGR++G
Sbjct: 231 ALCTATLRQPATSGGIVTYEGHFDILSLSGSFLLAEDGDTRSRTGGLSVALAGSDGRIVG 290
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGT 307
G VAG+L AATPVQVVVGSF+A+ +K + + E VPP A G A SPPS GT
Sbjct: 291 GCVAGMLMAATPVQVVVGSFIAEAKKPKEEQPKREPTSVPPHTAVFGA-ASTASPPSDGT 349
Query: 308 LSESSGGPGSPLNHSTGACNNNHLP 332
SE S PGSP+ + A + LP
Sbjct: 350 SSEHSDDPGSPMGPNGSAFTSAGLP 374
>gi|224138096|ref|XP_002326517.1| predicted protein [Populus trichocarpa]
gi|222833839|gb|EEE72316.1| predicted protein [Populus trichocarpa]
Length = 286
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 167/273 (61%), Positives = 196/273 (71%), Gaps = 27/273 (9%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSP------SSVTTATGGTGSGLSSPGGGPLSPDSI 124
EP K+KRGRPRKY PDG ++L L P+P + ++GG GSG+ PD
Sbjct: 6 EPAKKKRGRPRKYTPDGNIALGLSPTPIHSGMSAGQADSSGGAGSGVM--------PDVA 57
Query: 125 -----KKSRGRPPGSGSGKKHQLEALG-SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR 178
KK RGRPPGSG K QL+ALG + GVGFTPHVITVKAGED++SKIM+FSQ GPR
Sbjct: 58 SEHPSKKHRGRPPGSG---KKQLDALGGTGGVGFTPHVITVKAGEDIASKIMAFSQQGPR 114
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
VCILSANGAI NVTLRQ A SGG+VTYEGRFEI+SLSGSFLLSES+G RSRTGGLSVSL
Sbjct: 115 TVCILSANGAICNVTLRQPAMSGGSVTYEGRFEIISLSGSFLLSESNGSRSRTGGLSVSL 174
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM--ESLPVPPKLAPGGQP 296
+G DGRVLGG VAG+LTAA+ VQV++GSF+ADG+K + S + S P PP++ G P
Sbjct: 175 AGSDGRVLGGGVAGMLTAASAVQVILGSFIADGKKSNSKSLKSGPSSTP-PPQMLNFGAP 233
Query: 297 AGQCSPPSRGTLSESSG-GPGSPLNHSTGACNN 328
SPPSRG SESS GSP+N + G N
Sbjct: 234 LTTASPPSRGGSSESSDENGGSPVNRTPGIYGN 266
>gi|357165690|ref|XP_003580463.1| PREDICTED: uncharacterized protein LOC100838752 [Brachypodium
distachyon]
Length = 373
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 167/297 (56%), Positives = 197/297 (66%), Gaps = 25/297 (8%)
Query: 32 AVYKPITATSPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSL 91
A+Y+P +A Q SGAG AI V++ G E +K+KRGRPRKYGPDG S+
Sbjct: 70 AMYRPDSAPPGMQQTSGAG---AI-------VVSGSGGGELVKKKRGRPRKYGPDG--SI 117
Query: 92 ALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVG 151
VP P + T+ G GS S+P G K RGRPPGSG KK QL ALGS+G
Sbjct: 118 GYVPKPVAGATSEAGAGSN-SNPDG---------KRRGRPPGSG--KKKQLAALGSSGTS 165
Query: 152 FTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFE 211
FTPH+ITVK EDV+SKIMSFSQ GPR CILSANGA+ TLRQ ATSGG VTYEG F+
Sbjct: 166 FTPHIITVKPNEDVASKIMSFSQQGPRTTCILSANGALCTATLRQPATSGGIVTYEGHFD 225
Query: 212 ILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG 271
ILSLSGSFLL+E RSRTGGLSV+LSG DGR++GG VAG+L AATPVQVVVGSF+A+G
Sbjct: 226 ILSLSGSFLLAEDGDTRSRTGGLSVALSGSDGRIVGGCVAGMLMAATPVQVVVGSFIAEG 285
Query: 272 RKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGACNN 328
+K + + E P A G P+ SPPS GT S+ S PGSP+ + NN
Sbjct: 286 KKPKEEQQKREPSSAPMHTAGFGAPSA-ASPPSDGTSSDHSDDPGSPMGPNGSTFNN 341
>gi|255557601|ref|XP_002519830.1| DNA binding protein, putative [Ricinus communis]
gi|223540876|gb|EEF42434.1| DNA binding protein, putative [Ricinus communis]
Length = 376
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 167/293 (56%), Positives = 201/293 (68%), Gaps = 19/293 (6%)
Query: 40 TSPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSS 99
T P+ QPS GG + + M +P K+KRGRPRKY PDG ++L L P+P S
Sbjct: 62 TQPSKQPSSDGG--LFDGSSPPSSSGMRFSMDPAKKKRGRPRKYTPDGNIALGLSPTPIS 119
Query: 100 ---------VTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAG- 149
V + G G G+ +P P K++RGRPPGSG K QL+ALG G
Sbjct: 120 SSATSLPPHVADSGSGVGVGIGTPAIASDPPS--KRNRGRPPGSG---KKQLDALGGVGG 174
Query: 150 VGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGR 209
VGFTPHVITVKAGED++SKIM+FSQ GPR VCILSANGAI NVTLRQ A SGGTVTYEGR
Sbjct: 175 VGFTPHVITVKAGEDIASKIMAFSQQGPRTVCILSANGAICNVTLRQPAMSGGTVTYEGR 234
Query: 210 FEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLA 269
+EI+SLSGSFLLSE++G RSR+GGLSVSL+G DGRVLGG VAG+L AA+PVQV+VGSF+A
Sbjct: 235 YEIISLSGSFLLSENNGNRSRSGGLSVSLAGSDGRVLGGGVAGMLMAASPVQVIVGSFIA 294
Query: 270 DGRKESKSSHRMESLPVPP-KLAPGGQPAGQCSPPSRGTLSESSGGPG-SPLN 320
DG+K + + H+ P ++ G P SPPS+G SESS G SPLN
Sbjct: 295 DGKKSNSNIHKSGPSSAPTSQMLNFGAPMTTSSPPSQGVSSESSDENGSSPLN 347
>gi|115477244|ref|NP_001062218.1| Os08g0512400 [Oryza sativa Japonica Group]
gi|113624187|dbj|BAF24132.1| Os08g0512400, partial [Oryza sativa Japonica Group]
Length = 292
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 176/284 (61%), Positives = 202/284 (71%), Gaps = 26/284 (9%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSP------------ 121
+ KRGRPRKYGPDG+MSL LV SP TA T PG P SP
Sbjct: 20 RNKRGRPRKYGPDGSMSLGLVTSP----TAAASTPVAQGVPG--PFSPTQPKPPASFLSS 73
Query: 122 ---DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR 178
D +KK RGRP GS K +++A+GSAGVGFTPHVITV AGEDVS+KIMSF+Q+G R
Sbjct: 74 GWPDGVKK-RGRP--KGSTNKPRIDAVGSAGVGFTPHVITVLAGEDVSAKIMSFAQHGNR 130
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
AVC+LSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL++ GQRSRTGGLSVSL
Sbjct: 131 AVCVLSANGAISNVTLRQTATSGGTVTYEGRFEILSLSGSFLLTDHGGQRSRTGGLSVSL 190
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAG 298
+GPDGR+LGG VAGLL AATPVQ+VVGSF ++G+KE K E P K P G
Sbjct: 191 AGPDGRLLGGGVAGLLIAATPVQIVVGSFNSEGKKEPKQHAHSEPASAPSKAVPTAG-MG 249
Query: 299 QCSPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
SPPSRGTLSESSGG GSPL+ ++N P +++ +PWK
Sbjct: 250 PNSPPSRGTLSESSGGAGSPLHPGIAPPSSNSQPPFLSS-MPWK 292
>gi|224126489|ref|XP_002329567.1| predicted protein [Populus trichocarpa]
gi|222870276|gb|EEF07407.1| predicted protein [Populus trichocarpa]
Length = 375
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 143/207 (69%), Positives = 166/207 (80%), Gaps = 14/207 (6%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSI-----K 125
EP K+KRGRPRKY PDG ++L L P+P G +G + GGG ++ D+ K
Sbjct: 98 EPAKKKRGRPRKYTPDGNIALGLSPTP-----VPSGISAGHADSGGGGVTHDAASEHPSK 152
Query: 126 KSRGRPPGSGSGKKHQLEALGSAG-VGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
K+RGRPPGSG K QL+ALG G VGFTPHVITVKAGED++SKIM+FSQ GPR VCILS
Sbjct: 153 KNRGRPPGSG---KKQLDALGGVGGVGFTPHVITVKAGEDIASKIMAFSQQGPRTVCILS 209
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANGAI NVTLRQ A SGG+VTYEGRFEI+SLSGSFLLSES+G RSR+GGLSVSL+G DGR
Sbjct: 210 ANGAICNVTLRQPAMSGGSVTYEGRFEIISLSGSFLLSESNGSRSRSGGLSVSLAGSDGR 269
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADG 271
VLGG VAG+LTAA+PVQV+VGSF+ADG
Sbjct: 270 VLGGGVAGMLTAASPVQVIVGSFIADG 296
>gi|294461667|gb|ADE76393.1| unknown [Picea sitchensis]
Length = 302
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 168/268 (62%), Positives = 198/268 (73%), Gaps = 28/268 (10%)
Query: 64 MNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDS 123
M MG G++ MKRKRGRPRKYGPDG+M+LAL P +S G P SP
Sbjct: 11 MAMG-GTDSMKRKRGRPRKYGPDGSMALALAPLSASAP--------------GAPFSPLQ 55
Query: 124 IKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR 178
K+ RGRPPGSG KK +L ALG SAG+GFTPHVIT+ AGEDV+SKIMSFSQ GPR
Sbjct: 56 -KRGRGRPPGSG--KKQRLAALGEWVVGSAGIGFTPHVITIAAGEDVASKIMSFSQQGPR 112
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
AVCILSANGAISNVTLRQ ATSGGT+TYEGRFEILSLSGSF+L+E+ G RSRTGGLSVSL
Sbjct: 113 AVCILSANGAISNVTLRQPATSGGTLTYEGRFEILSLSGSFMLTENGGARSRTGGLSVSL 172
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRME-SLPVPPKLAPGGQPA 297
+ PDGRV+GG VAG+L AA+PVQVVVGSF+++G+K+ + E S+ + A GG A
Sbjct: 173 ASPDGRVVGGGVAGMLMAASPVQVVVGSFISNGQKDPPKPAKPEPSIGLAQAAASGGPVA 232
Query: 298 GQCSPPSRGTLSESSGGPGS-PLNHSTG 324
P SR L+++ GGPGS PLN +TG
Sbjct: 233 ---IPISRSPLNDTYGGPGSPPLNQNTG 257
>gi|242076972|ref|XP_002448422.1| hypothetical protein SORBIDRAFT_06g026920 [Sorghum bicolor]
gi|241939605|gb|EES12750.1| hypothetical protein SORBIDRAFT_06g026920 [Sorghum bicolor]
Length = 372
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 165/304 (54%), Positives = 196/304 (64%), Gaps = 20/304 (6%)
Query: 32 AVYKPITATSPTYQPS---GAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGT 88
A+Y+ + P Q G+GG GAI SG E +K+KRGRPRKYGPDG+
Sbjct: 58 AMYRADSDAPPGLQQQQHPGSGGGGAIVAV---------SGGELVKKKRGRPRKYGPDGS 108
Query: 89 MSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSA 148
+ L L S+ T G GG +PD K RGRPPGSG KK QL+ALGS+
Sbjct: 109 IGLGLK---SAAAAGTEAAGGQSGGGGGSSSNPDG--KRRGRPPGSG--KKKQLDALGSS 161
Query: 149 GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEG 208
G FTPH+ITVK EDV+SKIM+FSQ GPR CI+SANGA+ TLRQ ATSGG VTYEG
Sbjct: 162 GTSFTPHIITVKPNEDVASKIMAFSQQGPRTTCIISANGALCTATLRQPATSGGIVTYEG 221
Query: 209 RFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
F+ILSLSGSFLL+E RSRTGGLSV+L+G DGR++GG VAG+L AATPVQVVVGSF+
Sbjct: 222 HFDILSLSGSFLLAEDGDTRSRTGGLSVALAGSDGRIVGGCVAGMLMAATPVQVVVGSFI 281
Query: 269 ADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGACNN 328
A+G+K + + E VPP A G A SPPS GT SE S PGSP+ + N
Sbjct: 282 AEGKKPKEEQPKREPTSVPPHTA-GFGAASTASPPSDGTSSEHSDDPGSPMGPNGSTFTN 340
Query: 329 NHLP 332
LP
Sbjct: 341 AGLP 344
>gi|413955128|gb|AFW87777.1| hypothetical protein ZEAMMB73_819673 [Zea mays]
Length = 429
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 170/299 (56%), Positives = 198/299 (66%), Gaps = 48/299 (16%)
Query: 46 PSGAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATG 105
P GAG A G +V+ MG E M++KRGRPRKY PDG+M+LAL P +++A+
Sbjct: 107 PQGAG-------APGGSVLGMG---ELMRKKRGRPRKYAPDGSMALALAP----ISSASA 152
Query: 106 GTGSGLSSPG----------GGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPH 155
G G G ++PG G P S D K RGRPPGSG KK Q EALGS G+ FTPH
Sbjct: 153 GGGGGAAAPGQQQQHGGFSIGSPPS-DPSAKRRGRPPGSG--KKKQFEALGSWGIAFTPH 209
Query: 156 VITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSL 215
++ VKAGEDV+SKIM+FSQ GPR VCILSANGAISNVTLRQ ATSGG VTYEGRFEI+SL
Sbjct: 210 ILAVKAGEDVASKIMTFSQQGPRTVCILSANGAISNVTLRQPATSGGLVTYEGRFEIISL 269
Query: 216 SGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKES 275
SGSFLL+E RSRTGGLSV+L+G DGRVLGG VAG+L AATPVQVVV SF+A+G+K
Sbjct: 270 SGSFLLAEDGDTRSRTGGLSVALAGSDGRVLGGCVAGMLMAATPVQVVVASFIAEGKKSK 329
Query: 276 KSSHRMESLP-------------VPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNH 321
+ R VP +A SPPS GT S SS GSP+NH
Sbjct: 330 PAEARKVEPMAAPPPPPPQMAAFVPAPVA--------TSPPSEGTSSASSDDSGSPINH 380
>gi|356517172|ref|XP_003527263.1| PREDICTED: uncharacterized protein LOC100806173 [Glycine max]
Length = 355
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 177/348 (50%), Positives = 216/348 (62%), Gaps = 50/348 (14%)
Query: 20 QSMRLAFSADGTAVYKPITATSPTYQPSGAGGDGA--------IPQAQGLNVMNMGSGSE 71
Q+MR+ ++ADGTAV+ P PT + GGD + +PQ Q + M + E
Sbjct: 33 QNMRMDYAADGTAVFAP-----PTVTVNINGGDSSPAVPPGLGLPQPQPM----MVNSPE 83
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGG----GPLSPDS---I 124
P+KRKRGRPRKYGPDG M+L + TT G G + GG GPLS +
Sbjct: 84 PIKRKRGRPRKYGPDGGMTLGALK-----TTTPPGGGVPVGQSGGAFPAGPLSDSASAGT 138
Query: 125 KKSRGRPPGSGSGKKHQLEALGSA----GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
K RGRP GS + K + S G FTPHVITV AGED+S++IM+ SQ+ R +
Sbjct: 139 VKRRGRPRGSVNKNKKNDSSNSSKYSGPGSWFTPHVITVNAGEDLSARIMTISQSSSRNI 198
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
CIL+ANGAISNVTLRQ A+SGGTVTYEGRFEILSL GSF L+ + R GGLSVSLSG
Sbjct: 199 CILTANGAISNVTLRQPASSGGTVTYEGRFEILSLGGSFFLAGT----ERAGGLSVSLSG 254
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQC 300
PDGRVLGG VAGLL AA+PVQ+V+ SF++D RK K + + E+ K++ G GQ
Sbjct: 255 PDGRVLGGGVAGLLIAASPVQIVLASFVSDVRKHLKRAKKTEN----EKVSTAG---GQS 307
Query: 301 SPPSRGTL--SESSGGPGSPLNHSTGACN----NNHLPQGMATGIPWK 342
S PSRGTL S G GSPLN STGACN N+ P G+PWK
Sbjct: 308 SSPSRGTLSESSGGVGSGSPLNQSTGACNNTIENSTTPTQSFQGMPWK 355
>gi|356509574|ref|XP_003523522.1| PREDICTED: uncharacterized protein LOC100808432 [Glycine max]
Length = 357
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 178/344 (51%), Positives = 213/344 (61%), Gaps = 40/344 (11%)
Query: 20 QSMRLAFSA-DGTAVYKPITATSPTYQPSGAGGDGAIPQAQGLNVMN------MGSGSEP 72
+MR+ ++A DGTAV+ P T T +G A+P GL M + SEP
Sbjct: 33 NNMRMDYAAADGTAVFAPPTVT---VNINGGESSPAVPPGLGLAQPQPQPQPMMVNSSEP 89
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGG----PLSPDS---IK 125
+KRKRGRPRKYGP G M+LAL + TT GG + GG PLS + I
Sbjct: 90 IKRKRGRPRKYGPHGGMALAL-----NTTTPPGGAAVPVGQSGGAFPPAPLSDSASAGIV 144
Query: 126 KSRGRPPGSGSGKKHQLEALGSA-GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
K RGRP GS + K + S G FTPHVITVKAGED+S++IM+ SQ+ R +CIL+
Sbjct: 145 KRRGRPRGSVNKNKKNNSSKYSGPGSWFTPHVITVKAGEDLSARIMTISQSSSRNICILT 204
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANGAISNVTLRQ A+SGGTVTYEGRFEILSL GSF L+ + R GGLSVSLSGPDGR
Sbjct: 205 ANGAISNVTLRQPASSGGTVTYEGRFEILSLGGSFFLAGT----ERAGGLSVSLSGPDGR 260
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPS 304
VLGG VAGLL AA+PVQ+V+ SF++D RK K + +M++ V AGQ S PS
Sbjct: 261 VLGGGVAGLLVAASPVQIVLASFVSDVRKHFKHAKQMQNAKV-------SIAAGQSSSPS 313
Query: 305 RGTL--SESSGGPGSPLNHSTGACNNNH----LPQGMATGIPWK 342
RGTL S G GSPLN STGACNN P G+PWK
Sbjct: 314 RGTLSESSGGVGSGSPLNQSTGACNNTMNNCTTPTQSFQGMPWK 357
>gi|326498333|dbj|BAJ98594.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 392
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 178/370 (48%), Positives = 216/370 (58%), Gaps = 58/370 (15%)
Query: 19 IQSMRLAFSADGTAVY-KPITATSPT--YQPSGAGG-------DGAIPQAQG--LNVMNM 66
+ SMR+ + DG A + KP +A P YQP G G D A + G +NM
Sbjct: 35 VPSMRMTYGEDGNAYFLKPGSAPPPAEAYQPVGGAGLDMPVGPDAAAGRGNGGPPFELNM 94
Query: 67 GSGSEPMKRKRGRPRKYGPDGTMSLALVPSP---SSVTTATGGTGSGLSSPGGG---PLS 120
E +KRGR K+G DG+ SLALVP P A G + P G +
Sbjct: 95 ---EEEAAKKRGRAMKFGDDGSTSLALVPVPVPGEPTAVAPGDFSQPAAKPAAGGVLAVP 151
Query: 121 PDSIKKSRGRPPGSGSGKKHQ------LEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
P +KK RGRP GS + K Q L +GSAG GFTPHVI V+AGEDV++KI+SF+Q
Sbjct: 152 PVGMKK-RGRPKGSTNKVKKQDKVMSALAFIGSAGAGFTPHVIAVQAGEDVAAKILSFAQ 210
Query: 175 NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGL 234
NG RAV +LSANGAISNVTLRQ+ATSGGTVTYEGRFEILSLSGSF + ++ G RSRTGGL
Sbjct: 211 NGVRAVVVLSANGAISNVTLRQSATSGGTVTYEGRFEILSLSGSFTVQDTGGHRSRTGGL 270
Query: 235 SVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESL---------- 284
SVSL+ PDGRVLGG +AGLL A TP+QVVVG+F K+ H+ +
Sbjct: 271 SVSLASPDGRVLGGGIAGLLIACTPIQVVVGTFNTVAEKKKAPKHQAAAAHEPASAPPKM 330
Query: 285 -------PVPPKLAPGGQPAG--QCSPPSRGTLSESSGGPGSPLNHSTGAC---NNNHLP 332
P+ LAP G Q SPPSRGTLSESS GSP+N A +N+ L
Sbjct: 331 TPSFVPAPISVPLAPAPISVGMAQNSPPSRGTLSESS---GSPMNQGVPAAATPSNSGL- 386
Query: 333 QGMATGIPWK 342
+ +PWK
Sbjct: 387 ----SSMPWK 392
>gi|222641827|gb|EEE69959.1| hypothetical protein OsJ_29846 [Oryza sativa Japonica Group]
Length = 255
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 166/265 (62%), Positives = 189/265 (71%), Gaps = 21/265 (7%)
Query: 89 MSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIK-------KSRGRPPGSGSGKKHQ 141
MSLALVP ++ A SG SP SPD++ K RGRP GS + KKH
Sbjct: 1 MSLALVPVSTAAVAA-----SGPFSPAAAAKSPDAVSSAPPPGAKKRGRPKGS-TNKKHV 54
Query: 142 ----LEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQA 197
+ +GSAG GFTPHVI VKAGEDVS+KIMSFSQ+G R VC+LSANGAISNVTLRQA
Sbjct: 55 PSFGIGDIGSAGAGFTPHVIFVKAGEDVSAKIMSFSQHGTRGVCVLSANGAISNVTLRQA 114
Query: 198 ATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAA 257
ATSGGTVTYEGRFEILSLSGSFLLSE+ G RSRTGGLSVSL+GPDGRVLGG VAGLLTAA
Sbjct: 115 ATSGGTVTYEGRFEILSLSGSFLLSENGGHRSRTGGLSVSLAGPDGRVLGGGVAGLLTAA 174
Query: 258 TPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGS 317
+PVQ+VVGSF +G+K K + + P K+ P G SPPSRGTLSESSGGPGS
Sbjct: 175 SPVQIVVGSFNTEGKKGPKLHAPSDPMSAPLKMVPMSG-TGPSSPPSRGTLSESSGGPGS 233
Query: 318 PLNHSTGACNNNHLPQGMATGIPWK 342
PLN G +NH G+ + + WK
Sbjct: 234 PLNQ--GVTASNHGQPGLPS-LSWK 255
>gi|242095694|ref|XP_002438337.1| hypothetical protein SORBIDRAFT_10g012730 [Sorghum bicolor]
gi|241916560|gb|EER89704.1| hypothetical protein SORBIDRAFT_10g012730 [Sorghum bicolor]
Length = 361
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 180/344 (52%), Positives = 216/344 (62%), Gaps = 42/344 (12%)
Query: 24 LAFSADGTAVY-KPITATSPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRK 82
+ ++ DG AVY KP A P YQ S AG + +P A GL + + SEP KRKRGRPRK
Sbjct: 35 VFYTPDGIAVYAKP--AIPPFYQQS-AGSNAIVPAAPGL--AHSSATSEPFKRKRGRPRK 89
Query: 83 YGP-DGTMSLALVPSPSSVTTATGGT---------GSGLSSPGGGPLSPDSIK------- 125
YGP DG + LA+VP T A G S GGG +SP +
Sbjct: 90 YGPADGAVPLAIVPPSQPPTAAAPAASEASPTIPPGFAPSPQGGGVVSPQASPAPQPPAA 149
Query: 126 ------KSRGRPPGSGSGKKH-QLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR 178
K RGRPPG S K+ Q A G G+ PH+ TV+AGEDV+S++MSFS NG
Sbjct: 150 SGAPAVKKRGRPPGPSSKKQQPQAAAPGPGWAGWKPHIFTVQAGEDVASRVMSFSGNG-W 208
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
AVCIL+ANGA+SNVTLRQ +SGGTVTYEGRFEILSL+GS+LLSES+G SRTGGLSVSL
Sbjct: 209 AVCILTANGAVSNVTLRQGESSGGTVTYEGRFEILSLAGSYLLSESAGMSSRTGGLSVSL 268
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAG 298
+GPDGRVLGG+VAG LTAA+PVQVV+GSFLAD + E ++ P K A G P
Sbjct: 269 AGPDGRVLGGAVAGPLTAASPVQVVIGSFLADTKME------LDPGSAPEKHAFGRFPT- 321
Query: 299 QCSPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
S PSRGT ES GG SP N +TG+ + + P G + PWK
Sbjct: 322 -ASSPSRGT--ESLGGHASPPN-TTGSFSTSTQPPGFPSFPPWK 361
>gi|115483594|ref|NP_001065467.1| Os10g0572900 [Oryza sativa Japonica Group]
gi|113639999|dbj|BAF27304.1| Os10g0572900, partial [Oryza sativa Japonica Group]
Length = 251
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 138/202 (68%), Positives = 159/202 (78%), Gaps = 4/202 (1%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
D K RGRPPGSG KK Q EALGS G+ FTPH++TVKAGEDV+SKIM+FSQ GPR VC
Sbjct: 6 DPNAKRRGRPPGSG--KKKQFEALGSWGIAFTPHILTVKAGEDVASKIMAFSQQGPRTVC 63
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
ILSANGAISNVTLRQ ATSGG VTYEGRFEI+SLSGSFLL+E RSRTGGLSV+L+G
Sbjct: 64 ILSANGAISNVTLRQPATSGGLVTYEGRFEIISLSGSFLLAEDGDTRSRTGGLSVALAGS 123
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFLADGRK-ESKSSHRMESLPVPPKLAPGGQPAGQC 300
DGRVLGG VAG+L AATPVQVVV SF+A+G+K + + ++E + PP++A PA
Sbjct: 124 DGRVLGGCVAGMLMAATPVQVVVASFIAEGKKSKPVETRKVEPMSAPPQMA-TYVPAPVA 182
Query: 301 SPPSRGTLSESSGGPGSPLNHS 322
SPPS GT S SS GSP+NHS
Sbjct: 183 SPPSEGTSSGSSDDSGSPINHS 204
>gi|326530712|dbj|BAK01154.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 416
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 149/274 (54%), Positives = 172/274 (62%), Gaps = 39/274 (14%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
E +K+KRGRPRKYGPDGT+ A+ + G S+P G K RGR
Sbjct: 134 ELVKKKRGRPRKYGPDGTLGSAV----KAEAGGQSGGAGSNSNPDG---------KRRGR 180
Query: 131 PPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAIS 190
PPGSG KK QL+ALGSAG FTPH+ITVK EDV+SKIMSFSQ GPR CI+SANGA+
Sbjct: 181 PPGSG--KKKQLDALGSAGTSFTPHIITVKPNEDVASKIMSFSQQGPRTTCIISANGALC 238
Query: 191 NVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSV 250
TLRQ ATSGG VTYEG F+ILSLSGSFLL+E RSRTGGLSV+L+G DGRV+GG V
Sbjct: 239 TATLRQPATSGGIVTYEGHFDILSLSGSFLLAEDGDTRSRTGGLSVALAGSDGRVVGGCV 298
Query: 251 AGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSP-------- 302
AG+L AATPVQVVVGSF+A+G K+ K PK P P P
Sbjct: 299 AGMLMAATPVQVVVGSFIAEGNKKPKEEQ--------PKREPTSVPMPTSMPMQTASGFG 350
Query: 303 --------PSRGTLSESSGGPGSPLNHSTGACNN 328
PS GT S+ S PGSP+ + A NN
Sbjct: 351 AAASAAATPSDGTSSDHSDDPGSPIGPNGSAFNN 384
>gi|414886041|tpg|DAA62055.1| TPA: hypothetical protein ZEAMMB73_462098 [Zea mays]
Length = 390
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 187/337 (55%), Positives = 217/337 (64%), Gaps = 42/337 (12%)
Query: 18 SIQSMRLAFSADGTAVYKPITATS-----PTYQPSGAGGDGAIPQAQGLNVMNMGSGSEP 72
S QS+R+A++ DGTA++ P+ + S TYQP G A+ + V G EP
Sbjct: 32 STQSLRMAYTTDGTAIFTPVISVSVLPATATYQPVGG---SAVTASSLAGVGGNGGAGEP 88
Query: 73 M-KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGG------GP-LSPDSI 124
+ K+KRGRPRKYGPDG+MSLALVP+ + A G SG SP G P SPD
Sbjct: 89 VPKKKRGRPRKYGPDGSMSLALVPASMATAPAPPGV-SGAFSPNGPKATNAAPSASPDGA 147
Query: 125 KKSRGRPPGSGSGKKHQ--LEALGSAGVGFTPHVITVKAGE-------------DVSSKI 169
KK RGRP GS + KKH L+ +GV K GE DVS+KI
Sbjct: 148 KK-RGRPKGS-TNKKHVPGLDLDCKSGVSKLVLYNPFKKGEISSLVGVYKEMVSDVSAKI 205
Query: 170 MSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRS 229
MSF QNG RAVC+LSANG +SNVTLRQ+ATSGGTVT+EGRFEILSLSGSFLLSE G RS
Sbjct: 206 MSFPQNGTRAVCVLSANGIVSNVTLRQSATSGGTVTHEGRFEILSLSGSFLLSEDGGHRS 265
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPP- 288
RTGGLSVSL+GPDGRVLGGSVAGLLTAA+PVQ+VVG+F ADG K+ K + + P
Sbjct: 266 RTGGLSVSLAGPDGRVLGGSVAGLLTAASPVQIVVGTFDADGEKKPKQQQQQLAPSPPDP 325
Query: 289 -----KLAPGGQPAGQCSPPSRGT--LSESSGGPGSP 318
KLAP G AG SPPSRGT LSESSGG SP
Sbjct: 326 SPAPLKLAPTGVAAGPSSPPSRGTLSLSESSGGAPSP 362
>gi|42408802|dbj|BAD10063.1| putative AT-hook DNA-binding protein [Oryza sativa Japonica Group]
Length = 258
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 150/222 (67%), Positives = 174/222 (78%), Gaps = 5/222 (2%)
Query: 121 PDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
PD +KK RGRP GS K +++A+GSAGVGFTPHVITV AGEDVS+KIMSF+Q+G RAV
Sbjct: 42 PDGVKK-RGRP--KGSTNKPRIDAVGSAGVGFTPHVITVLAGEDVSAKIMSFAQHGNRAV 98
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
C+LSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL++ GQRSRTGGLSVSL+G
Sbjct: 99 CVLSANGAISNVTLRQTATSGGTVTYEGRFEILSLSGSFLLTDHGGQRSRTGGLSVSLAG 158
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQC 300
PDGR+LGG VAGLL AATPVQ+VVGSF ++G+KE K E P K P G
Sbjct: 159 PDGRLLGGGVAGLLIAATPVQIVVGSFNSEGKKEPKQHAHSEPASAPSKAVPTAG-MGPN 217
Query: 301 SPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
SPPSRGTLSESSGG GSPL+ ++N P +++ +PWK
Sbjct: 218 SPPSRGTLSESSGGAGSPLHPGIAPPSSNSQPPFLSS-MPWK 258
>gi|294461874|gb|ADE76494.1| unknown [Picea sitchensis]
Length = 302
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 158/273 (57%), Positives = 188/273 (68%), Gaps = 34/273 (12%)
Query: 63 VMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPD 122
V++ S MKRKRGRPRKYGPDG+M+LAL P + G TGS
Sbjct: 26 VIHGASNVNTMKRKRGRPRKYGPDGSMALALSP----FSALPGMTGS------------S 69
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
S K+ RGRPPG+G +K QL ALGSAGVGFTPHVIT+ AGEDV++KIMSFSQ GPRAVCI
Sbjct: 70 SQKRGRGRPPGTG--RKQQLAALGSAGVGFTPHVITIAAGEDVATKIMSFSQQGPRAVCI 127
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LSANGAISNVT+RQ A SGGTVTYEGRF+I+SLSGSFLL E++G R RTGGLS+SL+GPD
Sbjct: 128 LSANGAISNVTVRQPAASGGTVTYEGRFDIVSLSGSFLLMENNGAR-RTGGLSISLAGPD 186
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRK-ESKSSHRMESLPVPPKLAPGGQPAGQCS 301
GRV+GG VAG+L AA+PVQV+ GSF+ D +K + K + + S +P A
Sbjct: 187 GRVVGGVVAGMLMAASPVQVIAGSFILDSKKGQGKPENPVSSSGLPHVAA---------- 236
Query: 302 PPSRGTLSESSGGPG-SPLNHSTGACNNNHLPQ 333
G L GGPG SP N S+GA N + Q
Sbjct: 237 ---SGHLGAKHGGPGSSPFNPSSGASAINSVGQ 266
>gi|449462009|ref|XP_004148734.1| PREDICTED: uncharacterized protein LOC101204243 [Cucumis sativus]
gi|449511145|ref|XP_004163876.1| PREDICTED: uncharacterized LOC101204243 [Cucumis sativus]
Length = 362
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 142/297 (47%), Positives = 193/297 (64%), Gaps = 21/297 (7%)
Query: 38 TATSPTYQPSGAGGDGAIPQAQ--GLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVP 95
+++ P+ P+ A DG+ + + G N+ + K+KRGRPRKY PDG ++L L P
Sbjct: 69 SSSKPSESPNAASYDGSQSELRTGGFNI-------DSGKKKRGRPRKYSPDGNIALGLSP 121
Query: 96 SPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPH 155
+P + ++A +G+ SP P KK+RGRPPG+G K Q++ALG+ GVGFTPH
Sbjct: 122 TPIT-SSAVPADSAGMHSPDPRP------KKNRGRPPGTG---KRQMDALGTGGVGFTPH 171
Query: 156 VITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSL 215
VI VK GED++SK+M+FSQ GPR VCILSA+GA+ NVTL Q A S G+V+YEGR+EI+SL
Sbjct: 172 VILVKPGEDIASKVMAFSQQGPRTVCILSAHGAVCNVTL-QPALSSGSVSYEGRYEIISL 230
Query: 216 SGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKES 275
SGSFL+SE++G RSR+GGLSVSL+ DG+VLGG + +LTAA+ VQV+VGSFL DG+K
Sbjct: 231 SGSFLISENNGNRSRSGGLSVSLASADGQVLGG-ITNMLTAASTVQVIVGSFLVDGKKLG 289
Query: 276 KSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGACNNNHLP 332
S + P + G P P + + S GSPL+ G N + P
Sbjct: 290 ASIQKSGPSSTSPNMLNFGTPVAAGCPSEGASNNSSDDNGGSPLSRGPGMYTNANQP 346
>gi|302771533|ref|XP_002969185.1| hypothetical protein SELMODRAFT_410086 [Selaginella moellendorffii]
gi|300163690|gb|EFJ30301.1| hypothetical protein SELMODRAFT_410086 [Selaginella moellendorffii]
Length = 343
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 151/212 (71%), Positives = 163/212 (76%), Gaps = 30/212 (14%)
Query: 71 EPMKRKRGRPRKYGPDG-----TMSLALVP----SPSSVTTATGGTGSGLSSPGGGPLSP 121
EP+KRKRGRPRKYG DG ++SLAL P SP S T T
Sbjct: 40 EPVKRKRGRPRKYG-DGASGSSSVSLALTPLSSVSPISSVTTT----------------- 81
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+K RGRPPGSG KK QL ALGSAG GFTPHVIT+ AGEDV++KIMSFSQ GPRAVC
Sbjct: 82 -PTEKRRGRPPGSG--KKQQLAALGSAGQGFTPHVITIAAGEDVATKIMSFSQTGPRAVC 138
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
+LSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL+ES G RSRTGGLSVSL+GP
Sbjct: 139 VLSANGAISNVTLRQPATSGGTVTYEGRFEILSLSGSFLLTESGGTRSRTGGLSVSLAGP 198
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
DGRV+GG VAGLL AATPVQVVVGSF+AD RK
Sbjct: 199 DGRVVGGGVAGLLMAATPVQVVVGSFIADTRK 230
>gi|302784214|ref|XP_002973879.1| hypothetical protein SELMODRAFT_442286 [Selaginella moellendorffii]
gi|300158211|gb|EFJ24834.1| hypothetical protein SELMODRAFT_442286 [Selaginella moellendorffii]
Length = 407
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 151/212 (71%), Positives = 163/212 (76%), Gaps = 30/212 (14%)
Query: 71 EPMKRKRGRPRKYGPDG-----TMSLALVP----SPSSVTTATGGTGSGLSSPGGGPLSP 121
EP+KRKRGRPRKYG DG ++SLAL P SP S T T
Sbjct: 102 EPVKRKRGRPRKYG-DGASGSSSVSLALTPLSSVSPISSVTTT----------------- 143
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+K RGRPPGSG KK QL ALGSAG GFTPHVIT+ AGEDV++KIMSFSQ GPRAVC
Sbjct: 144 -PTEKRRGRPPGSG--KKQQLAALGSAGQGFTPHVITIAAGEDVATKIMSFSQTGPRAVC 200
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
+LSANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL+ES G RSRTGGLSVSL+GP
Sbjct: 201 VLSANGAISNVTLRQPATSGGTVTYEGRFEILSLSGSFLLTESGGTRSRTGGLSVSLAGP 260
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
DGRV+GG VAGLL AATPVQVVVGSF+AD RK
Sbjct: 261 DGRVVGGGVAGLLMAATPVQVVVGSFIADTRK 292
>gi|125591456|gb|EAZ31806.1| hypothetical protein OsJ_15962 [Oryza sativa Japonica Group]
Length = 379
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/221 (61%), Positives = 162/221 (73%), Gaps = 15/221 (6%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGL--SSPGGGPLSPDSIKKSR 128
E +K+KRGRPRKYGPDG + L L P+ ++ T A G +G S+P G K R
Sbjct: 98 ELVKKKRGRPRKYGPDGNIGLGLKPAAAAGTEAGGPSGGAGSNSNPDG---------KRR 148
Query: 129 GRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
GRPPGSG KK QL+ALGS+G FTPH+ITVK EDV+SKIM+FSQ GPR CI+SANGA
Sbjct: 149 GRPPGSG--KKKQLDALGSSGTSFTPHIITVKPNEDVASKIMAFSQQGPRTTCIISANGA 206
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
+ TLRQ ATSGG VTYEG F+ILSLSGSFLL+E RSRTGGLSV+L+G DGR++GG
Sbjct: 207 LCTATLRQPATSGGIVTYEGHFDILSLSGSFLLAEDGDTRSRTGGLSVALAGSDGRIVGG 266
Query: 249 SVAGLLTAATPVQVVVGSFLADGRKESKSSHRME--SLPVP 287
VAG+L AATPVQVVVGSF+A+G+K + + E S P P
Sbjct: 267 CVAGMLMAATPVQVVVGSFIAEGKKGKEEHLKREPTSAPTP 307
>gi|115460204|ref|NP_001053702.1| Os04g0589900 [Oryza sativa Japonica Group]
gi|38346715|emb|CAE04865.2| OSJNBa0086O06.13 [Oryza sativa Japonica Group]
gi|89572596|dbj|BAC78598.2| hypothetical protein [Oryza sativa Japonica Group]
gi|113565273|dbj|BAF15616.1| Os04g0589900 [Oryza sativa Japonica Group]
gi|215697767|dbj|BAG91960.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 379
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/221 (61%), Positives = 162/221 (73%), Gaps = 15/221 (6%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGL--SSPGGGPLSPDSIKKSR 128
E +K+KRGRPRKYGPDG + L L P+ ++ T A G +G S+P G K R
Sbjct: 98 ELVKKKRGRPRKYGPDGNIGLGLKPAAAAGTEAGGPSGGAGSNSNPDG---------KRR 148
Query: 129 GRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
GRPPGSG KK QL+ALGS+G FTPH+ITVK EDV+SKIM+FSQ GPR CI+SANGA
Sbjct: 149 GRPPGSG--KKKQLDALGSSGTSFTPHIITVKPNEDVASKIMAFSQQGPRTTCIISANGA 206
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
+ TLRQ ATSGG VTYEG F+ILSLSGSFLL+E RSRTGGLSV+L+G DGR++GG
Sbjct: 207 LCTATLRQPATSGGIVTYEGHFDILSLSGSFLLAEDGDTRSRTGGLSVALAGSDGRIVGG 266
Query: 249 SVAGLLTAATPVQVVVGSFLADGRKESKSSHRME--SLPVP 287
VAG+L AATPVQVVVGSF+A+G+K + + E S P P
Sbjct: 267 CVAGMLMAATPVQVVVGSFIAEGKKGKEEHLKREPTSAPTP 307
>gi|125549527|gb|EAY95349.1| hypothetical protein OsI_17180 [Oryza sativa Indica Group]
Length = 379
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/221 (61%), Positives = 162/221 (73%), Gaps = 15/221 (6%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGL--SSPGGGPLSPDSIKKSR 128
E +K+KRGRPRKYGPDG + L L P+ ++ T A G +G S+P G K R
Sbjct: 98 ELVKKKRGRPRKYGPDGNIGLGLKPAAAAGTEAGGPSGGAGSNSNPDG---------KRR 148
Query: 129 GRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
GRPPGSG KK QL+ALGS+G FTPH+ITVK EDV+SKIM+FSQ GPR CI+SANGA
Sbjct: 149 GRPPGSG--KKKQLDALGSSGTSFTPHIITVKPNEDVASKIMAFSQQGPRTTCIISANGA 206
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
+ TLRQ ATSGG VTYEG F+ILSLSGSFLL+E RSRTGGLSV+L+G DGR++GG
Sbjct: 207 LCTATLRQPATSGGIVTYEGHFDILSLSGSFLLAEDGDTRSRTGGLSVALAGSDGRIVGG 266
Query: 249 SVAGLLTAATPVQVVVGSFLADGRKESKSSHRME--SLPVP 287
VAG+L AATPVQVVVGSF+A+G+K + + E S P P
Sbjct: 267 CVAGMLMAATPVQVVVGSFIAEGKKGKEEHLKREPTSAPTP 307
>gi|225454180|ref|XP_002272142.1| PREDICTED: uncharacterized protein LOC100265498 [Vitis vinifera]
gi|297745264|emb|CBI40344.3| unnamed protein product [Vitis vinifera]
Length = 345
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 139/217 (64%), Positives = 168/217 (77%), Gaps = 25/217 (11%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
SEP+KRKRGRPRKYGPDGT+SLAL SPSS T+ G L+ + K+ RG
Sbjct: 89 SEPVKRKRGRPRKYGPDGTVSLAL--SPSSATSP-------------GTLTASTQKRGRG 133
Query: 130 RPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
RPPG+G +K QL +LG SAG+GFTPHVITV GEDV++KIMSFSQ GPRA+CILS
Sbjct: 134 RPPGTG--RKQQLASLGEWLSGSAGMGFTPHVITVAVGEDVATKIMSFSQQGPRAICILS 191
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANGA+S VTLRQ +TSGGTVTYEGRFEIL LSGS+LL+++ G R+RTGGLSVSL+ PDGR
Sbjct: 192 ANGAVSTVTLRQPSTSGGTVTYEGRFEILCLSGSYLLTDNGGSRNRTGGLSVSLASPDGR 251
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
V+GG V G+LTAA+PVQV+VGSF+ SK+ ++M
Sbjct: 252 VIGGGVGGMLTAASPVQVIVGSFIWG---NSKTKNKM 285
>gi|226499032|ref|NP_001148506.1| LOC100282121 [Zea mays]
gi|223943259|gb|ACN25713.1| unknown [Zea mays]
gi|413944406|gb|AFW77055.1| DNA-binding protein [Zea mays]
Length = 357
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 172/351 (49%), Positives = 204/351 (58%), Gaps = 59/351 (16%)
Query: 24 LAFSADGTAVYKPITATSPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKY 83
L ++ DG AVY+ P + AG + +P A G + + SEP KRKRGRPRKY
Sbjct: 34 LFYTHDGVAVYR--NPVMPAFYQQPAGSNVVVPAAPG--PAHSPASSEPFKRKRGRPRKY 89
Query: 84 GP-DGTMSLALVPSPSSVTTATGGTGSGLSS---PGGGPLSPDS---------------- 123
P DG + LA+VP PS TA S S PG P SP S
Sbjct: 90 APADGAVPLAIVP-PSQPPTARAPATSEASPTVPPGFSP-SPQSGGVVSRQASPAPAPAS 147
Query: 124 ----IKKSRGRPPGSGSGKKH-QLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR 178
+KK RGRP G S K+ Q A G G PH+ TV+AGEDV+S+ MSFS NG
Sbjct: 148 GAPDVKK-RGRPSGPSSKKQQPQAAAPGPGWTGLKPHIFTVQAGEDVASRAMSFSGNG-W 205
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
AVCIL+ANG +SNVTLRQ +SGGTVTYEGRFEILSL+GS+LLSES+G SRTGGLSVSL
Sbjct: 206 AVCILTANGTVSNVTLRQGESSGGTVTYEGRFEILSLAGSYLLSESTGMSSRTGGLSVSL 265
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAG 298
+ PDG VLGG+VAG LTAA+PVQVV+GSFLAD + E L PG P
Sbjct: 266 ASPDGHVLGGAVAGPLTAASPVQVVIGSFLADTKME---------------LDPGSAPEK 310
Query: 299 QC-------SPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
S PSRGT ESSGG SP N +TG+ + + P G + WK
Sbjct: 311 HVFSRFQTTSSPSRGT--ESSGGHASPPN-TTGSFSTSTQP-GFPSFPTWK 357
>gi|195619874|gb|ACG31767.1| DNA-binding protein [Zea mays]
Length = 354
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 172/351 (49%), Positives = 204/351 (58%), Gaps = 59/351 (16%)
Query: 24 LAFSADGTAVYKPITATSPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKY 83
L ++ DG AVY+ P + AG + +P A G + + SEP KRKRGRPRKY
Sbjct: 31 LFYTHDGVAVYR--NPVMPAFYQQPAGSNVVVPAAPG--PAHSPASSEPFKRKRGRPRKY 86
Query: 84 GP-DGTMSLALVPSPSSVTTATGGTGSGLSS---PGGGPLSPDS---------------- 123
P DG + LA+VP PS TA S S PG P SP S
Sbjct: 87 APADGAVPLAIVP-PSQPPTARAPATSEASPTVPPGFSP-SPQSGGVVSRQASPAPAPAS 144
Query: 124 ----IKKSRGRPPGSGSGKKH-QLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR 178
+KK RGRP G S K+ Q A G G PH+ TV+AGEDV+S+ MSFS NG
Sbjct: 145 GAPDVKK-RGRPSGPSSKKQQPQAAAPGPGWTGLKPHIFTVQAGEDVASRAMSFSGNG-W 202
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
AVCIL+ANG +SNVTLRQ +SGGTVTYEGRFEILSL+GS+LLSES+G SRTGGLSVSL
Sbjct: 203 AVCILTANGTVSNVTLRQGESSGGTVTYEGRFEILSLAGSYLLSESTGMSSRTGGLSVSL 262
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAG 298
+ PDG VLGG+VAG LTAA+PVQVV+GSFLAD + E L PG P
Sbjct: 263 ASPDGHVLGGAVAGPLTAASPVQVVIGSFLADTKME---------------LDPGSAPEK 307
Query: 299 QC-------SPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
S PSRGT ESSGG SP N +TG+ + + P G + WK
Sbjct: 308 HVFSRFQTTSSPSRGT--ESSGGHASPPN-TTGSFSTSTQP-GFPSFPTWK 354
>gi|212722288|ref|NP_001131389.1| uncharacterized protein LOC100192715 [Zea mays]
gi|194691394|gb|ACF79781.1| unknown [Zea mays]
Length = 307
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 137/205 (66%), Positives = 154/205 (75%), Gaps = 7/205 (3%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
D K RGRPPGSG KK Q EALGS G+ FTPH++ VKAGEDV+SKIM+FSQ GPR VC
Sbjct: 56 DPSAKRRGRPPGSG--KKKQFEALGSWGIAFTPHILAVKAGEDVASKIMTFSQQGPRTVC 113
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
ILSANGAISNVTLRQ ATSGG VTYEGRFEI+SLSGSFLL+E RSRTGGLSV+L+G
Sbjct: 114 ILSANGAISNVTLRQPATSGGLVTYEGRFEIISLSGSFLLAEDGDTRSRTGGLSVALAGS 173
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFLADGRK----ESKSSHRMESLPVPPKLAPGGQPA 297
DGRVLGG VAG+L AATPVQVVV SF+A+G+K E++ M + P PP PA
Sbjct: 174 DGRVLGGCVAGMLMAATPVQVVVASFIAEGKKSKPAEARKVEPMAAPPPPPPQMAAFVPA 233
Query: 298 -GQCSPPSRGTLSESSGGPGSPLNH 321
SPPS GT S SS GSP+NH
Sbjct: 234 PVATSPPSEGTSSASSDDSGSPINH 258
>gi|219887663|gb|ACL54206.1| unknown [Zea mays]
Length = 290
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 137/205 (66%), Positives = 154/205 (75%), Gaps = 7/205 (3%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
D K RGRPPGSG KK Q EALGS G+ FTPH++ VKAGEDV+SKIM+FSQ GPR VC
Sbjct: 39 DPSAKRRGRPPGSG--KKKQFEALGSWGIAFTPHILAVKAGEDVASKIMTFSQQGPRTVC 96
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
ILSANGAISNVTLRQ ATSGG VTYEGRFEI+SLSGSFLL+E RSRTGGLSV+L+G
Sbjct: 97 ILSANGAISNVTLRQPATSGGLVTYEGRFEIISLSGSFLLAEDGDTRSRTGGLSVALAGS 156
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFLADGRK----ESKSSHRMESLPVPPKLAPGGQPA 297
DGRVLGG VAG+L AATPVQVVV SF+A+G+K E++ M + P PP PA
Sbjct: 157 DGRVLGGCVAGMLMAATPVQVVVASFIAEGKKSKPAEARKVEPMAAPPPPPPQMAAFVPA 216
Query: 298 -GQCSPPSRGTLSESSGGPGSPLNH 321
SPPS GT S SS GSP+NH
Sbjct: 217 PVATSPPSEGTSSASSDDSGSPINH 241
>gi|224031515|gb|ACN34833.1| unknown [Zea mays]
Length = 267
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 156/231 (67%), Positives = 176/231 (76%), Gaps = 13/231 (5%)
Query: 89 MSLALVPSPSS---VTTATGGTG----SGLSSPGGGP-LSPDSIKKSRGRPPGSGSGKKH 140
MSLALVP+ + A G +G +G +P P SPD KK RGRP GS + KKH
Sbjct: 1 MSLALVPASMAGEPAPAALGASGPFSPNGPKAPNTAPSASPDGAKK-RGRPKGS-TNKKH 58
Query: 141 QLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATS 200
+ ALG AG GFTPH+I VKAGEDVS+KIMSFSQ+G RAVCILSANGAISNVTLRQ+ATS
Sbjct: 59 -VAALGPAGAGFTPHLIFVKAGEDVSAKIMSFSQHGTRAVCILSANGAISNVTLRQSATS 117
Query: 201 GGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPV 260
GGTVTYEGRFEILSLSGSFLLSE+ GQRSRTGGLSVSL+GPDGRVLGG VAGLLTAA+PV
Sbjct: 118 GGTVTYEGRFEILSLSGSFLLSENGGQRSRTGGLSVSLAGPDGRVLGGCVAGLLTAASPV 177
Query: 261 QVVVGSFLADG--RKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLS 309
Q+VVGSF A G + + + ++ P P LAP G AG SPPSRGTLS
Sbjct: 178 QIVVGSFDAGGKKQPKQQQQQQLAPSPAPLNLAPTGVAAGPSSPPSRGTLS 228
>gi|302784042|ref|XP_002973793.1| hypothetical protein SELMODRAFT_36429 [Selaginella moellendorffii]
gi|302803700|ref|XP_002983603.1| hypothetical protein SELMODRAFT_36449 [Selaginella moellendorffii]
gi|300148846|gb|EFJ15504.1| hypothetical protein SELMODRAFT_36449 [Selaginella moellendorffii]
gi|300158125|gb|EFJ24748.1| hypothetical protein SELMODRAFT_36429 [Selaginella moellendorffii]
Length = 186
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 116/149 (77%), Positives = 134/149 (89%), Gaps = 2/149 (1%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPG+G KK QL ALGSAG GFTPHVIT+ AGEDV+++I+SF+Q GPRA C+LS
Sbjct: 37 EKKRGRPPGTG--KKQQLAALGSAGQGFTPHVITIAAGEDVATRIISFAQIGPRATCVLS 94
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANGAISNVTLRQ ATSGGTVTYEGRFEILSLSGSFLL+E+ +SR+GGLSVSL+GPDGR
Sbjct: 95 ANGAISNVTLRQPATSGGTVTYEGRFEILSLSGSFLLTENGNTKSRSGGLSVSLAGPDGR 154
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRK 273
V+GGSVAGLL AA+PVQVVVGSF+A+ RK
Sbjct: 155 VIGGSVAGLLVAASPVQVVVGSFIAETRK 183
>gi|133907524|gb|ABO42262.1| AT-hook DNA-binding protein [Gossypium hirsutum]
Length = 340
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/238 (54%), Positives = 167/238 (70%), Gaps = 34/238 (14%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
E +KRKRGRPRKYGPDGT+SLAL P+ S+ G ++P K+ RGR
Sbjct: 87 ETVKRKRGRPRKYGPDGTVSLALTPA---------------SATHPGTITPIQ-KRGRGR 130
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPG+G +K QL +LG SAG+GFTPHVIT+ GED+++K+MSFSQ GPR VCILSA
Sbjct: 131 PPGTG--RKQQLSSLGELLSGSAGMGFTPHVITIAIGEDIATKLMSFSQQGPREVCILSA 188
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NGA+S VTLR+ ++SGGTVTYEGRFEIL LSGS+LL+ ++G R+RTGGLSVSL+ PDGR
Sbjct: 189 NGAVSTVTLRKPSSSGGTVTYEGRFEILCLSGSYLLTSNTGSRNRTGGLSVSLASPDGRA 248
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRK-----------ESKSSHRMESLPVPPKLAP 292
+GG V G+L AA+PVQV+VGSF+ G K + +++L PP ++P
Sbjct: 249 IGGGVGGMLIAASPVQVIVGSFIWGGSKAKNKKGGQEGIKDSDDQMVDNLVAPPGISP 306
>gi|224130006|ref|XP_002320727.1| predicted protein [Populus trichocarpa]
gi|222861500|gb|EEE99042.1| predicted protein [Populus trichocarpa]
Length = 324
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 127/218 (58%), Positives = 156/218 (71%), Gaps = 16/218 (7%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKYGPDG ++ AL P P ++ + TG S+ G + P S +K +
Sbjct: 57 KKKRGRPRKYGPDGAVARALSPMP--ISASAPHTGGDYSAGKPGKVWPGSYEKKKY---- 110
Query: 134 SGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
KK +E LG S G FTPHVITV AGEDV+ K++SFSQ GPRA+CILSANG
Sbjct: 111 ----KKMGMENLGEWAANSVGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGV 166
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
ISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ +E G RSR+GG+SVSL+ PDGRV+GG
Sbjct: 167 ISNVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTEIQGTRSRSGGMSVSLASPDGRVVGG 226
Query: 249 SVAGLLTAATPVQVVVGSFLADGRKESKSSH-RMESLP 285
SVAGLL AA+PVQVVVGSFL +E K +++S+P
Sbjct: 227 SVAGLLVAASPVQVVVGSFLPGNHQEQKPKKPKIDSIP 264
>gi|255541558|ref|XP_002511843.1| DNA binding protein, putative [Ricinus communis]
gi|223549023|gb|EEF50512.1| DNA binding protein, putative [Ricinus communis]
Length = 340
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 137/241 (56%), Positives = 167/241 (69%), Gaps = 34/241 (14%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP+KRKRGRPRKYGPDGT+SLAL PS S+ G ++P K+ RGR
Sbjct: 87 EPVKRKRGRPRKYGPDGTVSLALSPSLSTHP---------------GTITPTQ-KRGRGR 130
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPG+G +K QL +LG SAG+GFTPH+IT+ GED+++KIMSFSQ GPRA+CILSA
Sbjct: 131 PPGTG--RKQQLASLGEWLSGSAGMGFTPHIITIAVGEDIATKIMSFSQQGPRAICILSA 188
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NGA+S VTLRQ +TSGG+VTYEGRFEIL LSGS+L++ + G R+RTGGLSVSL+ PDGRV
Sbjct: 189 NGAVSTVTLRQPSTSGGSVTYEGRFEILCLSGSYLVTSNGGSRNRTGGLSVSLASPDGRV 248
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRKESK-----------SSHRMESLPVPPKLAPGG 294
+GG V G+L AA+PVQV+VGSFL G K S H+ PV P P
Sbjct: 249 IGGGVGGMLIAASPVQVIVGSFLWGGSKAKNKKGEGPEGARDSDHQTVENPVTPSSVPPS 308
Query: 295 Q 295
Q
Sbjct: 309 Q 309
>gi|125603988|gb|EAZ43313.1| hypothetical protein OsJ_27909 [Oryza sativa Japonica Group]
Length = 242
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 138/197 (70%), Positives = 157/197 (79%), Gaps = 2/197 (1%)
Query: 146 GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVT 205
GSAGVGFTPHVITV AGEDVS+KIMSF+Q+G RAVC+LSANGAISNVTLRQ ATSGGTVT
Sbjct: 48 GSAGVGFTPHVITVLAGEDVSAKIMSFAQHGNRAVCVLSANGAISNVTLRQTATSGGTVT 107
Query: 206 YEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG 265
YEGRFEILSLSGSFLL++ GQRSRTGGLSVSL+GPDGR+LGG VAGLL AATPVQ+VVG
Sbjct: 108 YEGRFEILSLSGSFLLTDHGGQRSRTGGLSVSLAGPDGRLLGGGVAGLLIAATPVQIVVG 167
Query: 266 SFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGA 325
SF ++G+KE K E P K P G SPPSRGTLSESSGG GSPL+
Sbjct: 168 SFNSEGKKEPKQHAHSEPASAPSKAVPTAG-MGPNSPPSRGTLSESSGGAGSPLHPGIAP 226
Query: 326 CNNNHLPQGMATGIPWK 342
++N P +++ +PWK
Sbjct: 227 PSSNSQPPFLSS-MPWK 242
>gi|449518609|ref|XP_004166329.1| PREDICTED: uncharacterized LOC101203138 [Cucumis sativus]
Length = 334
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 141/216 (65%), Positives = 162/216 (75%), Gaps = 23/216 (10%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
SEP+KRKRGRPRKYG +GT+SLAL PSPS+V AT SSP K+ RG
Sbjct: 79 SEPVKRKRGRPRKYGTEGTVSLALSPSPSAVNPATVA-----SSP----------KRGRG 123
Query: 130 RPPGSGSGKKHQLEAL-----GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
RPPGSG KK QL +L GSAG+GFTPHVIT+ GEDV++KIMSFSQ GPR VCILS
Sbjct: 124 RPPGSG--KKQQLASLCETLSGSAGMGFTPHVITIGIGEDVAAKIMSFSQQGPRVVCILS 181
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANGA+S VTLRQ +TSGGTVTYEGRFEI+ LSGS+ L E +G R+RTGGLSVSL+ PDGR
Sbjct: 182 ANGAVSTVTLRQPSTSGGTVTYEGRFEIICLSGSYALGEIAGSRNRTGGLSVSLASPDGR 241
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHR 280
V+GG V G L AATPVQV+VGSF+ G +SK R
Sbjct: 242 VIGGGVGGALVAATPVQVIVGSFMW-GSSKSKYKKR 276
>gi|449441474|ref|XP_004138507.1| PREDICTED: uncharacterized protein LOC101203138 [Cucumis sativus]
Length = 334
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 141/216 (65%), Positives = 162/216 (75%), Gaps = 23/216 (10%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
SEP+KRKRGRPRKYG +GT+SLAL PSPS+V AT SSP K+ RG
Sbjct: 79 SEPVKRKRGRPRKYGTEGTVSLALSPSPSAVNPATVA-----SSP----------KRGRG 123
Query: 130 RPPGSGSGKKHQLEAL-----GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
RPPGSG KK QL +L GSAG+GFTPHVIT+ GEDV++KIMSFSQ GPR VCILS
Sbjct: 124 RPPGSG--KKQQLASLCETLSGSAGMGFTPHVITIGIGEDVAAKIMSFSQQGPRVVCILS 181
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANGA+S VTLRQ +TSGGTVTYEGRFEI+ LSGS+ L E +G R+RTGGLSVSL+ PDGR
Sbjct: 182 ANGAVSTVTLRQPSTSGGTVTYEGRFEIICLSGSYALGEIAGSRNRTGGLSVSLASPDGR 241
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHR 280
V+GG V G L AATPVQV+VGSF+ G +SK R
Sbjct: 242 VIGGGVGGALVAATPVQVIVGSFMW-GSSKSKYKKR 276
>gi|225441014|ref|XP_002277536.1| PREDICTED: uncharacterized protein LOC100254577 [Vitis vinifera]
Length = 361
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 131/268 (48%), Positives = 169/268 (63%), Gaps = 24/268 (8%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
++KRGRPRKY DG + L+ SP T + + S K+ RGRPPG
Sbjct: 75 RKKRGRPRKYDADGNLRLSYAVSPPPGFTLSSPSSDFSS------------KRGRGRPPG 122
Query: 134 SGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
SG+ + L +LG +AG FTPHV+TV GEDV+SKI+SFSQ GPR +C+LSANGA
Sbjct: 123 SGNWQ--LLASLGELFANTAGGDFTPHVVTVNTGEDVASKILSFSQKGPRGICVLSANGA 180
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
+SNVT+RQ +SGG +TYEGRFEILSLSGSF +S+S G RSRTGGLSVSL+GPDGRV+GG
Sbjct: 181 VSNVTIRQPGSSGGILTYEGRFEILSLSGSFTVSDSGGARSRTGGLSVSLAGPDGRVIGG 240
Query: 249 SVAGLLTAATPVQVVVGSFLADGRKESKSSHRME-----SLPVPPKLAPGGQPAGQCSPP 303
+AG+LTAA P+Q+VVGSF+ +G K K H E +P P +P Q +P
Sbjct: 241 GIAGILTAAGPIQIVVGSFMPNGYKTHKRKHHREPTTTSIIPPAPDTVTAARPISQAAPE 300
Query: 304 SRGTLSESSGGPGSPLNHSTGACNNNHL 331
L+ +S G + + NN +
Sbjct: 301 VGPCLNSTSPSHGQSHGEADDSVNNKQI 328
>gi|297740052|emb|CBI30234.3| unnamed protein product [Vitis vinifera]
Length = 324
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/268 (48%), Positives = 169/268 (63%), Gaps = 24/268 (8%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
++KRGRPRKY DG + L+ SP T + + S K+ RGRPPG
Sbjct: 38 RKKRGRPRKYDADGNLRLSYAVSPPPGFTLSSPSSDFSS------------KRGRGRPPG 85
Query: 134 SGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
SG+ + L +LG +AG FTPHV+TV GEDV+SKI+SFSQ GPR +C+LSANGA
Sbjct: 86 SGNWQ--LLASLGELFANTAGGDFTPHVVTVNTGEDVASKILSFSQKGPRGICVLSANGA 143
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
+SNVT+RQ +SGG +TYEGRFEILSLSGSF +S+S G RSRTGGLSVSL+GPDGRV+GG
Sbjct: 144 VSNVTIRQPGSSGGILTYEGRFEILSLSGSFTVSDSGGARSRTGGLSVSLAGPDGRVIGG 203
Query: 249 SVAGLLTAATPVQVVVGSFLADGRKESKSSHRME-----SLPVPPKLAPGGQPAGQCSPP 303
+AG+LTAA P+Q+VVGSF+ +G K K H E +P P +P Q +P
Sbjct: 204 GIAGILTAAGPIQIVVGSFMPNGYKTHKRKHHREPTTTSIIPPAPDTVTAARPISQAAPE 263
Query: 304 SRGTLSESSGGPGSPLNHSTGACNNNHL 331
L+ +S G + + NN +
Sbjct: 264 VGPCLNSTSPSHGQSHGEADDSVNNKQI 291
>gi|357159090|ref|XP_003578335.1| PREDICTED: uncharacterized protein LOC100826497 [Brachypodium
distachyon]
Length = 383
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 158/353 (44%), Positives = 201/353 (56%), Gaps = 34/353 (9%)
Query: 20 QSMRLAFSADGTAVYKPITATSPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGR 79
Q+M LA++A+G Y AG D + G + + G+ E ++K G+
Sbjct: 35 QNMHLAYTAEGRPYYAQTAQNQSGGGDGAAGPDADAAEGNG-SPEHQGNMEEMARKKSGQ 93
Query: 80 PRKYGPDGTMSLALVP--SPSSVTTATGGTGSGLSSPGGGPL---SPDSIKKSRGRPPGS 134
P DG+MS ALVP +P+ VT GT S + G + +P +KK RGRP GS
Sbjct: 94 PSNEDSDGSMSAALVPVPNPAEVTPGASGTLSPAARNTAGTVPSAAPVGMKK-RGRPKGS 152
Query: 135 GSGKKHQL---EALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISN 191
+ K Q + G G FTPH I V AGEDV++KIMSFSQ+G R VC+LSANGAISN
Sbjct: 153 TNKVKKQKSVPDTTGFVGAHFTPHAICVNAGEDVAAKIMSFSQHGSRGVCVLSANGAISN 212
Query: 192 VTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVA 251
VT+RQA TSGGTVTYEGRFEILSLSGSFL SE+ G RSRTGGLSVSL+ +GRVLGG VA
Sbjct: 213 VTIRQADTSGGTVTYEGRFEILSLSGSFLESENGGHRSRTGGLSVSLASSNGRVLGGGVA 272
Query: 252 GLLTAATPVQVVVGSFLADGRKESKSSHRME-------------------SLPVPPKLAP 292
GLLTAATP+Q++VGSF K++ R ++ VP P
Sbjct: 273 GLLTAATPIQIIVGSFDTATEKKAPKKQRAPSDPSSSSAPPQMAPVIASAAMAVPAVTTP 332
Query: 293 GGQPAGQC---SPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGIPWK 342
+P + G ESS G+ LNH A N+N QG+++ + WK
Sbjct: 333 VAEPIAPVPLSVAMAAGPSGESSSAAGNQLNHGATA-NDNTQNQGLSS-MSWK 383
>gi|356512006|ref|XP_003524712.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 288
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 137/241 (56%), Positives = 172/241 (71%), Gaps = 7/241 (2%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDS-IKKSRGRPP 132
K+KRGRPRKY PDG ++L L P+ +S A G G G S G S D+ KK RGRPP
Sbjct: 8 KKKRGRPRKYSPDGNIALRLAPTHASPPAAASGGGGGGDSAGMA--SADAPAKKHRGRPP 65
Query: 133 GSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNV 192
GSG K QL+ALG+ GVGFTPHVI V++GED+++KIM+FSQ GPR VCILSA GAI NV
Sbjct: 66 GSG---KKQLDALGAGGVGFTPHVILVESGEDITAKIMAFSQQGPRTVCILSAIGAIGNV 122
Query: 193 TLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAG 252
TL+Q+A +GG TYEGRFEI+SLSGS SE++ +RSRT L+V+L+G DGRVLGG VAG
Sbjct: 123 TLQQSAMTGGIATYEGRFEIISLSGSLQQSENNSERSRTCTLNVTLAGSDGRVLGGGVAG 182
Query: 253 LLTAATPVQVVVGSFLADGRKESKSSHRMESLPV-PPKLAPGGQPAGQCSPPSRGTLSES 311
L AA+ VQV+VGSF+AD +K S ++ + S PP++ G SP S+G +ES
Sbjct: 183 TLIAASTVQVIVGSFIADAKKSSSNALKSGSSSAPPPQMLTFGSSMTPNSPTSQGPSTES 242
Query: 312 S 312
S
Sbjct: 243 S 243
>gi|358249184|ref|NP_001239751.1| uncharacterized protein LOC100814615 [Glycine max]
gi|255636132|gb|ACU18409.1| unknown [Glycine max]
Length = 341
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/208 (63%), Positives = 161/208 (77%), Gaps = 21/208 (10%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP+KRKRGRPRKYG DG++SLAL P+P+S S PG LS S K+ RGR
Sbjct: 82 EPVKRKRGRPRKYGTDGSVSLALTPTPTSS-----------SHPGA--LS-QSQKRGRGR 127
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPG+G KK QL +LG SAG+GFTPH+I + +GED+++KIM+FSQ GPR VCILSA
Sbjct: 128 PPGTG--KKQQLASLGELMSGSAGMGFTPHIINIASGEDIATKIMAFSQQGPRVVCILSA 185
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NGA+S VTLRQ +TSGGTVTYEGRFEI+ LSGS+L++E+ G R+RTGGLSVSL+ PDGRV
Sbjct: 186 NGAVSTVTLRQPSTSGGTVTYEGRFEIVCLSGSYLVTENGGSRNRTGGLSVSLASPDGRV 245
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRK 273
+GG V G+L A++PVQVVVGSFL G K
Sbjct: 246 IGGGVGGVLIASSPVQVVVGSFLWGGSK 273
>gi|125575772|gb|EAZ17056.1| hypothetical protein OsJ_32550 [Oryza sativa Japonica Group]
Length = 274
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 122/178 (68%), Positives = 143/178 (80%), Gaps = 2/178 (1%)
Query: 146 GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVT 205
GS G+ FTPH++TVKAGEDV+SKIM+FSQ GPR VCILSANGAISNVTLRQ ATSGG VT
Sbjct: 51 GSWGIAFTPHILTVKAGEDVASKIMAFSQQGPRTVCILSANGAISNVTLRQPATSGGLVT 110
Query: 206 YEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG 265
YEGRFEI+SLSGSFLL+E RSRTGGLSV+L+G DGRVLGG VAG+L AATPVQVVV
Sbjct: 111 YEGRFEIISLSGSFLLAEDGDTRSRTGGLSVALAGSDGRVLGGCVAGMLMAATPVQVVVA 170
Query: 266 SFLADGRK-ESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNHS 322
SF+A+G+K + + ++E + PP++A PA SPPS GT S SS GSP+NHS
Sbjct: 171 SFIAEGKKSKPVETRKVEPMSAPPQMA-TYVPAPVASPPSEGTSSGSSDDSGSPINHS 227
>gi|224120210|ref|XP_002318273.1| predicted protein [Populus trichocarpa]
gi|222858946|gb|EEE96493.1| predicted protein [Populus trichocarpa]
Length = 345
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 130/230 (56%), Positives = 156/230 (67%), Gaps = 27/230 (11%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPD--SIKKSRGRP 131
K+KRGRPRKY PDGT++LAL P P S + PL+ D + K+ RGRP
Sbjct: 92 KKKRGRPRKYAPDGTLALALSPMPISSSI---------------PLTGDYYAWKRGRGRP 136
Query: 132 PGSGSGKKHQLEALGS-------AGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
S K+H E + G F PHVITV AGEDV+ K+MSFSQ G RA+CILS
Sbjct: 137 LES-VKKQHNYEYESTGDKIAYFVGTNFMPHVITVNAGEDVTMKVMSFSQQGARAICILS 195
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANG ISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ SE+ G + R+GG+SVSL+GPDGR
Sbjct: 196 ANGTISNVTLRQPTSSGGTLTYEGRFEILSLSGSFMPSENGGTKGRSGGMSVSLAGPDGR 255
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESK-SSHRME-SLPVPPKLAP 292
V+GG +AGLL AA PVQVVVGSFL ++ESK R+E +L V P P
Sbjct: 256 VVGGGLAGLLVAAGPVQVVVGSFLLGHQQESKHKKQRIEPALAVIPATIP 305
>gi|356568374|ref|XP_003552386.1| PREDICTED: uncharacterized protein LOC100802542 isoform 1 [Glycine
max]
gi|356568376|ref|XP_003552387.1| PREDICTED: uncharacterized protein LOC100802542 isoform 2 [Glycine
max]
Length = 342
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 129/208 (62%), Positives = 159/208 (76%), Gaps = 21/208 (10%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP+KRKRGRPRKYG DG++SLAL P+P+S S PG S K+ RGR
Sbjct: 82 EPVKRKRGRPRKYGTDGSVSLALTPTPTSS-----------SYPGA---LTQSQKRGRGR 127
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPG+G KK QL +LG SAG+GFTPH+I + +GED+++KIM+FSQ G RAVCILSA
Sbjct: 128 PPGTG--KKQQLASLGELMSGSAGMGFTPHIINIASGEDITTKIMAFSQQGARAVCILSA 185
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NGA+S VTLRQ +TSGGTVTYEGRFEI+ LSGS+L++++ G R+RTGGLSVSL+ PDGRV
Sbjct: 186 NGAVSTVTLRQPSTSGGTVTYEGRFEIVCLSGSYLVTDNGGSRNRTGGLSVSLASPDGRV 245
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRK 273
+GG V G+L A++PVQVVVGSFL G K
Sbjct: 246 IGGGVGGVLIASSPVQVVVGSFLWGGSK 273
>gi|168066999|ref|XP_001785415.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662973|gb|EDQ49767.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 483
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 125/226 (55%), Positives = 154/226 (68%), Gaps = 36/226 (15%)
Query: 69 GSEPMKRKRGRPRKYGPDGTMSL-----------ALVPSPSSVTTATGGTGSGLSSPGGG 117
G +P+KRKRGRPRK+ S A++P+PSS
Sbjct: 116 GEQPLKRKRGRPRKFSTGSEFSPGTPGAGYPVFPAIMPAPSS------------------ 157
Query: 118 PLSPDSIKKSRGRPPGSGSGKKHQLEALGSA----GVGFTPHVITVKAGEDVSSKIMSFS 173
P +P K+ RGRP +GSGK+ QL ALG G GFTPH++TV GEDV++KIM F+
Sbjct: 158 PYTPSPDKRGRGRP--TGSGKRQQLAALGVVLAGTGQGFTPHILTVNTGEDVATKIMQFA 215
Query: 174 QNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSE-SSGQRSRTG 232
Q+GPRA+C+LSANGAISNVTLRQ +SGGTVTYEGR+EILSLSGS+L ++ G R RTG
Sbjct: 216 QHGPRAMCVLSANGAISNVTLRQQLSSGGTVTYEGRYEILSLSGSYLPTDLGGGARQRTG 275
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSS 278
GLSVSL+G DGRV+GG VAG+LTAA+P+QVVVGSFL+D K S
Sbjct: 276 GLSVSLAGSDGRVIGGGVAGMLTAASPIQVVVGSFLSDAYKSQPKS 321
>gi|255575345|ref|XP_002528575.1| DNA binding protein, putative [Ricinus communis]
gi|223531971|gb|EEF33783.1| DNA binding protein, putative [Ricinus communis]
Length = 408
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 124/227 (54%), Positives = 156/227 (68%), Gaps = 23/227 (10%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSP-------GGGPLSPD---- 122
K+KRGRPRKY +G + + +V+ ATG L+SP P PD
Sbjct: 92 KKKRGRPRKYDSEGNLRVQPFNHYQAVSAATGA----LTSPPPTTPAFSFSPSPPDHGFN 147
Query: 123 -SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNG 176
S K+ RGRPPGSG+ + L +LG +AG FTPHV+TV GEDV+ KI SF+Q G
Sbjct: 148 SSSKRGRGRPPGSGNWQL--LASLGELFANTAGGDFTPHVVTVNTGEDVAGKIHSFAQKG 205
Query: 177 PRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSV 236
PR +CILSANGA+SNVT+RQ +SGG +TYEGRFEILSLSGSF +SE+ G RSRTGGLSV
Sbjct: 206 PRGICILSANGAVSNVTIRQPGSSGGILTYEGRFEILSLSGSFTVSENGGVRSRTGGLSV 265
Query: 237 SLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMES 283
SL+ PDGRV+GG +AGLL AA+P+Q+V+GSF+ +G K K H E+
Sbjct: 266 SLASPDGRVIGGGIAGLLLAASPIQIVMGSFMPNGYKVHKKKHHREN 312
>gi|22328578|ref|NP_192945.2| AT-hook motif nuclear-localized protein 1 [Arabidopsis thaliana]
gi|17979485|gb|AAL50079.1| AT4g12080/F16J13_150 [Arabidopsis thaliana]
gi|23506149|gb|AAN31086.1| At4g12080/F16J13_150 [Arabidopsis thaliana]
gi|118420990|dbj|BAF37220.1| AT-hook motif nuclear localized protein 1 [Arabidopsis thaliana]
gi|332657694|gb|AEE83094.1| AT-hook motif nuclear-localized protein 1 [Arabidopsis thaliana]
Length = 356
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 128/221 (57%), Positives = 156/221 (70%), Gaps = 17/221 (7%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPD-SIKKSRGRP 131
MK+KRGRPRKYGPDGT+ +AL P P S A S L P + S K+S+ +P
Sbjct: 88 MKKKRGRPRKYGPDGTV-VALSPKPISSAPAP----SHLPPPSSHVIDFSASEKRSKVKP 142
Query: 132 PGSGSGKK--HQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
S + K HQ+E LG S G FTPH+ITV GEDV+ KI+SFSQ GPR++C+LS
Sbjct: 143 TNSFNRTKYHHQVENLGEWAPCSVGGNFTPHIITVNTGEDVTMKIISFSQQGPRSICVLS 202
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANG IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ ++S G RSRTGG+SVSL+ PDGR
Sbjct: 203 ANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNDSGGTRSRTGGMSVSLASPDGR 262
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLA----DGRKESKSSHRM 281
V+GG +AGLL AA+PVQVVVGSFLA +K K+ H
Sbjct: 263 VVGGGLAGLLVAASPVQVVVGSFLAGTDHQDQKPKKNKHDF 303
>gi|297800288|ref|XP_002868028.1| hypothetical protein ARALYDRAFT_914905 [Arabidopsis lyrata subsp.
lyrata]
gi|297313864|gb|EFH44287.1| hypothetical protein ARALYDRAFT_914905 [Arabidopsis lyrata subsp.
lyrata]
Length = 404
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 124/235 (52%), Positives = 165/235 (70%), Gaps = 14/235 (5%)
Query: 64 MNMGSGSEPMKRKRGRPRKYGPDG---TMSLALVPS---PSSVTTATGGTGSGLSSPGGG 117
M G + +K+KRGRPRKY DG ++L L P+ P++ + GG G + G
Sbjct: 86 MRFGIDHQQVKKKRGRPRKYAADGGGSNIALGLAPTSPLPTASNSYGGGNEGGGTGGDSG 145
Query: 118 PLSPDS----IKKSRGRPPGSGSGKKHQLEALG-SAGVGFTPHVITVKAGEDVSSKIMSF 172
+ +S K++RGRPPGSG K QL+ALG + GVGFTPHVI VK GED+++K+M+F
Sbjct: 146 GANANSSDPPAKRNRGRPPGSG---KKQLDALGGTGGVGFTPHVIEVKTGEDIATKVMAF 202
Query: 173 SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTG 232
+ GPRA+CILSA GA++NV LRQA G V YEGRFEI+SLSGSFL SES+G ++TG
Sbjct: 203 TNQGPRAICILSATGAVTNVKLRQATNPSGIVKYEGRFEIISLSGSFLNSESNGTVTKTG 262
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVP 287
LSVSL+G DG ++GGSVAG+L A + VQV+VGSF+ DGRK+ +S+ R ++ P P
Sbjct: 263 NLSVSLAGQDGGIVGGSVAGMLVAGSQVQVIVGSFVPDGRKQKQSAGRAQNTPEP 317
>gi|224067876|ref|XP_002302577.1| predicted protein [Populus trichocarpa]
gi|222844303|gb|EEE81850.1| predicted protein [Populus trichocarpa]
Length = 328
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 129/208 (62%), Positives = 151/208 (72%), Gaps = 23/208 (11%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
+P KRKRGRPRKYGPDG +SLAL PS S T S P S K+ RGR
Sbjct: 75 QPEKRKRGRPRKYGPDGAVSLALSPSLS--------THPETSIP--------SQKRGRGR 118
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPG+G +K QL +LG SAG+GFTPH+IT+ GED+++KIMSFSQ GPRA+CILSA
Sbjct: 119 PPGTG--RKQQLASLGEWLSGSAGMGFTPHIITIAVGEDIATKIMSFSQQGPRAICILSA 176
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NGA+S VTL Q +TSGGTVTYEGRFEIL LSGS+L S G R+RTGGLSVSL+ PDG V
Sbjct: 177 NGAVSTVTLHQPSTSGGTVTYEGRFEILCLSGSYLFSNDGGSRNRTGGLSVSLASPDGCV 236
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRK 273
+GG V G+L AA+PVQV+ GSFL G K
Sbjct: 237 IGGGVGGVLIAASPVQVIAGSFLWGGSK 264
>gi|297794575|ref|XP_002865172.1| hypothetical protein ARALYDRAFT_494313 [Arabidopsis lyrata subsp.
lyrata]
gi|297311007|gb|EFH41431.1| hypothetical protein ARALYDRAFT_494313 [Arabidopsis lyrata subsp.
lyrata]
Length = 391
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 120/214 (56%), Positives = 155/214 (72%), Gaps = 11/214 (5%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSI-------K 125
+K+KRGRPRKY PDG+++L L P+ ++ A+ G G G + K
Sbjct: 104 VKKKRGRPRKYTPDGSIALGLAPTSPLLSAASNSYGGGDGGVGDSGGGGGNGNSADPPAK 163
Query: 126 KSRGRPPGSGSGKKHQLEALG-SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++RGRPPGS K QL+ALG +AGVGFTPHVI VK GED++SK+M+FS+ GPR +CILS
Sbjct: 164 RNRGRPPGS---SKKQLDALGGTAGVGFTPHVIEVKTGEDIASKVMAFSEQGPRTICILS 220
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A+GA+ VTLRQA+ S G VTYEGRFEI++LSGSFL E +G +R+G LSVSL+GPDGR
Sbjct: 221 ASGAVGRVTLRQASHSSGIVTYEGRFEIITLSGSFLNYEVNGSTNRSGNLSVSLAGPDGR 280
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSS 278
++GGSV G L AAT VQV+VGSF+A+ +K SS
Sbjct: 281 IVGGSVVGPLVAATQVQVIVGSFVAEAKKPKPSS 314
>gi|449461555|ref|XP_004148507.1| PREDICTED: uncharacterized protein LOC101205370 [Cucumis sativus]
gi|449522829|ref|XP_004168428.1| PREDICTED: uncharacterized LOC101205370 [Cucumis sativus]
Length = 363
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 118/215 (54%), Positives = 155/215 (72%), Gaps = 14/215 (6%)
Query: 64 MNMGSGSEPMKRKRGRPRKYGPD-GTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPD 122
N+ SG K++RGRPRKY PD ++L L P+P+ ++ G + P S
Sbjct: 82 FNVDSG----KKRRGRPRKYAPDANNIALGLAPTPTVASSLPHGDLTAT------PDSEQ 131
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
+K+RGRPPGSG K Q ++GS G GFTPHV+ K GEDV++KI+SFSQ GPR V I
Sbjct: 132 PARKTRGRPPGSG---KKQSNSIGSGGTGFTPHVLLAKPGEDVAAKILSFSQQGPRTVFI 188
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LSANG +SN TLR +A+SGG+V+YEG ++I+SLSGSFLLSE++G RSRTGGLSV L+G +
Sbjct: 189 LSANGTLSNATLRHSASSGGSVSYEGHYDIISLSGSFLLSENNGTRSRTGGLSVLLAGSN 248
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKS 277
G+VLGG VAG+L A++ VQV+VGSFL D +K + S
Sbjct: 249 GQVLGGGVAGMLMASSQVQVIVGSFLEDDKKSNTS 283
>gi|356504535|ref|XP_003521051.1| PREDICTED: uncharacterized protein LOC100783475 [Glycine max]
Length = 340
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/202 (62%), Positives = 153/202 (75%), Gaps = 23/202 (11%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
E +KRKRGRPRKYG DG +SLAL P+P+S G G K+ RGR
Sbjct: 84 ETVKRKRGRPRKYGSDGAVSLALTPTPAS---HPGALAQGQ-------------KRGRGR 127
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPGSG KK QL +LG SAG+GFTPH+IT+ GED+++KIMSFSQ GPRA+CILSA
Sbjct: 128 PPGSG--KKQQLASLGELMSGSAGMGFTPHIITIAVGEDIATKIMSFSQQGPRAICILSA 185
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NGA+S VTLRQ +TSGGTVTYEGRFEI+ LSGS+L+++S G R+RTGGLSVSL+ PDGRV
Sbjct: 186 NGAVSTVTLRQPSTSGGTVTYEGRFEIVCLSGSYLVADSGGSRNRTGGLSVSLASPDGRV 245
Query: 246 LGGSVAGLLTAATPVQVVVGSF 267
+GG V G+L AA+PVQV++GSF
Sbjct: 246 VGGGVGGVLIAASPVQVILGSF 267
>gi|449451944|ref|XP_004143720.1| PREDICTED: uncharacterized protein LOC101211908 [Cucumis sativus]
gi|449488677|ref|XP_004158140.1| PREDICTED: uncharacterized LOC101211908 [Cucumis sativus]
Length = 333
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 125/224 (55%), Positives = 153/224 (68%), Gaps = 24/224 (10%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKYGPDGT++ L P P S + G +G G S +SIKKSR
Sbjct: 57 KKKRGRPRKYGPDGTVAPTLSPMPISSSIPLAGEFAGWKRGRG--RSVESIKKSR----- 109
Query: 134 SGSGKKHQLEALGS-----AGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
K + E G+ AG FTPHVITV GEDV+ K+MSFSQ G RA+CILSANG
Sbjct: 110 -----KFEYEIPGNKVAFFAGADFTPHVITVNIGEDVNLKVMSFSQQGSRAICILSANGM 164
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
+SNVTLRQ+ +SGGT+TYEGRFEILSLSGS++ SE G +SR+GG+SVSL+GPDGRV+GG
Sbjct: 165 VSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSEIGGTKSRSGGMSVSLAGPDGRVMGG 224
Query: 249 SVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAP 292
+AG+L AA PVQVVVGSFL G H+ E+ P ++ P
Sbjct: 225 GLAGMLIAAGPVQVVVGSFLPPG-------HQQENKPRKSRMEP 261
>gi|226530805|ref|NP_001151895.1| DNA binding protein [Zea mays]
gi|195650693|gb|ACG44814.1| DNA binding protein [Zea mays]
Length = 388
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 135/263 (51%), Positives = 172/263 (65%), Gaps = 31/263 (11%)
Query: 65 NMGSG---SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATG-GTGSGLSSPGGGPLS 120
+ GSG E +K+KRGRPRKY PDG ++L L P+ SS ++ G G+ +++PG G S
Sbjct: 109 DQGSGPGQDEQVKKKRGRPRKYKPDGAVTLGLSPTSSSTPHSSSSGMGTMVNTPGSGFGS 168
Query: 121 PD---------SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVS 166
S K+ RGRPPGSG K QL +LG S G GFTPHVI ++ GEDV+
Sbjct: 169 GGSGGSGSGAPSEKRGRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIIQPGEDVA 226
Query: 167 SKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSG 226
++IM+FSQ GPRAVCI+SA GAIS TL Q + SGG VTYEGRFEIL LSGS+L+ E G
Sbjct: 227 ARIMAFSQQGPRAVCIISATGAISTATLHQDSDSGGVVTYEGRFEILCLSGSYLVVEDGG 286
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLP- 285
RSR+GGL ++L GPD RV+GGSV G+LTAA VQV+VGSF+ G K++K ++ P
Sbjct: 287 TRSRSGGLCIALCGPDHRVIGGSVGGVLTAAGTVQVIVGSFMYGGSKKNKVKAEVDMEPE 346
Query: 286 ----------VPPKLAPGGQPAG 298
VPP ++ GG AG
Sbjct: 347 EVALAEHSGMVPPAMSGGGWEAG 369
>gi|356520420|ref|XP_003528860.1| PREDICTED: uncharacterized protein LOC100799791 [Glycine max]
Length = 340
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 124/202 (61%), Positives = 152/202 (75%), Gaps = 23/202 (11%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
E +KRKRGRPRKYGPDG +SLAL P+P+S G G K+ RGR
Sbjct: 84 ETVKRKRGRPRKYGPDGAVSLALTPTPAS---HPGALAQGQ-------------KRGRGR 127
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPGSG KK QL +LG SAG+GFTPH+IT+ GED+++KIM+FSQ GPRA+CILSA
Sbjct: 128 PPGSG--KKQQLASLGELMSGSAGMGFTPHIITIAVGEDIATKIMAFSQQGPRAICILSA 185
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NGA+S VTLRQ +TSGGTVTYEGRFEI+ LSGS+L+++S G R+RT LSVSL+ PDGRV
Sbjct: 186 NGAVSTVTLRQPSTSGGTVTYEGRFEIVCLSGSYLVADSGGTRNRTVALSVSLASPDGRV 245
Query: 246 LGGSVAGLLTAATPVQVVVGSF 267
+GG V G+L AA+PVQV++GSF
Sbjct: 246 IGGGVGGVLIAASPVQVILGSF 267
>gi|115449881|ref|NP_001048574.1| Os02g0824300 [Oryza sativa Japonica Group]
gi|48716318|dbj|BAD22931.1| putative AT-hook protein 1 [Oryza sativa Japonica Group]
gi|48717090|dbj|BAD22863.1| putative AT-hook protein 1 [Oryza sativa Japonica Group]
gi|113538105|dbj|BAF10488.1| Os02g0824300 [Oryza sativa Japonica Group]
gi|125541688|gb|EAY88083.1| hypothetical protein OsI_09514 [Oryza sativa Indica Group]
gi|125584210|gb|EAZ25141.1| hypothetical protein OsJ_08940 [Oryza sativa Japonica Group]
Length = 394
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 127/237 (53%), Positives = 164/237 (69%), Gaps = 20/237 (8%)
Query: 65 NMGSGS---EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSP 121
+ GSG+ EP+K+KRGRPRKY PDG ++L L PS S+ ++T G+ +++PG G S
Sbjct: 107 DQGSGAGQDEPVKKKRGRPRKYKPDGAVTLGLSPSSSTPHSSTSAMGTMVTTPGSGFGSG 166
Query: 122 D----------SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVS 166
+ K+ RGRPPGSG K QL +LG S G GFTPHVI + GEDV+
Sbjct: 167 AGSGGSGSGALTEKRGRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIISPGEDVA 224
Query: 167 SKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSG 226
++IMSFSQ GPRAVCI+SA GA+S TL Q + SGG VTYEGRFEIL LSGS+L+ E G
Sbjct: 225 ARIMSFSQQGPRAVCIISATGAVSTATLHQDSNSGGVVTYEGRFEILCLSGSYLVIEEGG 284
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMES 283
R+R+GGL ++L GPD RV+GGSV G+LTAA VQV+VGSF+ G K++K+ E+
Sbjct: 285 SRTRSGGLCIALCGPDHRVIGGSVGGVLTAAGTVQVIVGSFMYGGTKKNKAKAEQET 341
>gi|194700836|gb|ACF84502.1| unknown [Zea mays]
gi|194701606|gb|ACF84887.1| unknown [Zea mays]
gi|223975655|gb|ACN32015.1| unknown [Zea mays]
gi|413939549|gb|AFW74100.1| DNA binding protein [Zea mays]
Length = 388
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 134/263 (50%), Positives = 173/263 (65%), Gaps = 31/263 (11%)
Query: 65 NMGSG---SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATG-GTGSGLSSPGGGPLS 120
+ GSG E +K+KRGRPRKY PDG+++L L P+ SS ++ G G+ +++PG G S
Sbjct: 109 DQGSGPGQDEQVKKKRGRPRKYKPDGSVTLGLSPTSSSTPHSSSSGMGTMVNTPGSGFGS 168
Query: 121 PD---------SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVS 166
S K+ RGRPPGSG K QL +LG S G GFTPHVI ++ GEDV+
Sbjct: 169 GGSGGSGSGAPSEKRGRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIIQPGEDVA 226
Query: 167 SKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSG 226
++IM+FSQ GPRAVCI+SA GAIS TL Q + SGG VTYEGRFEIL LSGS+L+ E G
Sbjct: 227 ARIMAFSQQGPRAVCIISATGAISTATLHQDSDSGGVVTYEGRFEILCLSGSYLVVEDGG 286
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLP- 285
R+R+GGL ++L GPD RV+GGSV G+LTAA VQV+VGSF+ G K++K ++ P
Sbjct: 287 TRTRSGGLCIALCGPDHRVIGGSVGGVLTAAGTVQVIVGSFMYGGSKKNKVKAEVDMEPE 346
Query: 286 ----------VPPKLAPGGQPAG 298
VPP ++ GG AG
Sbjct: 347 EVAPAEHSGMVPPAMSGGGWEAG 369
>gi|18414996|ref|NP_567546.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|15451060|gb|AAK96801.1| putative protein [Arabidopsis thaliana]
gi|20148333|gb|AAM10057.1| putative protein [Arabidopsis thaliana]
gi|119657370|tpd|FAA00284.1| TPA: AT-hook motif nuclear localized protein 13 [Arabidopsis
thaliana]
gi|332658571|gb|AEE83971.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 439
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 125/241 (51%), Positives = 162/241 (67%), Gaps = 20/241 (8%)
Query: 64 MNMGSGSEPMKRKRGRPRKYGPD--------GTMSLAL-----VPSPSSVTTATGGTGSG 110
M G + +K+KRGRPRKY D ++L L +PS S+ G G
Sbjct: 121 MRFGIDHQQVKKKRGRPRKYAADGGGGGGGGSNIALGLAPTSPLPSASNSYGGGNEGGGG 180
Query: 111 LSSPGGGPLSPDS-IKKSRGRPPGSGSGKKHQLEALG-SAGVGFTPHVITVKAGEDVSSK 168
S G S D K++RGRPPGSG K QL+ALG + GVGFTPHVI VK GED+++K
Sbjct: 181 GDSAGANANSSDPPAKRNRGRPPGSG---KKQLDALGGTGGVGFTPHVIEVKTGEDIATK 237
Query: 169 IMSFSQNGPRAVCILSANGAISNVTLRQAATSG--GTVTYEGRFEILSLSGSFLLSESSG 226
I++F+ GPRA+CILSA GA++NV LRQA S GTV YEGRFEI+SLSGSFL SES+G
Sbjct: 238 ILAFTNQGPRAICILSATGAVTNVMLRQANNSNPTGTVKYEGRFEIISLSGSFLNSESNG 297
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPV 286
++TG LSVSL+G +GR++GG V G+L A + VQV+VGSF+ DGRK+ +S+ R ++ P
Sbjct: 298 TVTKTGNLSVSLAGHEGRIVGGCVDGMLVAGSQVQVIVGSFVPDGRKQKQSAGRAQNTPE 357
Query: 287 P 287
P
Sbjct: 358 P 358
>gi|2916772|emb|CAA11837.1| AT-hook protein 2 [Arabidopsis thaliana]
Length = 439
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/241 (51%), Positives = 162/241 (67%), Gaps = 20/241 (8%)
Query: 64 MNMGSGSEPMKRKRGRPRKYGPD--------GTMSLAL-----VPSPSSVTTATGGTGSG 110
M G + +K+KRGRPRKY D ++L L +PS S+ G G
Sbjct: 121 MRFGIDHQQVKKKRGRPRKYAADGGGGGGGGSNIALGLAPTSPLPSASNSYGGGNEGGGG 180
Query: 111 LSSPGGGPLSPDS-IKKSRGRPPGSGSGKKHQLEALG-SAGVGFTPHVITVKAGEDVSSK 168
S G S D K++RGRPPGSG K QL+ALG + GVGFTPHVI VK GED+++K
Sbjct: 181 GDSAGANANSSDPPAKRNRGRPPGSG---KKQLDALGGTGGVGFTPHVIEVKTGEDIATK 237
Query: 169 IMSFSQNGPRAVCILSANGAISNVTLRQAATSG--GTVTYEGRFEILSLSGSFLLSESSG 226
I++F+ GPRA+CILSA GA++NV LRQA S GTV YEGRFEI+SLSGSFL SES+G
Sbjct: 238 ILAFTNQGPRAICILSATGAVTNVMLRQANNSNPTGTVKYEGRFEIISLSGSFLNSESNG 297
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPV 286
++TG LSVSL+G +GR++GG V G+L A + VQV+VGSF+ DGRK+ +S+ R ++ P
Sbjct: 298 TVTKTGNLSVSLAGHEGRIVGGCVDGMLVAGSQVQVIVGSFVPDGRKQKQSAGRAQNTPE 357
Query: 287 P 287
P
Sbjct: 358 P 358
>gi|224130232|ref|XP_002320785.1| predicted protein [Populus trichocarpa]
gi|222861558|gb|EEE99100.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 138/251 (54%), Positives = 170/251 (67%), Gaps = 38/251 (15%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP+KRKRGRPRKYGPDG +SLA +++ T G +P S K+ RGR
Sbjct: 82 EPVKRKRGRPRKYGPDGAVSLA--------LSSSLSTHPGTITP--------SQKRGRGR 125
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPG+G +K QL +LG SAG+GFTPH+IT+ GED+++KIMSFSQ GPRAVCILSA
Sbjct: 126 PPGTG--RKQQLASLGEWLSGSAGMGFTPHIITIAVGEDIATKIMSFSQQGPRAVCILSA 183
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NGA+S VTLRQ +TSGGTVTYEGRFEIL LSGS+LL+ G R+R+GGLSVSL+ PDGRV
Sbjct: 184 NGAVSTVTLRQPSTSGGTVTYEGRFEILCLSGSYLLTNDGGSRNRSGGLSVSLASPDGRV 243
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRKESK------------SSHRMESLPVPPKLAPG 293
+GG V G+L AA+PVQV+VGSFL G ++K S H+ PV P
Sbjct: 244 IGGGVGGVLIAASPVQVIVGSFLWGGGSKTKNKKVEGPEGARDSDHQTVENPVTPTSV-- 301
Query: 294 GQPAGQCSPPS 304
QP+ +P S
Sbjct: 302 -QPSQNLTPTS 311
>gi|255645533|gb|ACU23261.1| unknown [Glycine max]
Length = 340
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 126/208 (60%), Positives = 152/208 (73%), Gaps = 23/208 (11%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
E +KRKRGRPRKYG DG +SLAL P+P+S G G K+ RGR
Sbjct: 84 ETVKRKRGRPRKYGSDGAVSLALTPTPAS---HPGALAQGQ-------------KRGRGR 127
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPGSG KK QL +LG SAG+GFTPH+IT+ GED+++KIMSFSQ GPRA+CILSA
Sbjct: 128 PPGSG--KKQQLASLGELMSGSAGMGFTPHIITIAVGEDIATKIMSFSQRGPRAICILSA 185
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NGA+S VTLRQ +TSGGTV YEG FEI+ LSGS L+++S G R+RTGGLSVSL+ PDGRV
Sbjct: 186 NGAVSTVTLRQPSTSGGTVAYEGCFEIVCLSGSHLVADSGGSRNRTGGLSVSLASPDGRV 245
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRK 273
+GG V G+L AA+PVQV++GSF D K
Sbjct: 246 VGGGVGGVLIAASPVQVILGSFSWDASK 273
>gi|297742667|emb|CBI34816.3| unnamed protein product [Vitis vinifera]
Length = 261
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 130/211 (61%), Positives = 155/211 (73%), Gaps = 18/211 (8%)
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSI-KKSRGR 130
P K+KRGRPRKYGPDGT+++AL P P S SS G P+ S+ K+ + R
Sbjct: 3 PAKKKRGRPRKYGPDGTVTMALSPKPIS------------SSAPGPPVIDFSVEKRGKIR 50
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
P GS S K +LE LG S G FTPH+ITV +GEDV+ KI+SFSQ GPRA+CILSA
Sbjct: 51 PVGSASKSKMELENLGEWVACSVGANFTPHIITVNSGEDVTMKIISFSQQGPRAICILSA 110
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NG IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ S+S G RSR+GG+SVSL+ PDGRV
Sbjct: 111 NGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRV 170
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
+GG VAGLL AA+PVQVVVGSFL + E K
Sbjct: 171 VGGGVAGLLVAASPVQVVVGSFLTGNQHEQK 201
>gi|356561759|ref|XP_003549146.1| PREDICTED: uncharacterized protein LOC100803208 [Glycine max]
Length = 348
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 119/203 (58%), Positives = 143/203 (70%), Gaps = 6/203 (2%)
Query: 66 MGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIK 125
M SGS K+KRGRPRKYGPDG +AL P P S + G S G PL +SIK
Sbjct: 61 MNSGSTEGKKKRGRPRKYGPDG--KVALSPMPISASIPFTGDFSAWKRGRGKPL--ESIK 116
Query: 126 KSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
K+ G+G + S G FTPH++TV GEDV+ KIMSFSQ G RA+CILSA
Sbjct: 117 KTFKFYEAGGAGSGDGIAY--SVGANFTPHILTVNDGEDVTMKIMSFSQQGYRAICILSA 174
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NG ISNVTLRQ +SGGT+TYEGRFEILSLSGS++ +E+ +SR+GG+S+SL+GPDGRV
Sbjct: 175 NGTISNVTLRQPTSSGGTLTYEGRFEILSLSGSYITTENGLTKSRSGGMSISLAGPDGRV 234
Query: 246 LGGSVAGLLTAATPVQVVVGSFL 268
+GG +AGLL AA PVQVVV SFL
Sbjct: 235 MGGGLAGLLVAAGPVQVVVASFL 257
>gi|147794107|emb|CAN62363.1| hypothetical protein VITISV_031923 [Vitis vinifera]
Length = 457
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/209 (61%), Positives = 154/209 (73%), Gaps = 18/209 (8%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS-RGRPP 132
K+KRGRPRKYGPDGT+++AL P P S SS G P+ S++K + RP
Sbjct: 74 KKKRGRPRKYGPDGTVTMALSPKPIS------------SSAPGPPVIDFSVEKRGKIRPV 121
Query: 133 GSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
GS S K +LE LG S G FTPH+ITV +GEDV+ KI+SFSQ GPRA+CILSANG
Sbjct: 122 GSASKSKMELENLGEWVACSVGANFTPHIITVNSGEDVTMKIISFSQQGPRAICILSANG 181
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ S+S G RSR+GG+SVSL+ PDGRV+G
Sbjct: 182 VISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVG 241
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKESK 276
G VAGLL AA+PVQVVVGSFL + E K
Sbjct: 242 GGVAGLLVAASPVQVVVGSFLTGNQHEQK 270
>gi|225426649|ref|XP_002274756.1| PREDICTED: uncharacterized protein LOC100244375 [Vitis vinifera]
Length = 346
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 129/210 (61%), Positives = 152/210 (72%), Gaps = 20/210 (9%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG--RP 131
K+KRGRPRKYGPDGT+++AL P P +SS GP D + RG RP
Sbjct: 74 KKKRGRPRKYGPDGTVTMALSPKP-------------ISSSAPGPPVIDFSVEKRGKIRP 120
Query: 132 PGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSAN 186
GS S K +LE LG S G FTPH+ITV +GEDV+ KI+SFSQ GPRA+CILSAN
Sbjct: 121 VGSASKSKMELENLGEWVACSVGANFTPHIITVNSGEDVTMKIISFSQQGPRAICILSAN 180
Query: 187 GAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVL 246
G IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ S+S G RSR+GG+SVSL+ PDGRV+
Sbjct: 181 GVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVV 240
Query: 247 GGSVAGLLTAATPVQVVVGSFLADGRKESK 276
GG VAGLL AA+PVQVVVGSFL + E K
Sbjct: 241 GGGVAGLLVAASPVQVVVGSFLTGNQHEQK 270
>gi|297803590|ref|XP_002869679.1| hypothetical protein ARALYDRAFT_914048 [Arabidopsis lyrata subsp.
lyrata]
gi|297315515|gb|EFH45938.1| hypothetical protein ARALYDRAFT_914048 [Arabidopsis lyrata subsp.
lyrata]
Length = 404
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 125/245 (51%), Positives = 159/245 (64%), Gaps = 38/245 (15%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
SE +K+KRGRPRKY PDGT+++ L P P S + PL+ + + RG
Sbjct: 79 SEQLKKKRGRPRKYNPDGTLAVTLSPMPISSSV---------------PLTSEFPPRKRG 123
Query: 130 RPPGSGSG--KKHQLEALGS-------AGVG--------FTPHVITVKAGEDVSSKIMSF 172
R G + KK Q+ AGVG FTPHV+ V AGEDV+ KIM+F
Sbjct: 124 RGRGKSNRWLKKSQMFQFDRSPVDTNLAGVGTADFVGANFTPHVLIVNAGEDVTMKIMTF 183
Query: 173 SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTG 232
SQ G RA+CILSANG ISNVTLRQ+ TSGGT+TYEGRFEILSL+GSF+ ++S G RSR G
Sbjct: 184 SQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGRFEILSLTGSFMQNDSGGTRSRAG 243
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAP 292
G+SV L+GPDGRV GG +AGL AA PVQV+VG+F+A G+++S+ +E L +L
Sbjct: 244 GMSVCLAGPDGRVFGGGLAGLFLAAGPVQVMVGTFIA-GQEQSQ----LE-LARERRLRF 297
Query: 293 GGQPA 297
G QP+
Sbjct: 298 GAQPS 302
>gi|4586113|emb|CAB40949.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|7267909|emb|CAB78251.1| putative DNA-binding protein [Arabidopsis thaliana]
Length = 365
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 128/234 (54%), Positives = 156/234 (66%), Gaps = 30/234 (12%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPD-SIKKSRGRP 131
MK+KRGRPRKYGPDGT+ +AL P P S A S L P + S K+S+ +P
Sbjct: 84 MKKKRGRPRKYGPDGTV-VALSPKPISSAPAP----SHLPPPSSHVIDFSASEKRSKVKP 138
Query: 132 PGSGSGKK--HQLEALG-----SAGVGFTPHVITVKAGE-------------DVSSKIMS 171
S + K HQ+E LG S G FTPH+ITV GE DV+ KI+S
Sbjct: 139 TNSFNRTKYHHQVENLGEWAPCSVGGNFTPHIITVNTGEVISSEFFFRSRHQDVTMKIIS 198
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
FSQ GPR++C+LSANG IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ ++S G RSRT
Sbjct: 199 FSQQGPRSICVLSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNDSGGTRSRT 258
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLA----DGRKESKSSHRM 281
GG+SVSL+ PDGRV+GG +AGLL AA+PVQVVVGSFLA +K K+ H
Sbjct: 259 GGMSVSLASPDGRVVGGGLAGLLVAASPVQVVVGSFLAGTDHQDQKPKKNKHDF 312
>gi|15235023|ref|NP_194262.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|4454020|emb|CAA23073.1| putative protein [Arabidopsis thaliana]
gi|7269383|emb|CAB81343.1| putative protein [Arabidopsis thaliana]
gi|20466213|gb|AAM20424.1| putative protein [Arabidopsis thaliana]
gi|28059577|gb|AAO30071.1| putative protein [Arabidopsis thaliana]
gi|119657350|tpd|FAA00274.1| TPA: AT-hook motif nuclear localized protein 3 [Arabidopsis
thaliana]
gi|332659641|gb|AEE85041.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 404
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 124/246 (50%), Positives = 155/246 (63%), Gaps = 34/246 (13%)
Query: 67 GSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKK 126
+ +E +K+KRGRPRKY PDGT+ + L P P S S P P K+
Sbjct: 79 NTSAEQLKKKRGRPRKYNPDGTLVVTLSPMPISS-----------SVPLTSEFPPR--KR 125
Query: 127 SRGRPPGSGSGKKHQLEALGS-------AGVG--------FTPHVITVKAGEDVSSKIMS 171
RGR + KK Q+ AGVG FTPHV+ V AGEDV+ KIM+
Sbjct: 126 GRGRGKSNRWLKKSQMFQFDRSPVDTNLAGVGTADFVGANFTPHVLIVNAGEDVTMKIMT 185
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
FSQ G RA+CILSANG ISNVTLRQ+ TSGGT+TYEGRFEILSL+GSF+ ++S G RSR
Sbjct: 186 FSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGRFEILSLTGSFMQNDSGGTRSRA 245
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLA 291
GG+SV L+GPDGRV GG +AGL AA PVQV+VG+F+A G+++S+ L +L
Sbjct: 246 GGMSVCLAGPDGRVFGGGLAGLFLAAGPVQVMVGTFIA-GQEQSQL-----ELAKERRLR 299
Query: 292 PGGQPA 297
G QP+
Sbjct: 300 FGAQPS 305
>gi|356535317|ref|XP_003536193.1| PREDICTED: uncharacterized protein LOC100776862 isoform 2 [Glycine
max]
Length = 330
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/215 (57%), Positives = 147/215 (68%), Gaps = 16/215 (7%)
Query: 74 KRKRGRPRKYGPDGTMSL----ALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
K+KRGRPRKYGPDG +L AL P P S + G S G P+ +SIKKS
Sbjct: 46 KKKRGRPRKYGPDGKPALGAVTALSPMPISSSIPLTGEFSAWKRGRGRPV--ESIKKSSF 103
Query: 130 R----PPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
+ PG G G + S G FTPHV+TV AGEDV+ KIMSFSQ G RA+CILSA
Sbjct: 104 KFEVESPGPGEGIAY------SVGANFTPHVLTVNAGEDVTMKIMSFSQQGSRAICILSA 157
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
G ISNVTLRQ ++ GGT+TYEGRFEILSLSGSF+ +E+ RSR+GG+SVSL+GPDGRV
Sbjct: 158 TGTISNVTLRQPSSCGGTLTYEGRFEILSLSGSFMPTENGVTRSRSGGMSVSLAGPDGRV 217
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHR 280
+GG +AGLL AA PVQVVV SFL + E K+ +
Sbjct: 218 MGGGLAGLLVAAGPVQVVVASFLPGHQLEHKTKKQ 252
>gi|356563280|ref|XP_003549892.1| PREDICTED: uncharacterized protein LOC100794202 [Glycine max]
Length = 331
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/233 (55%), Positives = 158/233 (67%), Gaps = 27/233 (11%)
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSP----DSIKKS 127
P K+KRGRPRKY PDG++++AL P P S PL P S K+
Sbjct: 68 PAKKKRGRPRKYAPDGSVTMALSPKPIS---------------SSAPLPPVIDFSSEKRG 112
Query: 128 RGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
+ +P S S K +LE LG S G FTPH+ITV +GEDV+ K++SFSQ GPRA+CI
Sbjct: 113 KIKPTSSVSKAKFELENLGEWVACSVGANFTPHIITVNSGEDVTMKVISFSQQGPRAICI 172
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LSANG IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ SES G RSR+GG+SVSL+ PD
Sbjct: 173 LSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSESGGTRSRSGGMSVSLASPD 232
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKES---KSSHRMESLPVPPKLAP 292
GRV+GG VAGLL AA+PVQVVVGSFLA + E K H + + +P + P
Sbjct: 233 GRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQKPRKQRHEVITSVIPAAVVP 285
>gi|223943273|gb|ACN25720.1| unknown [Zea mays]
Length = 306
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 128/242 (52%), Positives = 165/242 (68%), Gaps = 24/242 (9%)
Query: 65 NMGSG---SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVT--TATGGTGSGLSSPGGGPL 119
+GSG E +K+KRGRPRKY PDG ++L L PS SS+T +A+ G G+ +S+PG G
Sbjct: 16 ELGSGPAQDEQVKKKRGRPRKYKPDGAVTLGLSPS-SSLTPHSASLGMGTMISAPGSGFG 74
Query: 120 SPD---------SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDV 165
S S K+ RGRPPGSG K QL +LG S G GFTPHVI ++ GEDV
Sbjct: 75 SEGSGASGLGAPSEKRGRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIIQPGEDV 132
Query: 166 SSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL-LSES 224
+++IM+FSQ GPRAVCI+SA GA+S TL Q + SG VTYEGRFEIL LSGS+L + E
Sbjct: 133 AARIMAFSQQGPRAVCIISATGAVSAATLHQDSESGSVVTYEGRFEILCLSGSYLVVDEG 192
Query: 225 SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL-ADGRKESKSSHRMES 283
G R+R+GGL ++L GPD RV+GGSV G+L AA VQV+VGSF+ G K++K +++
Sbjct: 193 GGARTRSGGLCIALCGPDNRVIGGSVGGVLMAAGAVQVIVGSFMYGGGSKKNKVKAELDA 252
Query: 284 LP 285
P
Sbjct: 253 EP 254
>gi|356514170|ref|XP_003525779.1| PREDICTED: uncharacterized protein LOC100801730 [Glycine max]
Length = 327
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/222 (56%), Positives = 153/222 (68%), Gaps = 24/222 (10%)
Query: 64 MNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSP-- 121
M + P K+KRGRPRKY PDG++++AL P P S PL P
Sbjct: 60 MEAYPATMPAKKKRGRPRKYAPDGSVTMALSPKPIS---------------SSAPLPPVI 104
Query: 122 --DSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
S K+ + +P S S K +LE LG S G FTPH+ITV +GEDV+ K++SFSQ
Sbjct: 105 DFSSEKRGKIKPASSVSKAKFELENLGEWVACSVGANFTPHIITVNSGEDVTMKVISFSQ 164
Query: 175 NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGL 234
GPRA+CILSANG IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ +ES G RSR+GG+
Sbjct: 165 QGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNESGGTRSRSGGM 224
Query: 235 SVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
SVSL+ PDGRV+GG VAGLL AA+PVQVVVGSFLA + E K
Sbjct: 225 SVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQK 266
>gi|212275808|ref|NP_001130578.1| uncharacterized protein LOC100191677 [Zea mays]
gi|194689534|gb|ACF78851.1| unknown [Zea mays]
gi|413923988|gb|AFW63920.1| DNA binding protein [Zea mays]
Length = 400
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 128/242 (52%), Positives = 165/242 (68%), Gaps = 24/242 (9%)
Query: 65 NMGSG---SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVT--TATGGTGSGLSSPGGGPL 119
+GSG E +K+KRGRPRKY PDG ++L L PS SS+T +A+ G G+ +S+PG G
Sbjct: 110 ELGSGPAQDEQVKKKRGRPRKYKPDGAVTLGLSPS-SSLTPHSASLGMGTMISAPGSGFG 168
Query: 120 SPD---------SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDV 165
S S K+ RGRPPGSG K QL +LG S G GFTPHVI ++ GEDV
Sbjct: 169 SEGSGASGLGAPSEKRGRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIIQPGEDV 226
Query: 166 SSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL-LSES 224
+++IM+FSQ GPRAVCI+SA GA+S TL Q + SG VTYEGRFEIL LSGS+L + E
Sbjct: 227 AARIMAFSQQGPRAVCIISATGAVSAATLHQDSESGSVVTYEGRFEILCLSGSYLVVDEG 286
Query: 225 SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL-ADGRKESKSSHRMES 283
G R+R+GGL ++L GPD RV+GGSV G+L AA VQV+VGSF+ G K++K +++
Sbjct: 287 GGARTRSGGLCIALCGPDNRVIGGSVGGVLMAAGAVQVIVGSFMYGGGSKKNKVKAELDA 346
Query: 284 LP 285
P
Sbjct: 347 EP 348
>gi|356535315|ref|XP_003536192.1| PREDICTED: uncharacterized protein LOC100776862 isoform 1 [Glycine
max]
Length = 324
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/211 (57%), Positives = 144/211 (68%), Gaps = 14/211 (6%)
Query: 74 KRKRGRPRKYGPDGTMSL----ALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
K+KRGRPRKYGPDG +L AL P P S + G S G P+ +SIKKS
Sbjct: 46 KKKRGRPRKYGPDGKPALGAVTALSPMPISSSIPLTGEFSAWKRGRGRPV--ESIKKSSF 103
Query: 130 RPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAI 189
+ G G S G FTPHV+TV AGEDV+ KIMSFSQ G RA+CILSA G I
Sbjct: 104 KFLGEGIAY--------SVGANFTPHVLTVNAGEDVTMKIMSFSQQGSRAICILSATGTI 155
Query: 190 SNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGS 249
SNVTLRQ ++ GGT+TYEGRFEILSLSGSF+ +E+ RSR+GG+SVSL+GPDGRV+GG
Sbjct: 156 SNVTLRQPSSCGGTLTYEGRFEILSLSGSFMPTENGVTRSRSGGMSVSLAGPDGRVMGGG 215
Query: 250 VAGLLTAATPVQVVVGSFLADGRKESKSSHR 280
+AGLL AA PVQVVV SFL + E K+ +
Sbjct: 216 LAGLLVAAGPVQVVVASFLPGHQLEHKTKKQ 246
>gi|356532097|ref|XP_003534610.1| PREDICTED: uncharacterized protein LOC100791563 [Glycine max]
Length = 337
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 116/204 (56%), Positives = 142/204 (69%), Gaps = 17/204 (8%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKYGPDG S+AL P P S S+P S K RG+P
Sbjct: 61 KKKRGRPRKYGPDGLNSMALSPIPISS-----------SAPFANEFSS---GKQRGKPRA 106
Query: 134 SGSG--KKHQLEALG-SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAIS 190
KK ++ G S G F PH+ITV GED++ K++SFSQ GPRA+CILSA+G IS
Sbjct: 107 MEYKLPKKVGVDLFGDSVGTNFMPHIITVNTGEDITMKVISFSQQGPRAICILSASGVIS 166
Query: 191 NVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSV 250
NVTLRQ +SGGT+TYEGRFEILSLSGSF+ +++ G RSR+GG+SVSLS PDGR++GG V
Sbjct: 167 NVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTDNQGTRSRSGGMSVSLSSPDGRIVGGGV 226
Query: 251 AGLLTAATPVQVVVGSFLADGRKE 274
AGLL AA PVQVVVGSFL + ++
Sbjct: 227 AGLLVAAGPVQVVVGSFLPNNPQD 250
>gi|224053919|ref|XP_002298038.1| predicted protein [Populus trichocarpa]
gi|222845296|gb|EEE82843.1| predicted protein [Populus trichocarpa]
Length = 343
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/211 (54%), Positives = 151/211 (71%), Gaps = 18/211 (8%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP+K+KRGRPRKYG G +SL L P P+ ++G S + K++RGR
Sbjct: 90 EPVKKKRGRPRKYGLVGQVSLGLSPLPNKPKPSSGEDSS-------------TSKRNRGR 136
Query: 131 PPGSGSGKKHQLEALG-SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAI 189
PPGSG +K QL LG SAGV F+PHVI+++ GED+ SK++SFSQ PRAVCILS G +
Sbjct: 137 PPGSG--RKQQLATLGNSAGVAFSPHVISIEVGEDIVSKLLSFSQQRPRAVCILSGTGTV 194
Query: 190 SNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGS 249
S+VTLRQ A+SG ++TYEGRFEIL LSGS+L++E G R+RTGG+S SLS PDG V+GG+
Sbjct: 195 SSVTLRQPASSGSSITYEGRFEILCLSGSYLVAEDGGPRNRTGGISASLSSPDGHVIGGA 254
Query: 250 VAGLLTAATPVQVVVGSFLAD-GRKESKSSH 279
+A +L AA+PVQVV SF+ +K+ + SH
Sbjct: 255 IA-MLIAASPVQVVACSFVYGVSKKDKQVSH 284
>gi|255537127|ref|XP_002509630.1| DNA binding protein, putative [Ricinus communis]
gi|223549529|gb|EEF51017.1| DNA binding protein, putative [Ricinus communis]
Length = 322
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 127/209 (60%), Positives = 154/209 (73%), Gaps = 18/209 (8%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+K+KR RPRKYGPDGT++ AL P P ++TA +P P+ S +K R P
Sbjct: 68 IKKKRERPRKYGPDGTVTKALSPKP--ISTA---------APAPPPVIDFSAEKQRKIKP 116
Query: 133 GSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
S + K++LE LG S G FTPH+ITV AGEDV+ KI+SFSQ GPRA+CILSANG
Sbjct: 117 VSKT--KYELENLGEWVACSVGANFTPHIITVNAGEDVTMKIISFSQQGPRAICILSANG 174
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ +ES G RSR+GG+SVSL+ PDGRV+G
Sbjct: 175 VISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTESGGTRSRSGGMSVSLASPDGRVVG 234
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKESK 276
G VAGLL AA+PVQVVVGSFLA + E K
Sbjct: 235 GGVAGLLVAASPVQVVVGSFLAGNQHEQK 263
>gi|297828307|ref|XP_002882036.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327875|gb|EFH58295.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
Length = 340
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 121/241 (50%), Positives = 157/241 (65%), Gaps = 28/241 (11%)
Query: 49 AGGDGAIPQAQGLNVM--NMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGG 106
AGG GA+P G+N++ PMKRKRGRPRKYG DG +SLAL SP S T
Sbjct: 64 AGGAGALPHHIGVNMIAPPPPPSETPMKRKRGRPRKYGQDGPVSLALSSSPVSTITPN-- 121
Query: 107 TGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKA 161
+S K+ RGRPPGSG KK ++ ++G S+G+ FTPHVI V
Sbjct: 122 ---------------NSNKRGRGRPPGSG--KKQRMASIGELMPSSSGMSFTPHVIAVSI 164
Query: 162 GEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLL 221
GED++SK++SFSQ GPRA+C+LSA+GA+S TL Q + G + YEGRFEIL+LS S+L+
Sbjct: 165 GEDIASKVISFSQQGPRAICVLSASGAVSTATLLQPSAPGA-IKYEGRFEILALSTSYLV 223
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
+ R+RTG LSVSL+ PDGRV+GG++ G L AA+PVQV++GSF+ K KS R
Sbjct: 224 ATDGSFRNRTGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIIGSFIWAAPK-IKSKKRE 282
Query: 282 E 282
E
Sbjct: 283 E 283
>gi|147801443|emb|CAN77019.1| hypothetical protein VITISV_039795 [Vitis vinifera]
Length = 1029
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 139/251 (55%), Positives = 169/251 (67%), Gaps = 16/251 (6%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
S MK+KRGRPRKYGP G++++AL P P S + G S G P+ DS KK +
Sbjct: 753 SSEMKKKRGRPRKYGPGGSLTMALSPMPISSSIPLTGEFSAWKRGRGRPV--DSFKK-QH 809
Query: 130 RPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAI 189
+ +G++ S G FTPHVITV AGEDV+ KI+SFSQ G RA+CILSANGAI
Sbjct: 810 KSESESAGER----VAYSVGANFTPHVITVNAGEDVTMKIISFSQQGSRAICILSANGAI 865
Query: 190 SNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGS 249
SNVTLRQ +SGGT+TYEGRFEILSLSGSF+ SES G +SR+GG+SVSL+GPDGRVLGG
Sbjct: 866 SNVTLRQPNSSGGTLTYEGRFEILSLSGSFMPSESGGTKSRSGGMSVSLAGPDGRVLGGG 925
Query: 250 VAGLLTAATPVQVVVGSFLADGRKESK-SSHRMESLPVPPKLAPGGQPAGQCSPPSRGTL 308
+AGLL AA PVQV+VGSFL ++E K R+E PV + PA S P TL
Sbjct: 926 LAGLLVAAGPVQVLVGSFLPGHQQEQKPKKQRIE--PVQAAI-----PATVNSMPREETL 978
Query: 309 SESSGGPGSPL 319
++GGP L
Sbjct: 979 G-ANGGPNLNL 988
>gi|224074919|ref|XP_002304491.1| predicted protein [Populus trichocarpa]
gi|222841923|gb|EEE79470.1| predicted protein [Populus trichocarpa]
Length = 346
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 118/231 (51%), Positives = 154/231 (66%), Gaps = 26/231 (11%)
Query: 56 PQAQGLNVMNMGSGSE-----PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSG 110
P A + +NM + SE P+K+KRGRPRKYG DG +SL L P ++G S
Sbjct: 70 PNANFGHGINMAATSEVQVGEPVKKKRGRPRKYGLDGQVSLGLSSFPDKAKPSSGEDSS- 128
Query: 111 LSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDV 165
+ K++RGRPPGSG +K QL LG SAG+ F+PHV+++ GED+
Sbjct: 129 ------------TSKRNRGRPPGSG--RKQQLATLGEWMNSSAGLAFSPHVVSIGVGEDI 174
Query: 166 SSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESS 225
SK++SFSQ PRAVCILS G +S+VTLRQ A+SG +TYEGRFEIL LSGS+L++E
Sbjct: 175 VSKLLSFSQQRPRAVCILSGTGTVSSVTLRQPASSGPPITYEGRFEILCLSGSYLIAEDG 234
Query: 226 GQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
G R+RTGG+S S S PDG V+GG++A +L AA+PVQVVV +FL G K+ K
Sbjct: 235 GPRNRTGGISASFSSPDGHVIGGAIA-MLIAASPVQVVVCTFLYGGSKKDK 284
>gi|294461605|gb|ADE76363.1| unknown [Picea sitchensis]
Length = 395
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 132/207 (63%), Positives = 149/207 (71%), Gaps = 24/207 (11%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
+GSE +KRKRGRPRKYG D G G GLSSP SP S KK
Sbjct: 90 AGSETLKRKRGRPRKYGTD--------------VDGFGNVGLGLSSPS----SPFSDKKG 131
Query: 128 RGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
RG SGKK Q+ ALG AG GF PHVIT+ AGEDV KIM+F Q+GP AVC+LSANG
Sbjct: 132 RG------SGKKAQMVALGCAGHGFIPHVITIAAGEDVCKKIMAFMQHGPWAVCVLSANG 185
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
AISNVTLRQ A SGGTVTYEGRFEILSLSGSFLL+++ G +RTGGLSVSL+G DGRV+G
Sbjct: 186 AISNVTLRQPAMSGGTVTYEGRFEILSLSGSFLLTDTGGTHTRTGGLSVSLAGSDGRVIG 245
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKE 274
G V GLL AA+PVQVVVG+FL D +K+
Sbjct: 246 GGVGGLLMAASPVQVVVGTFLVDNKKD 272
>gi|449460854|ref|XP_004148159.1| PREDICTED: uncharacterized protein LOC101217222 [Cucumis sativus]
Length = 350
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 111/203 (54%), Positives = 143/203 (70%), Gaps = 5/203 (2%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKYGPDG SL L SP ++++ TG + +S +KK + R
Sbjct: 70 KKKRGRPRKYGPDGKRSLTLALSPMPISSSIPLTGEFPNWKRDNEISQAIVKKPQ-RFEF 128
Query: 134 SGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVT 193
G++ S G FTPHVITV AGED++ K+MSFSQ RA+CILSANG ISNVT
Sbjct: 129 ENPGQRLAY----SVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVT 184
Query: 194 LRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGL 253
LRQA +SGGT+TYEGRFEIL+L+GS++ +++ +SR GG+SVSL+G DGRV+GG +AGL
Sbjct: 185 LRQATSSGGTLTYEGRFEILALTGSYMPTQNGATKSRCGGMSVSLAGQDGRVVGGGLAGL 244
Query: 254 LTAATPVQVVVGSFLADGRKESK 276
L AA PVQ+VVGSFL ++E K
Sbjct: 245 LVAAGPVQIVVGSFLPGHQQEQK 267
>gi|168002503|ref|XP_001753953.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694929|gb|EDQ81275.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 386
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 121/227 (53%), Positives = 156/227 (68%), Gaps = 11/227 (4%)
Query: 54 AIPQAQGLNVMNMGSGSEPMKRKRGRPRKY--GPDGTMSLALVPSPSSVTTATGGTGSGL 111
A+ + G+ + + S E +KRKRGRPRKY G + A +P ++ A SG
Sbjct: 28 ALVMSMGMALGGVSSRGETVKRKRGRPRKYVGNEPGGAASAAGGTPVNMQLALHTPNSG- 86
Query: 112 SSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAG----VGFTPHVITVKAGEDVSS 167
P G P +P +K+ RGRP GS S K HQL + SAG FTPH+IT+ AGED+++
Sbjct: 87 --PSGSPFTPTGVKRGRGRPLGS-SRKLHQLVSFPSAGSWAGQNFTPHIITIAAGEDIAA 143
Query: 168 KIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SG 226
KI SF+Q+GPRAVC++SANGAIS LRQ ++SGG VTYEGR+EILSL GSFL +E +
Sbjct: 144 KIYSFAQHGPRAVCVMSANGAISTAILRQQSSSGGNVTYEGRYEILSLMGSFLPTEQGAN 203
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
R RTGGLSVSL+ DGRV+GG VAG+LTAA+P+QVVVGSF+ + K
Sbjct: 204 SRQRTGGLSVSLACSDGRVIGGGVAGVLTAASPIQVVVGSFIFEPEK 250
>gi|242067042|ref|XP_002454810.1| hypothetical protein SORBIDRAFT_04g037880 [Sorghum bicolor]
gi|241934641|gb|EES07786.1| hypothetical protein SORBIDRAFT_04g037880 [Sorghum bicolor]
Length = 401
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/239 (52%), Positives = 159/239 (66%), Gaps = 20/239 (8%)
Query: 65 NMGSG---SEPMKRKRGRPRKYGPDGTMSLALVPSPSS----------VTTATGGTGSGL 111
+ GSG E +K+KRGRPRKY PDG ++L L PS SS T G+G G
Sbjct: 111 DQGSGPGQDEQVKKKRGRPRKYKPDGAVTLGLSPSSSSTPHSSSPGMGTMVCTPGSGFGS 170
Query: 112 SSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVS 166
+ GG S K+ RGRPPGSG K QL +LG S G GFTPHVI ++ GEDV+
Sbjct: 171 GASGGSGSGAPSEKRGRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIIQPGEDVA 228
Query: 167 SKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSG 226
++IM+FSQ GPRAVCI+SA GA+S TL Q + SGG VTYEGRFEIL LSGS+L+ + G
Sbjct: 229 ARIMAFSQQGPRAVCIISATGAVSTATLHQDSDSGGVVTYEGRFEILCLSGSYLVLDDGG 288
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLP 285
R+R+GGL ++L GPD RV+GGSV G+LTAA VQV+VGSF+ G K++K+ + P
Sbjct: 289 TRTRSGGLCIALCGPDHRVIGGSVGGVLTAAGTVQVIVGSFMYGGSKKNKAKAEADIEP 347
>gi|356574795|ref|XP_003555530.1| PREDICTED: uncharacterized protein LOC100789179 [Glycine max]
Length = 330
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/220 (55%), Positives = 149/220 (67%), Gaps = 17/220 (7%)
Query: 74 KRKRGRPRKYGPDGTMSL----ALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
K+KRGRPRKYGPDG +L AL P P S + G S S G P+ +SIKKS
Sbjct: 46 KKKRGRPRKYGPDGKPALGAVTALSPMPISSSIPLTGEFSAWKSGRGRPV--ESIKKSSF 103
Query: 130 R----PPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
+ PG G + S G FTPHV+TV AGEDV+ KIM+FSQ G RA+CILSA
Sbjct: 104 KFEVESPGPVEGIAY------SVGANFTPHVLTVNAGEDVTMKIMTFSQQGSRAICILSA 157
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
G ISNVTLRQ ++ GGT+TYEG FEILSLSGSF+ +E+ RSR+GG+SVSL+GPDGRV
Sbjct: 158 TGTISNVTLRQPSSCGGTLTYEGLFEILSLSGSFMPTENGVTRSRSGGMSVSLAGPDGRV 217
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRKESKS-SHRMESL 284
+GG +AGLL AA PVQVVV SFL + E K+ R+E +
Sbjct: 218 MGGGLAGLLVAAGPVQVVVASFLPGHQLEHKTKKQRVEHV 257
>gi|255541324|ref|XP_002511726.1| DNA binding protein, putative [Ricinus communis]
gi|223548906|gb|EEF50395.1| DNA binding protein, putative [Ricinus communis]
Length = 324
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 129/220 (58%), Positives = 156/220 (70%), Gaps = 9/220 (4%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKYGPDGT++ AL P P S + GG SS G + +K + + G
Sbjct: 55 KKKRGRPRKYGPDGTVARALSPMPISSSAPPGGD---FSSGKPGKVWSGGFEKKKYKKMG 111
Query: 134 -SGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNV 192
SG A GS G FTPHVITV AGEDV+ K++SFSQ GPRA+CILSANG ISNV
Sbjct: 112 MENSGD----WASGSVGTNFTPHVITVNAGEDVTMKVISFSQQGPRAICILSANGVISNV 167
Query: 193 TLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAG 252
TLRQ +SGGT+TYEGRFEILSLSGSF+ +ES G RSR+GG+SVSL+ PDGRV+GG VAG
Sbjct: 168 TLRQPDSSGGTLTYEGRFEILSLSGSFMPTESQGTRSRSGGMSVSLASPDGRVVGGGVAG 227
Query: 253 LLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAP 292
LL AA+PVQVVVGSFL G + + +++ PVP + P
Sbjct: 228 LLVAASPVQVVVGSFLP-GNHQDQKPKKIKIDPVPASITP 266
>gi|326516268|dbj|BAJ88157.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 555
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 116/214 (54%), Positives = 153/214 (71%), Gaps = 18/214 (8%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPD------- 122
E +K+KRGRPRKY PDG+++L L PSPS+ +++ G G+ +++PG G
Sbjct: 209 DEQVKKKRGRPRKYKPDGSVTLGLSPSPSTPHSSSPGMGTMVTTPGSGFGQGTGSGGSGS 268
Query: 123 ---SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
+ K+ RGRPPGSG + QL +LG S G GFTPHVI + AGEDV+++IMSFSQ
Sbjct: 269 GALTEKRGRGRPPGSG--RMQQLASLGKWFLGSVGTGFTPHVIIISAGEDVAARIMSFSQ 326
Query: 175 NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGL 234
GPRA+CI+SA GA+S TL Q + SG VTYEGRFEIL LSGS+L+ E G R+R+GGL
Sbjct: 327 QGPRAICIISATGAVSTATLHQDSDSG-VVTYEGRFEILCLSGSYLVLEEGGTRTRSGGL 385
Query: 235 SVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
++L GPD RV+GG+V+G+LTAA VQV+VGSF+
Sbjct: 386 CIALCGPDHRVIGGTVSGVLTAAGTVQVIVGSFM 419
>gi|359490175|ref|XP_002268693.2| PREDICTED: uncharacterized protein LOC100254941 [Vitis vinifera]
Length = 327
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 129/221 (58%), Positives = 154/221 (69%), Gaps = 22/221 (9%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
S MK+KRGRPRKYGP G++++AL P P S + G S K+ RG
Sbjct: 51 SSEMKKKRGRPRKYGPGGSLTMALSPMPISSSIPLTGEFSAW-------------KRGRG 97
Query: 130 RPPGSGSGKKHQLEALG-------SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
RP S K+H+ E+ S G FTPHVITV AGEDV+ KI+SFSQ G RA+CI
Sbjct: 98 RPVDSFK-KQHKSESESAGERVAYSVGANFTPHVITVNAGEDVTMKIISFSQQGSRAICI 156
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LSANGAISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ SES G +SR+GG+SVSL+GPD
Sbjct: 157 LSANGAISNVTLRQPNSSGGTLTYEGRFEILSLSGSFMPSESGGTKSRSGGMSVSLAGPD 216
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK-SSHRME 282
GRVLGG +AGLL AA PVQV+VGSFL ++E K R+E
Sbjct: 217 GRVLGGGLAGLLVAAGPVQVLVGSFLPGHQQEQKPKKQRIE 257
>gi|296084126|emb|CBI24514.3| unnamed protein product [Vitis vinifera]
Length = 323
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 140/254 (55%), Positives = 168/254 (66%), Gaps = 30/254 (11%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
S MK+KRGRPRKYGP G++++AL P P S + G S K+ RG
Sbjct: 47 SSEMKKKRGRPRKYGPGGSLTMALSPMPISSSIPLTGEFSAW-------------KRGRG 93
Query: 130 RPPGSGSGKKHQLEALG-------SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
RP S K+H+ E+ S G FTPHVITV AGEDV+ KI+SFSQ G RA+CI
Sbjct: 94 RPVDSFK-KQHKSESESAGERVAYSVGANFTPHVITVNAGEDVTMKIISFSQQGSRAICI 152
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LSANGAISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ SES G +SR+GG+SVSL+GPD
Sbjct: 153 LSANGAISNVTLRQPNSSGGTLTYEGRFEILSLSGSFMPSESGGTKSRSGGMSVSLAGPD 212
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK-SSHRMESLPVPPKLAPGGQPAGQCS 301
GRVLGG +AGLL AA PVQV+VGSFL ++E K R+E PV + PA S
Sbjct: 213 GRVLGGGLAGLLVAAGPVQVLVGSFLPGHQQEQKPKKQRIE--PVQAAI-----PATVNS 265
Query: 302 PPSRGTLSESSGGP 315
P TL ++GGP
Sbjct: 266 MPREETLG-ANGGP 278
>gi|449499695|ref|XP_004160890.1| PREDICTED: uncharacterized LOC101217222 [Cucumis sativus]
Length = 356
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 111/204 (54%), Positives = 142/204 (69%), Gaps = 1/204 (0%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKYGPDG SL L SP ++++ TG + +S +KK +
Sbjct: 70 KKKRGRPRKYGPDGKRSLTLALSPMPISSSIPLTGEFPNWKRDNEISQAIVKKPQRFEFE 129
Query: 134 SGSGKKHQLEALG-SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNV 192
+ G L S G FTPHVITV AGED++ K+MSFSQ RA+CILSANG ISNV
Sbjct: 130 NPVGSNIIGARLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNV 189
Query: 193 TLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAG 252
TLRQA +SGGT+TYEGRFEIL+L+GS++ +++ +SR GG+SVSL+G DGRV+GG +AG
Sbjct: 190 TLRQATSSGGTLTYEGRFEILALTGSYMPTQNGATKSRCGGMSVSLAGQDGRVVGGGLAG 249
Query: 253 LLTAATPVQVVVGSFLADGRKESK 276
LL AA PVQ+VVGSFL ++E K
Sbjct: 250 LLVAAGPVQIVVGSFLPGHQQEQK 273
>gi|224067757|ref|XP_002302537.1| predicted protein [Populus trichocarpa]
gi|222844263|gb|EEE81810.1| predicted protein [Populus trichocarpa]
Length = 247
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 110/172 (63%), Positives = 133/172 (77%), Gaps = 13/172 (7%)
Query: 127 SRGRP----PGSGSGKKHQ---LEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
S G+P PGS KK++ +E LG S G FTPHVITV AGEDV+ K++SFSQ
Sbjct: 15 SAGKPGKVWPGSYEKKKYKKLGMENLGEWAANSVGTNFTPHVITVNAGEDVTMKVISFSQ 74
Query: 175 NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGL 234
GPRA+CILSANG ISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ +ES G RSR+GG+
Sbjct: 75 QGPRAICILSANGVISNVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTESQGTRSRSGGM 134
Query: 235 SVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSH-RMESLP 285
SVSL+ PDGRV+GGSVAGLL AA+PVQVVVGSFLA ++ K +++S+P
Sbjct: 135 SVSLASPDGRVVGGSVAGLLVAASPVQVVVGSFLAGNHQDQKPKKPKIDSIP 186
>gi|326511427|dbj|BAJ87727.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 366
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 121/237 (51%), Positives = 152/237 (64%), Gaps = 26/237 (10%)
Query: 50 GGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPS-----SVTTAT 104
G D IP A S + ++KRGRPRKY PDG+ L+PSPS ++ T
Sbjct: 66 GPDPHIPHAPCPPATATASQDDLGRKKRGRPRKYKPDGS---GLIPSPSPSPCTAIVPVT 122
Query: 105 GGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITV 159
G+G G SS +K RGRPPGSG K QL +LG + G GFTPHVI +
Sbjct: 123 PGSGGGPSS-----------EKRRGRPPGSG--KMQQLASLGKSFLGTVGTGFTPHVIII 169
Query: 160 KAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF 219
+GEDV+++IMSFSQ GPRAVCI+SA+GA+S TL Q A SG V YEGRFEIL LSGS+
Sbjct: 170 PSGEDVAARIMSFSQQGPRAVCIMSASGAVSTATLHQDAGSGSVVKYEGRFEILCLSGSY 229
Query: 220 LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
L+ + R+R GGL ++L G D RV+GGSV G+LTAA VQV+VGSF+ G K+S+
Sbjct: 230 LVIDDGVSRTRNGGLCIALCGADHRVIGGSVGGVLTAAGTVQVIVGSFMYAGSKKSR 286
>gi|326502392|dbj|BAJ95259.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 358
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 121/237 (51%), Positives = 152/237 (64%), Gaps = 26/237 (10%)
Query: 50 GGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPS-----SVTTAT 104
G D IP A S + ++KRGRPRKY PDG+ L+PSPS ++ T
Sbjct: 58 GPDPHIPHAPCPPATATASQDDLGRKKRGRPRKYKPDGS---GLIPSPSPSPCTAIVPVT 114
Query: 105 GGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITV 159
G+G G SS +K RGRPPGSG K QL +LG + G GFTPHVI +
Sbjct: 115 PGSGGGPSS-----------EKRRGRPPGSG--KMQQLASLGKSFLGTVGTGFTPHVIII 161
Query: 160 KAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF 219
+GEDV+++IMSFSQ GPRAVCI+SA+GA+S TL Q A SG V YEGRFEIL LSGS+
Sbjct: 162 PSGEDVAARIMSFSQQGPRAVCIMSASGAVSTATLHQDAGSGSVVKYEGRFEILCLSGSY 221
Query: 220 LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
L+ + R+R GGL ++L G D RV+GGSV G+LTAA VQV+VGSF+ G K+S+
Sbjct: 222 LVIDDGVSRTRNGGLCIALCGADHRVIGGSVGGVLTAAGTVQVIVGSFMYAGSKKSR 278
>gi|255537455|ref|XP_002509794.1| DNA binding protein, putative [Ricinus communis]
gi|223549693|gb|EEF51181.1| DNA binding protein, putative [Ricinus communis]
Length = 347
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 111/203 (54%), Positives = 144/203 (70%), Gaps = 21/203 (10%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP+K+KRGRPRKY PDG +SL L P P ++G PLSP K++RGR
Sbjct: 90 EPVKKKRGRPRKYAPDGQVSLGLSPLPVKPKPSSGQD----------PLSP---KRARGR 136
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPG+G +K QL LG SAG+ F+PHVI + GED+ +K++SF+Q PRA+CILS
Sbjct: 137 PPGTG--RKQQLALLGEWMNSSAGIAFSPHVIRIGVGEDIVAKVLSFAQQRPRALCILSG 194
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
G +S+VTLRQ A+SG T+T+EGRFEIL LSGS+L++E G R+RTGG+S SLS PDG V
Sbjct: 195 TGTVSSVTLRQPASSGPTLTFEGRFEILCLSGSYLVAEDGGPRNRTGGISASLSSPDGHV 254
Query: 246 LGGSVAGLLTAATPVQVVVGSFL 268
+GG++ G+L AA PVQVV SF+
Sbjct: 255 IGGAI-GMLIAAGPVQVVACSFV 276
>gi|357137691|ref|XP_003570433.1| PREDICTED: uncharacterized protein LOC100843775 [Brachypodium
distachyon]
Length = 450
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 121/232 (52%), Positives = 160/232 (68%), Gaps = 21/232 (9%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPD------- 122
E +K+KRGRPRKY PD ++L L PSPS+ +++ G G+ +++PG G S
Sbjct: 111 DEQVKKKRGRPRKYKPDRAVTLGLSPSPSTPHSSSSGMGAMVTTPGAGFGSGTGSGGSGS 170
Query: 123 ---SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
+ K+ RGRPPGSG K QL +LG S G GFTPHVI + AGEDV+++IMSFSQ
Sbjct: 171 GALTEKRGRGRPPGSG--KMQQLASLGTWFLGSVGTGFTPHVIIISAGEDVAARIMSFSQ 228
Query: 175 NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGL 234
GPRA+CI+SA GA+S TL Q + SG VTYEGRFEIL LSGS+L+ + G R R+GGL
Sbjct: 229 QGPRAICIISATGAVSTATLYQDSDSG-AVTYEGRFEILCLSGSYLVLDEGGTRKRSGGL 287
Query: 235 SVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG---RKESKSSHRMES 283
++L GPD RV+GGSV+G+LTAA VQV+VGSF+ G + ++K+ ME+
Sbjct: 288 CIALCGPDHRVIGGSVSGVLTAAGTVQVIVGSFMYGGGSKKSKAKAEQDMEN 339
>gi|359489416|ref|XP_002273440.2| PREDICTED: uncharacterized protein LOC100262627 [Vitis vinifera]
Length = 328
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 126/204 (61%), Positives = 147/204 (72%), Gaps = 20/204 (9%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKY PDG S+ L P P S S+P G S S K+ RGRP G
Sbjct: 49 KKKRGRPRKYQPDGMASMTLSPMPISS-----------SAPLSGNFS--SGKRGRGRPVG 95
Query: 134 SGSGKKHQL--EALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSAN 186
S S +K ++ E G S GV FTPH+ITV AGEDV+ K++SFSQ GPRAVCILSAN
Sbjct: 96 SESKQKQKVGSENSGNWSAISDGVNFTPHIITVNAGEDVTMKLISFSQQGPRAVCILSAN 155
Query: 187 GAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVL 246
G ISNVTLRQ +SGGT+TYEGRFEILSL+GSF+ +ES G R+R GG+SVSL+ PDGRV+
Sbjct: 156 GVISNVTLRQQDSSGGTLTYEGRFEILSLTGSFVPTESGGTRNRAGGMSVSLASPDGRVV 215
Query: 247 GGSVAGLLTAATPVQVVVGSFLAD 270
GG VAGLL AA+PV VVVGSFL D
Sbjct: 216 GGGVAGLLIAASPVLVVVGSFLPD 239
>gi|125537896|gb|EAY84291.1| hypothetical protein OsI_05670 [Oryza sativa Indica Group]
Length = 388
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 124/252 (49%), Positives = 153/252 (60%), Gaps = 45/252 (17%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDS-----IKKSR 128
KRKRGRPRKYGPDG++ L +P S + GGG +P + +K+ R
Sbjct: 67 KRKRGRPRKYGPDGSLLRPLKATPISASVP--------DDSGGGQYTPAAAVGAVMKRGR 118
Query: 129 GRPPGSGSGKK----------------------HQLEALG--------SAGVGFTPHVIT 158
GRP G S H LG ++G FTPH+I
Sbjct: 119 GRPVGFVSRASPVSVAVTAATSTAAVVVSSPATHTQTPLGPLGELVACASGANFTPHIIN 178
Query: 159 VKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGS 218
V AGEDV+ K++SFSQ GPRA+CILSANG ISNVTLRQ T GGTVTYEGRFE+LSLSGS
Sbjct: 179 VAAGEDVNMKVISFSQQGPRAICILSANGVISNVTLRQQDTLGGTVTYEGRFELLSLSGS 238
Query: 219 FLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSS 278
F ++S G RSR+GG+SVSL+ DGRV+GG VAGLL AA+PVQVVVGSFL + + ++
Sbjct: 239 FTPTDSGGTRSRSGGMSVSLAATDGRVIGGGVAGLLVAASPVQVVVGSFLPSYQLDQNAT 298
Query: 279 HR--MESLPVPP 288
+ +E VPP
Sbjct: 299 KKPVIEITTVPP 310
>gi|115443929|ref|NP_001045744.1| Os02g0125200 [Oryza sativa Japonica Group]
gi|41053039|dbj|BAD07970.1| putative AT-hook DNA-binding protein [Oryza sativa Japonica Group]
gi|113535275|dbj|BAF07658.1| Os02g0125200 [Oryza sativa Japonica Group]
Length = 388
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/252 (49%), Positives = 153/252 (60%), Gaps = 45/252 (17%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDS-----IKKSR 128
KRKRGRPRKYGPDG++ L +P S + GGG +P + +K+ R
Sbjct: 67 KRKRGRPRKYGPDGSLLRPLKATPISASVP--------DDSGGGQYTPAAAVGAVMKRGR 118
Query: 129 GRPPGSGSGKK----------------------HQLEALG--------SAGVGFTPHVIT 158
GRP G S H LG ++G FTPH+I
Sbjct: 119 GRPVGFVSRASPVSVAVTAATSTAAVVVSSPATHTQTPLGPLGELVACASGANFTPHIIN 178
Query: 159 VKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGS 218
V AGEDV+ K++SFSQ GPRA+CILSANG ISNVTLRQ T GGTVTYEGRFE+LSLSGS
Sbjct: 179 VAAGEDVNMKVISFSQQGPRAICILSANGVISNVTLRQQDTLGGTVTYEGRFELLSLSGS 238
Query: 219 FLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSS 278
F ++S G RSR+GG+SVSL+ DGRV+GG VAGLL AA+PVQVVVGSFL + + ++
Sbjct: 239 FTPTDSGGTRSRSGGMSVSLAATDGRVIGGGVAGLLVAASPVQVVVGSFLPSYQLDQNAT 298
Query: 279 HR--MESLPVPP 288
+ +E VPP
Sbjct: 299 KKPVIEITTVPP 310
>gi|297809519|ref|XP_002872643.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
gi|297318480|gb|EFH48902.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
Length = 353
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/205 (60%), Positives = 151/205 (73%), Gaps = 13/205 (6%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLS-PDSIKKSRGRP 131
+K+KRGRPRKYGPDGT+ +AL P P S A S L P + S K+S+ +P
Sbjct: 86 IKKKRGRPRKYGPDGTV-VALSPKPISSAPAP----SHLPPPSSNVIDFSASEKRSKMKP 140
Query: 132 PGSGSGKK--HQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+ + K HQ+E LG S G FTPHVITV AGEDV+ KI+SFSQ GPR++C+LS
Sbjct: 141 TNTFNRTKYHHQVENLGEWAPCSVGGNFTPHVITVNAGEDVTMKIISFSQQGPRSICVLS 200
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANG IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ ++S G RSRTGG+SVSL+ PDGR
Sbjct: 201 ANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNDSGGTRSRTGGMSVSLASPDGR 260
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLA 269
V+GG + GLL AA+PVQVVVGSFLA
Sbjct: 261 VVGGGLGGLLVAASPVQVVVGSFLA 285
>gi|449459666|ref|XP_004147567.1| PREDICTED: uncharacterized protein LOC101210208 [Cucumis sativus]
gi|449523579|ref|XP_004168801.1| PREDICTED: uncharacterized LOC101210208 [Cucumis sativus]
Length = 330
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 126/229 (55%), Positives = 160/229 (69%), Gaps = 19/229 (8%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
S P K+KRGRPRKYGPDG++S+AL P P S++ + + KK +
Sbjct: 70 SVPGKKKRGRPRKYGPDGSVSMALSPKPISLSVPPPV------------IDFSTEKKGKV 117
Query: 130 RPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
RP + S K +++ LG S G FTPH+ITV AGEDV+ KI+SFSQ GPRA+CILS
Sbjct: 118 RPASAVSKSKFEVDNLGDWVPCSLGANFTPHIITVNAGEDVTMKIISFSQQGPRAICILS 177
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANG IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ S++ RSR+GG+SVSL+ PDGR
Sbjct: 178 ANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGATRSRSGGMSVSLASPDGR 237
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSH-RMESL-PVPPKLA 291
V+GG VAGLL AA+PVQVVVGSFL+ + E K + +++ P PP A
Sbjct: 238 VVGGGVAGLLVAASPVQVVVGSFLSGNQHEQKPKKPKHDTISPAPPTAA 286
>gi|296089154|emb|CBI38857.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 126/204 (61%), Positives = 147/204 (72%), Gaps = 20/204 (9%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKY PDG S+ L P P S S+P G S S K+ RGRP G
Sbjct: 55 KKKRGRPRKYQPDGMASMTLSPMPISS-----------SAPLSGNFS--SGKRGRGRPVG 101
Query: 134 SGSGKKHQL--EALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSAN 186
S S +K ++ E G S GV FTPH+ITV AGEDV+ K++SFSQ GPRAVCILSAN
Sbjct: 102 SESKQKQKVGSENSGNWSAISDGVNFTPHIITVNAGEDVTMKLISFSQQGPRAVCILSAN 161
Query: 187 GAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVL 246
G ISNVTLRQ +SGGT+TYEGRFEILSL+GSF+ +ES G R+R GG+SVSL+ PDGRV+
Sbjct: 162 GVISNVTLRQQDSSGGTLTYEGRFEILSLTGSFVPTESGGTRNRAGGMSVSLASPDGRVV 221
Query: 247 GGSVAGLLTAATPVQVVVGSFLAD 270
GG VAGLL AA+PV VVVGSFL D
Sbjct: 222 GGGVAGLLIAASPVLVVVGSFLPD 245
>gi|195620754|gb|ACG32207.1| DNA binding protein [Zea mays]
Length = 400
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/242 (51%), Positives = 157/242 (64%), Gaps = 24/242 (9%)
Query: 65 NMGSG---SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGL---------- 111
+GSG E +K+KRGRPRKY PDG ++L L PS SS+T + G G
Sbjct: 110 ELGSGPAQDEQVKKKRGRPRKYKPDGAVTLGLSPS-SSLTPHSASLGMGTMVSAPGSGFG 168
Query: 112 -SSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDV 165
G L S K+ RGRPPGSG K QL +LG S G GFTPHVI ++ GEDV
Sbjct: 169 SGGSGASGLGAPSEKRGRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIIQPGEDV 226
Query: 166 SSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL-LSES 224
+++IM+FSQ GPRAVCI+SA GA+S TL Q + SG VTYEGRFEIL LSGS+L + E
Sbjct: 227 AARIMAFSQQGPRAVCIISATGAVSAATLHQDSESGSVVTYEGRFEILCLSGSYLVVDEG 286
Query: 225 SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL-ADGRKESKSSHRMES 283
G R+R+GGL ++L GPD RV+GGSV G+L AA VQV+VGSF+ G K++K +++
Sbjct: 287 GGARTRSGGLCIALCGPDNRVIGGSVGGVLMAAGAVQVIVGSFMYGGGSKKNKVKAELDA 346
Query: 284 LP 285
P
Sbjct: 347 GP 348
>gi|356533463|ref|XP_003535283.1| PREDICTED: uncharacterized protein LOC100812673 [Glycine max]
Length = 396
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 111/219 (50%), Positives = 144/219 (65%), Gaps = 20/219 (9%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKY DG + ++ P+P T +G LS+P S K+ RG+
Sbjct: 96 KKKRGRPRKYDADGNLRVSARPTP------TPPSGFTLSTPS----EYSSSKRERGKHYN 145
Query: 134 SGSGKKHQLEALGSAGVG----------FTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+ + L S+ +G F HV+ GEDV+ KI+SF+Q GPR +CIL
Sbjct: 146 TTFANNSYQQQLYSSSLGDVFAITAAGDFVAHVLNAYTGEDVAGKILSFAQKGPRGICIL 205
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
SANGAISNVT+RQ +SGG +TYEGRFEILSLSGSF + ++SG +SRTGGLSVSL+GPDG
Sbjct: 206 SANGAISNVTIRQPGSSGGILTYEGRFEILSLSGSFTVVDNSGMKSRTGGLSVSLAGPDG 265
Query: 244 RVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRME 282
RV+GG VAGLLTAA P+Q+VVGSF+ + K K ++ E
Sbjct: 266 RVIGGGVAGLLTAAGPIQIVVGSFMQNCCKTQKRKYQRE 304
>gi|222622088|gb|EEE56220.1| hypothetical protein OsJ_05202 [Oryza sativa Japonica Group]
Length = 388
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/252 (49%), Positives = 153/252 (60%), Gaps = 45/252 (17%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDS-----IKKSR 128
KRKRGRPRKYGPDG++ L +P S + GGG +P + +K+ R
Sbjct: 67 KRKRGRPRKYGPDGSLLRPLKATPISASVP--------DDSGGGQYTPAAAVGAVMKRGR 118
Query: 129 GRPPGSGSGKK----------------------HQLEALG--------SAGVGFTPHVIT 158
GRP G S H LG ++G FTPH+I
Sbjct: 119 GRPVGFVSRASPVSVAVTAATSTAAVVVSSPATHTQTPLGPLGELVACASGANFTPHIIN 178
Query: 159 VKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGS 218
V AGEDV+ K++SFSQ GPRA+CILSANG ISNVTLRQ T GGTVTYEGRFE+LSLSGS
Sbjct: 179 VAAGEDVNMKVISFSQQGPRAICILSANGVISNVTLRQQDTLGGTVTYEGRFELLSLSGS 238
Query: 219 FLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSS 278
F ++S G RSR+GG+SVSL+ DGRV+GG VAGLL AA+PVQVVVGSFL + + ++
Sbjct: 239 FTPTDSGGTRSRSGGMSVSLAATDGRVIGGGVAGLLVAASPVQVVVGSFLPSYQLDQNAT 298
Query: 279 HR--MESLPVPP 288
+ +E VPP
Sbjct: 299 KKPVIEITTVPP 310
>gi|449522149|ref|XP_004168090.1| PREDICTED: uncharacterized LOC101212918 [Cucumis sativus]
Length = 369
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 121/200 (60%), Positives = 143/200 (71%), Gaps = 16/200 (8%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKYGPDGT+++AL P P S + G G ++ G G L K
Sbjct: 84 KKKRGRPRKYGPDGTVTMALSPLPLSSSAPAAG-GFSITKRGKGRLGGSEFKHH------ 136
Query: 134 SGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
KK +E +G + G F PH+ITV AGEDV+ KI+SFSQ GPRA+CILSANG
Sbjct: 137 ----KKMGMEYIGEWNACAVGTNFMPHIITVNAGEDVTMKIISFSQQGPRAICILSANGV 192
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
ISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ +E+ G RSRTGG+SVSL+ PDGRV+GG
Sbjct: 193 ISNVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTENQGTRSRTGGMSVSLASPDGRVVGG 252
Query: 249 SVAGLLTAATPVQVVVGSFL 268
VAGLL AA PVQVVVGSFL
Sbjct: 253 GVAGLLIAAGPVQVVVGSFL 272
>gi|449432243|ref|XP_004133909.1| PREDICTED: uncharacterized protein LOC101212918 [Cucumis sativus]
Length = 348
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/202 (61%), Positives = 146/202 (72%), Gaps = 20/202 (9%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKYGPDGT+++AL P P S S+P G S K+ +GR G
Sbjct: 63 KKKRGRPRKYGPDGTVTMALSPLPLSS-----------SAPAAGGFS--ITKRGKGRLGG 109
Query: 134 S--GSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSAN 186
S KK +E +G + G F PH+ITV AGEDV+ KI+SFSQ GPRA+CILSAN
Sbjct: 110 SEFKHHKKMGMEYIGEWNACAVGTNFMPHIITVNAGEDVTMKIISFSQQGPRAICILSAN 169
Query: 187 GAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVL 246
G ISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ +E+ G RSRTGG+SVSL+ PDGRV+
Sbjct: 170 GVISNVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTENQGTRSRTGGMSVSLASPDGRVV 229
Query: 247 GGSVAGLLTAATPVQVVVGSFL 268
GG VAGLL AA PVQVVVGSFL
Sbjct: 230 GGGVAGLLIAAGPVQVVVGSFL 251
>gi|168045748|ref|XP_001775338.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673283|gb|EDQ59808.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 449
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/249 (50%), Positives = 151/249 (60%), Gaps = 53/249 (21%)
Query: 69 GSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSR 128
G +P KRKRGRPRK+ G +S + S V A L P +P K+ R
Sbjct: 109 GEQPPKRKRGRPRKFATGGELSSGALGSVYPVLPA-------LMPASSSPYTPSPEKRGR 161
Query: 129 GRPPGSGSGKKHQLEALGSA----GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
GRPPGSG KK QL ALG G GFTPH++TV GEDVS++IM F+Q+GPRA+C+LS
Sbjct: 162 GRPPGSG--KKQQLAALGVVLAGTGQGFTPHILTVSTGEDVSTRIMQFAQHGPRAMCVLS 219
Query: 185 ANGAISNVTLRQAATSGGTVTYE------------------------------------- 207
ANGAISNVTLRQ ++SGGTVTYE
Sbjct: 220 ANGAISNVTLRQQSSSGGTVTYEVNVPSDYIEDCYDMLQHWFSAFINMWFTFYIVNTCTV 279
Query: 208 --GRFEILSLSGSFLLSE-SSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVV 264
GR+EILSL+GS+L +E G R RTGGLSVSL+G DGRV+GG VAG+LTAA+P+QVVV
Sbjct: 280 NYGRYEILSLTGSYLSTELGGGARQRTGGLSVSLAGSDGRVIGGGVAGMLTAASPIQVVV 339
Query: 265 GSFLADGRK 273
SFL+D K
Sbjct: 340 ASFLSDTFK 348
>gi|356540448|ref|XP_003538701.1| PREDICTED: uncharacterized protein LOC100790569 [Glycine max]
Length = 352
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 114/206 (55%), Positives = 145/206 (70%), Gaps = 21/206 (10%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP+K+KRGRPRKYGPDG++SL L P + TA GSG SS K+ RGR
Sbjct: 89 EPVKKKRGRPRKYGPDGSVSLMLSPMSA---TANSTPGSGTSSE----------KRPRGR 135
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPGSG +K QL LG SAG+ F+PHVITV GED+ +K++SF++ PRAVCIL+
Sbjct: 136 PPGSG--RKQQLATLGEWMNNSAGLAFSPHVITVGVGEDIVAKLLSFARQRPRAVCILTG 193
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
G IS+VTLRQ A++ +VTYEGRF+IL LSGS+L++E G +RTGG+SVSLS PDG +
Sbjct: 194 TGTISSVTLRQPASTSISVTYEGRFQILCLSGSYLVAEEGGPHNRTGGMSVSLSSPDGHI 253
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADG 271
+GG V L+ AA+PVQVV SF+ G
Sbjct: 254 IGGGVTRLV-AASPVQVVACSFVYGG 278
>gi|359807105|ref|NP_001241091.1| uncharacterized protein LOC100796830 [Glycine max]
gi|255644758|gb|ACU22881.1| unknown [Glycine max]
Length = 346
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 121/211 (57%), Positives = 147/211 (69%), Gaps = 22/211 (10%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+K+KRGRPRKYGPDG++++AL P P S P S D RG+
Sbjct: 63 VKKKRGRPRKYGPDGSVTMALSPMPIS---------------SSAPPSNDFSSGKRGKMR 107
Query: 133 GSGS--GKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
G KK L+ LG S G F PH+ITV AGED++ K++SFSQ GPRA+CILSA
Sbjct: 108 GMDYKPSKKVGLDYLGDLNACSDGTNFMPHIITVNAGEDITMKVISFSQQGPRAICILSA 167
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NG ISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ +++ G RSRTGG+SVSL+ PDGRV
Sbjct: 168 NGVISNVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTDNQGTRSRTGGMSVSLASPDGRV 227
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
+GG VAGLL AA+PVQVVVGSFL ++E K
Sbjct: 228 VGGGVAGLLVAASPVQVVVGSFLPSSQQEQK 258
>gi|356513399|ref|XP_003525401.1| PREDICTED: uncharacterized protein LOC100798706 [Glycine max]
Length = 352
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 117/227 (51%), Positives = 153/227 (67%), Gaps = 28/227 (12%)
Query: 59 QGLNVMNMGSG-----SEPMKRKRGRPRKYGPDGTMSLALVP--SPSSVTTATGGTGSGL 111
QG N G G EP+K+KRGRPRKYGPDG +SL L P +P++ T T
Sbjct: 73 QGHANFNHGIGIGAPSREPVKKKRGRPRKYGPDGAVSLRLSPMSAPANSTQDASET---- 128
Query: 112 SSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVS 166
+P S KK+RGRPPGSG +K QL ALG SAG+ F+PHVIT+ GED+
Sbjct: 129 --------TP-SQKKARGRPPGSG--RKQQLAALGEWMNSSAGLAFSPHVITIGVGEDIV 177
Query: 167 SKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSG 226
+K++S SQ PRA+CI+S G +S+VTLRQ A++ +VT+EGRF+IL LSGS+L++E G
Sbjct: 178 AKLLSLSQQRPRALCIMSGTGTVSSVTLRQPASTNASVTFEGRFQILCLSGSYLVAEDGG 237
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
+RTGG+SVSLS PDG V+GG VA +L A +PVQV++ SF+ G K
Sbjct: 238 PLNRTGGISVSLSSPDGHVIGGGVA-VLIAGSPVQVMLCSFVYGGSK 283
>gi|449462812|ref|XP_004149134.1| PREDICTED: uncharacterized protein LOC101205374 [Cucumis sativus]
gi|449494644|ref|XP_004159607.1| PREDICTED: uncharacterized LOC101205374 [Cucumis sativus]
Length = 305
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 121/216 (56%), Positives = 153/216 (70%), Gaps = 16/216 (7%)
Query: 67 GSGSEPMKRKRGRPRKYGPDGTMSLA-LVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIK 125
GS +E K+KRGRPRKYGPDG +++A L P P S + S+ K
Sbjct: 37 GSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAPAPAAVIDFSAE----------K 86
Query: 126 KSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
+ + RP S + K+++E LG S G FTPH+ITV +GEDV+ K++SFSQ GPRA+
Sbjct: 87 RGKVRPASSLTKTKYEVENLGEWVPCSVGANFTPHIITVSSGEDVTMKVLSFSQQGPRAI 146
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
CILSANG IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ S+S G +SR GG+SVSL+
Sbjct: 147 CILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKSRIGGMSVSLAS 206
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
PDGRV+GG VAGLL AA+PVQVVVGSF++ + E K
Sbjct: 207 PDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQK 242
>gi|357117022|ref|XP_003560275.1| PREDICTED: uncharacterized protein LOC100833750 [Brachypodium
distachyon]
Length = 336
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 115/217 (52%), Positives = 141/217 (64%), Gaps = 27/217 (12%)
Query: 70 SEPMKRKRGRPRKYGP--DGTM---SLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSI 124
SE +K+KRGRPRKY P DG S ALV P++ G S
Sbjct: 76 SEQVKKKRGRPRKYNPPPDGLSPPSSSALVKVPATPGPGGSGGPS--------------- 120
Query: 125 KKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA 179
+K RGRPPGSG K QL +LG S G GFTPHVI + +GED++++IMSFSQ GPRA
Sbjct: 121 EKRRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIIPSGEDIAARIMSFSQQGPRA 178
Query: 180 VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLS 239
VCI+SA GA+S TL Q A+SG +TYEGRFEIL LSGS+L+ + G R+R GGL ++L
Sbjct: 179 VCIMSATGAVSTPTLHQDASSGSAITYEGRFEILCLSGSYLVIDDGGSRTRNGGLCIALC 238
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
G D RV+GGSV G+LTAA VQV+VGSF+ G K K
Sbjct: 239 GADHRVIGGSVGGVLTAAGTVQVIVGSFMYAGSKNKK 275
>gi|413939548|gb|AFW74099.1| hypothetical protein ZEAMMB73_836102 [Zea mays]
Length = 327
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 120/218 (55%), Positives = 151/218 (69%), Gaps = 20/218 (9%)
Query: 65 NMGSG---SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATG-GTGSGLSSPGGGPLS 120
+ GSG E +K+KRGRPRKY PDG+++L L P+ SS ++ G G+ +++PG G S
Sbjct: 109 DQGSGPGQDEQVKKKRGRPRKYKPDGSVTLGLSPTSSSTPHSSSSGMGTMVNTPGSGFGS 168
Query: 121 PD---------SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVS 166
S K+ RGRPPGSG K QL +LG S G GFTPHVI ++ GEDV+
Sbjct: 169 GGSGGSGSGAPSEKRGRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIIQPGEDVA 226
Query: 167 SKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSG 226
++IM+FSQ GPRAVCI+SA GAIS TL Q + SGG VTYEGRFEIL LSGS+L+ E G
Sbjct: 227 ARIMAFSQQGPRAVCIISATGAISTATLHQDSDSGGVVTYEGRFEILCLSGSYLVVEDGG 286
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVV 264
R+R+GGL ++L GPD RV+GGSV G+LTAA VQV V
Sbjct: 287 TRTRSGGLCIALCGPDHRVIGGSVGGVLTAAGTVQVSV 324
>gi|297742528|emb|CBI34677.3| unnamed protein product [Vitis vinifera]
Length = 309
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 117/209 (55%), Positives = 147/209 (70%), Gaps = 20/209 (9%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+EP+KRKRGRPRKYGPDG +SL L P +A GSG +P + K+ RG
Sbjct: 53 AEPVKRKRGRPRKYGPDGNVSLGLSP-----MSARPSLGSGSVTP--------TQKRGRG 99
Query: 130 RPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
RPPG+G +K QL LG SAG+ F PHVI++ GED++++I+SFSQ PRA+CILS
Sbjct: 100 RPPGTG--RKQQLATLGEWMNSSAGLAFAPHVISMAVGEDIATRILSFSQQRPRALCILS 157
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A+G +S VTLRQ +S GTVTYEGRFEIL LSGS+L +E+ G R+R GG+SVSL PDG
Sbjct: 158 ASGTVSAVTLRQPTSSSGTVTYEGRFEILCLSGSYLPAETGGPRNRIGGISVSLCSPDGH 217
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRK 273
V+GG V G+L AA+PVQVV SF+ G K
Sbjct: 218 VIGGGVGGMLIAASPVQVVACSFVYGGSK 246
>gi|225426407|ref|XP_002273061.1| PREDICTED: uncharacterized protein LOC100249560 [Vitis vinifera]
Length = 346
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 117/209 (55%), Positives = 147/209 (70%), Gaps = 20/209 (9%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+EP+KRKRGRPRKYGPDG +SL L P +A GSG +P + K+ RG
Sbjct: 90 AEPVKRKRGRPRKYGPDGNVSLGLSP-----MSARPSLGSGSVTP--------TQKRGRG 136
Query: 130 RPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
RPPG+G +K QL LG SAG+ F PHVI++ GED++++I+SFSQ PRA+CILS
Sbjct: 137 RPPGTG--RKQQLATLGEWMNSSAGLAFAPHVISMAVGEDIATRILSFSQQRPRALCILS 194
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A+G +S VTLRQ +S GTVTYEGRFEIL LSGS+L +E+ G R+R GG+SVSL PDG
Sbjct: 195 ASGTVSAVTLRQPTSSSGTVTYEGRFEILCLSGSYLPAETGGPRNRIGGISVSLCSPDGH 254
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRK 273
V+GG V G+L AA+PVQVV SF+ G K
Sbjct: 255 VIGGGVGGMLIAASPVQVVACSFVYGGSK 283
>gi|357440217|ref|XP_003590386.1| hypothetical protein MTR_1g061530 [Medicago truncatula]
gi|355479434|gb|AES60637.1| hypothetical protein MTR_1g061530 [Medicago truncatula]
Length = 362
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 112/241 (46%), Positives = 151/241 (62%), Gaps = 20/241 (8%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKY DG ++ PS + T L+SP G LS + +GR
Sbjct: 65 KKKRGRPRKYDADGNLN----PSYKKIVKTTTPI---LTSPPGFTLSTNEFASKKGRGKS 117
Query: 134 SGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
+G + G +A V F PHV+TV AGEDV KI+SF+Q PR +CILSANGA
Sbjct: 118 TGFVNYQTFSSFGEVFPSTAAVDFAPHVVTVYAGEDVGGKILSFAQKSPRGICILSANGA 177
Query: 189 ISNVTLRQAATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
IS V L Q ++GG++ TYEGRFEILSLSGS+ S++SG R+R GGLSVSL+GPDGRV+G
Sbjct: 178 ISKVALGQPGSTGGSILTYEGRFEILSLSGSYTASDNSGIRTREGGLSVSLAGPDGRVIG 237
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKE---SKSSHRMESLPVP----PKLAPGGQPAGQC 300
G+VAG+L AA P+Q+VVGSF+++G + R +++ P P++ +P Q
Sbjct: 238 GAVAGVLIAAGPIQIVVGSFMSNGNNSKPLKRKYQREQTVASPTSTGPEIVTAARPISQA 297
Query: 301 S 301
+
Sbjct: 298 N 298
>gi|357123004|ref|XP_003563203.1| PREDICTED: uncharacterized protein LOC100826632 [Brachypodium
distachyon]
Length = 340
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 114/206 (55%), Positives = 139/206 (67%), Gaps = 16/206 (7%)
Query: 70 SEPMKRKRGRPRKYGP--DGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
SE +K+KRGRPRKY P DG SP S T+A + S G S +K
Sbjct: 74 SEQVKKKRGRPRKYKPPPDGL-------SPPSSTSALVTVPATPGSGPGPGGSGGPSEKR 126
Query: 128 RGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
RGRPPGSG K QL +LG S G GFTPHVI + +GEDV+++IMSFSQ GPRAVCI
Sbjct: 127 RGRPPGSG--KMQQLASLGKCFLGSVGTGFTPHVIIIPSGEDVAARIMSFSQQGPRAVCI 184
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
+SA GA+S TL Q A+SG +TYEGRFEIL LSGS+L+ + G R+R GGL ++L G D
Sbjct: 185 MSATGAVSTATLHQDASSGSVITYEGRFEILCLSGSYLVIDDGGSRTRNGGLCIALCGAD 244
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFL 268
RV+GGSV G+LTAA VQV+VGSF+
Sbjct: 245 HRVIGGSVGGVLTAAGTVQVIVGSFM 270
>gi|15237481|ref|NP_199476.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|9758500|dbj|BAB08908.1| unnamed protein product [Arabidopsis thaliana]
gi|51315384|gb|AAT99797.1| At5g46640 [Arabidopsis thaliana]
gi|52627131|gb|AAU84692.1| At5g46640 [Arabidopsis thaliana]
gi|119657360|tpd|FAA00279.1| TPA: AT-hook motif nuclear localized protein 8 [Arabidopsis
thaliana]
gi|225879094|dbj|BAH30617.1| hypothetical protein [Arabidopsis thaliana]
gi|332008026|gb|AED95409.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 386
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 129/242 (53%), Positives = 166/242 (68%), Gaps = 21/242 (8%)
Query: 41 SPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVP-SP-- 97
SP+ QP G D Q Q L V K+KRGRPRKY PDG+++L L P SP
Sbjct: 82 SPSSQPMRFGIDD---QNQQLQV----------KKKRGRPRKYTPDGSIALGLAPTSPLL 128
Query: 98 SSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEAL-GSAGVGFTPHV 156
S+ + + G G G S G + P +K++RGRPPGS K QL+AL G++GVGFTPHV
Sbjct: 129 SAASNSYGEGGVGDSGGNGNSVDP-PVKRNRGRPPGSS---KKQLDALGGTSGVGFTPHV 184
Query: 157 ITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLS 216
I V GED++SK+M+FS G R +CILSA+GA+S V LRQA+ S G VTYEGRFEI++LS
Sbjct: 185 IEVNTGEDIASKVMAFSDQGSRTICILSASGAVSRVMLRQASHSSGIVTYEGRFEIITLS 244
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
GS L E +G +R+G LSV+L+GPDG ++GGSV G L AAT VQV+VGSF+A+ +K +
Sbjct: 245 GSVLNYEVNGSTNRSGNLSVALAGPDGGIVGGSVVGNLVAATQVQVIVGSFVAEAKKPKQ 304
Query: 277 SS 278
SS
Sbjct: 305 SS 306
>gi|115474539|ref|NP_001060866.1| Os08g0118000 [Oryza sativa Japonica Group]
gi|42407899|dbj|BAD09039.1| putative AT-hook protein 1 [Oryza sativa Japonica Group]
gi|50725642|dbj|BAD33109.1| putative AT-hook protein 1 [Oryza sativa Japonica Group]
gi|113622835|dbj|BAF22780.1| Os08g0118000 [Oryza sativa Japonica Group]
gi|119657404|tpd|FAA00301.1| TPA: AT-hook motif nuclear localized protein 1 [Oryza sativa
Japonica Group]
gi|125602001|gb|EAZ41326.1| hypothetical protein OsJ_25837 [Oryza sativa Japonica Group]
gi|215687040|dbj|BAG90886.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 372
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 121/234 (51%), Positives = 156/234 (66%), Gaps = 25/234 (10%)
Query: 75 RKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGS 134
+KRGRPRKYGPDG++ L +P S + + G +P + ++K+ RGRP
Sbjct: 71 KKRGRPRKYGPDGSLIRPLNATPISASVPMAASAVGPYTPASAVGA--AMKRGRGRPLDF 128
Query: 135 GSGKK-----------------HQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSF 172
S K +++G SAG FTPH+ITV GEDV+ K++SF
Sbjct: 129 ASTAKLHHHHQHQHHHQQQQFGFHFDSIGEMVACSAGANFTPHIITVAPGEDVTMKVISF 188
Query: 173 SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTG 232
SQ GPRA+CILSANG ISNVTLRQ +SGGT+TYEGRFE+LSLSGSF+ +E+SG RSR+G
Sbjct: 189 SQQGPRAICILSANGVISNVTLRQPDSSGGTLTYEGRFELLSLSGSFMPTENSGTRSRSG 248
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSH-RMESLP 285
G+SVSL+ PDGRV+GG VAGLL AA+PVQ+VVGSFL + E K+ R+E+ P
Sbjct: 249 GMSVSLASPDGRVVGGGVAGLLVAASPVQIVVGSFLPSYQMEQKNKKPRVEAAP 302
>gi|125559961|gb|EAZ05409.1| hypothetical protein OsI_27618 [Oryza sativa Indica Group]
Length = 372
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 121/234 (51%), Positives = 156/234 (66%), Gaps = 25/234 (10%)
Query: 75 RKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGS 134
+KRGRPRKYGPDG++ L +P S + + G +P + ++K+ RGRP
Sbjct: 71 KKRGRPRKYGPDGSLIRPLNATPISASVPMAASAVGPYTPASAVGA--AMKRGRGRPLDF 128
Query: 135 GSGKK-----------------HQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSF 172
S K +++G SAG FTPH+ITV GEDV+ K++SF
Sbjct: 129 ASTAKLHHHHQHQHHHQQQQFGFHFDSIGEMVACSAGANFTPHIITVAPGEDVTMKVISF 188
Query: 173 SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTG 232
SQ GPRA+CILSANG ISNVTLRQ +SGGT+TYEGRFE+LSLSGSF+ +E+SG RSR+G
Sbjct: 189 SQQGPRAICILSANGVISNVTLRQPDSSGGTLTYEGRFELLSLSGSFMPTENSGTRSRSG 248
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSH-RMESLP 285
G+SVSL+ PDGRV+GG VAGLL AA+PVQ+VVGSFL + E K+ R+E+ P
Sbjct: 249 GMSVSLASPDGRVVGGGVAGLLVAASPVQIVVGSFLPSYQIEQKNKKPRVEAAP 302
>gi|356540605|ref|XP_003538778.1| PREDICTED: uncharacterized protein LOC100789687 [Glycine max]
Length = 339
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 106/204 (51%), Positives = 134/204 (65%), Gaps = 24/204 (11%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
+ SE K+KRGRPRKY PDG ++L L P+ + ++A KK
Sbjct: 75 AASESSKKKRGRPRKYSPDGNIALGLGPTHAPASSAD-----------------PPAKKH 117
Query: 128 RGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
RGRPPGSG K Q++ALG G GFTPHVIT + GED+++K+++F + GPR VC LSANG
Sbjct: 118 RGRPPGSG---KKQMDALGIPGTGFTPHVITAEVGEDIAAKLVAFCEQGPRTVCTLSANG 174
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
A NVT+R GTV YEG FEI+SL + L S++ +R LSVSL+GPDGRVLG
Sbjct: 175 ATRNVTIRAPDMPAGTVAYEGPFEIISLKAATLQSDN----NRMAALSVSLAGPDGRVLG 230
Query: 248 GSVAGLLTAATPVQVVVGSFLADG 271
G V G LTAAT VQ+V+GSF+ADG
Sbjct: 231 GEVVGALTAATAVQIVLGSFIADG 254
>gi|357481621|ref|XP_003611096.1| DNA-binding PD1-like protein [Medicago truncatula]
gi|355512431|gb|AES94054.1| DNA-binding PD1-like protein [Medicago truncatula]
Length = 321
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 113/211 (53%), Positives = 144/211 (68%), Gaps = 25/211 (11%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
SG + +K+KRGRPRKYGPD +SL L SP S T + +PDS K+
Sbjct: 64 SGEQSVKKKRGRPRKYGPDVPVSLRL--SPMSATANS---------------TPDSEKRP 106
Query: 128 RGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
RGRPPGSG +K QL ALG SAG F+PHVIT+ ED+ K++ FSQ+ PRA+C+
Sbjct: 107 RGRPPGSG--RKQQLAALGEWMNSSAGQAFSPHVITIGPQEDIVEKLLLFSQHRPRALCV 164
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LS G +S+VTLRQ A++ +VTYEGRF+IL LSGS+L++E G +RTGG+SVSLS D
Sbjct: 165 LSGTGTVSSVTLRQPASTSVSVTYEGRFQILCLSGSYLVAEDGGPHNRTGGISVSLSSMD 224
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
G V+GG VA L+ AA+PVQVVV SF+ G K
Sbjct: 225 GHVIGGGVARLI-AASPVQVVVCSFVYGGSK 254
>gi|297795967|ref|XP_002865868.1| hypothetical protein ARALYDRAFT_495229 [Arabidopsis lyrata subsp.
lyrata]
gi|297311703|gb|EFH42127.1| hypothetical protein ARALYDRAFT_495229 [Arabidopsis lyrata subsp.
lyrata]
Length = 418
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 118/233 (50%), Positives = 147/233 (63%), Gaps = 28/233 (12%)
Query: 64 MNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTT------------------ATG 105
M + + S +K+KRGRPRKY PDG++++ L P P S +
Sbjct: 70 MPVENSSSDLKKKRGRPRKYNPDGSLAVTLSPMPISSSVPLTSELGSRKRGRGRGRGRGR 129
Query: 106 GTGSGLSSPGGGPLSPDSIKKSR----GRPPGSGSGKKHQLEALGSAGVGFTPHVITVKA 161
G G G P + + +K + P SG G + FTPHV+TV A
Sbjct: 130 GRGQGSREPNNDNNNNNWLKNPQMFEFNNTPSSGGGGPAEF-----VSPSFTPHVLTVNA 184
Query: 162 GEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLL 221
GEDV+ KIM+FSQ G RA+CILSANG ISNVTLRQ+ TSGGT+TYEG FEILSL+GSF+
Sbjct: 185 GEDVTMKIMTFSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGHFEILSLTGSFIP 244
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKE 274
SES G RSR GG+SVSL+GPDGRV GG +AGL AA PVQV+VGSF+A G++E
Sbjct: 245 SESGGTRSRAGGMSVSLAGPDGRVFGGGLAGLFIAAGPVQVMVGSFIA-GQEE 296
>gi|224074727|ref|XP_002304442.1| predicted protein [Populus trichocarpa]
gi|222841874|gb|EEE79421.1| predicted protein [Populus trichocarpa]
Length = 333
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 120/212 (56%), Positives = 147/212 (69%), Gaps = 11/212 (5%)
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGS-GLSSPGGGPLSPDSIKKSRGR 130
P+K+KRGRPRKYGPDG++++AL P P S S + P S+ +
Sbjct: 66 PLKKKRGRPRKYGPDGSVTMALSPKPISSAAPAPSPPVIDFSVVKQKKIKP----VSKAK 121
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
S Q + LG S G FTPH+ITV AGEDV+ KI+SFSQ GPRA+C+LSA
Sbjct: 122 ISVSWLLMLWQFDLLGEWVACSVGANFTPHIITVNAGEDVTMKIISFSQQGPRAICVLSA 181
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NG IS+VTLRQ +SGGT+TYEGRFEILSLSGSF+ +E+ G RSR+GG+SVSL+ PDGRV
Sbjct: 182 NGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTETGGTRSRSGGMSVSLASPDGRV 241
Query: 246 LGGSVAGLLTAATPVQ-VVVGSFLADGRKESK 276
+GG VAGLL AA+PVQ VVVGSFLA + E K
Sbjct: 242 VGGGVAGLLVAASPVQVVVVGSFLAGNQHEQK 273
>gi|357476667|ref|XP_003608619.1| AT-hook protein [Medicago truncatula]
gi|355509674|gb|AES90816.1| AT-hook protein [Medicago truncatula]
Length = 334
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 120/240 (50%), Positives = 150/240 (62%), Gaps = 32/240 (13%)
Query: 38 TATSPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPM--KRKRGRPRKYGPDGTMSLALVP 95
T T+ P+ A A PQ++ +V + G S K+KRGRPRKY PDG ++L
Sbjct: 35 TTTANIMAPATARFPFASPQSEPFSVTHDGPSSPSTLGKKKRGRPRKYSPDGNIAL---- 90
Query: 96 SPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPH 155
G GS S GRPPGSG K QL+ALG+ G GFTPH
Sbjct: 91 ----------GFGSCF-------FSCCCYVCCFGRPPGSG---KKQLDALGAGGTGFTPH 130
Query: 156 VITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYE-----GRF 210
VI V++GED++ K+M+FSQ GPR VCILSA GAIS+V LRQ A SG YE G+F
Sbjct: 131 VILVESGEDITEKVMAFSQTGPRTVCILSAIGAISSVILRQPA-SGSIARYEVQLVNGQF 189
Query: 211 EILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD 270
EI+SLSG LSE++G++SRT L VS++G DGRVLGG+VAG LTAA+ VQV+VGSF+ D
Sbjct: 190 EIVSLSGPMPLSENNGEQSRTSSLYVSVAGADGRVLGGAVAGELTAASTVQVIVGSFIVD 249
>gi|356568280|ref|XP_003552341.1| PREDICTED: uncharacterized protein LOC100777213 [Glycine max]
Length = 338
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 121/220 (55%), Positives = 151/220 (68%), Gaps = 14/220 (6%)
Query: 58 AQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGG 117
A G++ +++G K+KRGRPRKYGPDG S+AL P P S S+P
Sbjct: 47 AIGVSPVSVGLDGTAAKKKRGRPRKYGPDGLNSMALSPMPISS-----------SAPFAN 95
Query: 118 PLSPDSIKKSRGRPPGSGSGKKHQLEALG-SAGVGFTPHVITVKAGEDVSSKIMSFSQNG 176
S KSRG KK ++ G S G F PH+ITV GED++ K++SFSQ G
Sbjct: 96 NFSSGKRGKSRGME--YKLLKKVGVDLFGDSVGTNFMPHIITVNTGEDITMKVISFSQQG 153
Query: 177 PRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSV 236
PRA+CILSA+G ISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ +++ G RSR+GG+SV
Sbjct: 154 PRAICILSASGVISNVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTDNQGSRSRSGGMSV 213
Query: 237 SLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
SLS PDGRV+GG VAGLL AA PVQVVVGSFL + +++ K
Sbjct: 214 SLSSPDGRVVGGGVAGLLVAAGPVQVVVGSFLPNNQQDQK 253
>gi|356528260|ref|XP_003532722.1| PREDICTED: uncharacterized protein LOC100813888 [Glycine max]
Length = 352
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 115/227 (50%), Positives = 151/227 (66%), Gaps = 28/227 (12%)
Query: 59 QGLNVMNMGSG-----SEPMKRKRGRPRKYGPDGTMSLALVP--SPSSVTTATGGTGSGL 111
QG N G G SEP+K+KRGRPRKYGPDG +SL L P +P++ T T
Sbjct: 73 QGHTNFNHGIGIGAPSSEPVKKKRGRPRKYGPDGAVSLRLSPMSAPANSTQDASET---- 128
Query: 112 SSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVS 166
+P S KK+RGRPPGSG +K QL ALG SAG+ F+PHV+T+ GED+
Sbjct: 129 --------TP-SQKKARGRPPGSG--RKQQLAALGEWMNSSAGLAFSPHVVTIGVGEDIV 177
Query: 167 SKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSG 226
+K++S SQ RA+CI+S G +S+VTLRQ A++ +VT+EGRF+IL LSGS+L++E G
Sbjct: 178 AKLLSLSQQRSRALCIMSGTGTVSSVTLRQPASTNASVTFEGRFQILCLSGSYLVAEDGG 237
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
+RTGG+SVSLS DG V+GG VA +L A PVQV++ SF+ G K
Sbjct: 238 PSNRTGGISVSLSSHDGHVIGGGVA-VLIAGGPVQVMLCSFVYGGSK 283
>gi|356497236|ref|XP_003517468.1| PREDICTED: uncharacterized protein LOC100795781 [Glycine max]
Length = 357
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 110/208 (52%), Positives = 142/208 (68%), Gaps = 21/208 (10%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP+K+KRGRPRKYGPDG++SL L SP S T ++ S K+ RGR
Sbjct: 97 EPVKKKRGRPRKYGPDGSVSLML--SPMSATASSTPGSGTSSE-----------KRPRGR 143
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPGSG +K QL LG SAG+ F+PHVITV ED+ +K++SF++ PRAVCIL+
Sbjct: 144 PPGSG--RKQQLATLGEWMNSSAGLAFSPHVITVGVDEDIVAKLLSFARQRPRAVCILTG 201
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
G IS+VTLRQ A++ VTYEGRF+IL LSGS+L++E G +RTGG+SVSLS PDG +
Sbjct: 202 TGTISSVTLRQPASTSIGVTYEGRFQILCLSGSYLVAEEGGPHNRTGGMSVSLSSPDGHI 261
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRK 273
+GG V L+ A++PVQVV SF+ G K
Sbjct: 262 IGGGVTRLV-ASSPVQVVACSFVYGGSK 288
>gi|297799736|ref|XP_002867752.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313588|gb|EFH44011.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
Length = 332
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 121/233 (51%), Positives = 157/233 (67%), Gaps = 20/233 (8%)
Query: 67 GSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKK 126
G S P+K++RGRPRKY DG A+ SP+ ++TA T + S + K+
Sbjct: 65 GFSSGPIKKRRGRPRKYRHDGA---AVTLSPNPISTAAPTTSHVID------FSTTAEKR 115
Query: 127 SRGRP--PGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA 179
+ +P P S K+Q+E LG SA FTPH+ITV AGEDV+ +I+SFSQ G A
Sbjct: 116 GKMKPATPSSFIRPKYQVENLGEWAPSSAAANFTPHIITVNAGEDVTKRIISFSQQGSLA 175
Query: 180 VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLS 239
+C+L ANG +S+VTLRQ +SGGT+TYEGRFEILSLSG+F+ S+S G RSRTGG+SVSL+
Sbjct: 176 ICVLCANGVVSSVTLRQPHSSGGTLTYEGRFEILSLSGTFMPSDSDGTRSRTGGMSVSLA 235
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSFLA----DGRKESKSSHRMESLPVPP 288
PDGRV+GG VAGLL AATP+QVVVGSFLA ++ + +H S P+ P
Sbjct: 236 SPDGRVVGGGVAGLLVAATPIQVVVGSFLAGTNQQDQRPKQQNHNFMSSPLMP 288
>gi|15225902|ref|NP_182109.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|30690145|ref|NP_850442.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|14194131|gb|AAK56260.1|AF367271_1 At2g45850/F4I18.17 [Arabidopsis thaliana]
gi|3386609|gb|AAC28539.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana]
gi|16323338|gb|AAL15382.1| At2g45850/F4I18.17 [Arabidopsis thaliana]
gi|17065246|gb|AAL32777.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana]
gi|21387187|gb|AAM47997.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana]
gi|119657362|tpd|FAA00280.1| TPA: AT-hook motif nuclear localized protein 9 [Arabidopsis
thaliana]
gi|330255515|gb|AEC10609.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|330255516|gb|AEC10610.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 348
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 115/235 (48%), Positives = 153/235 (65%), Gaps = 27/235 (11%)
Query: 55 IPQAQGLNVM--NMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLS 112
+P G+N++ PMKRKRGRPRKYG DG++SLAL S S T
Sbjct: 77 LPHHIGVNMIAPPPPPSETPMKRKRGRPRKYGQDGSVSLALSSSSVSTITPN-------- 128
Query: 113 SPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSS 167
+S K+ RGRPPGSG KK ++ ++G S+G+ FTPHVI V GED++S
Sbjct: 129 ---------NSNKRGRGRPPGSG--KKQRMASVGELMPSSSGMSFTPHVIAVSIGEDIAS 177
Query: 168 KIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQ 227
K+++FSQ GPRA+C+LSA+GA+S TL Q + S G + YEGRFEIL+LS S++++
Sbjct: 178 KVIAFSQQGPRAICVLSASGAVSTATLIQPSASPGAIKYEGRFEILALSTSYIVATDGSF 237
Query: 228 RSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRME 282
R+RTG LSVSL+ PDGRV+GG++ G L AA+PVQV+VGSF+ K KS R E
Sbjct: 238 RNRTGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPK-IKSKKREE 291
>gi|356497039|ref|XP_003517372.1| PREDICTED: uncharacterized protein LOC100788026 [Glycine max]
Length = 338
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 134/204 (65%), Gaps = 24/204 (11%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
+ SE K+KRGRPRKY PDG ++L L P+ + ++A KK
Sbjct: 74 AASESSKKKRGRPRKYSPDGNIALGLGPTHAPASSAD-----------------PPAKKH 116
Query: 128 RGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
RGRPPGSG K Q++ALG G GFTPHVIT + GED++SK+++F + G R VC LSA+G
Sbjct: 117 RGRPPGSG---KKQMDALGIPGTGFTPHVITAEVGEDIASKLVAFCEQGRRTVCTLSASG 173
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
AI NVT+R G + YEG+FEI+SL + L S++ +R LSVS++GPDGR+LG
Sbjct: 174 AIRNVTIRAPDMPAGILAYEGQFEIISLKAATLQSDN----NRMAALSVSIAGPDGRLLG 229
Query: 248 GSVAGLLTAATPVQVVVGSFLADG 271
G V G LTAAT VQV++GSF+ADG
Sbjct: 230 GEVVGALTAATAVQVILGSFIADG 253
>gi|15235790|ref|NP_194008.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|2827554|emb|CAA16562.1| putative DNA binding protein [Arabidopsis thaliana]
gi|7269124|emb|CAB79232.1| putative DNA binding protein [Arabidopsis thaliana]
gi|21537115|gb|AAM61456.1| putative DNA binding protein [Arabidopsis thaliana]
gi|111074368|gb|ABH04557.1| At4g22770 [Arabidopsis thaliana]
gi|119657348|tpd|FAA00273.1| TPA: AT-hook motif nuclear localized protein 2 [Arabidopsis
thaliana]
gi|225898799|dbj|BAH30530.1| hypothetical protein [Arabidopsis thaliana]
gi|332659256|gb|AEE84656.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 334
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/235 (51%), Positives = 158/235 (67%), Gaps = 22/235 (9%)
Query: 67 GSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKK 126
G S P+K++RGRPRKYG DG A+ SP+ +++A T + S S K+
Sbjct: 65 GFSSGPIKKRRGRPRKYGHDGA---AVTLSPNPISSAAPTTSHVID------FSTTSEKR 115
Query: 127 SRGRP----PGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGP 177
+ +P P S K+Q+E LG SA FTPH+ITV AGEDV+ +I+SFSQ G
Sbjct: 116 GKMKPATPTPSSFIRPKYQVENLGEWSPSSAAANFTPHIITVNAGEDVTKRIISFSQQGS 175
Query: 178 RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVS 237
A+C+L ANG +S+VTLRQ +SGGT+TYEGRFEILSLSG+F+ S+S G RSRTGG+SVS
Sbjct: 176 LAICVLCANGVVSSVTLRQPDSSGGTLTYEGRFEILSLSGTFMPSDSDGTRSRTGGMSVS 235
Query: 238 LSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKS----SHRMESLPVPP 288
L+ PDGRV+GG VAGLL AATP+QVVVG+FL ++ ++ +H S P+ P
Sbjct: 236 LASPDGRVVGGGVAGLLVAATPIQVVVGTFLGGTNQQEQTPKPHNHNFMSSPLMP 290
>gi|30696854|ref|NP_176536.2| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|26451696|dbj|BAC42943.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|28973281|gb|AAO63965.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|119657354|tpd|FAA00276.1| TPA: AT-hook motif nuclear localized protein 5 [Arabidopsis
thaliana]
gi|332195982|gb|AEE34103.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 378
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/204 (51%), Positives = 143/204 (70%), Gaps = 18/204 (8%)
Query: 71 EPM-KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+PM K+KRGRPRKY PDG +SL L P P + + P++ K++RG
Sbjct: 101 QPMVKKKRGRPRKYVPDGQVSLGLSPMPCVSKKSKDSSSMS---------DPNAPKRARG 151
Query: 130 RPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
RPPG+G +K +L LG SAG+ F PHVI+V +GED+ SK++SFSQ PRA+CI+S
Sbjct: 152 RPPGTG--RKQRLANLGEWMNTSAGLAFAPHVISVGSGEDIVSKVLSFSQKRPRALCIMS 209
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G +S+VTLR+ A++ ++T+EGRFEILSL GS+L++E G +SRTGGLSVSLSGP+G
Sbjct: 210 GTGTVSSVTLREPASTTPSLTFEGRFEILSLGGSYLVNEEGGSKSRTGGLSVSLSGPEGH 269
Query: 245 VLGGSVAGLLTAATPVQVVVGSFL 268
V+GG + G+L AA+ VQVV SF+
Sbjct: 270 VIGGGI-GMLIAASLVQVVACSFV 292
>gi|168050233|ref|XP_001777564.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671049|gb|EDQ57607.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 277
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 117/228 (51%), Positives = 142/228 (62%), Gaps = 41/228 (17%)
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTG--SGLSSPGGGPLSP---DSIKK 126
P+KRKRGRPRKY T P + G S L+ P +P S K+
Sbjct: 20 PLKRKRGRPRKYATGDT--------PQVTASGLGNISLFSALAKQIAAPYTPPPNKSEKR 71
Query: 127 SRGRPPGSGSGKKHQLEALGS--AGVG--FTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
RGRP GS KK QL LG AG G FTPH++TV GED SSKIM F+Q+GPRA+C+
Sbjct: 72 GRGRP--VGSTKKQQLANLGVVLAGTGKSFTPHILTVSTGEDASSKIMQFAQHGPRAMCV 129
Query: 183 LSANGAISNVTLRQAATSGGTVTYE---------------------GRFEILSLSGSFLL 221
LSANGA+SNV LRQ ++SGGTVTYE GR+EILSLSGS+L
Sbjct: 130 LSANGAVSNVMLRQDSSSGGTVTYEVQTGYSEECLALETLQWSNFKGRYEILSLSGSYLP 189
Query: 222 SE-SSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
++ G++ RTG +SVSL+G DGRV GG VAG+L AA+P+QVVVGSFL
Sbjct: 190 TDGEDGEKQRTGSVSVSLAGSDGRVFGGRVAGVLMAASPIQVVVGSFL 237
>gi|388500614|gb|AFK38373.1| unknown [Lotus japonicus]
Length = 357
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/239 (49%), Positives = 152/239 (63%), Gaps = 22/239 (9%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKY DG ++ P+ +AT LS+ + S K+ RG+P
Sbjct: 65 KKKRGRPRKYDADGNLN------PAYKKSATPPQRFTLSATA----NEFSAKRGRGKP-A 113
Query: 134 SGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
+G G H + G SA FTPHV+TV GEDV+ KIMSF+Q PR +CILSANG
Sbjct: 114 TGFGNYHLFASFGEVFASSASGDFTPHVVTVYTGEDVAGKIMSFAQKSPRGICILSANGP 173
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
ISNV LRQ + GG +TYEGRFEILSLSGSF +S+SSG +SR+ GLSVSL+GPDGRV+GG
Sbjct: 174 ISNVILRQPGSCGGILTYEGRFEILSLSGSFSVSDSSGMKSRSAGLSVSLAGPDGRVIGG 233
Query: 249 SVAGLLTAATPVQVVVGSFLADG--RKESKSSHRMESLPVP----PKLAPGGQPAGQCS 301
VAGLLTAA P+Q+VVGSF+ +G + + R ++ P P+ G P Q +
Sbjct: 234 GVAGLLTAAGPIQIVVGSFMPNGYLKTHKRKYQREHTVASPTSTGPETVTGATPISQAN 292
>gi|449452330|ref|XP_004143912.1| PREDICTED: uncharacterized protein LOC101219973 [Cucumis sativus]
Length = 343
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 123/256 (48%), Positives = 157/256 (61%), Gaps = 35/256 (13%)
Query: 57 QAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGG 116
+ G+NV + SEP+K+KRGRPRKY PDG +SL L P + G
Sbjct: 76 RGMGINVSAGVNSSEPVKKKRGRPRKYAPDGQVSLGLSPMSA-----------------G 118
Query: 117 GPLSP--DSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKI 169
L+P +S R R GSG+K QL LG SAG+ F PHVI V AGED+ +K+
Sbjct: 119 SKLTPGSNSSTPRRRRGRPPGSGRKQQLALLGDWMNNSAGLAFAPHVIHVGAGEDIVAKV 178
Query: 170 MSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRS 229
+SF+Q PRAVC+LS NG +S+VTLRQ A++G +VTYEG F+IL LSGS+L++E G RS
Sbjct: 179 LSFAQQRPRAVCVLSGNGTVSSVTLRQPASTGVSVTYEGHFQILCLSGSYLVAEDGGPRS 238
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK----------SSH 279
RTGG+SVSL+ PDG V+GG VA +LTAA PVQVVV SF+ + ++K S H
Sbjct: 239 RTGGISVSLASPDGHVIGGGVA-VLTAAGPVQVVVCSFVYGPKIKNKQVAGPKSNDGSGH 297
Query: 280 RMESLPVPPKLAPGGQ 295
V P AP Q
Sbjct: 298 EHHDNLVSPTSAPSTQ 313
>gi|297820982|ref|XP_002878374.1| hypothetical protein ARALYDRAFT_324562 [Arabidopsis lyrata subsp.
lyrata]
gi|297324212|gb|EFH54633.1| hypothetical protein ARALYDRAFT_324562 [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 109/206 (52%), Positives = 142/206 (68%), Gaps = 26/206 (12%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
SG +KRKRGRPRKYG DG++SLAL PS S+V SP+S K+
Sbjct: 89 SGDTSLKRKRGRPRKYGQDGSVSLALSPSVSNV-------------------SPNSNKRG 129
Query: 128 RGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
RGRPPGSG KK +L ++G S+G+ FTPHVI V GED++SK++SFS GPRA+C+
Sbjct: 130 RGRPPGSG--KKQRLSSIGEMMPSSSGMSFTPHVIVVSIGEDIASKVISFSHQGPRAICV 187
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LSA+GA+S TL Q A S GT+TYEG FE++SLS S+L + + +RTG L+VSL+ D
Sbjct: 188 LSASGAVSTATLLQPAPSHGTITYEGLFELISLSTSYLNTTDNDYPNRTGSLAVSLASSD 247
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFL 268
GRV+GG + G L AA+ VQV+VGSF+
Sbjct: 248 GRVIGGGIGGPLIAASQVQVIVGSFI 273
>gi|6633838|gb|AAF19697.1|AC008047_4 F2K11.15 [Arabidopsis thaliana]
Length = 826
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 105/208 (50%), Positives = 143/208 (68%), Gaps = 22/208 (10%)
Query: 71 EPM-KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+PM K+KRGRPRKY PDG +SL L P P + + P++ K++RG
Sbjct: 456 QPMVKKKRGRPRKYVPDGQVSLGLSPMPCVSKKSKDSSSMS---------DPNAPKRARG 506
Query: 130 RPPGSGSGKKHQLEALG---------SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
RPPG+G +K +L LG SAG+ F PHVI+V +GED+ SK++SFSQ PRA+
Sbjct: 507 RPPGTG--RKQRLANLGEISSEWMNTSAGLAFAPHVISVGSGEDIVSKVLSFSQKRPRAL 564
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
CI+S G +S+VTLR+ A++ ++T+EGRFEILSL GS+L++E G +SRTGGLSVSLSG
Sbjct: 565 CIMSGTGTVSSVTLREPASTTPSLTFEGRFEILSLGGSYLVNEEGGSKSRTGGLSVSLSG 624
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFL 268
P+G V+GG + G+L AA+ VQVV SF+
Sbjct: 625 PEGHVIGGGI-GMLIAASLVQVVACSFV 651
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 96/212 (45%), Positives = 133/212 (62%), Gaps = 28/212 (13%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+KRKRGRPRKYG + + SP S P+ K++RGRPP
Sbjct: 98 VKRKRGRPRKYGEPMVSNKSRDSSPMS--------------------DPNEPKRARGRPP 137
Query: 133 GSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
G+G +K +L LG SAG+ F PHVI++ AGED+++K++SFSQ PRA+CI+S G
Sbjct: 138 GTG--RKQRLANLGEWMNTSAGLAFAPHVISIGAGEDIAAKVLSFSQQRPRALCIMSGTG 195
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
IS+VTL + ++ +TYEG FEI+S GS+L++E G RSRTGGLSVSLS PDG ++
Sbjct: 196 TISSVTLCKPGSTDRHLTYEGPFEIISFGGSYLVNEEGGSRSRTGGLSVSLSRPDGSIIA 255
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKESKSSH 279
G V +L AA VQVV SF+ R ++ +++
Sbjct: 256 GGV-DMLIAANLVQVVACSFVYGARAKTHNNN 286
>gi|222635485|gb|EEE65617.1| hypothetical protein OsJ_21176 [Oryza sativa Japonica Group]
Length = 354
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 174/381 (45%), Positives = 213/381 (55%), Gaps = 73/381 (19%)
Query: 4 SETAVNVMTSQPPASIQSMRLAFSADGTAVYKPITATSPTYQPSGAG--------GDGAI 55
S ++ T+ PPA +RLA++ DG AVYK T P YQ A G+G
Sbjct: 5 SAACLSRRTAPPPA----VRLAYTHDGIAVYK-HTPPPPVYQTPAAVAAPSPPVRGNGGA 59
Query: 56 PQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALV--------------------- 94
P + +E KRKRGRPRKY + LA+V
Sbjct: 60 P-----------ASAEQHKRKRGRPRKYAVT-DVPLAVVPPSPPKAAAAAGASAAQSPAT 107
Query: 95 ----PSPSSVTTATGGTGSGLSSPGGGPLSPDSI--KKSRGRPPGSGSGKKHQLE----- 143
P SS A GG + +P P + + K RGRPPGSG+ ++ + +
Sbjct: 108 PTLPPGFSSGLAAYGGAAASQPAPRQAPPASGRVLPHKKRGRPPGSGNKQQQRPQHKKAA 167
Query: 144 ALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGT 203
A GS+ +G P VITV+ GEDV S++MSF++NG AVC+LSANGA+SN+TLRQA +SG T
Sbjct: 168 APGSSVIGLKPSVITVQVGEDVVSRVMSFTKNG-WAVCVLSANGAVSNMTLRQAGSSGAT 226
Query: 204 -VTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQV 262
V YEG FEILSLSGS+LLSES G SR GGLSVSL+GPDGRVLGG VAG L AATPVQV
Sbjct: 227 TVNYEGHFEILSLSGSYLLSESVGLSSRAGGLSVSLAGPDGRVLGGGVAGPLNAATPVQV 286
Query: 263 VVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPA-GQCSPPSRGTLSESSGGPGSPLNH 321
V+GSFLAD +K K + P G P G +P SRGT S SSGGPGSPLN
Sbjct: 287 VIGSFLADVKKGHKQA------------MPSGAPYPGVSTPTSRGTPSGSSGGPGSPLNQ 334
Query: 322 STGACNNNHLPQGMATGIPWK 342
S N Q +A PW+
Sbjct: 335 SASGSFNTSNQQALAD-FPWR 354
>gi|125555140|gb|EAZ00746.1| hypothetical protein OsI_22774 [Oryza sativa Indica Group]
Length = 373
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 172/375 (45%), Positives = 211/375 (56%), Gaps = 70/375 (18%)
Query: 11 MTSQPP-ASIQSMRLAFSADGTAVYKPITATSPTYQPSGAG--------GDGAIPQAQGL 61
+ QPP + ++RLA++ DG AVYK T P YQ A G+G P
Sbjct: 26 LQPQPPHGAAAAVRLAYTHDGIAVYK-HTPPPPVYQTPAAVAAPSPPVRGNGGAP----- 79
Query: 62 NVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALV-------------------------PS 96
+ +E KRKRGRPRKY + LA+V P
Sbjct: 80 ------ASAEQHKRKRGRPRKYAVT-DVPLAVVPPSPPKAAAAAGAGAAQSPATPTLPPG 132
Query: 97 PSSVTTATGGTGSGLSSPGGGPLSPDSI--KKSRGRPPGSGSGKKHQLE-----ALGSAG 149
SS A GG + +P P + + K RGRPPGSG+ ++ + + A GS+
Sbjct: 133 FSSGLAAYGGAAASQPAPRQAPPASGRVLPHKKRGRPPGSGNKQQQRPQHKKAAAPGSSV 192
Query: 150 VGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGT-VTYEG 208
+G P VITV+ GEDV S++MSF++NG AVC+LSANGA+SN+TLRQA +SG T V YEG
Sbjct: 193 IGLKPSVITVQVGEDVVSRVMSFTKNG-WAVCVLSANGAVSNMTLRQAGSSGATTVNYEG 251
Query: 209 RFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
FEILSLSGS+LLSES G SR GGLSVSL+GPDGRVLGG VAG L AATPVQVV+GSFL
Sbjct: 252 HFEILSLSGSYLLSESVGLSSRAGGLSVSLAGPDGRVLGGGVAGPLNAATPVQVVIGSFL 311
Query: 269 ADGRKESKSSHRMESLPVPPKLAPGGQPA-GQCSPPSRGTLSESSGGPGSPLNHSTGACN 327
AD +K K + P G P G +P SRGT S SSGGPGSPLN S
Sbjct: 312 ADVKKGHKQA------------MPSGAPYPGVSTPTSRGTPSGSSGGPGSPLNQSASGSF 359
Query: 328 NNHLPQGMATGIPWK 342
N Q +A PW+
Sbjct: 360 NTSNQQALAD-FPWR 373
>gi|115467856|ref|NP_001057527.1| Os06g0326000 [Oryza sativa Japonica Group]
gi|50725730|dbj|BAD33241.1| putative AT-hook DNA-binding protein [Oryza sativa Japonica Group]
gi|113595567|dbj|BAF19441.1| Os06g0326000 [Oryza sativa Japonica Group]
Length = 378
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 172/375 (45%), Positives = 211/375 (56%), Gaps = 70/375 (18%)
Query: 11 MTSQPP-ASIQSMRLAFSADGTAVYKPITATSPTYQPSGAG--------GDGAIPQAQGL 61
+ QPP + ++RLA++ DG AVYK T P YQ A G+G P
Sbjct: 31 LQPQPPHGAAAAVRLAYTHDGIAVYK-HTPPPPVYQTPAAVAAPSPPVRGNGGAP----- 84
Query: 62 NVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALV-------------------------PS 96
+ +E KRKRGRPRKY + LA+V P
Sbjct: 85 ------ASAEQHKRKRGRPRKYAVT-DVPLAVVPPSPPKAAAAAGASAAQSPATPTLPPG 137
Query: 97 PSSVTTATGGTGSGLSSPGGGPLSPDSI--KKSRGRPPGSGSGKKHQLE-----ALGSAG 149
SS A GG + +P P + + K RGRPPGSG+ ++ + + A GS+
Sbjct: 138 FSSGLAAYGGAAASQPAPRQAPPASGRVLPHKKRGRPPGSGNKQQQRPQHKKAAAPGSSV 197
Query: 150 VGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGT-VTYEG 208
+G P VITV+ GEDV S++MSF++NG AVC+LSANGA+SN+TLRQA +SG T V YEG
Sbjct: 198 IGLKPSVITVQVGEDVVSRVMSFTKNG-WAVCVLSANGAVSNMTLRQAGSSGATTVNYEG 256
Query: 209 RFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
FEILSLSGS+LLSES G SR GGLSVSL+GPDGRVLGG VAG L AATPVQVV+GSFL
Sbjct: 257 HFEILSLSGSYLLSESVGLSSRAGGLSVSLAGPDGRVLGGGVAGPLNAATPVQVVIGSFL 316
Query: 269 ADGRKESKSSHRMESLPVPPKLAPGGQPA-GQCSPPSRGTLSESSGGPGSPLNHSTGACN 327
AD +K K + P G P G +P SRGT S SSGGPGSPLN S
Sbjct: 317 ADVKKGHKQA------------MPSGAPYPGVSTPTSRGTPSGSSGGPGSPLNQSASGSF 364
Query: 328 NNHLPQGMATGIPWK 342
N Q +A PW+
Sbjct: 365 NTSNQQALAD-FPWR 378
>gi|297793789|ref|XP_002864779.1| hypothetical protein ARALYDRAFT_496402 [Arabidopsis lyrata subsp.
lyrata]
gi|297310614|gb|EFH41038.1| hypothetical protein ARALYDRAFT_496402 [Arabidopsis lyrata subsp.
lyrata]
Length = 399
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 107/228 (46%), Positives = 143/228 (62%), Gaps = 13/228 (5%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSL-----ALVPSPSSVTTATGGTGS---GLSSPGGGPL 119
+GS+P K+KRGRPRKY PDG+++ L P+P S + G G + PL
Sbjct: 70 TGSDPTKKKRGRPRKYAPDGSLNPRFSRPTLSPTPISSSIPLSGDYQWKRGKAQQQHQPL 129
Query: 120 SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA 179
+ +KKS GS + G FT H TV AGEDV+ K+M +SQ G RA
Sbjct: 130 --EFVKKSHKFEYGSPAPTPPPPGLSCYVGANFTTHQFTVNAGEDVTMKVMPYSQQGSRA 187
Query: 180 VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLS 239
+CILSA G+ISNVTL Q +GGT+TYEGRFEILSLSGSF+ +E+ G + RTGG+S+SL+
Sbjct: 188 ICILSATGSISNVTLGQPTNAGGTLTYEGRFEILSLSGSFMPTENGGTKGRTGGMSISLA 247
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHR---MESL 284
GP+G++ GG +AG+L AA PVQVV+GSF+ + E + ME+
Sbjct: 248 GPNGKIFGGGLAGMLIAAGPVQVVMGSFIVMHQAEQNQKKKPRVMEAF 295
>gi|168040997|ref|XP_001772979.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675712|gb|EDQ62204.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 170
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 105/172 (61%), Positives = 129/172 (75%), Gaps = 18/172 (10%)
Query: 107 TGSGLS--SPGGG-PLSPDSI------------KKSRGRPPGSGSGKKHQLEALGSAGVG 151
TG+GL+ PGGG P+ P + ++ RGRP GSG KK QL AL +G G
Sbjct: 1 TGAGLTPGVPGGGFPVLPSLLPGPSSSPYSSPDRRGRGRPLGSG--KKQQLAALAGSGQG 58
Query: 152 FTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFE 211
FTPH++TV GEDV++KIM F+Q+GPRA C+LSANGAISNVT RQ ++SGGTVTYEGRFE
Sbjct: 59 FTPHILTVNTGEDVATKIMQFAQHGPRATCVLSANGAISNVTFRQQSSSGGTVTYEGRFE 118
Query: 212 ILSLSGSFLLSE-SSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQV 262
ILSLSGS+L ++ G R RTGGLSVSL+G DG V+GG VAG+LTAA+P+QV
Sbjct: 119 ILSLSGSYLPTDLGGGARQRTGGLSVSLAGIDGSVIGGGVAGMLTAASPIQV 170
>gi|357138571|ref|XP_003570864.1| PREDICTED: uncharacterized protein LOC100828198 [Brachypodium
distachyon]
Length = 374
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 111/232 (47%), Positives = 141/232 (60%), Gaps = 39/232 (16%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
KRKRGRPRKYGPDG + L +P S + G G +P + +K+ GRP G
Sbjct: 52 KRKRGRPRKYGPDGGLVRPLKATPISASVPDDDGGGGRYTPAAAVGA--VMKRGGGRPVG 109
Query: 134 SGS---------------------------------GKKH---QLEALGSA-GVGFTPHV 156
S ++H Q + +G A G F PH+
Sbjct: 110 FVSRAAPVVPVTAAAPTAVVVVSPPPPPPAAANVQTHQQHGPPQGDLVGCASGANFMPHI 169
Query: 157 ITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLS 216
+ V AGED++ K++SFSQ GP+A+CILSANG ISNVTLRQ + GGTVTYEGRFE+LSLS
Sbjct: 170 LNVAAGEDINMKVISFSQQGPKAICILSANGLISNVTLRQHDSLGGTVTYEGRFELLSLS 229
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
GSF +++ G R R+GG+SVSL+ DGRV+GG VAGLL AA+PVQVVVGSF+
Sbjct: 230 GSFTPTDNGGTRDRSGGMSVSLAAADGRVIGGGVAGLLVAASPVQVVVGSFV 281
>gi|449495813|ref|XP_004159952.1| PREDICTED: uncharacterized protein LOC101224467 [Cucumis sativus]
Length = 343
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 122/256 (47%), Positives = 156/256 (60%), Gaps = 35/256 (13%)
Query: 57 QAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGG 116
+ G+NV + EP+K+KRGRPRKY PDG +SL L P + G
Sbjct: 76 RGMGINVSAGVNSGEPVKKKRGRPRKYAPDGQVSLGLSPMSA-----------------G 118
Query: 117 GPLSP--DSIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKI 169
L+P +S R R GSG+K QL LG SAG+ F PHVI V AGED+ +K+
Sbjct: 119 SKLTPGSNSSTPRRRRGRPPGSGRKQQLALLGDWMNNSAGLAFAPHVIHVGAGEDIVAKV 178
Query: 170 MSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRS 229
+SF+Q PRAVC+LS NG +S+VTLRQ A++G +VTYEG F+IL LSGS+L++E G RS
Sbjct: 179 LSFAQQRPRAVCVLSGNGTVSSVTLRQPASTGVSVTYEGHFQILCLSGSYLVAEDGGPRS 238
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK----------SSH 279
RTGG+SVSL+ PDG V+GG VA +LTAA PVQVVV SF+ + ++K S H
Sbjct: 239 RTGGISVSLASPDGHVIGGGVA-VLTAAGPVQVVVCSFVYGPKIKNKQVAGPKSNDGSGH 297
Query: 280 RMESLPVPPKLAPGGQ 295
V P AP Q
Sbjct: 298 EHHDNLVSPTSAPSTQ 313
>gi|297837037|ref|XP_002886400.1| hypothetical protein ARALYDRAFT_315069 [Arabidopsis lyrata subsp.
lyrata]
gi|297332241|gb|EFH62659.1| hypothetical protein ARALYDRAFT_315069 [Arabidopsis lyrata subsp.
lyrata]
Length = 780
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 104/208 (50%), Positives = 142/208 (68%), Gaps = 22/208 (10%)
Query: 71 EPM-KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+PM K+KRGRPRKY PDG +SL L P P + + P++ K++RG
Sbjct: 461 QPMVKKKRGRPRKYAPDGQVSLGLSPMPCVSKKSKDSSSMS---------DPNAPKRARG 511
Query: 130 RPPGSGSGKKHQLEALG---------SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
RPPG+G +K +L LG SAG+ F PHVI+V +GED+ SK++SFSQ RA+
Sbjct: 512 RPPGTG--RKQRLANLGEISSEWMNTSAGLAFAPHVISVGSGEDIVSKVLSFSQKRSRAL 569
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
CI+S G +S+VTLR+ A++ ++T+EGRFEILSL GS+L++E G +SRTGGLSVSLSG
Sbjct: 570 CIMSGTGTVSSVTLREPASTTPSLTFEGRFEILSLGGSYLVNEEGGSKSRTGGLSVSLSG 629
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFL 268
P+G V+GG + G+L AA+ VQVV SF+
Sbjct: 630 PEGHVIGGGI-GMLIAASLVQVVACSFV 656
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 106/222 (47%), Positives = 142/222 (63%), Gaps = 21/222 (9%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+K+KRGRPRKY DG +SL L P P + + P++ K++RGRPP
Sbjct: 102 VKKKRGRPRKYVADGQVSLGLSPVPCVSNKSKDSSSMS---------DPNAPKRARGRPP 152
Query: 133 GSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
G+G +K +L LG SAG+ F PHVI+V AGED+ SKI+SFSQ PRA+CI+S G
Sbjct: 153 GTG--RKQRLANLGEWMNTSAGLAFAPHVISVGAGEDIVSKILSFSQQRPRALCIMSGTG 210
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
IS+ TL + A++ ++T+EGR+EILS GS+L++E G RSRTGGLSVSLSG DGR++
Sbjct: 211 TISSATLCEPASTAPSITFEGRYEILSFGGSYLVNEEGGSRSRTGGLSVSLSGSDGRIIA 270
Query: 248 GSVAGLLTAATPVQVVVGSFL----ADGRKESKSSHRMESLP 285
G V G+L AA+ VQVV SF+ A + + R E P
Sbjct: 271 GGV-GMLIAASLVQVVACSFVYGASAKSHNNNNKTIRQEKEP 311
>gi|297793791|ref|XP_002864780.1| hypothetical protein ARALYDRAFT_496402 [Arabidopsis lyrata subsp.
lyrata]
gi|297310615|gb|EFH41039.1| hypothetical protein ARALYDRAFT_496402 [Arabidopsis lyrata subsp.
lyrata]
Length = 771
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 104/210 (49%), Positives = 137/210 (65%), Gaps = 10/210 (4%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSL-----ALVPSPSSVTTATGGT---GSGLSSPGGGPL 119
+GS+P K+KRGRPRKY PDG+++ L P+P S + G G + PL
Sbjct: 70 TGSDPTKKKRGRPRKYAPDGSLNPRFSRPTLSPTPISSSIPLSGDYQWKRGKAQQQHQPL 129
Query: 120 SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA 179
+ +KKS GS + G FT H TV AGEDV+ K+M +SQ G RA
Sbjct: 130 --EFVKKSHKFEYGSPAPTPPPPGLSCYVGANFTTHQFTVNAGEDVTMKVMPYSQQGSRA 187
Query: 180 VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLS 239
+CILSA G+ISNVTL Q +GGT+TYEGRFEILSLSGSF+ +E+ G + RTGG+S+SL+
Sbjct: 188 ICILSATGSISNVTLGQPTNAGGTLTYEGRFEILSLSGSFMPTENGGTKGRTGGMSISLA 247
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSFLA 269
GP+G++ GG +AG+L AA PVQVV+GSF+
Sbjct: 248 GPNGKIFGGGLAGMLIAAGPVQVVMGSFIV 277
>gi|242078017|ref|XP_002443777.1| hypothetical protein SORBIDRAFT_07g001760 [Sorghum bicolor]
gi|241940127|gb|EES13272.1| hypothetical protein SORBIDRAFT_07g001760 [Sorghum bicolor]
Length = 363
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 119/239 (49%), Positives = 153/239 (64%), Gaps = 34/239 (14%)
Query: 75 RKRGRPRKYGPDGTMSLALVPSPSSV-----TTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+KRGRPRKYGPDG++ L +P S T G + S+ G ++K+ RG
Sbjct: 62 KKRGRPRKYGPDGSLIRPLNATPISASAPMPTAVAPGQYTPASAVGA------AMKRGRG 115
Query: 130 RPPGSGSGKKHQLEALG----------------------SAGVGFTPHVITVKAGEDVSS 167
RP + Q + SAG FTPH+ITV GEDV+
Sbjct: 116 RPLDFAAAAAKQQQQQQQHHHQHHHLQHPNVLAGDMVACSAGANFTPHIITVAPGEDVTM 175
Query: 168 KIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQ 227
K++SFSQ GPRA+CILSANG ISNVTLRQ +SGGT+TYEGRFE+LSLSGSF+ +E++G
Sbjct: 176 KVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYEGRFELLSLSGSFMPTENNGT 235
Query: 228 RSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSH-RMESLP 285
RSR+GG+SVSL+ PDGRV+GG VAGLL AA+PVQ+VVGSFL + E K+ R+++ P
Sbjct: 236 RSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQIVVGSFLPSYQMEQKNKKPRVDAAP 294
>gi|110289623|gb|ABG66282.1| AT-hook protein 1, putative, expressed [Oryza sativa Japonica
Group]
gi|215765047|dbj|BAG86744.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 200
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 104/154 (67%), Positives = 120/154 (77%), Gaps = 2/154 (1%)
Query: 170 MSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRS 229
M+FSQ GPR VCILSANGAISNVTLRQ ATSGG VTYEGRFEI+SLSGSFLL+E RS
Sbjct: 1 MAFSQQGPRTVCILSANGAISNVTLRQPATSGGLVTYEGRFEIISLSGSFLLAEDGDTRS 60
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKES-KSSHRMESLPVPP 288
RTGGLSV+L+G DGRVLGG VAG+L AATPVQVVV SF+A+G+K + ++E + PP
Sbjct: 61 RTGGLSVALAGSDGRVLGGCVAGMLMAATPVQVVVASFIAEGKKSKPVETRKVEPMSAPP 120
Query: 289 KLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNHS 322
++A PA SPPS GT S SS GSP+NHS
Sbjct: 121 QMA-TYVPAPVASPPSEGTSSGSSDDSGSPINHS 153
>gi|6850898|emb|CAB71061.1| putative DNA-binding protein [Arabidopsis thaliana]
Length = 348
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 109/207 (52%), Positives = 141/207 (68%), Gaps = 27/207 (13%)
Query: 68 SGSEPMKRKRGRPRKYGPDG-TMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKK 126
SG +KRKRGRPRKYG DG ++SLAL PS S+V SP+S K+
Sbjct: 89 SGDTSVKRKRGRPRKYGQDGGSVSLALSPSISNV-------------------SPNSNKR 129
Query: 127 SRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
RGRPPGSG KK +L ++G S G+ FTPHVI V GED++SK++SFS GPRA+C
Sbjct: 130 GRGRPPGSG--KKQRLSSIGEMMPSSTGMSFTPHVIVVSIGEDIASKVISFSHQGPRAIC 187
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
+LSA+GA+S TL Q A S GT+ YEG FE++SLS S+L + + +RTG L+VSL+ P
Sbjct: 188 VLSASGAVSTATLLQPAPSHGTIIYEGLFELISLSTSYLNTTDNDYPNRTGSLAVSLASP 247
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFL 268
DGRV+GG + G L AA+ VQV+VGSF+
Sbjct: 248 DGRVIGGGIGGPLIAASQVQVIVGSFI 274
>gi|30695388|ref|NP_191690.2| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|22136014|gb|AAM91589.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|31711840|gb|AAP68276.1| At3g61310 [Arabidopsis thaliana]
gi|119657366|tpd|FAA00282.1| TPA: AT-hook motif nuclear localized protein 11 [Arabidopsis
thaliana]
gi|332646665|gb|AEE80186.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 354
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 109/207 (52%), Positives = 141/207 (68%), Gaps = 27/207 (13%)
Query: 68 SGSEPMKRKRGRPRKYGPDG-TMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKK 126
SG +KRKRGRPRKYG DG ++SLAL PS S+V SP+S K+
Sbjct: 95 SGDTSVKRKRGRPRKYGQDGGSVSLALSPSISNV-------------------SPNSNKR 135
Query: 127 SRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
RGRPPGSG KK +L ++G S G+ FTPHVI V GED++SK++SFS GPRA+C
Sbjct: 136 GRGRPPGSG--KKQRLSSIGEMMPSSTGMSFTPHVIVVSIGEDIASKVISFSHQGPRAIC 193
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
+LSA+GA+S TL Q A S GT+ YEG FE++SLS S+L + + +RTG L+VSL+ P
Sbjct: 194 VLSASGAVSTATLLQPAPSHGTIIYEGLFELISLSTSYLNTTDNDYPNRTGSLAVSLASP 253
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFL 268
DGRV+GG + G L AA+ VQV+VGSF+
Sbjct: 254 DGRVIGGGIGGPLIAASQVQVIVGSFI 280
>gi|326514846|dbj|BAJ99784.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 393
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 116/224 (51%), Positives = 149/224 (66%), Gaps = 24/224 (10%)
Query: 75 RKRGRPRKYGPDGTMSLALVPSPSSVTT-ATGGTGSGLSSPGGGPLSPDSIKKSRGRPP- 132
+KRGRPRKYGPDG++ L +P S + + +G +P + ++K+ RGRP
Sbjct: 89 KKRGRPRKYGPDGSLIQPLNATPISASAPMSAAVAAGQYTPAAAVGA--AMKRGRGRPLD 146
Query: 133 -GSGSGKKHQLEALG-------------------SAGVGFTPHVITVKAGEDVSSKIMSF 172
+ + K + + SAG FTPH+ITV GEDV+ K++SF
Sbjct: 147 FAAAAAKPYHHQLQQPQQQQFGFHFSSIGDMVACSAGGNFTPHIITVAPGEDVTMKVISF 206
Query: 173 SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTG 232
SQ GPRA+CILSANG ISNVTLRQ +SGGT+TYEGRFE+LSLSGSF+ +E+SG RSR+G
Sbjct: 207 SQQGPRAICILSANGVISNVTLRQPDSSGGTLTYEGRFELLSLSGSFMPTENSGARSRSG 266
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
G+SVSL+ PDGRV+GG VAGLL AA+PVQ+VVGSFL E K
Sbjct: 267 GMSVSLASPDGRVVGGGVAGLLVAASPVQIVVGSFLPSYLMEPK 310
>gi|2213536|emb|CAA67290.1| DNA-binding protein PD1 [Pisum sativum]
gi|119657408|tpd|FAA00303.1| TPA: AT-hook motif nuclear localized protein 1 [Pisum sativum]
Length = 347
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 110/217 (50%), Positives = 144/217 (66%), Gaps = 21/217 (9%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
SEP+K+KRGRPRKYGPDG++SL L P +S+P + + RG
Sbjct: 91 SEPVKKKRGRPRKYGPDGSVSLKLTP---------------MSAPANSTQDSGTPSEKRG 135
Query: 130 RPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
R GSG+K QL ALG SAG+ F+PHVIT+ AGED+++K++ SQ PRA+CILS
Sbjct: 136 RGRPRGSGRKQQLAALGDWMTSSAGLAFSPHVITIAAGEDIAAKLLLLSQQRPRALCILS 195
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G S VTLRQ A++ VTYEG+F+ILSLSGS+L+SE G +RTGG+SVSLS DG
Sbjct: 196 GTGIASKVTLRQPASTNAGVTYEGKFQILSLSGSYLVSEDGGPTNRTGGISVSLSSRDGH 255
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
V+GGSVA +L A +P+Q+VV SF+ G + K+ M
Sbjct: 256 VIGGSVA-MLIAGSPIQLVVCSFVYGGGSKVKTKQGM 291
>gi|255640322|gb|ACU20449.1| unknown [Glycine max]
Length = 231
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/181 (55%), Positives = 125/181 (69%), Gaps = 9/181 (4%)
Query: 138 KKHQLEALG-SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ 196
KK ++ G S G F PH+ITV GED++ K++SFSQ GPRA+CILSA+G ISNVTLRQ
Sbjct: 7 KKVGVDLFGDSVGTNFMPHIITVNTGEDITMKVISFSQQGPRAICILSASGVISNVTLRQ 66
Query: 197 AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTA 256
+SGGT+TYEGRFEILSLSGSF+ +++ G RSR+GG+SVSLS PDGR++GG VAGLL A
Sbjct: 67 PDSSGGTLTYEGRFEILSLSGSFMPTDNQGTRSRSGGMSVSLSSPDGRIVGGGVAGLLVA 126
Query: 257 ATPVQVVVGSFLADGRKESKSSHRMESLP---VPPKLAPGGQPAGQCSPPSRGTLSESSG 313
A PVQVVVGSFL + ++ K V P +A P PP+ G + G
Sbjct: 127 AGPVQVVVGSFLPNNPQDKKPKKPKSDYAPANVTPSIAVSSAP-----PPTNGEKEDVMG 181
Query: 314 G 314
G
Sbjct: 182 G 182
>gi|2213534|emb|CAA67291.1| DNA-binding PD1-like protein [Pisum sativum]
Length = 334
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 110/217 (50%), Positives = 144/217 (66%), Gaps = 21/217 (9%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
SEP+K+KRGRPRKYGPDG++SL L P +S+P + + RG
Sbjct: 91 SEPVKKKRGRPRKYGPDGSVSLKLSP---------------MSAPANSTQDSGTPSEKRG 135
Query: 130 RPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
R GSG+K QL ALG SAG+ F+PHVIT+ AGED+++K++ SQ PRA+CILS
Sbjct: 136 RGRPRGSGRKQQLAALGDWMTSSAGLAFSPHVITIAAGEDIAAKLLLLSQQRPRALCILS 195
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G S VTLRQ A++ VTYEG+F+ILSLSGS+L+SE G +RTGG+SVSLS DG
Sbjct: 196 GTGIASKVTLRQPASTNAGVTYEGKFQILSLSGSYLVSEDGGPTNRTGGISVSLSSRDGH 255
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
V+GGSVA +L A +P+Q+VV SF+ G + K+ M
Sbjct: 256 VIGGSVA-MLIAGSPIQLVVCSFVYGGGSKVKTKQGM 291
>gi|357139520|ref|XP_003571329.1| PREDICTED: uncharacterized protein LOC100824915 [Brachypodium
distachyon]
Length = 397
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 119/234 (50%), Positives = 149/234 (63%), Gaps = 32/234 (13%)
Query: 75 RKRGRPRKYGPDGTM--------------SLALVPSPSSVTTATGGTGSGLSSPGGGPLS 120
+KRGRPRKYGPDG++ LA SP T A+ + G PL
Sbjct: 76 KKRGRPRKYGPDGSLIRPLNATPISASAPMLAAAVSPGQYTPASAVGAAMKRGRGSRPLD 135
Query: 121 PDSIKKSRGRP-------------PGSGSG---KKHQLEAL--GSAGVGFTPHVITVKAG 162
S + +P S SG + H++ + SAG FTPH+ITV G
Sbjct: 136 FSSSTAAMAKPYHHYQQPPPPQADSSSSSGFPLRLHRVSDMVACSAGGNFTPHIITVAPG 195
Query: 163 EDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLS 222
EDV+ K++SFSQ GPRA+CILSANG ISNVTLRQ +SGGT+TYEGRFE+LSLSGSF+ +
Sbjct: 196 EDVTMKVISFSQQGPRAICILSANGVISNVTLRQPDSSGGTLTYEGRFELLSLSGSFMPT 255
Query: 223 ESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
ES+G RSR+GG+SVSL+ PDGRV+GG VAGLL AA+PVQ+VVG+FL + E K
Sbjct: 256 ESNGARSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQIVVGTFLPSYQMEQK 309
>gi|359807562|ref|NP_001240898.1| uncharacterized protein LOC100793726 [Glycine max]
gi|255644376|gb|ACU22693.1| unknown [Glycine max]
Length = 264
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 111/196 (56%), Positives = 137/196 (69%), Gaps = 22/196 (11%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+K+KRGRPRKYGPDG++++AL P P S + P S D RG+
Sbjct: 63 VKKKRGRPRKYGPDGSVTMALSPMPISSS---------------APPSNDFSSGKRGKMR 107
Query: 133 GSGS--GKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
G KK L+ +G S G F PH+ITV AGED++ K++SFSQ GPRA+CILSA
Sbjct: 108 GMDYKPSKKVGLDYIGDLNVCSDGTNFMPHIITVNAGEDITMKVISFSQQGPRAICILSA 167
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NG ISNVTLRQ +SGGT+TYEGRFEILSLSGSF+ +++ G RSRTGG+SVSL+ PDGRV
Sbjct: 168 NGVISNVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTDNQGTRSRTGGMSVSLASPDGRV 227
Query: 246 LGGSVAGLLTAATPVQ 261
+GG VAGLL AA+PVQ
Sbjct: 228 VGGGVAGLLVAASPVQ 243
>gi|168012741|ref|XP_001759060.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689759|gb|EDQ76129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 519
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 120/212 (56%), Positives = 146/212 (68%), Gaps = 35/212 (16%)
Query: 73 MKRKRGRPRKYG-------PDGTMSL--ALVPSPSSVTTATGGTGSGLSSPGGGPLSPDS 123
+KRKRGRPRK+ P G + AL+P SS P +P S
Sbjct: 228 LKRKRGRPRKFSTGESSPIPSGAYPVFPALMPGSSS------------------PYTP-S 268
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSA----GVGFTPHVITVKAGEDVSSKIMSFSQNGPRA 179
K+ RGR SGK QL ALG G GFTPH++TV GEDV++KIM F+Q+GPRA
Sbjct: 269 EKRGRGR--SQFSGKNQQLAALGVVLAGTGQGFTPHILTVNTGEDVATKIMQFAQHGPRA 326
Query: 180 VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSE-SSGQRSRTGGLSVSL 238
+C+LSANGAISNVTLRQ ++SGGTVTYEGR+EILSLSGS+L ++ G R RTGGLSVSL
Sbjct: 327 MCVLSANGAISNVTLRQQSSSGGTVTYEGRYEILSLSGSYLPTDLGGGARQRTGGLSVSL 386
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD 270
+G DG V+GG VAG+LTAA+P+QVVVGSFL+D
Sbjct: 387 AGIDGGVIGGGVAGMLTAASPIQVVVGSFLSD 418
>gi|79544830|ref|NP_201032.2| AT hook motif DNA-binding protein [Arabidopsis thaliana]
gi|8809639|dbj|BAA97190.1| unnamed protein product [Arabidopsis thaliana]
gi|26451694|dbj|BAC42942.1| unknown protein [Arabidopsis thaliana]
gi|28973553|gb|AAO64101.1| unknown protein [Arabidopsis thaliana]
gi|119657356|tpd|FAA00277.1| TPA: AT-hook motif nuclear localized protein 6 [Arabidopsis
thaliana]
gi|332010204|gb|AED97587.1| AT hook motif DNA-binding protein [Arabidopsis thaliana]
Length = 404
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 105/236 (44%), Positives = 140/236 (59%), Gaps = 29/236 (12%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMS---LALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSI 124
+GS+P K+KRGRPRKY PDG+++ L SP+ +++ S P G
Sbjct: 70 TGSDPTKKKRGRPRKYAPDGSLNPRFLRPTLSPTPISS---------SIPLSGDYQWKRG 120
Query: 125 KKSRGRPPGSGSGKKHQLEALGS-------------AGVGFTPHVITVKAGEDVSSKIMS 171
K + P K H+ E GS G FT H TV GEDV+ K+M
Sbjct: 121 KAQQQHQPLEFVKKSHKFE-YGSPAPTPPLPGLSCYVGANFTTHQFTVNGGEDVTMKVMP 179
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
+SQ G RA+CILSA G+ISNVTL Q +GGT+TYEGRFEILSLSGSF+ +E+ G + R
Sbjct: 180 YSQQGSRAICILSATGSISNVTLGQPTNAGGTLTYEGRFEILSLSGSFMPTENGGTKGRA 239
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHR---MESL 284
GG+S+SL+GP+G + GG +AG+L AA PVQVV+GSF+ + E + ME+
Sbjct: 240 GGMSISLAGPNGNIFGGGLAGMLIAAGPVQVVMGSFIVMHQAEQNQKKKPRVMEAF 295
>gi|15242131|ref|NP_199972.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|9758201|dbj|BAB08675.1| unnamed protein product [Arabidopsis thaliana]
gi|119657352|tpd|FAA00275.1| TPA: AT-hook motif nuclear localized protein 4 [Arabidopsis
thaliana]
gi|225879112|dbj|BAH30626.1| hypothetical protein [Arabidopsis thaliana]
gi|332008718|gb|AED96101.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 419
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 88/118 (74%), Positives = 101/118 (85%)
Query: 152 FTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFE 211
FTPHV+TV AGEDV+ KIM+FSQ G RA+CILSANG ISNVTLRQ+ TSGGT+TYEG FE
Sbjct: 178 FTPHVLTVNAGEDVTMKIMTFSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGHFE 237
Query: 212 ILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLA 269
ILSL+GSF+ SES G RSR GG+SVSL+G DGRV GG +AGL AA PVQV+VGSF+A
Sbjct: 238 ILSLTGSFIPSESGGTRSRAGGMSVSLAGQDGRVFGGGLAGLFIAAGPVQVMVGSFIA 295
>gi|357477009|ref|XP_003608790.1| hypothetical protein MTR_4g101990 [Medicago truncatula]
gi|355509845|gb|AES90987.1| hypothetical protein MTR_4g101990 [Medicago truncatula]
Length = 332
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 119/223 (53%), Positives = 149/223 (66%), Gaps = 25/223 (11%)
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSP----DSIKKS 127
P K+KRGRPRKY DG+++ AL P P S PL P + K++
Sbjct: 70 PEKKKRGRPRKYAADGSVTAALSPKPIS---------------SSAPLPPVIDFTAEKRA 114
Query: 128 RGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
+ +P S S +LE +G S G FTPH+ITV AGEDV+ K++SFSQ GPRAVCI
Sbjct: 115 KVKPVSSVSKANFELENIGEWVPCSVGSNFTPHIITVNAGEDVTMKVISFSQQGPRAVCI 174
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSE-SSGQRSRTGGLSVSLSGP 241
LSANG I +VTLRQ +SGGT+TYEG FEILSLSGSF+ +E G RSR+GG+SVSL+ P
Sbjct: 175 LSANGVIKSVTLRQPDSSGGTLTYEGLFEILSLSGSFMPNESGGGTRSRSGGMSVSLASP 234
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESL 284
DGRV+GG VAGLL AA+PVQVVVGSF+A + E K ++ +
Sbjct: 235 DGRVVGGGVAGLLVAASPVQVVVGSFMAGNQHEQKPRNQKHDV 277
>gi|357520457|ref|XP_003630517.1| AT-hook motif nuclear localized protein [Medicago truncatula]
gi|355524539|gb|AET04993.1| AT-hook motif nuclear localized protein [Medicago truncatula]
Length = 351
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 147/218 (67%), Gaps = 24/218 (11%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPS--PSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
S+P+K+KRGRPRKYGPDG++SL L P+ P+ T T S +
Sbjct: 93 SDPVKKKRGRPRKYGPDGSVSLKLSPTSAPAKSTQEDSTTPS----------------EK 136
Query: 128 RGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
RGR GSG+K QL ALG SAG+ F+PHVIT+ GED+++K++S SQ PRA+CI
Sbjct: 137 RGRGRPRGSGRKQQLAALGDWMTSSAGLAFSPHVITIGVGEDIAAKLLSLSQQRPRALCI 196
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LS NG +++VTLRQ A++ VTYEG+F+ILSLSGS+L++E SG +RTGG+SVSLS D
Sbjct: 197 LSGNGIVTSVTLRQPASTNIGVTYEGKFQILSLSGSYLVAEDSGPSNRTGGISVSLSSRD 256
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHR 280
G V+GGSVA L+ A + +QVVV SF+ G + K+
Sbjct: 257 GHVIGGSVAKLI-AGSLIQVVVCSFVYGGGSKVKTKQE 293
>gi|2894604|emb|CAA17138.1| putative protein [Arabidopsis thaliana]
gi|7268547|emb|CAB78797.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 120/270 (44%), Positives = 158/270 (58%), Gaps = 36/270 (13%)
Query: 64 MNMGSGSEPMKRKRGRPRKYGPD--------GTMSLAL-----VPSPSSVTTATGGTGSG 110
M G + +K+KRGRPRKY D ++L L +PS S+ G G
Sbjct: 121 MRFGIDHQQVKKKRGRPRKYAADGGGGGGGGSNIALGLAPTSPLPSASNSYGGGNEGGGG 180
Query: 111 LSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIM 170
S G S D P + + + G+ GVGFTPHVI VK GED+++KI+
Sbjct: 181 GDSAGANANSSD---------PPAKRNRGRPPGSGGTGGVGFTPHVIEVKTGEDIATKIL 231
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAATSG--GTVTYEGRFEILSLSGSFLLSESSGQR 228
+F+ GPRA+CILSA GA++NV LRQA S GTV YEGRFEI+SLSGSFL SES+G
Sbjct: 232 AFTNQGPRAICILSATGAVTNVMLRQANNSNPTGTVKYEGRFEIISLSGSFLNSESNGTV 291
Query: 229 SRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPP 288
++TG LSVSL+G +GR++GG V G+L A + VQV+VGSF+ DGRK+ +S+ R ++ P P
Sbjct: 292 TKTGNLSVSLAGHEGRIVGGCVDGMLVAGSQVQVIVGSFVPDGRKQKQSAGRAQNTPEP- 350
Query: 289 KLAPGGQPAGQCSPPSRGTLSESSGGPGSP 318
S P+ GGPGSP
Sbjct: 351 -----------ASAPANMLSFGGVGGPGSP 369
>gi|224124924|ref|XP_002329847.1| predicted protein [Populus trichocarpa]
gi|222871084|gb|EEF08215.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 102/155 (65%), Positives = 123/155 (79%), Gaps = 6/155 (3%)
Query: 136 SGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAIS 190
S K++LE LG S G FTPH+ITV AGEDV+ K++SFSQ GPRA+CILSANG IS
Sbjct: 91 SKAKYELENLGEWVACSVGANFTPHIITVNAGEDVTMKVISFSQQGPRAICILSANGVIS 150
Query: 191 NVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSV 250
+VTLRQ +SGGT+TYEGRFEILSLSGSF+ +E+ G RSR+GG+SVSL+ PDGRV+GG V
Sbjct: 151 SVTLRQPDSSGGTLTYEGRFEILSLSGSFMPTETGGTRSRSGGMSVSLASPDGRVVGGGV 210
Query: 251 AGLLTAATPVQVVVGSFLADGRKESK-SSHRMESL 284
AGLL AA+PVQVVVGSFLA + E K + +SL
Sbjct: 211 AGLLVAASPVQVVVGSFLAGNQHEQKPKKQKHDSL 245
>gi|357504087|ref|XP_003622332.1| DNA-binding protein [Medicago truncatula]
gi|355497347|gb|AES78550.1| DNA-binding protein [Medicago truncatula]
Length = 340
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 120/220 (54%), Positives = 144/220 (65%), Gaps = 13/220 (5%)
Query: 73 MKRKRGRPRKYGPDGT----MSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSR 128
+K+KRGRPRKYGPDG AL P P S + G S G P+ +S+KKS
Sbjct: 48 LKKKRGRPRKYGPDGKPAPGAVTALSPMPISSSIPLTGEFSAWKRGRGKPV--ESMKKSS 105
Query: 129 GR-----PPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+ PP G S G FT +V+TV +GEDV+ KIMS SQ G RA+CIL
Sbjct: 106 FKFDFESPPVQVVGGGVSEGIAYSVGANFTAYVLTVNSGEDVTMKIMS-SQQGSRAICIL 164
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
SA G ISNVTLRQ+ +SGGT+TYEGRFEILSLSGSF+ +E+ RSR+GG+SVSL+GPDG
Sbjct: 165 SATGTISNVTLRQSTSSGGTLTYEGRFEILSLSGSFMPTENGITRSRSGGMSVSLAGPDG 224
Query: 244 RVLGGSVAGLLTAATPVQVVVGSFLADGRKESKS-SHRME 282
RVLGG +AGLL A+ PVQVVVGSFL E S R+E
Sbjct: 225 RVLGGGLAGLLIASGPVQVVVGSFLPGHHLEHNSKKQRVE 264
>gi|357507279|ref|XP_003623928.1| hypothetical protein MTR_7g077120 [Medicago truncatula]
gi|355498943|gb|AES80146.1| hypothetical protein MTR_7g077120 [Medicago truncatula]
Length = 346
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 152/218 (69%), Gaps = 21/218 (9%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
E +KRKRGRPRKYG D +SLAL SPS ++ GT + GGP K+ RGR
Sbjct: 86 ETVKRKRGRPRKYGADRVVSLAL--SPSPTPSSNPGTMTQ-----GGP------KRGRGR 132
Query: 131 PPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
PPGSG KK QL + G SAG GF PHVI + +GED+++KI++FSQ RA+C+LS+
Sbjct: 133 PPGSG--KKQQLASFGELMSGSAGTGFIPHVIEIASGEDIAAKILTFSQVRARALCVLSS 190
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
+G++S+V +R+ + SGGT+ YEG F I+S+SG ++ +E+ R+R GGLS+SL GPDGR+
Sbjct: 191 SGSVSSVIIREPSISGGTLKYEGHFHIMSMSGCYVPTENGSSRNRDGGLSISLLGPDGRL 250
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMES 283
GG+V G L AA+PVQV++GSFL GR ++K+ + S
Sbjct: 251 FGGAVGGPLVAASPVQVMIGSFLW-GRLKAKNKKKESS 287
>gi|297810159|ref|XP_002872963.1| hypothetical protein ARALYDRAFT_490548 [Arabidopsis lyrata subsp.
lyrata]
gi|297318800|gb|EFH49222.1| hypothetical protein ARALYDRAFT_490548 [Arabidopsis lyrata subsp.
lyrata]
Length = 279
Score = 173 bits (439), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 125/260 (48%), Positives = 159/260 (61%), Gaps = 26/260 (10%)
Query: 74 KRKRGRPRKYGP-DGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
K++RGRPRKYG +GT P PSS T L G L+ +IK +
Sbjct: 27 KKRRGRPRKYGEANGT------PLPSSSTPL-------LKKRAKGKLNGFAIKMHKTI-- 71
Query: 133 GSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNV 192
+ S + G AG FTPH+ITV GED++ +I+SFSQ GPRA+CILSANG ISNV
Sbjct: 72 -NSSATGERFGVGGGAGSNFTPHIITVHTGEDITMRIISFSQQGPRAICILSANGVISNV 130
Query: 193 TLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAG 252
TLR + GGT+TYEGRFEILSLSGSF+ +E+ G R R+GG+SVSL+GPDGRV+GG VAG
Sbjct: 131 TLRHPESCGGTLTYEGRFEILSLSGSFMETENQGSRGRSGGMSVSLAGPDGRVVGGGVAG 190
Query: 253 LLTAATPVQVVVGSFLADGRKESK--SSHRME-------SLPVPPKLAPGGQPAGQCSPP 303
LL AATP+QVVVGSF+ +++ + R+E SLP PP + PP
Sbjct: 191 LLIAATPIQVVVGSFITSDQQDHQIPRKQRVEHTPPTVTSLPPPPASVFSSTNPEREQPP 250
Query: 304 SRGTLSESSGGPGSPLNHST 323
S +S + G P N +T
Sbjct: 251 SSFGISSWTNGQDMPRNSAT 270
>gi|226503753|ref|NP_001140867.1| uncharacterized protein LOC100272943 [Zea mays]
gi|194701518|gb|ACF84843.1| unknown [Zea mays]
gi|195609746|gb|ACG26703.1| DNA-binding protein [Zea mays]
gi|413921421|gb|AFW61353.1| DNA-binding protein [Zea mays]
Length = 391
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 112/228 (49%), Positives = 145/228 (63%), Gaps = 27/228 (11%)
Query: 75 RKRGRPRKYGPDGTMSLALVPSPSSVTTAT-GGTGSGLSSPGGGPLSPDSIKKSRGRP-- 131
+KRGRPRKYGPDG++ L +P S + G +P + ++K+ RGRP
Sbjct: 79 KKRGRPRKYGPDGSLIRPLNATPISASAPLPAAVAPGHYTPASAVGA--AMKRGRGRPLD 136
Query: 132 -----------------PGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKI 169
+++G SAG FTPH+ITV GEDV +K+
Sbjct: 137 FAAAAAKQHQQHHHQLYQHQQQQFGFHFDSIGDMGACSAGANFTPHIITVAPGEDVMTKV 196
Query: 170 MSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRS 229
+SFSQ GPRA+C+LSANG IS VTL Q +SGGT+TYEGRFE+LSLSGSF+ +E+ G RS
Sbjct: 197 ISFSQQGPRAICVLSANGVISTVTLCQPDSSGGTLTYEGRFELLSLSGSFMPTENGGTRS 256
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKS 277
R+GG+SVSL+ PDGRV+GG VAGLL AA+PVQ+VVGSFL + E K+
Sbjct: 257 RSGGMSVSLASPDGRVVGGGVAGLLVAASPVQIVVGSFLPSYQMEQKN 304
>gi|225463960|ref|XP_002270792.1| PREDICTED: uncharacterized protein LOC100261576 [Vitis vinifera]
gi|296087886|emb|CBI35169.3| unnamed protein product [Vitis vinifera]
Length = 357
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 108/215 (50%), Positives = 136/215 (63%), Gaps = 26/215 (12%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
E ++RKRGRPRKYG L+ SPSS P KK +G
Sbjct: 74 ETVRRKRGRPRKYG-TSEQGLSAKKSPSSSV-------------------PVPKKKEQGL 113
Query: 131 PPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAIS 190
GS KK QL +LG+AG FTPHVITV +GEDV+ KIM F Q R +CI+SA+G+IS
Sbjct: 114 ---GGSSKKSQLVSLGNAGQSFTPHVITVASGEDVAQKIMFFMQQSKREICIMSASGSIS 170
Query: 191 NVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSV 250
N +LRQ ATSGG V YEGRFEILSL+GS++ +E G RTGGLSV LS DG ++GG V
Sbjct: 171 NASLRQPATSGGNVAYEGRFEILSLTGSYVRTEIGG---RTGGLSVCLSNTDGEIIGGGV 227
Query: 251 AGLLTAATPVQVVVGSFLADGRKESKSSHRMESLP 285
G L AA PVQV+VG+FL D +K++ + + ++ P
Sbjct: 228 GGPLKAAGPVQVIVGTFLVDSKKDTSTGLKADASP 262
>gi|147835652|emb|CAN72947.1| hypothetical protein VITISV_034305 [Vitis vinifera]
Length = 285
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 108/215 (50%), Positives = 136/215 (63%), Gaps = 26/215 (12%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
E ++RKRGRPRKYG L+ SPSS P KK +G
Sbjct: 29 ETVRRKRGRPRKYGTS-EQGLSAKKSPSSSV-------------------PVPKKKEQGL 68
Query: 131 PPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAIS 190
GS KK QL +LG+AG FTPHVITV +GEDV+ KIM F Q R +CI+SA+G+IS
Sbjct: 69 ---GGSSKKSQLVSLGNAGQSFTPHVITVASGEDVAQKIMFFMQQSKREICIMSASGSIS 125
Query: 191 NVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSV 250
N +LRQ ATSGG V YEGRFEILSL+GS++ +E G RTGGLSV LS DG ++GG V
Sbjct: 126 NASLRQPATSGGNVAYEGRFEILSLTGSYVRTEIGG---RTGGLSVCLSNTDGEIIGGGV 182
Query: 251 AGLLTAATPVQVVVGSFLADGRKESKSSHRMESLP 285
G L AA PVQV+VG+FL D +K++ + + ++ P
Sbjct: 183 GGPLKAAGPVQVIVGTFLVDSKKDTSTGLKADASP 217
>gi|145339839|ref|NP_191931.2| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|66792610|gb|AAY56407.1| At4g00200 [Arabidopsis thaliana]
gi|110737183|dbj|BAF00540.1| putative transcription factor [Arabidopsis thaliana]
gi|332656437|gb|AEE81837.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 318
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 111/228 (48%), Positives = 140/228 (61%), Gaps = 41/228 (17%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K++RGRPRKY +G P PSS P K+ RG+ G
Sbjct: 56 KKRRGRPRKYEANG------APLPSSSV-------------------PLVKKRVRGKLNG 90
Query: 134 SGSGKKHQLEALGSAGV--------------GFTPHVITVKAGEDVSSKIMSFSQNGPRA 179
K H+ S+G FTPHVITV GED++ +I+SFSQ GPRA
Sbjct: 91 FDMKKMHKTIGFHSSGERFGVGGGVGGGVGSNFTPHVITVNTGEDITMRIISFSQQGPRA 150
Query: 180 VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLS 239
+CILSANG ISNVTLRQ + GGT+TYEGRFEILSLSGSF+ +E+ G + R+GG+SVSL+
Sbjct: 151 ICILSANGVISNVTLRQPDSCGGTLTYEGRFEILSLSGSFMETENQGSKGRSGGMSVSLA 210
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKS--SHRMESLP 285
GPDGRV+GG VAGLL AATP+QVVVGSF+ +++ + R+E P
Sbjct: 211 GPDGRVVGGGVAGLLIAATPIQVVVGSFITSDQQDHQKPRKQRVEHAP 258
>gi|357481887|ref|XP_003611229.1| hypothetical protein MTR_5g011680 [Medicago truncatula]
gi|355512564|gb|AES94187.1| hypothetical protein MTR_5g011680 [Medicago truncatula]
Length = 288
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 103/239 (43%), Positives = 131/239 (54%), Gaps = 28/239 (11%)
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR- 130
P K+KRGRPRKY PDG++SLA+ P P+S + L +PG L+ + S G
Sbjct: 44 PAKKKRGRPRKYRPDGSLSLAIPPKPTSSSIGEAAKFE-LENPGSRMLNYVVVSSSLGNE 102
Query: 131 ----------------------PPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSK 168
PP S +G QL A S FTPH+I V AGEDV K
Sbjct: 103 QSEQMLKTQENEVTPTSTPTAAPPVSTAG---QLPA-SSVSATFTPHIIIVNAGEDVPMK 158
Query: 169 IMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQR 228
IMSF Q GP A+CIL NG IS V + + +S TYE ++EI +LSGSF+ E G+R
Sbjct: 159 IMSFCQQGPEAICILYVNGVISKVVISRPQSSRTLFTYEVKYEIRTLSGSFMPKEKCGRR 218
Query: 229 SRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVP 287
S +GG+SVSL G V+GG VAG L AA+PV VVVGSFL ++ + E + P
Sbjct: 219 SISGGMSVSLVDLHGHVVGGRVAGPLVAASPVNVVVGSFLPSEHEQKLKTQNNEVISTP 277
>gi|17979309|gb|AAL49880.1| putative DNA-binding protein [Arabidopsis thaliana]
Length = 355
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 99/222 (44%), Positives = 136/222 (61%), Gaps = 32/222 (14%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+KRKRGRPRKYG + + SP S P+ K++RGRPP
Sbjct: 92 VKRKRGRPRKYGEPMVSNKSRDSSPMS--------------------DPNEPKRARGRPP 131
Query: 133 GSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
G+G +K +L LG SAG+ F PHVI++ AGED+++K++SFSQ PRA+CI+S G
Sbjct: 132 GTG--RKQRLANLGEWMNTSAGLAFAPHVISIGAGEDIAAKVLSFSQQRPRALCIMSGTG 189
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
IS+VTL + ++ +TYEG FEI+S GS+L++E G RSRTGGLSVSLS PDG ++
Sbjct: 190 TISSVTLCKPGSTDRHLTYEGPFEIISFGGSYLVNEEGGSRSRTGGLSVSLSRPDGSIIA 249
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKESKSSH----RMESLP 285
G V +L AA VQVV SF+ R ++ +++ R E P
Sbjct: 250 GGV-DMLIAANLVQVVACSFVYGARAKTHNNNNKTIRQEKEP 290
>gi|115484183|ref|NP_001065753.1| Os11g0149100 [Oryza sativa Japonica Group]
gi|62701672|gb|AAX92745.1| expressed protein [Oryza sativa Japonica Group]
gi|77548692|gb|ABA91489.1| expressed protein [Oryza sativa Japonica Group]
gi|113644457|dbj|BAF27598.1| Os11g0149100 [Oryza sativa Japonica Group]
Length = 366
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 97/156 (62%), Positives = 118/156 (75%), Gaps = 8/156 (5%)
Query: 126 KSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
+ RGRP GSG++ L LG SAG FTPHVI V GEDV+ +IMSFSQ GPR++
Sbjct: 108 RRRGRP--KGSGRRQILATLGEWYALSAGGSFTPHVIIVGTGEDVAGRIMSFSQKGPRSI 165
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
CILSANG ISNV L Q +SG T TYEGRFEIL L+GSF ++E G+R RTGGLSVSL+G
Sbjct: 166 CILSANGTISNVALSQPGSSGSTFTYEGRFEILQLTGSFTMAEEGGRR-RTGGLSVSLAG 224
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
PDGRV+GG VAG+L AA+P+QV+VGSFL + K+ +
Sbjct: 225 PDGRVVGGVVAGMLRAASPIQVIVGSFLPNSLKQHQ 260
>gi|22330402|ref|NP_176537.2| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|20466009|gb|AAM20226.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|119657368|tpd|FAA00283.1| TPA: AT-hook motif nuclear localized protein 12 [Arabidopsis
thaliana]
gi|332195983|gb|AEE34104.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 361
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 99/222 (44%), Positives = 136/222 (61%), Gaps = 32/222 (14%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+KRKRGRPRKYG + + SP S P+ K++RGRPP
Sbjct: 98 VKRKRGRPRKYGEPMVSNKSRDSSPMS--------------------DPNEPKRARGRPP 137
Query: 133 GSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
G+G +K +L LG SAG+ F PHVI++ AGED+++K++SFSQ PRA+CI+S G
Sbjct: 138 GTG--RKQRLANLGEWMNTSAGLAFAPHVISIGAGEDIAAKVLSFSQQRPRALCIMSGTG 195
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
IS+VTL + ++ +TYEG FEI+S GS+L++E G RSRTGGLSVSLS PDG ++
Sbjct: 196 TISSVTLCKPGSTDRHLTYEGPFEIISFGGSYLVNEEGGSRSRTGGLSVSLSRPDGSIIA 255
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKESKSSH----RMESLP 285
G V +L AA VQVV SF+ R ++ +++ R E P
Sbjct: 256 GGV-DMLIAANLVQVVACSFVYGARAKTHNNNNKTIRQEKEP 296
>gi|140052431|gb|ABE80131.2| HMG-I and HMG-Y, DNA-binding [Medicago truncatula]
Length = 270
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 99/213 (46%), Positives = 125/213 (58%), Gaps = 31/213 (14%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K+KRGRPRKY DG ++ PS + T L+SP G LS + +GR
Sbjct: 65 KKKRGRPRKYDADGNLN----PSYKKIVKTTTPI---LTSPPGFTLSTNEFASKKGRGKS 117
Query: 134 SGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
+G + G +A V F PHV+TV AGEDV KI+SF+Q PR +CILSANGA
Sbjct: 118 TGFVNYQTFSSFGEVFPSTAAVDFAPHVVTVYAGEDVGGKILSFAQKSPRGICILSANGA 177
Query: 189 ISNVTLRQAATSGGT-------------------VTYEGRFEILSLSGSFLLSESSGQRS 229
IS V L Q ++G V +GRFEILSLSGS+ S++SG R+
Sbjct: 178 ISKVALGQPGSTGVNSKKQCNGKAYHRQCPLAREVVTQGRFEILSLSGSYTASDNSGIRT 237
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQV 262
R GGLSVSL+GPDGRV+GG+VAG+L AA P+QV
Sbjct: 238 REGGLSVSLAGPDGRVIGGAVAGVLIAAGPIQV 270
>gi|346703416|emb|CBX25513.1| hypothetical_protein [Oryza glaberrima]
Length = 366
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 97/158 (61%), Positives = 118/158 (74%), Gaps = 10/158 (6%)
Query: 126 KSRGRPPGSGSGKKHQLEALG-------SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR 178
+ RGRP GSG++ L LG SAG FTPHVI V GEDV+ +IMSFSQ GPR
Sbjct: 106 RRRGRP--KGSGRRQILATLGQGEWYALSAGGSFTPHVIIVGTGEDVAGRIMSFSQKGPR 163
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
++CILSANG ISNV L Q +SG T TYEGRFEIL L+GSF ++E G+R RTGGLSVSL
Sbjct: 164 SICILSANGTISNVALSQPGSSGSTFTYEGRFEILQLTGSFTMAEEGGRR-RTGGLSVSL 222
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
+GPDGRV+GG VAG+L AA+P+QV+VGSFL + K+ +
Sbjct: 223 AGPDGRVVGGVVAGMLRAASPIQVIVGSFLPNSLKQHQ 260
>gi|255539687|ref|XP_002510908.1| DNA binding protein, putative [Ricinus communis]
gi|223550023|gb|EEF51510.1| DNA binding protein, putative [Ricinus communis]
Length = 198
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 81/125 (64%), Positives = 102/125 (81%), Gaps = 2/125 (1%)
Query: 168 KIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQ 227
K+MSFSQ G RA+CILSANG ISNVTLRQ +SGGT+TYEGRFEILSLSGS++ +S G
Sbjct: 2 KVMSFSQQGARAICILSANGTISNVTLRQPTSSGGTLTYEGRFEILSLSGSYMPIDSGGT 61
Query: 228 RSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVP 287
+SR+GG+S+SL+GPDGRV+GG +AGLL AA PVQVVVGSFL ++E K H+ + + +P
Sbjct: 62 KSRSGGMSISLAGPDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQK--HKKQRIELP 119
Query: 288 PKLAP 292
P + P
Sbjct: 120 PAVTP 124
>gi|242067421|ref|XP_002448987.1| hypothetical protein SORBIDRAFT_05g002940 [Sorghum bicolor]
gi|241934830|gb|EES07975.1| hypothetical protein SORBIDRAFT_05g002940 [Sorghum bicolor]
Length = 362
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/141 (65%), Positives = 108/141 (76%), Gaps = 2/141 (1%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
SAG FTPHVI V GEDV+++IMSFSQ GPR+VCILSANG ISNVTLRQ SG T TY
Sbjct: 141 SAGGSFTPHVIIVGTGEDVAARIMSFSQKGPRSVCILSANGTISNVTLRQPDASGSTFTY 200
Query: 207 EGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
EGRFEIL L GSF ++E R RTGGLSVSL+GPDGRV+GG VAG+L AA+P+QV+VGS
Sbjct: 201 EGRFEILQLMGSFTMAEEG--RRRTGGLSVSLAGPDGRVVGGVVAGMLRAASPIQVIVGS 258
Query: 267 FLADGRKESKSSHRMESLPVP 287
FL + K+ + M+ P P
Sbjct: 259 FLPNSLKQHQRRMSMQQQPSP 279
>gi|212721472|ref|NP_001131540.1| hypothetical protein [Zea mays]
gi|194691798|gb|ACF79983.1| unknown [Zea mays]
gi|413935384|gb|AFW69935.1| hypothetical protein ZEAMMB73_977343 [Zea mays]
Length = 265
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 91/145 (62%), Positives = 113/145 (77%), Gaps = 3/145 (2%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
++G FTPH+I V AGEDVS K++SFSQ GPRA+CILSANG I+NVTLRQ + GGTVTY
Sbjct: 54 ASGANFTPHIINVAAGEDVSMKVISFSQQGPRAICILSANGVIANVTLRQQDSLGGTVTY 113
Query: 207 EGRFEILSLSGSFLLSE-SSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG 265
EGRFE+LSLSGSF ++ G RSR+GG+SVSL+ DGRV+GG VAGLL AA+PVQVVVG
Sbjct: 114 EGRFELLSLSGSFTPTDSGGGTRSRSGGMSVSLAAADGRVIGGGVAGLLVAASPVQVVVG 173
Query: 266 SFLADGRKESKSSHR--MESLPVPP 288
SFL + + ++ + +E VPP
Sbjct: 174 SFLPSYQLDQGANKKPVIEITTVPP 198
>gi|414589836|tpg|DAA40407.1| TPA: hypothetical protein ZEAMMB73_591820 [Zea mays]
Length = 268
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 113/201 (56%), Positives = 126/201 (62%), Gaps = 54/201 (26%)
Query: 163 EDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYE--------------- 207
+DVS+KIMSFSQ+G RAVCILSANGAISNVTLRQ+ATSGGTVTYE
Sbjct: 29 DDVSAKIMSFSQHGTRAVCILSANGAISNVTLRQSATSGGTVTYEVRILNATSYEYRVHF 88
Query: 208 -------------------------------------GRFEILSLSGSFLLSESSGQRSR 230
GRFEILSLSGSFLLSE+ GQRSR
Sbjct: 89 DTDSQLEYFTARYTGTAIQKSDLTDVYCLYRESSLSLGRFEILSLSGSFLLSENGGQRSR 148
Query: 231 TGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG--RKESKSSHRMESLPVPP 288
TGGLSVSL+GPDGRVLGG VAGLLTAA+PVQ+VVGSF A G + + + ++ P P
Sbjct: 149 TGGLSVSLAGPDGRVLGGCVAGLLTAASPVQIVVGSFDAGGKKQPKQQQQQQLAPSPAPL 208
Query: 289 KLAPGGQPAGQCSPPSRGTLS 309
LAP G AG SPPSRGTLS
Sbjct: 209 NLAPTGVAAGPSSPPSRGTLS 229
>gi|356506003|ref|XP_003521778.1| PREDICTED: uncharacterized protein LOC100809675 [Glycine max]
Length = 346
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/231 (46%), Positives = 137/231 (59%), Gaps = 30/231 (12%)
Query: 52 DGAIPQAQGLNVMN-------MGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTAT 104
+G +P A G +++ + S EP KRKRGRPRKYG P +
Sbjct: 36 NGLLPNADGSHILYPHSVASAVSSQLEPAKRKRGRPRKYG---------TPEQALAAKKA 86
Query: 105 GGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGED 164
T S S P SP S KK ALG+AG GFTPHVI+V AGED
Sbjct: 87 ATTLSHSFSVDKKPHSPTF-----------PSSKKSHSFALGNAGQGFTPHVISVAAGED 135
Query: 165 VSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES 224
V KIM F Q R +CILSA+G+ISN +LRQ ATSGG++ YEGRFEI+SL+GS++ +E
Sbjct: 136 VGQKIMLFMQQSRREMCILSASGSISNASLRQPATSGGSIAYEGRFEIISLTGSYVRNEL 195
Query: 225 SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKES 275
+RTGGLSV LS DG+++GG V G L AA PVQV+VG+F D +K++
Sbjct: 196 G---TRTGGLSVCLSNTDGQIIGGGVGGPLKAAGPVQVIVGTFFIDNKKDT 243
>gi|413921420|gb|AFW61352.1| hypothetical protein ZEAMMB73_404625 [Zea mays]
Length = 298
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 105/215 (48%), Positives = 136/215 (63%), Gaps = 27/215 (12%)
Query: 75 RKRGRPRKYGPDGTMSLALVPSPSSVTTAT-GGTGSGLSSPGGGPLSPDSIKKSRGRP-- 131
+KRGRPRKYGPDG++ L +P S + G +P + ++K+ RGRP
Sbjct: 79 KKRGRPRKYGPDGSLIRPLNATPISASAPLPAAVAPGHYTPASAVGA--AMKRGRGRPLD 136
Query: 132 -----------------PGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKI 169
+++G SAG FTPH+ITV GEDV +K+
Sbjct: 137 FAAAAAKQHQQHHHQLYQHQQQQFGFHFDSIGDMGACSAGANFTPHIITVAPGEDVMTKV 196
Query: 170 MSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRS 229
+SFSQ GPRA+C+LSANG IS VTL Q +SGGT+TYEGRFE+LSLSGSF+ +E+ G RS
Sbjct: 197 ISFSQQGPRAICVLSANGVISTVTLCQPDSSGGTLTYEGRFELLSLSGSFMPTENGGTRS 256
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVV 264
R+GG+SVSL+ PDGRV+GG VAGLL AA+PVQV +
Sbjct: 257 RSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVCI 291
>gi|357160917|ref|XP_003578918.1| PREDICTED: uncharacterized protein LOC100823323 [Brachypodium
distachyon]
Length = 388
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/167 (53%), Positives = 111/167 (66%), Gaps = 13/167 (7%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ-AATSGGTVT 205
SAG FTPHVI V GEDV ++IMS SQ GPR+VCILSANG ISNV + Q + SG TVT
Sbjct: 156 SAGGSFTPHVIIVPRGEDVVTRIMSCSQKGPRSVCILSANGTISNVAINQPGSASGDTVT 215
Query: 206 YEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG 265
+EG FEIL L+GSF ++E R RTGGLSVSL+ PDGRV GG VAG+L A TP+QV++G
Sbjct: 216 FEGLFEILQLTGSFTMAEEG--RRRTGGLSVSLAHPDGRVFGGVVAGMLRAGTPIQVILG 273
Query: 266 SFLADGRKESKSSHRME-------SLPV---PPKLAPGGQPAGQCSP 302
SFL + K+ + + +LPV PP + P Q +P
Sbjct: 274 SFLPNSLKQHQRRMGLNQQPSTVPALPVIAAPPPVLTAAMPVSQAAP 320
>gi|219362695|ref|NP_001137004.1| DNA binding protein [Zea mays]
gi|195639104|gb|ACG39020.1| DNA binding protein [Zea mays]
gi|224034497|gb|ACN36324.1| unknown [Zea mays]
gi|413924870|gb|AFW64802.1| DNA binding protein [Zea mays]
Length = 353
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 89/130 (68%), Positives = 105/130 (80%), Gaps = 2/130 (1%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
SAG FTPHVI V GEDV+++IMSFSQ GPR+VCILSANG+ISNVTLRQ SG T TY
Sbjct: 128 SAGGSFTPHVIIVGTGEDVAARIMSFSQKGPRSVCILSANGSISNVTLRQPDASGSTFTY 187
Query: 207 EGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
EGRFEIL L GSF ++E R RTGGLSVSL+GPDGRV+GG VAG+L AA+P+QV+VGS
Sbjct: 188 EGRFEILQLMGSFTMAEEG--RRRTGGLSVSLAGPDGRVVGGVVAGMLRAASPIQVIVGS 245
Query: 267 FLADGRKESK 276
FL + K+ +
Sbjct: 246 FLPNSLKQHQ 255
>gi|194697936|gb|ACF83052.1| unknown [Zea mays]
gi|413924871|gb|AFW64803.1| hypothetical protein ZEAMMB73_859441 [Zea mays]
Length = 351
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 89/130 (68%), Positives = 105/130 (80%), Gaps = 2/130 (1%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
SAG FTPHVI V GEDV+++IMSFSQ GPR+VCILSANG+ISNVTLRQ SG T TY
Sbjct: 126 SAGGSFTPHVIIVGTGEDVAARIMSFSQKGPRSVCILSANGSISNVTLRQPDASGSTFTY 185
Query: 207 EGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
EGRFEIL L GSF ++E R RTGGLSVSL+GPDGRV+GG VAG+L AA+P+QV+VGS
Sbjct: 186 EGRFEILQLMGSFTMAEEG--RRRTGGLSVSLAGPDGRVVGGVVAGMLRAASPIQVIVGS 243
Query: 267 FLADGRKESK 276
FL + K+ +
Sbjct: 244 FLPNSLKQHQ 253
>gi|357481879|ref|XP_003611225.1| AT-hook protein [Medicago truncatula]
gi|355512560|gb|AES94183.1| AT-hook protein [Medicago truncatula]
Length = 720
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 128/209 (61%), Gaps = 10/209 (4%)
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRP 131
P K+KRGRPRKY PDG++SLA+ P P S + L +P G ++ D +++
Sbjct: 44 PAKKKRGRPRKYRPDGSLSLAIPPKPKSSSIGEAAKFE-LENPVGAIVNLDPHEEAIEDK 102
Query: 132 PGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISN 191
++H++ S G FTP +ITV +GE+++ K+MSF Q GP A+CILSANG IS+
Sbjct: 103 TQHSQEREHKV----SEGTTFTPRIITVNSGENIAMKVMSFCQQGPEAICILSANGVISS 158
Query: 192 VTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVA 251
T+ Q ++ TYEG++E +SLSGS S SR+ G+SVSL+G G V+GG VA
Sbjct: 159 ATISQPQSAEKLSTYEGKYENISLSGS-----SMPNGSRSVGMSVSLAGLYGHVVGGCVA 213
Query: 252 GLLTAATPVQVVVGSFLADGRKESKSSHR 280
L A+PV VVV SFLA+ + E K R
Sbjct: 214 CPLVGASPVNVVVSSFLANEQSEQKLRTR 242
>gi|224103017|ref|XP_002312891.1| predicted protein [Populus trichocarpa]
gi|222849299|gb|EEE86846.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 97/224 (43%), Positives = 130/224 (58%), Gaps = 22/224 (9%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+KRKRGRPRKY + + SP +++ + K+ RGRP
Sbjct: 62 LKRKRGRPRKYDAGANLVSSPPLSPPPGLSSSLSSCE---------------KRVRGRP- 105
Query: 133 GSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
GSGK L +LG +AG FTPHV+ V GED+ +K++ FSQ G RAVCILSA G
Sbjct: 106 -RGSGKLQLLASLGGFAAETAGGSFTPHVVPVHTGEDIVTKLLVFSQKGARAVCILSATG 164
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
+S+V +RQ +SGG + Y+G FEILSLSGSF S++ G + G LS+SL+ P+GRV G
Sbjct: 165 VVSSVIMRQPGSSGGILRYDGPFEILSLSGSFTFSKTGGSNRKNGMLSISLAKPNGRVFG 224
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLA 291
G VAG L AA P+Q+++ SF + KE K + P LA
Sbjct: 225 GGVAGSLIAAGPIQLIIASFKQNIGKEIKRRQSADPPTAPSLLA 268
>gi|125533398|gb|EAY79946.1| hypothetical protein OsI_35110 [Oryza sativa Indica Group]
gi|125576224|gb|EAZ17446.1| hypothetical protein OsJ_32974 [Oryza sativa Japonica Group]
Length = 337
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/130 (66%), Positives = 105/130 (80%), Gaps = 1/130 (0%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
SAG FTPHVI V GEDV+ +IMSFSQ GPR++CILSANG ISNV L Q +SG T TY
Sbjct: 103 SAGGSFTPHVIIVGTGEDVAGRIMSFSQKGPRSICILSANGTISNVALSQPGSSGSTFTY 162
Query: 207 EGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
EGRFEIL L+GSF ++E G+R RTGGLSVSL+GPDGRV+GG VAG+L AA+P+QV+VGS
Sbjct: 163 EGRFEILQLTGSFTMAEEGGRR-RTGGLSVSLAGPDGRVVGGVVAGMLRAASPIQVIVGS 221
Query: 267 FLADGRKESK 276
FL + K+ +
Sbjct: 222 FLPNSLKQHQ 231
>gi|357166788|ref|XP_003580851.1| PREDICTED: uncharacterized protein LOC100832411 [Brachypodium
distachyon]
Length = 405
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 138/237 (58%), Gaps = 27/237 (11%)
Query: 66 MGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGS------GLSSPGGGPL 119
M S EP+KRKRGRPRKYGPDG M+ S SS G+ L GG +
Sbjct: 112 MASPPEPVKRKRGRPRKYGPDGAMNKMSSSSLSSSHHQQQMMGAPPPRLGSLDMVGGMDV 171
Query: 120 SPDSIKKSRGRPPGSGSG-----KKHQLEAL-GSAGVGFTPHVITVKAGEDVSSKIMSFS 173
+ KK RGRPPG+G KK A GSAG FTPH+IT EDV+ KI +F+
Sbjct: 172 DAAN-KKRRGRPPGTGKKLSSPTKKPSGNAFSGSAGTSFTPHIITASPSEDVAGKIAAFA 230
Query: 174 QNGPRAVCILSANGAISNVTLRQAATSGGTVT-----------YEGRFEILSLSGSFLLS 222
PRAVC+LSA G++S V LR A +V+ YEG +EILSLSGS+ L+
Sbjct: 231 TQSPRAVCVLSAMGSVSRVVLRHPADHASSVSRAPPSYNNPAIYEGLYEILSLSGSYNLN 290
Query: 223 ESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG-RKESKSS 278
E Q++++ G+SV+L P+ V+GG + G L AA+ VQVV+GSF+ G R +SK S
Sbjct: 291 ED--QQNQSDGISVTLCSPERHVIGGVLGGALVAASTVQVVLGSFVHGGSRAKSKKS 345
>gi|255636324|gb|ACU18501.1| unknown [Glycine max]
Length = 191
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 80/118 (67%), Positives = 102/118 (86%)
Query: 150 VGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGR 209
+GFTPH+IT+ GED+++KIM+FSQ GPRA+CILSANGA+S VTLRQ +TSGGT TYE R
Sbjct: 1 MGFTPHIITIAVGEDIATKIMAFSQQGPRAICILSANGAVSTVTLRQPSTSGGTATYEER 60
Query: 210 FEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
FEI+ LSGS+L+++S G R+RT LSVSL+ PDGRV+GG V G+L AA+PVQV++GSF
Sbjct: 61 FEIVCLSGSYLVADSGGARNRTVALSVSLASPDGRVIGGGVGGVLIAASPVQVILGSF 118
>gi|297833142|ref|XP_002884453.1| hypothetical protein ARALYDRAFT_477717 [Arabidopsis lyrata subsp.
lyrata]
gi|297330293|gb|EFH60712.1| hypothetical protein ARALYDRAFT_477717 [Arabidopsis lyrata subsp.
lyrata]
Length = 408
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 135/209 (64%), Gaps = 17/209 (8%)
Query: 71 EPMKRKRGRPRKY-GPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
EP+KRKRGRPRKY P+ +LA SS ++++ L++ GG +S +S
Sbjct: 100 EPLKRKRGRPRKYVTPE--QALAAKKMASSASSSSAKERRELAAVTGGTVSTNS------ 151
Query: 130 RPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAI 189
GS KK QL ++G G FTPH++ + GEDV+ KIM F+ +C+LSA+G I
Sbjct: 152 -----GSSKKSQLGSVGKTGQCFTPHIVNIAPGEDVAQKIMIFANQSKHELCVLSASGTI 206
Query: 190 SNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGS 249
SN +LRQ AT+G + +EG++EILSLSGS++ +E G +TGGLS SLS DG+++GG+
Sbjct: 207 SNASLRQPATAGVNLPHEGQYEILSLSGSYIRTEQGG---KTGGLSASLSASDGQIIGGA 263
Query: 250 VAGLLTAATPVQVVVGSFLADGRKESKSS 278
+ LTAA PVQV++G+F D +K++ S
Sbjct: 264 IGTHLTAAGPVQVILGTFQLDRKKDAAGS 292
>gi|79596510|ref|NP_850512.2| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|332640580|gb|AEE74101.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 309
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 134/206 (65%), Gaps = 17/206 (8%)
Query: 71 EPMKRKRGRPRKY-GPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
EP+KRKRGRPRKY P+ +LA SS ++++ L++ GG +S +S
Sbjct: 102 EPVKRKRGRPRKYVTPE--QALAAKKLASSASSSSAKQRRELAAVTGGTVSTNS------ 153
Query: 130 RPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAI 189
GS KK QL ++G G FTPH++ + GEDV KIM F+ +C+LSA+G I
Sbjct: 154 -----GSSKKSQLGSVGKTGQCFTPHIVNIAPGEDVVQKIMMFANQSKHELCVLSASGTI 208
Query: 190 SNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGS 249
SN +LRQ A SGG + YEG++EILSLSGS++ +E G ++GGLSVSLS DG+++GG+
Sbjct: 209 SNASLRQPAPSGGNLPYEGQYEILSLSGSYIRTEQGG---KSGGLSVSLSASDGQIIGGA 265
Query: 250 VAGLLTAATPVQVVVGSFLADGRKES 275
+ LTAA PVQV++G+F D +K++
Sbjct: 266 IGSHLTAAGPVQVILGTFQLDRKKDA 291
>gi|30679188|ref|NP_187109.2| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|119935918|gb|ABM06034.1| At3g04590 [Arabidopsis thaliana]
gi|225898615|dbj|BAH30438.1| hypothetical protein [Arabidopsis thaliana]
gi|332640581|gb|AEE74102.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 411
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 95/209 (45%), Positives = 135/209 (64%), Gaps = 17/209 (8%)
Query: 71 EPMKRKRGRPRKY-GPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
EP+KRKRGRPRKY P+ +LA SS ++++ L++ GG +S +S
Sbjct: 102 EPVKRKRGRPRKYVTPE--QALAAKKLASSASSSSAKQRRELAAVTGGTVSTNS------ 153
Query: 130 RPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAI 189
GS KK QL ++G G FTPH++ + GEDV KIM F+ +C+LSA+G I
Sbjct: 154 -----GSSKKSQLGSVGKTGQCFTPHIVNIAPGEDVVQKIMMFANQSKHELCVLSASGTI 208
Query: 190 SNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGS 249
SN +LRQ A SGG + YEG++EILSLSGS++ +E G ++GGLSVSLS DG+++GG+
Sbjct: 209 SNASLRQPAPSGGNLPYEGQYEILSLSGSYIRTEQGG---KSGGLSVSLSASDGQIIGGA 265
Query: 250 VAGLLTAATPVQVVVGSFLADGRKESKSS 278
+ LTAA PVQV++G+F D +K++ S
Sbjct: 266 IGSHLTAAGPVQVILGTFQLDRKKDAAGS 294
>gi|224132080|ref|XP_002328180.1| predicted protein [Populus trichocarpa]
gi|222837695|gb|EEE76060.1| predicted protein [Populus trichocarpa]
Length = 344
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 116/255 (45%), Positives = 140/255 (54%), Gaps = 37/255 (14%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+KRKRGRPRKY D LV SP G S LSS + RGR
Sbjct: 65 VKRKRGRPRKYDVDAN----LVSSPP----PPQGLSSSLSS-----------YEKRGRGR 105
Query: 133 GSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
GSGK L +LG +AG FTPHV+ V GED+ SKI+ SQ G RAVCILSA G
Sbjct: 106 PRGSGKLQLLASLGGFAAETAGGSFTPHVVPVYTGEDIVSKIIELSQKGARAVCILSATG 165
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
+S+V +RQ SGG + Y+GRFEILSLSGSF E+ G + G LSVSL+ PDGRV G
Sbjct: 166 VVSSVIMRQPGPSGGILRYDGRFEILSLSGSFTFGETGGSNRKNGMLSVSLAKPDGRVFG 225
Query: 248 GSVAGLLTAATPVQVVVGSFLAD-----GRKESKSSHRMESLP-------VPPKLAPGGQ 295
G VAG L AA P+Q+V+ SF + R++S SLP VP K+A
Sbjct: 226 GGVAGSLIAAGPIQLVIASFKQNIGKGIKRRQSADPPAAPSLPANSDVVRVPVKIAGTTD 285
Query: 296 PAGQCSPPSRGTLSE 310
C+ P+ LSE
Sbjct: 286 GEDNCTTPT-SALSE 299
>gi|356573149|ref|XP_003554726.1| PREDICTED: uncharacterized protein LOC100816781 [Glycine max]
Length = 356
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 109/233 (46%), Positives = 143/233 (61%), Gaps = 29/233 (12%)
Query: 52 DGAIPQAQGLNVMN-------MGSGSEPMKRKRGRPRKYG-PDGTMSLALVPSPSSVTTA 103
+G +P A G +++ + S EP KRKRGRPRKYG P+ ++ + SS +
Sbjct: 41 NGLLPNADGSHMLYPHSVASAVSSQLEPAKRKRGRPRKYGTPEQALAAKKAATTSSQS-- 98
Query: 104 TGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLE-ALGSAGVGFTPHVITVKAG 162
S D S P S + K L ALG+AG GFTPHVI+V AG
Sbjct: 99 ---------------FSADKKPHSPTFPSSSFTSSKKSLSFALGNAGQGFTPHVISVAAG 143
Query: 163 EDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLS 222
EDV KIM F Q R +CILSA+G+ISN +LRQ ATSGG++TYEGRFEI+SL+GS++ +
Sbjct: 144 EDVGQKIMLFMQQSRREMCILSASGSISNASLRQPATSGGSITYEGRFEIISLTGSYVRN 203
Query: 223 ESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKES 275
E +RTGGLSV LS DG+++GG V G L AA PVQV+VG+F D +K++
Sbjct: 204 ELG---TRTGGLSVCLSNTDGQIIGGGVGGPLKAAGPVQVIVGTFFIDNKKDN 253
>gi|77552992|gb|ABA95788.1| DNA-binding family protein, putative, expressed [Oryza sativa
Japonica Group]
Length = 280
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 88/139 (63%), Positives = 109/139 (78%), Gaps = 2/139 (1%)
Query: 138 KKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQA 197
+K + AL SAG FTPHVI V GEDV+++IMSFSQ GPR+VCIL+ANG ISNV L Q
Sbjct: 34 RKGEWYAL-SAGGSFTPHVIIVATGEDVAARIMSFSQKGPRSVCILAANGTISNVVLNQP 92
Query: 198 ATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAA 257
+SG T +YEG FEIL L+GSF ++E G R RTGGLSVSL+GPDGRV+GG VAG+L AA
Sbjct: 93 GSSGSTFSYEGCFEILQLTGSFTIAE-EGVRRRTGGLSVSLAGPDGRVVGGVVAGMLRAA 151
Query: 258 TPVQVVVGSFLADGRKESK 276
+P+QV+VGSFL + K+ +
Sbjct: 152 SPIQVIVGSFLPNNLKQHQ 170
>gi|346703792|emb|CBX24460.1| hypothetical_protein [Oryza glaberrima]
Length = 278
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/139 (63%), Positives = 109/139 (78%), Gaps = 2/139 (1%)
Query: 138 KKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQA 197
+K + AL SAG FTPHVI V GEDV+++IMSFSQ GPR+VCIL+ANG ISNV L Q
Sbjct: 33 RKGEWYAL-SAGGSFTPHVIIVATGEDVAARIMSFSQKGPRSVCILAANGTISNVVLNQP 91
Query: 198 ATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAA 257
+SG T +YEG FEIL L+GSF ++E G R RTGGLSVSL+GPDGRV+GG VAG+L AA
Sbjct: 92 GSSGSTFSYEGCFEILQLTGSFTIAE-EGVRRRTGGLSVSLAGPDGRVVGGVVAGMLRAA 150
Query: 258 TPVQVVVGSFLADGRKESK 276
+P+QV+VGSFL + K+ +
Sbjct: 151 SPIQVIVGSFLPNNLKQHQ 169
>gi|115461412|ref|NP_001054306.1| Os04g0683900 [Oryza sativa Japonica Group]
gi|113565877|dbj|BAF16220.1| Os04g0683900 [Oryza sativa Japonica Group]
gi|215686331|dbj|BAG87592.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704650|dbj|BAG94278.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195855|gb|EEC78282.1| hypothetical protein OsI_17980 [Oryza sativa Indica Group]
Length = 419
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 99/235 (42%), Positives = 136/235 (57%), Gaps = 43/235 (18%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLA----------LVPSPSSVTTATGGTGSGLSSPGGGPLS 120
EP+KRKRGRPRKYGPDGTM ++ ++ +P + + +G G GG +
Sbjct: 123 EPVKRKRGRPRKYGPDGTMKVSTAAAAQHQQQMLSAPPRMGSVSGADMVG----GGSGMD 178
Query: 121 PDSIKKSRGRPPGSGSGKKHQLEA----------LGSAGVGFTPHVITVKAGEDVSSKIM 170
+ KK RGRPPG+G KK QL + GSAG FTPH+IT EDV+ KI+
Sbjct: 179 DSAQKKRRGRPPGTG--KKQQLSSPVKLSGGNAFSGSAGTSFTPHIITASPSEDVAGKIV 236
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAATSGGT-----------VTYEGRFEILSLSGSF 219
+F+ + RAVC+LSA G++S V LR A + YEG +EILS+SG +
Sbjct: 237 AFANHSSRAVCVLSATGSVSRVVLRHPADGAMSRVHASSHYKNPAIYEGLYEILSMSGCY 296
Query: 220 -LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
L++E GQ + GLSV+L P+ ++GG + G L AA+ VQVV+GSF+ G K
Sbjct: 297 NLMNE--GQ---SDGLSVTLCSPERHIIGGVLGGALVAASTVQVVLGSFVQGGSK 346
>gi|32488704|emb|CAE03447.1| OSJNBa0088H09.5 [Oryza sativa Japonica Group]
gi|90399216|emb|CAH68288.1| H0306F12.9 [Oryza sativa Indica Group]
Length = 356
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/235 (41%), Positives = 135/235 (57%), Gaps = 43/235 (18%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLA----------LVPSPSSVTTATGGTGSGLSSPGGGPLS 120
EP+KRKRGRPRKYGPDGTM ++ ++ +P + + +G G GG +
Sbjct: 60 EPVKRKRGRPRKYGPDGTMKVSTAAAAQHQQQMLSAPPRMGSVSGADMVG----GGSGMD 115
Query: 121 PDSIKKSRGRPPGSGSGKKHQLEA----------LGSAGVGFTPHVITVKAGEDVSSKIM 170
+ KK RGRPPG+G KK QL + GSAG FTPH+IT EDV+ KI+
Sbjct: 116 DSAQKKRRGRPPGTG--KKQQLSSPVKLSGGNAFSGSAGTSFTPHIITASPSEDVAGKIV 173
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAATSGGT-----------VTYEGRFEILSLSGSF 219
+F+ + RAVC+LSA G++S V LR A + YEG +EILS+SG +
Sbjct: 174 AFANHSSRAVCVLSATGSVSRVVLRHPADGAMSRVHASSHYKNPAIYEGLYEILSMSGCY 233
Query: 220 -LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
L++E ++ GLSV+L P+ ++GG + G L AA+ VQVV+GSF+ G K
Sbjct: 234 NLMNE-----GQSDGLSVTLCSPERHIIGGVLGGALVAASTVQVVLGSFVQGGSK 283
>gi|222629803|gb|EEE61935.1| hypothetical protein OsJ_16679 [Oryza sativa Japonica Group]
Length = 418
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 97/235 (41%), Positives = 135/235 (57%), Gaps = 43/235 (18%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLA----------LVPSPSSVTTATGGTGSGLSSPGGGPLS 120
EP+KRKRGRPRKYGPDGTM ++ ++ +P + + +G G GG +
Sbjct: 122 EPVKRKRGRPRKYGPDGTMKVSTAAAAQHQQQMLSAPPRMGSVSGADMVG----GGSGMD 177
Query: 121 PDSIKKSRGRPPGSGSGKKHQLEA----------LGSAGVGFTPHVITVKAGEDVSSKIM 170
+ KK RGRPPG+G KK QL + GSAG FTPH+IT EDV+ KI+
Sbjct: 178 DSAQKKRRGRPPGTG--KKQQLSSPVKLSGGNAFSGSAGTSFTPHIITASPSEDVAGKIV 235
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAATSGGT-----------VTYEGRFEILSLSGSF 219
+F+ + RAVC+LSA G++S V LR A + YEG +EILS+SG +
Sbjct: 236 AFANHSSRAVCVLSATGSVSRVVLRHPADGAMSRVHASSHYKNPAIYEGLYEILSMSGCY 295
Query: 220 -LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
L++E ++ GLSV+L P+ ++GG + G L AA+ VQVV+GSF+ G K
Sbjct: 296 NLMNE-----GQSDGLSVTLCSPERHIIGGVLGGALVAASTVQVVLGSFVQGGSK 345
>gi|242082798|ref|XP_002441824.1| hypothetical protein SORBIDRAFT_08g002940 [Sorghum bicolor]
gi|241942517|gb|EES15662.1| hypothetical protein SORBIDRAFT_08g002940 [Sorghum bicolor]
Length = 356
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 86/130 (66%), Positives = 104/130 (80%), Gaps = 2/130 (1%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
+AG FTPHVI V GEDV+++IMSFS+ GPR+VCILSANG ISNVTLRQ SG T TY
Sbjct: 126 TAGGSFTPHVIIVGTGEDVAARIMSFSKKGPRSVCILSANGTISNVTLRQPDPSGSTFTY 185
Query: 207 EGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
EG FEIL L+GSF ++E R RTGGLSVSL+GPDGRV+GG VAG+L AA+P+QV+VGS
Sbjct: 186 EGLFEILQLTGSFTMAEEG--RKRTGGLSVSLAGPDGRVVGGVVAGMLRAASPIQVIVGS 243
Query: 267 FLADGRKESK 276
FL + K+ +
Sbjct: 244 FLPNSLKQHQ 253
>gi|449443249|ref|XP_004139392.1| PREDICTED: uncharacterized protein LOC101221844 [Cucumis sativus]
gi|449520142|ref|XP_004167093.1| PREDICTED: uncharacterized protein LOC101229030 [Cucumis sativus]
Length = 362
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 99/204 (48%), Positives = 126/204 (61%), Gaps = 10/204 (4%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP +RKRGRPRKYG A + +S +++ L+S S+
Sbjct: 68 EPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELASS-------SSLNAVSAS 120
Query: 131 PPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAIS 190
S KK QL ALG+AG GF PHVI V AGEDV KIM F Q R +CILSA+G+IS
Sbjct: 121 SSFSTPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMQFMQQCKREICILSASGSIS 180
Query: 191 NVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSV 250
N +LRQ A SGG + YEGRFEI+SL GS++ ++ G +TGGLSV LS +G ++GG V
Sbjct: 181 NASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGGLSVCLSSAEGHIIGGGV 237
Query: 251 AGLLTAATPVQVVVGSFLADGRKE 274
G L AA PVQV+VG+F+ D +KE
Sbjct: 238 GGPLKAAGPVQVIVGTFVIDPKKE 261
>gi|115487330|ref|NP_001066152.1| Os12g0147000 [Oryza sativa Japonica Group]
gi|113648659|dbj|BAF29171.1| Os12g0147000 [Oryza sativa Japonica Group]
Length = 387
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/130 (65%), Positives = 104/130 (80%), Gaps = 1/130 (0%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
SAG FTPHVI V GEDV+++IMSFSQ GPR+VCIL+ANG ISNV L Q +SG T +Y
Sbjct: 143 SAGGSFTPHVIIVATGEDVAARIMSFSQKGPRSVCILAANGTISNVVLNQPGSSGSTFSY 202
Query: 207 EGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
EG FEIL L+GSF ++E G R RTGGLSVSL+GPDGRV+GG VAG+L AA+P+QV+VGS
Sbjct: 203 EGCFEILQLTGSFTIAE-EGVRRRTGGLSVSLAGPDGRVVGGVVAGMLRAASPIQVIVGS 261
Query: 267 FLADGRKESK 276
FL + K+ +
Sbjct: 262 FLPNNLKQHQ 271
>gi|168047842|ref|XP_001776378.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672338|gb|EDQ58877.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 383
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 99/199 (49%), Positives = 120/199 (60%), Gaps = 29/199 (14%)
Query: 72 PMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTG--SGLSSPGGGPLSP--DSIKKS 127
PMKRKRGRPRKY SP + G T S L+ P +P S K+
Sbjct: 109 PMKRKRGRPRKYTTGD--------SPQVTVSGFGNTSLFSALAKQIAAPYTPPDKSEKRG 160
Query: 128 RGRPPGSGSGKKHQLEALGS--AGVG--FTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
RGRP GS +K QL LG AG G FTPH++TV GED SSKIM F+Q+GPRA+C+L
Sbjct: 161 RGRP--VGSTRKQQLANLGVVLAGTGKSFTPHILTVHTGEDASSKIMQFAQHGPRAMCVL 218
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL-LSESSGQRSRTGGLSVSLSGPD 242
SANGA+SNV LRQ ++S GTVTYEGR+EILSLSGS+L LS G + RTG +
Sbjct: 219 SANGAVSNVMLRQDSSSEGTVTYEGRYEILSLSGSYLPLSGEDGAKQRTGIV-------- 270
Query: 243 GRVLGGSVAGLLTAATPVQ 261
V+G + GLL + V
Sbjct: 271 --VVGSFLLGLLKTDSKVD 287
>gi|326511204|dbj|BAJ87616.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 408
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 100/225 (44%), Positives = 128/225 (56%), Gaps = 28/225 (12%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATG--------------GTGSGLSSPGG 116
EP+KRKRGRPRKYGPDG M + S SS G SG GG
Sbjct: 119 EPVKRKRGRPRKYGPDGAMKHHMSSSSSSAHHHQQQHQHQMMGAPQQRMGPMSGQGMAGG 178
Query: 117 GPLSPDSIKKSRGRPPGSG------SGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIM 170
L + KK RGRPPG+G + K GSAG FTPH+IT EDV+ KI
Sbjct: 179 --LDDAAQKKKRGRPPGTGKKLSSTTSKPSGNAFPGSAGTSFTPHIITASPSEDVAGKIA 236
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAA----TSGGTVTYEGRFEILSLSGSFLLSESSG 226
+F+ PRAVC+LSA G++S LR A + YEG +EILSLSGS+ L+E G
Sbjct: 237 AFASQSPRAVCVLSAMGSVSRAVLRHPADHPPSYNNPSIYEGLYEILSLSGSYNLNE--G 294
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG 271
Q+++T G+SV+L P+ V+GG + G L AA+ VQVV+G+F+ G
Sbjct: 295 QQNQTDGISVTLCSPERHVIGGVLGGALVAASTVQVVLGTFVQGG 339
>gi|357498789|ref|XP_003619683.1| hypothetical protein MTR_6g061670 [Medicago truncatula]
gi|355494698|gb|AES75901.1| hypothetical protein MTR_6g061670 [Medicago truncatula]
Length = 314
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 100/210 (47%), Positives = 126/210 (60%), Gaps = 6/210 (2%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+K+KRGRPRK + AL P P S + G SG S GGG +S +P
Sbjct: 50 VKKKRGRPRK--SESGSKPALSPMPISASIPLTGDFSGWKSGGGGGGGVVKPFESIKKPL 107
Query: 133 GSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNV 192
+ + G F HV+TV +GEDVS KIMS SQ + ILSA G ISNV
Sbjct: 108 KLNDFDEDN--GISPFGSNFKTHVLTVNSGEDVSMKIMSLSQQEYHTISILSATGTISNV 165
Query: 193 TLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAG 252
TLRQ+ GGT TYEG FEILSLSGSF+ +E+ +SR+G +SVSL+GP+GRV GG++AG
Sbjct: 166 TLRQSDACGGTSTYEGVFEILSLSGSFVPTENGLTKSRSGRMSVSLAGPNGRVFGGALAG 225
Query: 253 LLTAATPVQVVVGSFLADGRKESKSSHRME 282
LL AA VQVVV SF + KE+ R++
Sbjct: 226 LLVAAGSVQVVVASFFPE--KENPKRQRVD 253
>gi|357512373|ref|XP_003626475.1| hypothetical protein MTR_7g116320 [Medicago truncatula]
gi|355501490|gb|AES82693.1| hypothetical protein MTR_7g116320 [Medicago truncatula]
Length = 367
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 110/249 (44%), Positives = 147/249 (59%), Gaps = 23/249 (9%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP KRKRGRPRKYG +P A + S S + + K +
Sbjct: 71 EPAKRKRGRPRKYG-----------TPEQALAAKKASTSSFSPTPPTLDTTTNNKNTHSF 119
Query: 131 PPGSGSGKK---HQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
P S S H L +LG+AG GF+ HVI V AGEDV KIM F Q +CI+SA+G
Sbjct: 120 SPSSSSFTTKKSHSL-SLGNAGQGFSAHVIAVAAGEDVGQKIMQFMQQHRGEICIMSASG 178
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
+ISN +LRQ A+SGG + YEGRF+I+SL+GS++ +E+ G R+GGLSV LS DG+++G
Sbjct: 179 SISNASLRQPASSGGNIMYEGRFDIISLTGSYVRNETGG---RSGGLSVCLSNSDGQIIG 235
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKL-APGGQPAGQCSPPSRG 306
G V G L AA PVQV+VG+F D +K++ + + + P KL +P G+PA S R
Sbjct: 236 GGVGGPLKAAGPVQVIVGTFFIDNKKDTSAGGKGD--PSAGKLPSPVGEPA--SSLGFRQ 291
Query: 307 TLSESSGGP 315
T+ SSG P
Sbjct: 292 TVDSSSGNP 300
>gi|6175163|gb|AAF04889.1|AC011437_4 unknown protein [Arabidopsis thaliana]
gi|119657372|tpd|FAA00285.1| TPA: AT-hook motif nuclear localized protein 14 [Arabidopsis
thaliana]
Length = 418
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 95/216 (43%), Positives = 135/216 (62%), Gaps = 24/216 (11%)
Query: 71 EPMKRKRGRPRKY-GPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
EP+KRKRGRPRKY P+ +LA SS ++++ L++ GG +S +S
Sbjct: 102 EPVKRKRGRPRKYVTPE--QALAAKKLASSASSSSAKQRRELAAVTGGTVSTNS------ 153
Query: 130 RPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAI 189
GS KK QL ++G G FTPH++ + GEDV KIM F+ +C+LSA+G I
Sbjct: 154 -----GSSKKSQLGSVGKTGQCFTPHIVNIAPGEDVVQKIMMFANQSKHELCVLSASGTI 208
Query: 190 SNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGS 249
SN +LRQ A SGG + YEG++EILSLSGS++ +E G ++GGLSVSLS DG+++GG+
Sbjct: 209 SNASLRQPAPSGGNLPYEGQYEILSLSGSYIRTEQGG---KSGGLSVSLSASDGQIIGGA 265
Query: 250 VAGLLTAATPVQ-------VVVGSFLADGRKESKSS 278
+ LTAA PVQ V++G+F D +K++ S
Sbjct: 266 IGSHLTAAGPVQVQFCCIIVILGTFQLDRKKDAAGS 301
>gi|3193332|gb|AAC19314.1| similar to Arabidopsis AT-hook protein 1 (GB:AJ222585) [Arabidopsis
thaliana]
gi|7267107|emb|CAB80778.1| putative transcription factor [Arabidopsis thaliana]
gi|119657358|tpd|FAA00278.1| TPA: AT-hook motif nuclear localized protein 7 [Arabidopsis
thaliana]
Length = 345
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 111/255 (43%), Positives = 140/255 (54%), Gaps = 68/255 (26%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K++RGRPRKY +G P PSS P K+ RG+ G
Sbjct: 56 KKRRGRPRKYEANGA------PLPSSSV-------------------PLVKKRVRGKLNG 90
Query: 134 SGSGKKHQLEALGSAGV--------------GFTPHVITVKAGE---------------- 163
K H+ S+G FTPHVITV GE
Sbjct: 91 FDMKKMHKTIGFHSSGERFGVGGGVGGGVGSNFTPHVITVNTGEVCILEEKGPKLSLGRR 150
Query: 164 -DVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLS 222
D++ +I+SFSQ GPRA+CILSANG ISNVTLRQ + GGT+TYEGRFEILSLSGSF+ +
Sbjct: 151 FDITMRIISFSQQGPRAICILSANGVISNVTLRQPDSCGGTLTYEGRFEILSLSGSFMET 210
Query: 223 ESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQ----------VVVGSFLADGR 272
E+ G + R+GG+SVSL+GPDGRV+GG VAGLL AATP+Q VVVGSF+ +
Sbjct: 211 ENQGSKGRSGGMSVSLAGPDGRVVGGGVAGLLIAATPIQVTHESNNNVYVVVGSFITSDQ 270
Query: 273 KESKS--SHRMESLP 285
++ + R+E P
Sbjct: 271 QDHQKPRKQRVEHAP 285
>gi|255561895|ref|XP_002521956.1| DNA binding protein, putative [Ricinus communis]
gi|223538760|gb|EEF40360.1| DNA binding protein, putative [Ricinus communis]
Length = 364
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 86/154 (55%), Positives = 112/154 (72%), Gaps = 7/154 (4%)
Query: 138 KKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQA 197
K QL ALG+AG GFTPHVI+V AGEDV+ KIM F Q R +CILSA+G+ISN +LRQ
Sbjct: 131 KSQQLVALGNAGQGFTPHVISVSAGEDVAQKIMLFMQQCRREMCILSASGSISNASLRQP 190
Query: 198 ATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAA 257
ATSGG +TYEGRFEI+SLSGS++ +E G R GGLSV LS DG+++GG + G L A
Sbjct: 191 ATSGGNITYEGRFEIISLSGSYVRTEIGG---RAGGLSVCLSNSDGQIIGGGIGGPLIAG 247
Query: 258 TPVQVVVGSFLADGRKESKSSHRMES----LPVP 287
PVQV++G+F+ D +K+ S ++++ LP P
Sbjct: 248 GPVQVIIGTFVVDNKKDVGSGGKVDASSSKLPSP 281
>gi|212722592|ref|NP_001132694.1| uncharacterized protein LOC100194172 [Zea mays]
gi|194695112|gb|ACF81640.1| unknown [Zea mays]
Length = 380
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 103/258 (39%), Positives = 134/258 (51%), Gaps = 39/258 (15%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGP-------LSPD 122
+EP+KRKRGRPRKYGPDGTM +S T + S GP +
Sbjct: 92 AEPLKRKRGRPRKYGPDGTMRQQQQQQAASSQQQLVATQPRICSLSSGPDMLGSSGMEDL 151
Query: 123 SIKKSRGRPPGSGSGKKHQLEA--------LGSAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
+ KK RGRPPG+G KKHQ GSAG FTPH+IT EDV++KI++F+
Sbjct: 152 AQKKRRGRPPGTG--KKHQPSTSQGPGNAFAGSAGTSFTPHIITASPSEDVAAKIVAFAS 209
Query: 175 NGPRAVCILSANGAISNVTLRQAAT-------------SGGTVTYEGRFEILSLSGSFLL 221
+AVC+LSA G++S LR A YEG +EILSL+GS+ L
Sbjct: 210 QSSKAVCVLSAMGSVSRAVLRHPADGSPMARVHASPQPYKNPAVYEGFYEILSLTGSYNL 269
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG--RKESKSSH 279
++ GGLSV+L P+ V+GG + G L AA VQVV+GSF G K K+
Sbjct: 270 AQG-------GGLSVTLCSPERNVIGGVLGGPLVAAGTVQVVLGSFYQGGSRSKSKKAGK 322
Query: 280 RMESLPVPPKLAPGGQPA 297
+ ++ P GGQ A
Sbjct: 323 QQQAAAFSPDSLTGGQEA 340
>gi|414584712|tpg|DAA35283.1| TPA: hypothetical protein ZEAMMB73_589559 [Zea mays]
Length = 380
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 103/258 (39%), Positives = 134/258 (51%), Gaps = 39/258 (15%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGP-------LSPD 122
+EP+KRKRGRPRKYGPDGTM +S T + S GP +
Sbjct: 92 AEPLKRKRGRPRKYGPDGTMRQQQQQQAASSQQQLVATQPRICSLSSGPDMLGSSGMEDP 151
Query: 123 SIKKSRGRPPGSGSGKKHQLEA--------LGSAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
+ KK RGRPPG+G KKHQ GSAG FTPH+IT EDV++KI++F+
Sbjct: 152 AQKKRRGRPPGTG--KKHQPSTSQGPGNAFAGSAGTSFTPHIITASPSEDVAAKIVAFAS 209
Query: 175 NGPRAVCILSANGAISNVTLRQAAT-------------SGGTVTYEGRFEILSLSGSFLL 221
+AVC+LSA G++S LR A YEG +EILSL+GS+ L
Sbjct: 210 QSSKAVCVLSAMGSVSRAVLRHPADGSPMARVHASPQPYKNPAVYEGFYEILSLTGSYNL 269
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG--RKESKSSH 279
++ GGLSV+L P+ V+GG + G L AA VQVV+GSF G K K+
Sbjct: 270 AQG-------GGLSVTLCSPERNVIGGVLGGPLVAAGTVQVVLGSFHQGGSRSKSKKAGK 322
Query: 280 RMESLPVPPKLAPGGQPA 297
+ ++ P GGQ A
Sbjct: 323 QQQAAAFSPDSLTGGQEA 340
>gi|226530164|ref|NP_001150147.1| DNA binding protein [Zea mays]
gi|195637110|gb|ACG38023.1| DNA binding protein [Zea mays]
gi|413920027|gb|AFW59959.1| DNA binding protein [Zea mays]
Length = 397
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 97/224 (43%), Positives = 126/224 (56%), Gaps = 33/224 (14%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+EP+KRKRGRPRKYGPDGTM + + G +G + S G + S KK RG
Sbjct: 105 TEPVKRKRGRPRKYGPDGTMKQQQL---VAAQPRIGPSGPNMISSAG--IEDSSQKKRRG 159
Query: 130 RPPGSGSGKKHQLE------ALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
RPPG+ KKHQ GSAG FTPH+IT EDV++KI++F+ RAVC+L
Sbjct: 160 RPPGTA--KKHQPSPSQGNAFAGSAGTSFTPHIITASPSEDVAAKIVAFATQSSRAVCVL 217
Query: 184 SANGAISNVTLRQAAT--------------SGGTVTYEGRFEILSLSGSFLLSESSGQRS 229
SA G++S LR A + YEG +EI+SL+GS+ L+E S Q
Sbjct: 218 SAMGSVSRAVLRHPADGSPMARVHASPQPYNNSPAIYEGFYEIMSLTGSYNLAEGSQQEQ 277
Query: 230 R------TGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+GGLSV+L P+ V+GG + G L AA VQVV+GSF
Sbjct: 278 CQGQGQPSGGLSVTLCSPERNVIGGVLGGPLVAAGTVQVVLGSF 321
>gi|224116000|ref|XP_002332023.1| predicted protein [Populus trichocarpa]
gi|222875248|gb|EEF12379.1| predicted protein [Populus trichocarpa]
Length = 365
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 101/219 (46%), Positives = 129/219 (58%), Gaps = 50/219 (22%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
KRKRGRPRKYG T LAL ++ TAT S+ SR R
Sbjct: 78 KRKRGRPRKYG---TPELAL----AAKKTATSA----------------SVAASRERK-- 112
Query: 134 SGSGKKHQL------------------EALGSAGVGFTPHVITVKAGEDVSSKIMSFSQN 175
++HQ LG+AG GFTPHVITV AGEDV KI+ F Q
Sbjct: 113 ----EQHQAGSSSTTSSFSGSSSKKSQHVLGTAGHGFTPHVITVAAGEDVGQKIIQFLQQ 168
Query: 176 GPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLS 235
R +CILSA+G++ NV+LRQ ATSGG ++YEGRFEI+SLSGS++ ++ G R GGLS
Sbjct: 169 STREMCILSASGSVMNVSLRQPATSGGNISYEGRFEIISLSGSYIRTDMGG---RAGGLS 225
Query: 236 VSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKE 274
V LS +G+++GG V G L AA PVQV+VG+F+ D +K+
Sbjct: 226 VCLSDSNGQIIGGGVGGPLKAAGPVQVIVGTFVLDNKKD 264
>gi|167600637|gb|ABZ89179.1| hypothetical protein 46C02.5 [Coffea canephora]
Length = 351
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/135 (60%), Positives = 101/135 (74%), Gaps = 3/135 (2%)
Query: 140 HQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAAT 199
+QL A GS G F PHVITV AGEDV KIM F Q R +CILSA+G+ISN +LRQ AT
Sbjct: 139 YQLAASGSTGQSFIPHVITVAAGEDVGQKIMLFMQQSKREICILSASGSISNASLRQPAT 198
Query: 200 SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATP 259
SGG +TYEGRF+ILSL GS++ +E G RTGGLSV LS DG+++GG V G LTAA P
Sbjct: 199 SGGNITYEGRFDILSLCGSYVRTELGG---RTGGLSVCLSSTDGQIIGGGVGGPLTAAGP 255
Query: 260 VQVVVGSFLADGRKE 274
+Q++VG+F+ D +K+
Sbjct: 256 IQIIVGTFVIDPKKD 270
>gi|324388024|gb|ADY38786.1| DNA-binding protein [Coffea arabica]
Length = 351
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/135 (60%), Positives = 101/135 (74%), Gaps = 3/135 (2%)
Query: 140 HQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAAT 199
+QL A GS G F PHVITV AGEDV KIM F Q R +CILSA+G+ISN +LRQ AT
Sbjct: 139 YQLAASGSTGQSFIPHVITVAAGEDVGQKIMLFMQQSKREICILSASGSISNASLRQPAT 198
Query: 200 SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATP 259
SGG +TYEGRF+ILSL GS++ +E G RTGGLSV LS DG+++GG V G LTAA P
Sbjct: 199 SGGNITYEGRFDILSLCGSYVRTELGG---RTGGLSVCLSSTDGQIIGGGVGGPLTAAGP 255
Query: 260 VQVVVGSFLADGRKE 274
+Q++VG+F+ D +K+
Sbjct: 256 IQIIVGTFVMDPKKD 270
>gi|326367379|gb|ADZ55297.1| DNA-binding family protein [Coffea arabica]
Length = 351
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/135 (60%), Positives = 101/135 (74%), Gaps = 3/135 (2%)
Query: 140 HQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAAT 199
+QL A GS G F PHVITV AGEDV KIM F Q R +CILSA+G+ISN +LRQ AT
Sbjct: 139 YQLAASGSTGQSFIPHVITVAAGEDVGQKIMLFMQQSKREICILSASGSISNASLRQPAT 198
Query: 200 SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATP 259
SGG +TYEGRF+ILSL GS++ +E G RTGGLSV LS DG+++GG V G LTAA P
Sbjct: 199 SGGNITYEGRFDILSLCGSYVRTELGG---RTGGLSVCLSSTDGQIIGGGVGGPLTAAGP 255
Query: 260 VQVVVGSFLADGRKE 274
+Q++VG+F+ D +K+
Sbjct: 256 IQIIVGTFVIDPKKD 270
>gi|118484865|gb|ABK94299.1| unknown [Populus trichocarpa]
Length = 369
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 94/200 (47%), Positives = 123/200 (61%), Gaps = 14/200 (7%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
KRKRGRPRKYG T AL ++ + + G + S
Sbjct: 84 KRKRGRPRKYG---TPEQALAAKKTASSNSAAAYREKKEHQAGSSSTISSFS-------- 132
Query: 134 SGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVT 193
+ S KK Q +LG+AG GFTPHVITV GEDV+ KIM F Q R +CILSA+G+I + +
Sbjct: 133 AYSSKKSQHASLGNAGHGFTPHVITVAEGEDVTQKIMHFLQQSMREMCILSASGSILSAS 192
Query: 194 LRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGL 253
L Q ATSGG ++YEGR+EI+SL GS++ +E G R GGLSV LS +G+++GG V G
Sbjct: 193 LSQPATSGGNISYEGRYEIISLCGSYVRTEMGG---RAGGLSVCLSDTNGQIIGGGVGGP 249
Query: 254 LTAATPVQVVVGSFLADGRK 273
L AA PVQV+VG+F+ D +K
Sbjct: 250 LKAAGPVQVIVGTFMLDNKK 269
>gi|224123500|ref|XP_002319093.1| predicted protein [Populus trichocarpa]
gi|222857469|gb|EEE95016.1| predicted protein [Populus trichocarpa]
Length = 318
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 94/200 (47%), Positives = 123/200 (61%), Gaps = 14/200 (7%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
KRKRGRPRKYG T AL ++ + + G + S
Sbjct: 33 KRKRGRPRKYG---TPEQALAAKKTASSNSAAAYREKKEHQAGSSSTISSFS-------- 81
Query: 134 SGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVT 193
+ S KK Q +LG+AG GFTPHVITV GEDV+ KIM F Q R +CILSA+G+I + +
Sbjct: 82 AYSSKKSQHASLGNAGHGFTPHVITVAEGEDVTQKIMHFLQQSMREMCILSASGSILSAS 141
Query: 194 LRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGL 253
L Q ATSGG ++YEGR+EI+SL GS++ +E G R GGLSV LS +G+++GG V G
Sbjct: 142 LSQPATSGGNISYEGRYEIISLCGSYVRTEMGG---RAGGLSVCLSDTNGQIIGGGVGGP 198
Query: 254 LTAATPVQVVVGSFLADGRK 273
L AA PVQV+VG+F+ D +K
Sbjct: 199 LKAAGPVQVIVGTFMLDNKK 218
>gi|357482383|ref|XP_003611477.1| DNA binding protein [Medicago truncatula]
gi|355512812|gb|AES94435.1| DNA binding protein [Medicago truncatula]
Length = 384
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 120/338 (35%), Positives = 167/338 (49%), Gaps = 52/338 (15%)
Query: 36 PITATSPTYQ--PSGAGGDGAIPQAQGLNVMNM----GSGSEPMKRKRGRPRKYGPDGTM 89
PIT +PT P +PQ + L++M SGS KRKRGRPRKY P+G +
Sbjct: 33 PITMITPTTAQFPLSNINTNPLPQYEHLSLMLFVGASSSGSGSFKRKRGRPRKYFPNGKI 92
Query: 90 SLALVPSPSS-----------VTTATGGTGSG------------------------LSSP 114
+L P+ V T G G G +SP
Sbjct: 93 TLGSSLDPTHAASFASPSSSAVKKNTSGRGRGRPRKYFPNGKITLGSSLDPTHAATFASP 152
Query: 115 GGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG-SAGVGFTPHVITVKAGEDVSSKIMSFS 173
+ ++ + +G+P GS KK +E G + G GF+PHVI V GED+ +K+ +F
Sbjct: 153 SSSAVKKNTSIRGKGKPRGSFK-KKLPIEMSGVTNGSGFSPHVIIVNRGEDIVAKVGAFC 211
Query: 174 QNGPRA-VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTG 232
Q GP +CILSA+G + N L Q SG VTYEGRFEI+SLSG+ +S+++ + + G
Sbjct: 212 QGGPNTDMCILSAHGLVGNAALYQ---SGSVVTYEGRFEIISLSGNLEVSDNTTKFKKMG 268
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPP-KLA 291
VSL G R+L G VA L AA+ V+V +G F D +K S + ++ S VPP ++A
Sbjct: 269 YFKVSLEGHGSRLLAGVVADKLIAASLVKVTIGVFTLDCKKASSNYLKLGSSSVPPSQIA 328
Query: 292 PGGQPAGQCSPPSRGTLSESSGGPGS-PLNHSTGACNN 328
G S +G S+SSG + P N G NN
Sbjct: 329 AFGT---LTSDAYQGPSSDSSGDNDNIPFNQLPGINNN 363
>gi|346703216|emb|CBX25315.1| hypothetical_protein [Oryza brachyantha]
Length = 344
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 92/191 (48%), Positives = 121/191 (63%), Gaps = 31/191 (16%)
Query: 126 KSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
+ RGRP GSG++ L L DV+++IMSFSQ GPR++CILSA
Sbjct: 110 RRRGRP--KGSGRRQILATL------------------DVAARIMSFSQKGPRSICILSA 149
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
NG ISNV L Q +SG T TYEGRFEIL L+GSF ++E G+R RTGGLSVSL+GPDGRV
Sbjct: 150 NGTISNVALSQPGSSGSTFTYEGRFEILQLTGSFTMAEEGGRR-RTGGLSVSLAGPDGRV 208
Query: 246 LGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLP-----VPPKLAP-----GGQ 295
+GG VAG+L AA+P+QV+VGSFL + K+ + ++ P +P ++AP
Sbjct: 209 VGGVVAGMLRAASPIQVIVGSFLPNSLKQHQRRMGLQQQPSATPALPAQMAPPPVLTAAM 268
Query: 296 PAGQCSPPSRG 306
P Q +P + G
Sbjct: 269 PISQAAPGTNG 279
>gi|357168161|ref|XP_003581513.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 230
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/173 (48%), Positives = 110/173 (63%), Gaps = 10/173 (5%)
Query: 126 KSRGRPPGSGSGKKHQLEALG--SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
K RGRPP SG K QL LG S G F PHV+ + GED++SKIMSFS+ +++CIL
Sbjct: 17 KRRGRPPKSGG--KSQLALLGGCSPGNAFAPHVLHINQGEDITSKIMSFSELHAKSICIL 74
Query: 184 SANGAISNVTLRQAATSGG--TVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
SANG +S VTLR ++ S G Y+G FEI+SL GS LLS+ + GGLS+ +S P
Sbjct: 75 SANGTVSTVTLRLSSHSDGLDNAVYQGHFEIISLKGSCLLSDEGDSGNHGGGLSIVVSTP 134
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF---LADGRKESK-SSHRMESLPVPPKL 290
G + GGS+ G L AA PVQV+ GSF + + +KE K S ++ L VP +L
Sbjct: 135 CGTIFGGSIGGPLIAADPVQVIAGSFNYRVTEEKKEPKISDSQLTELKVPWEL 187
>gi|4165183|emb|CAA10643.1| SAP1 protein [Antirrhinum majus]
Length = 300
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 131/208 (62%), Gaps = 27/208 (12%)
Query: 70 SEPMKRKRGRPRKYG-PDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSR 128
+E +KRKRGRPRKYG P+ + + +P +A+G +S PL+
Sbjct: 43 NESVKRKRGRPRKYGTPEQAAAAKRLSAPKKRDSASGVASVSSASSKKSPLA-------- 94
Query: 129 GRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
ALG+ G F+PH+ITV AGEDV KIM F Q R +C++SA+G+
Sbjct: 95 ---------------ALGNMGQSFSPHIITVAAGEDVGQKIMMFVQQSKREICVISASGS 139
Query: 189 ISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGG 248
+S+ +LRQ A+SGG+VTYEGRF+ILSLSGSF+ +E G RTGGLSV LS DG+++GG
Sbjct: 140 VSSASLRQQASSGGSVTYEGRFDILSLSGSFIHAEFGG---RTGGLSVCLSSSDGQIIGG 196
Query: 249 SVAGLLTAATPVQVVVGSFLADGRKESK 276
V G LTAA +QV+VG+F+ + +K++
Sbjct: 197 GVGGPLTAAATIQVIVGTFVVETKKDAN 224
>gi|413920026|gb|AFW59958.1| hypothetical protein ZEAMMB73_895910, partial [Zea mays]
Length = 390
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 93/219 (42%), Positives = 121/219 (55%), Gaps = 33/219 (15%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+EP+KRKRGRPRKYGPDGTM + + G +G + S G + S KK RG
Sbjct: 105 TEPVKRKRGRPRKYGPDGTMKQQQL---VAAQPRIGPSGPNMISSAG--IEDSSQKKRRG 159
Query: 130 RPPGSGSGKKHQLE------ALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
RPPG+ KKHQ GSAG FTPH+IT EDV++KI++F+ RAVC+L
Sbjct: 160 RPPGTA--KKHQPSPSQGNAFAGSAGTSFTPHIITASPSEDVAAKIVAFATQSSRAVCVL 217
Query: 184 SANGAISNVTLRQAA--------------TSGGTVTYEGRFEILSLSGSFLLSESSGQRS 229
SA G++S LR A + YEG +EI+SL+GS+ L+E S Q
Sbjct: 218 SAMGSVSRAVLRHPADGSPMARVHASPQPYNNSPAIYEGFYEIMSLTGSYNLAEGSQQEQ 277
Query: 230 R------TGGLSVSLSGPDGRVLGGSVAGLLTAATPVQV 262
+GGLSV+L P+ V+GG + G L AA VQV
Sbjct: 278 CQGQGQPSGGLSVTLCSPERNVIGGVLGGPLVAAGTVQV 316
>gi|357513671|ref|XP_003627124.1| hypothetical protein MTR_8g017860, partial [Medicago truncatula]
gi|355521146|gb|AET01600.1| hypothetical protein MTR_8g017860, partial [Medicago truncatula]
Length = 247
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 87/172 (50%), Positives = 106/172 (61%), Gaps = 12/172 (6%)
Query: 38 TATSPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPM--KRKRGRPRKYGPDGTMSLALVP 95
T T+ P+ A A PQ++ +V + G S K+KRGRPRKY PDG ++L L P
Sbjct: 35 TTTANIMAPATARFPFASPQSEPFSVTHDGPSSPSTLGKKKRGRPRKYSPDGNIALGLAP 94
Query: 96 SPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPH 155
SS AT +G S P KK RGRPPGSG K QL+ALG+ G GFTPH
Sbjct: 95 V-SSPVAATSAASAGDSGNADAPP-----KKHRGRPPGSG---KKQLDALGAGGTGFTPH 145
Query: 156 VITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYE 207
VI V++GED++ K+M+FSQ GPR VCILSA GAIS+V LRQ A SG YE
Sbjct: 146 VILVESGEDITEKVMAFSQTGPRTVCILSAIGAISSVILRQPA-SGSIARYE 196
>gi|357482197|ref|XP_003611384.1| DNA binding protein [Medicago truncatula]
gi|355512719|gb|AES94342.1| DNA binding protein [Medicago truncatula]
Length = 339
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 106/269 (39%), Positives = 141/269 (52%), Gaps = 33/269 (12%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
SGS +K+KRGRPRKY D ++L+L P T T + S +KKS
Sbjct: 75 SGSGSIKKKRGRPRKYFLDDNITLSLGSGPIHDATITYPSNS-------------IVKKS 121
Query: 128 ---RGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA-VCIL 183
RGRP GS KK ++E LG G F PH+I V GED+ K+M+ Q G + IL
Sbjct: 122 TRGRGRPRGSFK-KKQEVEVLGVTGTSFFPHLIIVNPGEDIVEKLMTCCQGGSNTEMSIL 180
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
SA+G + V+L + G VTYE +FE+LSL G+ S++SG + VSL P+
Sbjct: 181 SAHGLVGIVSLHR---EGRIVTYEDKFELLSLLGTLEPSDNSGGCKKMSNFKVSLLTPNS 237
Query: 244 RVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQ---PAGQC 300
+L G V L AA+ V++ VGSF G+K S + +L V P L P Q PAG
Sbjct: 238 HLLAGVVVDKLIAASLVKITVGSFTLSGKKASSN-----NLKVGPSLTPSSQFAAPAGVI 292
Query: 301 SP-PSRGTLSESSGGPGSPLNHSTGACNN 328
S PS G+ SSG SP + +G NN
Sbjct: 293 SQGPSFGS---SSGNETSPFSQGSGIYNN 318
>gi|357438967|ref|XP_003589760.1| AT-hook DNA-binding protein [Medicago truncatula]
gi|355478808|gb|AES60011.1| AT-hook DNA-binding protein [Medicago truncatula]
Length = 359
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 159/339 (46%), Gaps = 69/339 (20%)
Query: 31 TAVYKPITATSP-----TYQPSGAGGDGAIPQAQGLN---------VMNMGSGSEPMKRK 76
T + +PITA P T QP P ++ LN + SGS + +K
Sbjct: 37 TTMMEPITARFPQLHMNTNQP---------PHSEPLNNNIPSTLKPCVTASSGSGSIHKK 87
Query: 77 RGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGS 136
+GRPRKY PDG ++ALV SP+ T T + S + G RGRP GS
Sbjct: 88 KGRPRKYFPDG--NIALVSSPALDATITSHSSSIANKSTRG----------RGRPRGS-L 134
Query: 137 GKKHQLEALGSAGVGFTPHVITVKAGE---------------DVSSKIMSFSQNGPRA-V 180
KK ++E G +G GF+ HVITV GE D+ K+ +F Q GP +
Sbjct: 135 NKKKKVEVSGVSGTGFSQHVITVNPGETLMMLRRWLLMYVEMDIVMKLKTFCQGGPNTDM 194
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
CILSA+G + V L Q SG V EGRFEILSLSG ++ G VSL
Sbjct: 195 CILSAHGLVGTVALHQ---SGTIVLREGRFEILSLSGMLEEFDNKNGFKTMGYFKVSLVD 251
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQC 300
P+ VLGG VA L AA+ V+V+VGSF DG+ S S+ KL Q
Sbjct: 252 PNLNVLGGVVADKLIAASFVKVIVGSFTLDGKNCSSSNL---------KLGSSSMTISQF 302
Query: 301 SPPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGI 339
+ P T + +S GP S S G NN ++P GI
Sbjct: 303 AAPRTPTSAAASQGPSS---MSYG--NNENIPFDQVLGI 336
>gi|388516365|gb|AFK46244.1| unknown [Medicago truncatula]
Length = 198
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/138 (52%), Positives = 107/138 (77%), Gaps = 1/138 (0%)
Query: 146 GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVT 205
GSAG GF PHVI + +GED+++KI++FSQ RA+C+LS++G++S+V +R+ + SGGT+
Sbjct: 3 GSAGTGFIPHVIEIASGEDIAAKILTFSQVRARALCVLSSSGSVSSVIIREPSISGGTLK 62
Query: 206 YEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG 265
YEG F I+S+SG ++ +E+ R+R GGLS+SL GPDGR+ GG+V G L AA+PVQV++G
Sbjct: 63 YEGHFHIMSMSGCYVPTENGSSRNRDGGLSISLLGPDGRLFGGAVGGPLVAASPVQVMIG 122
Query: 266 SFLADGRKESKSSHRMES 283
SFL GR ++K+ + S
Sbjct: 123 SFLW-GRLKAKNKKKESS 139
>gi|357441297|ref|XP_003590926.1| SAP1 protein [Medicago truncatula]
gi|355479974|gb|AES61177.1| SAP1 protein [Medicago truncatula]
Length = 329
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 70/131 (53%), Positives = 94/131 (71%), Gaps = 4/131 (3%)
Query: 138 KKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA-VCILSANGAISNVTLRQ 196
KK +LG++ GF H +TV GED+ IM Q R +CILSA+G+IS+ TLRQ
Sbjct: 95 KKFHSSSLGNSREGFNIHFVTVAPGEDIGQNIMMLMQKNSRCEMCILSASGSISSATLRQ 154
Query: 197 AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTA 256
ATSGG +TYEGRF+I+SL+GS++ +E G R+GGLSV LS DG+++GGS+AG L A
Sbjct: 155 PATSGGNITYEGRFDIISLTGSYVRNELDG---RSGGLSVCLSHSDGQLVGGSIAGPLKA 211
Query: 257 ATPVQVVVGSF 267
A+PVQV+ G+F
Sbjct: 212 ASPVQVIAGTF 222
>gi|218195851|gb|EEC78278.1| hypothetical protein OsI_17974 [Oryza sativa Indica Group]
Length = 471
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 88/216 (40%), Positives = 122/216 (56%), Gaps = 43/216 (19%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLA----------LVPSPSSVTTATGGTGSGLSSPGGGPLS 120
EP+KRKRGRPRKYGPDGTM ++ ++ +P + + +G G GG +
Sbjct: 123 EPVKRKRGRPRKYGPDGTMKVSTAAAAQHQQQMLSAPPRMGSVSGADMVG----GGSGMD 178
Query: 121 PDSIKKSRGRPPGSGSGKKHQLEA----------LGSAGVGFTPHVITVKAGEDVSSKIM 170
+ KK RGRPPG+G KK QL + GSAG FTPH+IT EDV+ KI+
Sbjct: 179 DSAQKKRRGRPPGTG--KKQQLSSPVKLSGGNAFSGSAGTSFTPHIITASPSEDVAGKIV 236
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAATSGGT-----------VTYEGRFEILSLSGSF 219
+F+ + RAVC+LSA G++S V LR A + YEG +EILS+SG +
Sbjct: 237 AFANHSSRAVCVLSATGSVSRVVLRHPADGAMSRVHASSHYKNPAIYEGLYEILSMSGCY 296
Query: 220 -LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLL 254
L++E GQ + GLSV+L P+ ++GG + G L
Sbjct: 297 NLMNE--GQ---SDGLSVTLCSPERHIIGGVLGGAL 327
>gi|388523041|gb|AFK49582.1| unknown [Medicago truncatula]
Length = 329
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 69/131 (52%), Positives = 94/131 (71%), Gaps = 4/131 (3%)
Query: 138 KKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA-VCILSANGAISNVTLRQ 196
KK +LG++ GF H +TV GED+ IM Q R +CILSA+G+IS+ TLRQ
Sbjct: 95 KKFHSSSLGNSREGFNIHFVTVAPGEDIGQNIMMLMQKNSRCEMCILSASGSISSATLRQ 154
Query: 197 AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTA 256
AT+GG +TYEGRF+I+SL+GS++ +E G R+GGLSV LS DG+++GGS+AG L A
Sbjct: 155 PATTGGNITYEGRFDIISLTGSYVRNELDG---RSGGLSVCLSHSDGQLVGGSIAGPLKA 211
Query: 257 ATPVQVVVGSF 267
A+PVQV+ G+F
Sbjct: 212 ASPVQVIAGTF 222
>gi|255573022|ref|XP_002527441.1| DNA binding protein, putative [Ricinus communis]
gi|223533176|gb|EEF34933.1| DNA binding protein, putative [Ricinus communis]
Length = 353
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/216 (43%), Positives = 125/216 (57%), Gaps = 16/216 (7%)
Query: 73 MKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP 132
+KRKRGRPRK+ + ++++ + + + L S + RGR
Sbjct: 59 VKRKRGRPRKFDHHHHHHHIQMDHENTMSNVSPSSSNFLRSC-----------EKRGRGR 107
Query: 133 GSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANG 187
GSG+ L ALG +AG PHVITV GED+ SKI SF+Q GPRAVC+LSA G
Sbjct: 108 PRGSGRLQLLAALGGFAAETAGGILIPHVITVNTGEDIVSKISSFAQRGPRAVCVLSATG 167
Query: 188 AISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG 247
+S V +RQ +SGG + EG FEILSLSGSF E+S R + G LSV+L+ PDG+V G
Sbjct: 168 VVSCVIIRQPGSSGGLLRCEGHFEILSLSGSFTFRETSTARRKIGVLSVTLAKPDGQVFG 227
Query: 248 GSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMES 283
G V G L A+ P+Q++V SF + KE K ES
Sbjct: 228 GGVVGSLIASGPIQLIVASFKQNISKELKLRQSSES 263
>gi|168000569|ref|XP_001752988.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695687|gb|EDQ82029.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 156
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/144 (50%), Positives = 94/144 (65%), Gaps = 3/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + + G PH++ V G DVS + SFS+ R VC++
Sbjct: 2 RKPRGRPPGSKNKPKPPIIIMRENGQAMRPHILEVAGGCDVSDSVASFSRRRQRGVCVMG 61
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A+G +SNVTLRQ T+G T+T+ GRFEI+SLSG+FL SS T GL+VSL+G G+
Sbjct: 62 ASGTVSNVTLRQPTTAGATITFHGRFEIISLSGAFLPHPSS---QPTTGLTVSLAGAAGQ 118
Query: 245 VLGGSVAGLLTAATPVQVVVGSFL 268
VLGGSV G L AA PV V+ SF+
Sbjct: 119 VLGGSVVGTLMAAGPVVVIAASFM 142
>gi|414588596|tpg|DAA39167.1| TPA: hypothetical protein ZEAMMB73_847336 [Zea mays]
Length = 199
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 74/107 (69%), Positives = 87/107 (81%), Gaps = 2/107 (1%)
Query: 170 MSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRS 229
MSFSQ GPR+VCILSANG ISNVTLRQ +SG T TYEGRFEIL L GSF ++E R
Sbjct: 1 MSFSQKGPRSVCILSANGTISNVTLRQPGSSGSTFTYEGRFEILQLMGSFTMAEEG--RK 58
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
RTGGLSVSL+GPDGRV+GG VAG+L AA+P+QV+VGSFL + K+ +
Sbjct: 59 RTGGLSVSLAGPDGRVVGGVVAGMLRAASPIQVIVGSFLPNSLKQHQ 105
>gi|413923989|gb|AFW63921.1| hypothetical protein ZEAMMB73_149666 [Zea mays]
Length = 356
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 100/242 (41%), Positives = 130/242 (53%), Gaps = 68/242 (28%)
Query: 65 NMGSG---SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVT--TATGGTGSGLSSPGGGPL 119
+GSG E +K+KRGRPRKY PDG ++L L PS SS+T +A+ G G+ +S+PG G
Sbjct: 110 ELGSGPAQDEQVKKKRGRPRKYKPDGAVTLGLSPS-SSLTPHSASLGMGTMISAPGSGFG 168
Query: 120 SPD---------SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDV 165
S S K+ RGRPPGSG K QL +LG S G GFTPHVI ++ GE
Sbjct: 169 SEGSGASGLGAPSEKRGRGRPPGSG--KMQQLASLGKWFLGSVGTGFTPHVIIIQPGE-- 224
Query: 166 SSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLL-SES 224
GRFEIL LSGS+L+ E
Sbjct: 225 ------------------------------------------GRFEILCLSGSYLVVDEG 242
Query: 225 SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL-ADGRKESKSSHRMES 283
G R+R+GGL ++L GPD RV+GGSV G+L AA VQV+VGSF+ G K++K +++
Sbjct: 243 GGARTRSGGLCIALCGPDNRVIGGSVGGVLMAAGAVQVIVGSFMYGGGSKKNKVKAELDA 302
Query: 284 LP 285
P
Sbjct: 303 EP 304
>gi|357472019|ref|XP_003606294.1| F-box/LRR-repeat protein [Medicago truncatula]
gi|355507349|gb|AES88491.1| F-box/LRR-repeat protein [Medicago truncatula]
Length = 1048
Score = 128 bits (321), Expect = 4e-27, Method: Composition-based stats.
Identities = 72/147 (48%), Positives = 91/147 (61%), Gaps = 8/147 (5%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGP 177
++ + R S G++ Q+E G +AG +PHV+ VK GEDV KI +F Q GP
Sbjct: 448 TLSSKKRRVEKSLRGQRFQIEVQGGCVGETAGGTMSPHVLIVKPGEDVVGKIFAFYQKGP 507
Query: 178 R-AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGS--FLLSESSGQRSRTGGL 234
AVCILSA G IS+VT+RQ + S G +TYEG FEILSLSGS F + G + + G L
Sbjct: 508 SSAVCILSATGTISSVTIRQPSASDGFLTYEGHFEILSLSGSCTFTSGAAGGAQRKIGML 567
Query: 235 SVSLSGPDGRVLGGSVAGLLTAATPVQ 261
SVSL+ P+G V GG V L AATP Q
Sbjct: 568 SVSLAKPNGEVFGGGVENTLIAATPTQ 594
>gi|168026651|ref|XP_001765845.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683022|gb|EDQ69436.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 165
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 73/144 (50%), Positives = 92/144 (63%), Gaps = 3/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + G PH++ V G DVS + SFS+ R VC++
Sbjct: 2 RKPRGRPPGSKNKPKPPVIITRENGNAMRPHILEVAGGCDVSDSVASFSRRRQRGVCVMG 61
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A+G +SNVTLRQ T G TVT+ GRFEI+SLSG+FL SS T GL+VSL+G G+
Sbjct: 62 ASGTVSNVTLRQPTTPGATVTFHGRFEIISLSGAFLPHPSSAP---TTGLTVSLAGAAGQ 118
Query: 245 VLGGSVAGLLTAATPVQVVVGSFL 268
VLGGSV G L AA PV V+ SF+
Sbjct: 119 VLGGSVVGTLMAAGPVLVIAASFI 142
>gi|168020982|ref|XP_001763021.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685833|gb|EDQ72226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 162
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 71/144 (49%), Positives = 91/144 (63%), Gaps = 3/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + G PH++ V G DV + SFS+ R +C++
Sbjct: 1 RKPRGRPPGSKNKPKPPVIITRENGNAMRPHILEVAGGCDVGDSVASFSRRRQRGICVMG 60
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A+G +SNVTLRQ T G TVT+ GRFEI+SLSG+FL SS T GL+VSL+G G+
Sbjct: 61 ASGTVSNVTLRQPTTPGATVTFHGRFEIISLSGAFLPHPSSAP---TTGLTVSLAGAAGQ 117
Query: 245 VLGGSVAGLLTAATPVQVVVGSFL 268
VLGGSV G L AA PV V+ SF+
Sbjct: 118 VLGGSVVGTLMAAGPVLVIAASFI 141
>gi|357481877|ref|XP_003611224.1| DNA-binding protein [Medicago truncatula]
gi|355512559|gb|AES94182.1| DNA-binding protein [Medicago truncatula]
Length = 328
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/138 (50%), Positives = 93/138 (67%), Gaps = 7/138 (5%)
Query: 141 QLEALGSAGVGFTPHV--ITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAA 198
++E S FTPH+ ITVKAGE+V+ K+MS + P A+CILSA G IS+ T+ Q
Sbjct: 45 EVEHQVSNATAFTPHISIITVKAGENVTMKVMSSCRKEPEAICILSAIGVISSATISQPH 104
Query: 199 TSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAAT 258
+S TYEG++ I+SLSG F+ +ES G GG+S+SL G DG V+ G VAG L A +
Sbjct: 105 SSEKLSTYEGKYCIVSLSGPFMPNESRG-----GGMSISLMGLDGHVVEGCVAGPLMAES 159
Query: 259 PVQVVVGSFLADGRKESK 276
PV+VVVGSF+A+ + E K
Sbjct: 160 PVKVVVGSFMANEQHEQK 177
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 62/98 (63%), Gaps = 3/98 (3%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
S G TPH+I V AGEDV+ KIMSF A+ ILSANG S T+ + SG TY
Sbjct: 207 SVGAALTPHIIIVNAGEDVTRKIMSFCCQRHVAISILSANGVASRATINRPQASGTFYTY 266
Query: 207 EGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
EGR++I SLSG F+ ES G R+G ++VSL+ DG+
Sbjct: 267 EGRYDIQSLSGWFMPMESRG---RSGDMNVSLADLDGK 301
>gi|242060318|ref|XP_002451448.1| hypothetical protein SORBIDRAFT_04g002140 [Sorghum bicolor]
gi|241931279|gb|EES04424.1| hypothetical protein SORBIDRAFT_04g002140 [Sorghum bicolor]
Length = 353
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 95/229 (41%), Positives = 114/229 (49%), Gaps = 59/229 (25%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
KRKRGRPRKYGPDGT L +P S +A G G +P + +K+ RGRP G
Sbjct: 57 KRKRGRPRKYGPDGTPLRPLNATPIS-ASAPDDAGVGQYTPAAAVGA--VMKRGRGRPVG 113
Query: 134 SGS-----------------------------GKKHQLEALG-----SAGVGFTPHVITV 159
S QL LG ++G FTPH+I V
Sbjct: 114 FISRVTPISVAVTAAAPTPAVVVSAPPPAPAPAPHSQLAPLGELVACASGANFTPHIINV 173
Query: 160 KAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF 219
AGE +I+ T R AAT T GRFE+LSLSGSF
Sbjct: 174 AAGEAPHIEILKEELQ-----------------TSRNAAT-----TLRGRFELLSLSGSF 211
Query: 220 LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
++S G RSR+GG+SVSL+ DGRV+GG VAGLL AA+PVQVVVGSFL
Sbjct: 212 TPTDSGGTRSRSGGMSVSLAAADGRVIGGGVAGLLVAASPVQVVVGSFL 260
>gi|297827141|ref|XP_002881453.1| hypothetical protein ARALYDRAFT_482633 [Arabidopsis lyrata subsp.
lyrata]
gi|297327292|gb|EFH57712.1| hypothetical protein ARALYDRAFT_482633 [Arabidopsis lyrata subsp.
lyrata]
Length = 411
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 59/126 (46%), Positives = 87/126 (69%), Gaps = 1/126 (0%)
Query: 144 ALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGT 203
++G G F PH+ TV GED+ +IMSF++NG R + +LSANGA++NV ++ ++S
Sbjct: 69 SMGFGGGDFKPHMFTVNKGEDIIKRIMSFTENGSRGISVLSANGAVANVKIQLHSSSRRV 128
Query: 204 VTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG-PDGRVLGGSVAGLLTAATPVQV 262
VTY+ +EI+SLS + +SES G + +TGG + + G P V GG++AG L AA+PVQV
Sbjct: 129 VTYKDEYEIVSLSNTMAISESGGVKHKTGGWRIMIGGAPGASVFGGTLAGSLIAASPVQV 188
Query: 263 VVGSFL 268
V+GSF
Sbjct: 189 VIGSFW 194
>gi|147815748|emb|CAN74881.1| hypothetical protein VITISV_001409 [Vitis vinifera]
Length = 313
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 74/139 (53%), Positives = 89/139 (64%), Gaps = 6/139 (4%)
Query: 207 EGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
+GRF+I+SLSGSFLLSE +G R RTGGLSVSL+G DGRVLGG VAG+LTAATPVQVVVGS
Sbjct: 167 QGRFDIISLSGSFLLSEDNGSRHRTGGLSVSLAGSDGRVLGGGVAGMLTAATPVQVVVGS 226
Query: 267 FLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSPLNHSTGAC 326
F+ADG+K + + S P P ++ G P SP G+ S GSPLN
Sbjct: 227 FIADGKKTNTNQSGSSSAP-PAQMLNFGAPVVPASPSQGGSSESSDENGGSPLNRGPLPY 285
Query: 327 NN-----NHLPQGMATGIP 340
NN + +P A G P
Sbjct: 286 NNVSQPIHQMPMYAAMGWP 304
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/45 (75%), Positives = 40/45 (88%)
Query: 163 EDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYE 207
+D++SKIM+FSQ GPR VCILSANGAI NVTLRQ A SGGT++YE
Sbjct: 7 KDIASKIMAFSQQGPRTVCILSANGAICNVTLRQPAMSGGTISYE 51
>gi|168009644|ref|XP_001757515.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691209|gb|EDQ77572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 156
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 90/144 (62%), Gaps = 3/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + G PH++ + G DV + SFS+ R V +L
Sbjct: 1 RKPRGRPPGSKNKPKPPIIITRENGQAMRPHILEIAGGCDVGDSVASFSRRRQRGVHVLG 60
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A+G +SNVTLRQ T G TVT+ GRFEI+SLSG+FL +S T GL+V+L+G G+
Sbjct: 61 ASGIVSNVTLRQPTTPGATVTFHGRFEIISLSGAFLPHLTS---QPTTGLTVTLAGAAGQ 117
Query: 245 VLGGSVAGLLTAATPVQVVVGSFL 268
VLGGSV G L AA PV V+ SFL
Sbjct: 118 VLGGSVVGTLMAAGPVLVIAASFL 141
>gi|346703299|emb|CBX25397.1| hypothetical_protein [Oryza brachyantha]
Length = 371
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/114 (61%), Positives = 91/114 (79%), Gaps = 3/114 (2%)
Query: 163 EDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLS 222
V+++IMSFSQ GPR+VCILSANG IS+V L Q +SG T +YE FEIL L+GSF ++
Sbjct: 155 HHVAARIMSFSQKGPRSVCILSANGTISSVALNQPGSSGSTFSYE--FEILQLTGSFTIA 212
Query: 223 ESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
+ G+R RTGGLSVSL+GPDGRV+GG VAG+L AA+P+QV+VGSFL + K+ +
Sbjct: 213 KEGGRR-RTGGLSVSLAGPDGRVVGGVVAGMLRAASPIQVIVGSFLPNSLKQHQ 265
>gi|148909040|gb|ABR17623.1| unknown [Picea sitchensis]
Length = 271
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/143 (46%), Positives = 89/143 (62%), Gaps = 2/143 (1%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + + PH++ V G DV + F +C+LS
Sbjct: 53 RKPRGRPPGSKNKAKPPVVITRDSEDAMRPHILEVAGGHDVVECLTQFCGRRQVGLCVLS 112
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVT+RQA +G TVT+ GRFEILSLSG++ + SG S GLS+SL+G G+
Sbjct: 113 GRGMVTNVTIRQATGTGSTVTFHGRFEILSLSGAY--TAPSGASSSPCGLSISLAGAQGQ 170
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
VLGGSVAG+L AA PV V+V SF
Sbjct: 171 VLGGSVAGVLRAAGPVIVIVASF 193
>gi|168067305|ref|XP_001785561.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662818|gb|EDQ49626.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 155
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + G PHV+ V +G DV + F++ R VC++
Sbjct: 2 RKPRGRPPGSKNKPKPPVIITRENGNAMRPHVLEVASGHDVWESVTDFARRRQRGVCVMG 61
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G ++NVTLRQ T G TVT GRFEI+SLSGS+L + + GL++S +G G+
Sbjct: 62 GSGTVTNVTLRQPTTPGATVTIHGRFEIISLSGSYLPPPAPSPPT---GLTISFAGASGQ 118
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
VLGG VAG LTAA+PV V+ SF
Sbjct: 119 VLGGCVAGALTAASPVLVIATSF 141
>gi|302772392|ref|XP_002969614.1| hypothetical protein SELMODRAFT_71342 [Selaginella moellendorffii]
gi|302774925|ref|XP_002970879.1| hypothetical protein SELMODRAFT_71343 [Selaginella moellendorffii]
gi|300161590|gb|EFJ28205.1| hypothetical protein SELMODRAFT_71343 [Selaginella moellendorffii]
gi|300163090|gb|EFJ29702.1| hypothetical protein SELMODRAFT_71342 [Selaginella moellendorffii]
Length = 217
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 93/148 (62%), Gaps = 4/148 (2%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ ++K RGRPPGS + K + +G PHV+ + G DV + +F++ R +C
Sbjct: 7 EIVRKPRGRPPGSKNKPKPPIIITRDSGNAMRPHVLEIAGGCDVGETLAAFARRRQRGLC 66
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
+L +G ++NVTLRQ A G TVT+ GRFEILSLSG+FL + GL+V+L+G
Sbjct: 67 VLGGSGTVANVTLRQLAAPGSTVTFHGRFEILSLSGAFLPPPAP---VAVAGLTVALAGS 123
Query: 242 D-GRVLGGSVAGLLTAATPVQVVVGSFL 268
G+VLGGSV G+L AA+PV V+ SF+
Sbjct: 124 QPGQVLGGSVVGVLMAASPVLVIAASFV 151
>gi|168016851|ref|XP_001760962.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687971|gb|EDQ74351.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 159
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/144 (44%), Positives = 88/144 (61%), Gaps = 1/144 (0%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + G PH++ V +G DV + F++ R +C++
Sbjct: 2 RKPRGRPPGSKNKPKPPVIITRENGNAMRPHILEVASGHDVWESVADFARRRQRGICVMG 61
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRS-RTGGLSVSLSGPDG 243
+G ++NVTLRQ+ T G TVT GRFEI+SLSGS+L S + T GL++S +G G
Sbjct: 62 GSGTVTNVTLRQSTTPGATVTIHGRFEIISLSGSYLPPPSPTPPAGLTTGLTISFAGASG 121
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+VLGG V G L AA+PV VV SF
Sbjct: 122 QVLGGCVVGALMAASPVLVVATSF 145
>gi|356507995|ref|XP_003522748.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 280
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/173 (38%), Positives = 99/173 (57%), Gaps = 1/173 (0%)
Query: 96 SPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPH 155
S TT T T +S GG + + +++ RGRPPGS + K + +P+
Sbjct: 40 SAEDATTITPSTAQKANSSGGDGATIEVVRRPRGRPPGSKNKPKPPVIITRDPEPAMSPY 99
Query: 156 VITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATS-GGTVTYEGRFEILS 214
++ V G DV I FS +C+L+ +G ++NVTLRQ +T+ G TVT+ GRF+ILS
Sbjct: 100 ILEVSGGNDVVEAIAQFSHRKNMGICVLTGSGTVANVTLRQPSTTPGTTVTFHGRFDILS 159
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+S +FL +S + G ++SL+GP G+++GG VAG L AA V V+ SF
Sbjct: 160 VSATFLPQQSGASPAVPNGFAISLAGPQGQIVGGLVAGGLMAAGTVFVIAASF 212
>gi|357441299|ref|XP_003590927.1| SAP1 protein [Medicago truncatula]
gi|355479975|gb|AES61178.1| SAP1 protein [Medicago truncatula]
Length = 217
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/114 (53%), Positives = 84/114 (73%), Gaps = 7/114 (6%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRA-VCILSANGAISNVTLRQAATSGGTVTYEGRFEIL 213
+ +TV+ D+ IM Q R +CILSA+G+IS+ TLRQ ATSGG +TYEGRF+I+
Sbjct: 3 YELTVR---DIGQNIMMLMQKNSRCEMCILSASGSISSATLRQPATSGGNITYEGRFDII 59
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
SL+GS++ +E G R+GGLSV LS DG+++GGS+AG L AA+PVQV+ G+F
Sbjct: 60 SLTGSYVRNELDG---RSGGLSVCLSHSDGQLVGGSIAGPLKAASPVQVIAGTF 110
>gi|15227997|ref|NP_181195.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|4581154|gb|AAD24638.1| hypothetical protein [Arabidopsis thaliana]
gi|330254174|gb|AEC09268.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 574
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 90/130 (69%), Gaps = 5/130 (3%)
Query: 152 FTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFE 211
FTPH TV GED+ +IMSF+ NG R + +LS NGA++NVT+ +S +T++ +E
Sbjct: 105 FTPHSFTVNKGEDIIKRIMSFTANGSRGISVLSVNGAVANVTILPHGSSRRVMTFKEEYE 164
Query: 212 ILSLSGSFL-LSESSGQRSRTGGLSVSLSGPD-GRVLGGSVAGLLTAATPVQVVVGSF-- 267
I+SL+ + + +SES G +++TGG +++ G GRV GG++AG L AA+PVQVV+GSF
Sbjct: 165 IVSLTNNTMAISESGGVKNKTGGWRITIGGAAGGRVHGGALAGSLIAASPVQVVIGSFWP 224
Query: 268 -LADGRKESK 276
+ + R++ K
Sbjct: 225 LITNSRQKRK 234
>gi|357438971|ref|XP_003589762.1| AT-hook protein [Medicago truncatula]
gi|355478810|gb|AES60013.1| AT-hook protein [Medicago truncatula]
Length = 395
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/275 (33%), Positives = 133/275 (48%), Gaps = 53/275 (19%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
SGS +++KRGRPR+Y DG ++ S +T G
Sbjct: 76 SGSGSIQKKRGRPREYFLDGYIA-------SIAKRSTRG--------------------- 107
Query: 128 RGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA-VCILSAN 186
RGRP GS + KK ++EA G G F+ HVITV G+D+ +K+ + Q GP +CILSA+
Sbjct: 108 RGRPHGSLNKKK-KVEAPGVTGTDFSQHVITVNPGDDIVAKLKTCCQGGPNTEMCILSAH 166
Query: 187 GAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVL 246
G + V L Q G EG+FEILSLSG + +++ R +VSL P+ V
Sbjct: 167 GLVGTVALHQP---GRIFICEGQFEILSLSGMLEVFDNNNGFKRMNYFTVSLVEPNSNVF 223
Query: 247 GGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM--ESLPVPPKLAPGGQPAGQCSPPS 304
GG V L+ AA+ V+V V F D + S S+ + S+P+ + A G P
Sbjct: 224 GGVVDKLI-AASLVKVKVACFTLDDKNGSSSNLNLGPSSIPI-SQFAAFGTPT------- 274
Query: 305 RGTLSESSGGPGSPLNHSTGACNNNHLPQGMATGI 339
S ++ GP S NN ++P G+ GI
Sbjct: 275 ----SATTQGPS-----SISLSNNENIPLGLGHGI 300
>gi|356515688|ref|XP_003526530.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 284
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/157 (38%), Positives = 95/157 (60%), Gaps = 1/157 (0%)
Query: 112 SSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMS 171
+S GG + + +++ RGRPPGS + K + +P+++ V G DV I
Sbjct: 61 NSSGGDGATIEVVRRPRGRPPGSKNKPKPPVIITRDPEPAMSPYILEVSGGNDVVEAIAQ 120
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATS-GGTVTYEGRFEILSLSGSFLLSESSGQRSR 230
FS+ +C+L+ +G ++NVTLRQ +T+ G TVT+ GRF+ILS+S +FL +S +
Sbjct: 121 FSRRKNMGICVLTGSGTVANVTLRQPSTTPGTTVTFHGRFDILSVSATFLPQQSGASPAV 180
Query: 231 TGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G ++SL+GP G+++GG VAG L AA V V+ SF
Sbjct: 181 PNGFAISLAGPQGQIVGGLVAGGLMAAGTVFVIAASF 217
>gi|224127406|ref|XP_002320066.1| predicted protein [Populus trichocarpa]
gi|222860839|gb|EEE98381.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 62/143 (43%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VCILS
Sbjct: 79 RRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIATGSDIMESVSTFARRRQRGVCILS 138
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTL+Q A+ G VT GRFEILSLSGSFL + S GL+V L+G G+
Sbjct: 139 GTGTVTNVTLKQPASPGAVVTLHGRFEILSLSGSFLPPPAPPAAS---GLTVYLAGGQGQ 195
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSVAG L A+ PV V+ SF
Sbjct: 196 VIGGSVAGPLLASGPVVVMAASF 218
>gi|413920025|gb|AFW59957.1| hypothetical protein ZEAMMB73_895910 [Zea mays]
Length = 267
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 63/135 (46%), Positives = 80/135 (59%), Gaps = 13/135 (9%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+EP+KRKRGRPRKYGPDGTM + + G +G + S G + S KK RG
Sbjct: 105 TEPVKRKRGRPRKYGPDGTMKQQQL---VAAQPRIGPSGPNMISSAG--IEDSSQKKRRG 159
Query: 130 RPPGSGSGKKHQLEA------LGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
RPPG+ KKHQ GSAG FTPH+IT EDV++KI++F+ RAVC+L
Sbjct: 160 RPPGTA--KKHQPSPSQGNAFAGSAGTSFTPHIITASPSEDVAAKIVAFATQSSRAVCVL 217
Query: 184 SANGAISNVTLRQAA 198
SA G++S LR A
Sbjct: 218 SAMGSVSRAVLRHPA 232
>gi|326508248|dbj|BAJ99391.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 275
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 76/197 (38%), Positives = 94/197 (47%), Gaps = 57/197 (28%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDS-----IKKSR 128
KRKRGRPRKYGPDG + L +P S + GGG +P S +K+ R
Sbjct: 70 KRKRGRPRKYGPDGGLLRPLNATPISASVPDDS--------GGGHYTPASAVGAAMKRGR 121
Query: 129 GRPPGSGSGK--------------------------------------KHQLEALG---- 146
GRP G S +H LG
Sbjct: 122 GRPVGFISRAAPVVAVPVTAATPTPAVVVSTPPPPAPVSVAAPAAPTPQHLAPPLGDVVG 181
Query: 147 -SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVT 205
++G FTPH++ V GED++ K++SFSQ GPRA+CILSANG ISNVTLRQ + GGTVT
Sbjct: 182 CASGANFTPHILNVATGEDINMKVISFSQQGPRAICILSANGVISNVTLRQHDSLGGTVT 241
Query: 206 YEGRFEILSLSGSFLLS 222
YE +L FL S
Sbjct: 242 YE-VCSLLCKPSIFLYS 257
>gi|302794765|ref|XP_002979146.1| hypothetical protein SELMODRAFT_57074 [Selaginella moellendorffii]
gi|302813662|ref|XP_002988516.1| hypothetical protein SELMODRAFT_47043 [Selaginella moellendorffii]
gi|300143623|gb|EFJ10312.1| hypothetical protein SELMODRAFT_47043 [Selaginella moellendorffii]
gi|300152914|gb|EFJ19554.1| hypothetical protein SELMODRAFT_57074 [Selaginella moellendorffii]
Length = 173
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 89/156 (57%), Gaps = 11/156 (7%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
++K RGRPPGS + K + G G PHV+ + +G DV I +F++ R++C+L
Sbjct: 2 VRKPRGRPPGSKNKPKPPIIITRETGTGMRPHVLEIASGCDVHECIATFARRRQRSLCVL 61
Query: 184 SANGAISNVTLRQAAT-----SGGTVTYEGRFEILSLSGSFL------LSESSGQRSRTG 232
A+G +SNVTLRQ S +T GRF+ILS+SG+F+ +
Sbjct: 62 GASGTVSNVTLRQPTVPPGGNSASVLTLHGRFDILSMSGTFMQPTAPQPLMPMPLPPTSS 121
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
GL++S++G G+V+GG V G L + +P+ V+ SFL
Sbjct: 122 GLTISMAGAQGQVIGGLVVGALMSVSPILVIAASFL 157
>gi|15232970|ref|NP_191646.1| AT-hook motif nuclear-localized protein 18 [Arabidopsis thaliana]
gi|7329697|emb|CAB82691.1| putative protein [Arabidopsis thaliana]
gi|119657380|tpd|FAA00289.1| TPA: AT-hook motif nuclear localized protein 18 [Arabidopsis
thaliana]
gi|332646598|gb|AEE80119.1| AT-hook motif nuclear-localized protein 18 [Arabidopsis thaliana]
Length = 265
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 89/152 (58%), Gaps = 7/152 (4%)
Query: 118 PLSPDSIKKSR--GRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQN 175
P ++IKK R GRP GS + K + + F HV+ + DV + F++
Sbjct: 50 PSGEENIKKRRPRGRPAGSKNKPKAPIIVTRDSANAFRCHVMEITNACDVMESLAVFARR 109
Query: 176 GPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLS 235
R VC+L+ NGA++NVT+RQ GG V+ GRFEILSLSGSFL + S GL
Sbjct: 110 RQRGVCVLTGNGAVTNVTVRQPG--GGVVSLHGRFEILSLSGSFLPPPAPPAAS---GLK 164
Query: 236 VSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
V L+G G+V+GGSV G LTA++PV V+ SF
Sbjct: 165 VYLAGGQGQVIGGSVVGPLTASSPVVVMAASF 196
>gi|294461824|gb|ADE76470.1| unknown [Picea sitchensis]
Length = 294
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 92/160 (57%), Gaps = 9/160 (5%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + PHV+ V G DV ++ F + +CI+S
Sbjct: 76 RKPRGRPPGSKNKPKPPIIITRDNENAMRPHVLEVAVGCDVGESVLQFVRRRQIGLCIMS 135
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF---LLSESSGQRSRTGGLSVSLSGP 241
+G +++VTLRQ G + + GRFEILSLSG + S SS S +GGL++SL+G
Sbjct: 136 GSGTVASVTLRQPTVPGAPLNFRGRFEILSLSGMYLPSPSSSSSSSSSLSGGLTISLAGA 195
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
G+V+GGSVAG LTAA PV ++ SF S S HR+
Sbjct: 196 QGQVVGGSVAGELTAAGPVTIIAASF------TSPSYHRL 229
>gi|356574748|ref|XP_003555507.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 324
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G DV+ + F++ R VC+LS
Sbjct: 103 RRPRGRPPGSKNKPKPPIFVTRDSPNTLRSHVMEVTGGADVAESVAQFARRRQRGVCVLS 162
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G+++NVTLRQ + G V GRFEILSL+G+FL + + GL+V L+G G+
Sbjct: 163 GSGSVANVTLRQPSAPGAVVALHGRFEILSLTGTFLPGPAPPGST---GLTVYLTGGQGQ 219
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
++GGSV G L AA PV V+ +F
Sbjct: 220 IVGGSVVGSLVAAGPVMVIAATF 242
>gi|449461381|ref|XP_004148420.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449529176|ref|XP_004171577.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 286
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 95/178 (53%), Gaps = 11/178 (6%)
Query: 91 LALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGV 150
+A +PSP T T G + + +++ RGRPPGS + K L
Sbjct: 40 IAALPSPFKHHTDLTSTADGSTI--------EVVRRPRGRPPGSKNKPKPPLVVTREPEP 91
Query: 151 GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ-AATSGGTVTYEGR 209
P+V+ V G DV I FS+ +C+L+ +G ++NV+LRQ +AT G TVT+ GR
Sbjct: 92 AMRPYVLEVPGGNDVVEAISRFSRRKNLGLCVLNGSGTVANVSLRQPSATPGATVTFHGR 151
Query: 210 FEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
FEILS+S + S G S+SL+GP G+++GG VAG L AA V VV SF
Sbjct: 152 FEILSISATVF--PQSTPLPLPNGFSISLAGPQGQIVGGLVAGALIAAGTVFVVASSF 207
>gi|449465880|ref|XP_004150655.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 281
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 77/215 (35%), Positives = 109/215 (50%), Gaps = 16/215 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++SRGRPPGS + +K + + + HVI + G DV+ I F R VC+LS
Sbjct: 71 RRSRGRPPGSKNKRKSPIIVTRDSPHTLSTHVIEIVGGADVADSINQFCCRRQRGVCVLS 130
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G + +VT+RQ+A SG + GRFEILS+SGSFL + GL+V L+G G+
Sbjct: 131 GSGTVVDVTVRQSAGSGAVIQLRGRFEILSVSGSFLPGRDPPCST---GLTVYLAGGQGQ 187
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKES---KSSHRMESLPVPPKLAPGGQPAGQCS 301
V+GG+V G L A PV ++ +F A+ E + H E V +P AG+
Sbjct: 188 VIGGTVVGPLLAGGPVILIAATF-ANATYERLPLQHHHNYEEREV----SPATTSAGELE 242
Query: 302 PPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMA 336
P E+S P N+ NNNH G A
Sbjct: 243 EPLPYPRIETSIYDLIPPNN-----NNNHALDGYA 272
>gi|225459109|ref|XP_002285689.1| PREDICTED: uncharacterized protein LOC100255831 [Vitis vinifera]
Length = 309
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/213 (38%), Positives = 116/213 (54%), Gaps = 18/213 (8%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRPPGS + K + +P+V+ V G D+ I FS+ +C
Sbjct: 96 EVVRRPRGRPPGSKNKPKPPVIITRDTEPAMSPYVLEVPGGVDIVEAIARFSRRRNIGLC 155
Query: 182 ILSANGAISNVTLRQAATS-GGTVTYEGRFEILSLSGSFL-LSESSGQRSRTGGLSVSLS 239
+L+ +G ++NVTLRQ +T+ G TVT+ GRF+ILS+S + + S SS S G ++SL+
Sbjct: 156 VLNGSGTVANVTLRQPSTTPGATVTFHGRFDILSISATIIPQSASSPIPSSANGFTISLA 215
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQ 299
GP G+++GGSVAG L AA V V+ SF + S HR+ P GG GQ
Sbjct: 216 GPQGQIVGGSVAGTLLAAGTVYVIAASF------NNPSYHRLPGEDEVPNSGSGGN-DGQ 268
Query: 300 CSPPSRGTLSESSGGPGSPLNHSTGACNNNHLP 332
SPP T S SG P P S +C HLP
Sbjct: 269 -SPP---TGSGDSGHP--PAEMSIYSC---HLP 292
>gi|449432311|ref|XP_004133943.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449480005|ref|XP_004155773.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 254
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/144 (41%), Positives = 89/144 (61%), Gaps = 5/144 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++SRGRPPGS + K + + H++ V G DV + ++++ R VCILS
Sbjct: 48 RRSRGRPPGSKNKPKPPVIITRESANTLRAHILEVNTGCDVFDSVATYARKRQRGVCILS 107
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPDG 243
GA++NVTLRQ +++GG +T GRFEILSL+GSFL + G S L++ L+G G
Sbjct: 108 GTGAVTNVTLRQPSSTGGAITLPGRFEILSLTGSFLPPPAPPGATS----LTIFLAGGQG 163
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+++GG+V G L A+ PV V+ SF
Sbjct: 164 QIVGGNVVGSLIASGPVIVIASSF 187
>gi|147776522|emb|CAN74013.1| hypothetical protein VITISV_003550 [Vitis vinifera]
Length = 417
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/213 (38%), Positives = 116/213 (54%), Gaps = 18/213 (8%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRPPGS + K + +P+V+ V G D+ I FS+ +C
Sbjct: 204 EVVRRPRGRPPGSKNKPKPPVIITRDTEPAMSPYVLEVPGGVDIVEAIARFSRRRNIGLC 263
Query: 182 ILSANGAISNVTLRQAATS-GGTVTYEGRFEILSLSGSFL-LSESSGQRSRTGGLSVSLS 239
+L+ +G ++NVTLRQ +T+ G TVT+ GRF+ILS+S + + S SS S G ++SL+
Sbjct: 264 VLNGSGTVANVTLRQPSTTPGATVTFHGRFDILSISATIIPQSASSPIPSSANGFTISLA 323
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQ 299
GP G+++GGSVAG L AA V V+ SF + S HR+ P GG GQ
Sbjct: 324 GPQGQIVGGSVAGTLLAAGTVYVIAASF------NNPSYHRLPGEDEVPNSGSGGN-DGQ 376
Query: 300 CSPPSRGTLSESSGGPGSPLNHSTGACNNNHLP 332
SPP T S SG P P S +C HLP
Sbjct: 377 -SPP---TGSGDSGHP--PAEMSIYSC---HLP 400
>gi|449508093|ref|XP_004163216.1| PREDICTED: putative DNA-binding protein ESCAROLA-like, partial
[Cucumis sativus]
Length = 277
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/215 (35%), Positives = 109/215 (50%), Gaps = 16/215 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++SRGRPPGS + K + + + HVI + G DV+ I F R VC+LS
Sbjct: 67 RRSRGRPPGSKNKPKSPIIVTRDSPHTLSTHVIEIVGGADVADSINQFCCRRQRGVCVLS 126
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G + +VT+RQ+A SG + GRFEILS+SGSFL + GL+V L+G G+
Sbjct: 127 GSGTVVDVTVRQSAGSGAVIQLRGRFEILSVSGSFLPGRDPPCST---GLTVYLAGGQGQ 183
Query: 245 VLGGSVAGLLTAATPVQVVVGSFLADGRKES---KSSHRMESLPVPPKLAPGGQPAGQCS 301
V+GG+V G L A PV ++ +F A+ E + H E +++P AG+
Sbjct: 184 VIGGTVVGPLLAGGPVILIAATF-ANATYERLPLQHHHNYEER----EVSPATTSAGELE 238
Query: 302 PPSRGTLSESSGGPGSPLNHSTGACNNNHLPQGMA 336
P E+S P N+ NNNH G A
Sbjct: 239 EPLPYPRIETSIYDLIPPNN-----NNNHALDGYA 268
>gi|357457297|ref|XP_003598929.1| DNA binding protein [Medicago truncatula]
gi|355487977|gb|AES69180.1| DNA binding protein [Medicago truncatula]
Length = 257
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 95/183 (51%), Gaps = 11/183 (6%)
Query: 151 GFTPHVITVKAGEDVSSKIMSFSQNGPR---AVCILSANGAISNVTLRQAATSGGTVTYE 207
PHVI V GED+ K+ ++SQ +CI+SA+G + +V L SG YE
Sbjct: 61 DIIPHVIFVNPGEDIIEKVAAYSQAVAEPDTEICIMSAHGLVGSVALHH---SGSIFNYE 117
Query: 208 GRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+FEI+SL G+ + +++ R VSL+ D R+L G VA L AA+ V+V+VGSF
Sbjct: 118 GQFEIVSLFGNLEVYDNNSDNIRMSYFKVSLANTDSRLLEGVVADKLIAASLVKVIVGSF 177
Query: 268 LADGRKESKSSHRMESLPVP-PKLAPGGQPAGQCSPPSRGTLSESSGG-PGSPLNHSTGA 325
DG+ S ++ E P PKL G Q ++G S+S G +P + T
Sbjct: 178 TLDGKNASLNNLEYEFSSAPLPKLVNDG---TQTDVTTQGHSSQSLGDKENNPFSQGTAI 234
Query: 326 CNN 328
NN
Sbjct: 235 YNN 237
>gi|357487081|ref|XP_003613828.1| DNA-binding protein [Medicago truncatula]
gi|355515163|gb|AES96786.1| DNA-binding protein [Medicago truncatula]
Length = 323
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 77/198 (38%), Positives = 100/198 (50%), Gaps = 19/198 (9%)
Query: 67 GSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKK 126
GS SE ++R GRP KYG + SP S T G + S+ K
Sbjct: 48 GSNSEQVQRGEGRPPKYG--------VSRSPFSPMTPPSGLATSHSNESEE-------KD 92
Query: 127 SRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR-AVCILSA 185
GR GS +E + G TP+V+ V E+V KI +F +NGPR AVCIL+A
Sbjct: 93 GNGRSGGSLVSTDGFVEE--TTGESITPYVLIVNPRENVVEKISAFFKNGPRQAVCILAA 150
Query: 186 NGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
GA+SNVTL Q S G + YEG F ILSL+G Q+ +SVSLS PDG +
Sbjct: 151 TGAVSNVTLYQPGVSDGFLRYEGHFPILSLNGPCTFPGGCAQK-EIEMMSVSLSKPDGSI 209
Query: 246 LGGSVAGLLTAATPVQVV 263
GG + + AATP+ +
Sbjct: 210 FGGGIGRSMIAATPIHFL 227
>gi|224081949|ref|XP_002306539.1| predicted protein [Populus trichocarpa]
gi|222855988|gb|EEE93535.1| predicted protein [Populus trichocarpa]
Length = 304
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 92/162 (56%), Gaps = 9/162 (5%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRPPGS + K + +P+++ V G DV + F + +C
Sbjct: 80 EVVRRPRGRPPGSKNKPKPPVIITREPEPAMSPYILEVPGGNDVVEALSRFCRRKNMGIC 139
Query: 182 ILSANGAISNVTLRQAATS-GGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
+L+ G ++NVTLRQ +T+ G T+T+ GRF+ILS+S +FL +S ++SL+G
Sbjct: 140 VLTGTGTVANVTLRQPSTTPGSTITFHGRFDILSISATFLPQTTS--YPLPNSFTISLAG 197
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRME 282
P G+++GG VAG L AA V VV SF + S HR++
Sbjct: 198 PQGQIVGGIVAGGLVAAGTVFVVAASF------NNPSYHRLQ 233
>gi|224101033|ref|XP_002312113.1| predicted protein [Populus trichocarpa]
gi|222851933|gb|EEE89480.1| predicted protein [Populus trichocarpa]
Length = 157
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 83/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + +G D+ I +FS V ILS
Sbjct: 2 RRPRGRPAGSKNKPKPPVVITKESPNSLRSHVLEISSGSDIVDSIANFSHRRHHGVSILS 61
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G + NVTLRQ A GG +T GRFEILSLSGSFL + S +R L+V L+G G+
Sbjct: 62 GSGIVDNVTLRQPAAPGGVITLHGRFEILSLSGSFLPAPSPPGATR---LTVYLAGAQGQ 118
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GG+V G L AA PV V+ +F
Sbjct: 119 VVGGTVMGELVAAGPVMVIAATF 141
>gi|297817408|ref|XP_002876587.1| hypothetical protein ARALYDRAFT_486561 [Arabidopsis lyrata subsp.
lyrata]
gi|297322425|gb|EFH52846.1| hypothetical protein ARALYDRAFT_486561 [Arabidopsis lyrata subsp.
lyrata]
Length = 264
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 87/147 (59%), Gaps = 7/147 (4%)
Query: 123 SIKKSR--GRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
+IKK R GRP GS + K + + F HV+ + G DV + F++ R V
Sbjct: 54 NIKKRRPRGRPAGSKNKPKAPIIVTRDSANAFRCHVMEITNGCDVMESLAVFARRRQRGV 113
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
C+L+ NGA++NVT+RQ GG V+ GRFEILSLSGSFL + + GL+V L+G
Sbjct: 114 CVLTGNGAVTNVTVRQPG--GGVVSLHGRFEILSLSGSFLPPPAPPAAT---GLTVYLAG 168
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGS+ G L A+ PV ++ SF
Sbjct: 169 GQGQVIGGSLVGPLMASGPVVIMAASF 195
>gi|224067058|ref|XP_002302339.1| predicted protein [Populus trichocarpa]
gi|222844065|gb|EEE81612.1| predicted protein [Populus trichocarpa]
Length = 274
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 59/161 (36%), Positives = 94/161 (58%), Gaps = 9/161 (5%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRPPGS + K + + +P+++ V G DV + F + +C
Sbjct: 52 EVVRRPRGRPPGSKNKPKPPVIITRESEPSMSPYILEVPGGNDVVEALSRFCRRKNMGIC 111
Query: 182 ILSANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
+L+ +G ++NVTLRQ +AT G T+T+ GRF+ILS+S +FL +S + ++SL+G
Sbjct: 112 VLTGSGTVANVTLRQPSATPGATITFHGRFDILSISATFLPQTASYPVPNS--FTISLAG 169
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
P G+++GG VAG L AA V VV SF + S HR+
Sbjct: 170 PQGQIVGGIVAGSLVAAGTVFVVAASF------NNPSYHRL 204
>gi|357465293|ref|XP_003602928.1| hypothetical protein MTR_3g100470 [Medicago truncatula]
gi|355491976|gb|AES73179.1| hypothetical protein MTR_3g100470 [Medicago truncatula]
Length = 290
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 90/158 (56%), Gaps = 7/158 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + +P ++ + G DV I FS+ +C+L+
Sbjct: 74 RRPRGRPPGSKNKPKPPIIITRDPETVMSPFILDISGGNDVVEAISEFSRRKNIGLCVLT 133
Query: 185 ANGAISNVTLRQAATS-GGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
+G ++NVTLRQ +T+ G TVT+ GRF+ILS++ +F+ + + S+SL+GP G
Sbjct: 134 GSGTVANVTLRQPSTTPGTTVTFHGRFDILSITATFVPQQHGVSPAIPSNFSISLAGPQG 193
Query: 244 RVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
+++GG VAG L AA V V+ SF + S HR+
Sbjct: 194 QIVGGIVAGNLIAAGTVFVIASSF------NNPSYHRL 225
>gi|225463966|ref|XP_002271606.1| PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera]
Length = 291
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 84/144 (58%), Gaps = 4/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G D++ I F++ R VC+LS
Sbjct: 67 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDITESIAQFARRRQRGVCVLS 126
Query: 185 ANGAISNVTLRQAATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
A+G + NVTLRQ + GG V GRFEILSL+G+FL + + GL++ L+G
Sbjct: 127 ASGTVMNVTLRQPSAPGGAVMALHGRFEILSLTGAFLPGPAP---PGSTGLTIYLAGGQA 183
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G L AA PV V+ +F
Sbjct: 184 QVVGGSVVGSLIAAGPVMVIAATF 207
>gi|296087883|emb|CBI35166.3| unnamed protein product [Vitis vinifera]
Length = 275
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 84/144 (58%), Gaps = 4/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G D++ I F++ R VC+LS
Sbjct: 67 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDITESIAQFARRRQRGVCVLS 126
Query: 185 ANGAISNVTLRQAATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
A+G + NVTLRQ + GG V GRFEILSL+G+FL + + GL++ L+G
Sbjct: 127 ASGTVMNVTLRQPSAPGGAVMALHGRFEILSLTGAFLPGPAP---PGSTGLTIYLAGGQA 183
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G L AA PV V+ +F
Sbjct: 184 QVVGGSVVGSLIAAGPVMVIAATF 207
>gi|147812096|emb|CAN61523.1| hypothetical protein VITISV_016751 [Vitis vinifera]
Length = 259
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 84/144 (58%), Gaps = 4/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G D++ I F++ R VC+LS
Sbjct: 35 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDITESIAQFARRRQRGVCVLS 94
Query: 185 ANGAISNVTLRQAATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
A+G + NVTLRQ + GG V GRFEILSL+G+FL + + GL++ L+G
Sbjct: 95 ASGTVMNVTLRQPSAPGGAVMALHGRFEILSLTGAFLPGPAP---PGSTGLTIYLAGGQA 151
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G L AA PV V+ +F
Sbjct: 152 QVVGGSVVGSLIAAGPVMVIAATF 175
>gi|255545940|ref|XP_002514030.1| DNA binding protein, putative [Ricinus communis]
gi|223547116|gb|EEF48613.1| DNA binding protein, putative [Ricinus communis]
Length = 310
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 92/161 (57%), Gaps = 9/161 (5%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRPPGS + K + +P+++ V G DV I F + +C
Sbjct: 92 EVVRRPRGRPPGSKNKPKPPVIITRDPEPAMSPYILEVCGGSDVVEAISRFCRRKNIGIC 151
Query: 182 ILSANGAISNVTLRQAATS-GGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
+L+ +G ++NVTLRQ +T+ G T+T+ GRF+ILS+S +F+ S T ++SL+G
Sbjct: 152 VLTGSGTVANVTLRQPSTTPGSTITFHGRFDILSISATFMPQTVSYPVPNT--FTISLAG 209
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
P G+++GG VAG L AA V ++ +F + S HR+
Sbjct: 210 PQGQIVGGLVAGSLIAAGTVYIMAATF------NNPSYHRL 244
>gi|356500760|ref|XP_003519199.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 271
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 91/167 (54%), Gaps = 13/167 (7%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRPPGS + K L +P ++ + G DV + FS+ +C
Sbjct: 58 EVVRRPRGRPPGSKNRPKPPLIITREPEPAMSPFILEIPGGSDVVEALARFSRRKNTGLC 117
Query: 182 ILSANGAISNVTLRQ-----AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSV 236
+L+ +G ++NVTLRQ A + TVT+ GRF+ILS+S +FL S + +V
Sbjct: 118 VLTGSGTVANVTLRQPSFSPAGATVATVTFHGRFDILSMSATFLHHASPA--AIPNAFAV 175
Query: 237 SLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMES 283
SLSGP G+++GG VAG L AA V V+ SF + S HR+ S
Sbjct: 176 SLSGPQGQIVGGFVAGRLLAAGTVFVIAASF------NNPSYHRLSS 216
>gi|297792253|ref|XP_002864011.1| hypothetical protein ARALYDRAFT_917968 [Arabidopsis lyrata subsp.
lyrata]
gi|297309846|gb|EFH40270.1| hypothetical protein ARALYDRAFT_917968 [Arabidopsis lyrata subsp.
lyrata]
Length = 270
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 93/164 (56%), Gaps = 10/164 (6%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRPPGS + K + +P+++ V +G DV I F + VC
Sbjct: 47 EVVRRPRGRPPGSKNKPKPPVFVTRDTDPPMSPYILEVPSGNDVVEAINRFCRRKSIGVC 106
Query: 182 ILSANGAISNVTLRQ--AATSGGTVTYEGRFEILSLSGSFL--LSESSGQRSRTGGLSVS 237
+LS +G+++NVTLRQ A G T+T+ G+F++LS+S +FL +S + +VS
Sbjct: 107 VLSGSGSVANVTLRQPSPAAPGSTITFHGKFDLLSVSATFLPPPPRTSLSPPVSNFFTVS 166
Query: 238 LSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
L+GP G+++GG VAG L +A V V+ SF + S HR+
Sbjct: 167 LAGPQGQIIGGFVAGPLISAGTVYVIAASF------NNPSYHRL 204
>gi|15240535|ref|NP_199781.1| Predicted AT-hook DNA-binding family protein [Arabidopsis thaliana]
gi|8978267|dbj|BAA98158.1| unnamed protein product [Arabidopsis thaliana]
gi|119657378|tpd|FAA00288.1| TPA: AT-hook motif nuclear localized protein 17 [Arabidopsis
thaliana]
gi|225879102|dbj|BAH30621.1| hypothetical protein [Arabidopsis thaliana]
gi|332008463|gb|AED95846.1| Predicted AT-hook DNA-binding family protein [Arabidopsis thaliana]
Length = 276
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 93/164 (56%), Gaps = 10/164 (6%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRPPGS + K + +P+++ V +G DV I F + VC
Sbjct: 53 EVVRRPRGRPPGSKNKPKPPVFVTRDTDPPMSPYILEVPSGNDVVEAINRFCRRKSIGVC 112
Query: 182 ILSANGAISNVTLRQ--AATSGGTVTYEGRFEILSLSGSFL--LSESSGQRSRTGGLSVS 237
+LS +G+++NVTLRQ A G T+T+ G+F++LS+S +FL +S + +VS
Sbjct: 113 VLSGSGSVANVTLRQPSPAALGSTITFHGKFDLLSVSATFLPPPPRTSLSPPVSNFFTVS 172
Query: 238 LSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
L+GP G+++GG VAG L +A V V+ SF + S HR+
Sbjct: 173 LAGPQGQIIGGFVAGPLISAGTVYVIAASF------NNPSYHRL 210
>gi|414589703|tpg|DAA40274.1| TPA: hypothetical protein ZEAMMB73_130445 [Zea mays]
Length = 344
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 84/149 (56%), Gaps = 6/149 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSA--GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
KK RGRPPGS + K + A PHVI + G DV+ + F+ +C+
Sbjct: 90 KKRRGRPPGSKNKPKPPVVVTREAEPAAAMRPHVIEIPCGCDVADALARFAARRNLGICV 149
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGG----LSVSL 238
L+ GA++NV+LR + G V + G++E+LS+S +FL S + LS+SL
Sbjct: 150 LAGTGAVANVSLRHPSPGGPAVMFHGQYEVLSISATFLPPAMSAVAPQAAAAAACLSISL 209
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+GP G+++GG+VAG L AA+ V +V +F
Sbjct: 210 AGPHGQIVGGAVAGPLYAASTVVLVAAAF 238
>gi|357467175|ref|XP_003603872.1| AT-hook protein [Medicago truncatula]
gi|355492920|gb|AES74123.1| AT-hook protein [Medicago truncatula]
Length = 332
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 79/237 (33%), Positives = 106/237 (44%), Gaps = 54/237 (22%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGE--------------------- 163
KK RGRP GS + + + A GS + H++ V E
Sbjct: 93 KKRRGRPLGSRNEIQSKKRASGSVRLA-NAHIMMVNVQEKERKKKFAKDQLLIFLVHICF 151
Query: 164 ---DVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
+V KI +FSQN +CILSA G S T+ G T TYEGRFEI+SL GS L
Sbjct: 152 GIQNVLEKINTFSQNLSENICILSAVGTTSKATI---CVDGKTKTYEGRFEIISLGGSLL 208
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKE------ 274
+ GL+VSLS DG V GG + +L AA+PVQ+V+GS+ ++E
Sbjct: 209 PDKKESHCKVFEGLNVSLSL-DGNVFGGRLVKILIAASPVQIVLGSYPVGSQEEVDYDPK 267
Query: 275 -----------SKSSHRMESLPVPPKLAPGGQPAGQCSPPSR--------GTLSESS 312
++S ++ES P PP + PS+ GTLS SS
Sbjct: 268 EPPKEDPSPPSAESQEKVESDPSPPSTESQEKTESHLKDPSKEDPNPSSEGTLSNSS 324
>gi|449456182|ref|XP_004145829.1| PREDICTED: uncharacterized protein LOC101216092 [Cucumis sativus]
Length = 213
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/150 (40%), Positives = 84/150 (56%), Gaps = 21/150 (14%)
Query: 129 GRPPGSGSGKKHQLEALGSAG--------VGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
G PPG G +L+ L S G FTPH+I V GE++ ++I +FS R V
Sbjct: 65 GHPPGFG-----KLQVLASLGGYAWDTFSRDFTPHIILVAPGENIVNRISNFSVPRSRTV 119
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
CI+SA G +S++ + + T+ +EG FEIL LSG G R +++S S
Sbjct: 120 CIISAVGLVSSIIIHDPNSVASTLKFEGTFEILQLSG----WSHEGDDIRL--MTISFSK 173
Query: 241 PDGR--VLGGSVAGLLTAATPVQVVVGSFL 268
DGR V GG+VA L AATPVQ+++GSF+
Sbjct: 174 LDGRNQVFGGAVASSLIAATPVQIIMGSFI 203
>gi|302790596|ref|XP_002977065.1| hypothetical protein SELMODRAFT_58746 [Selaginella moellendorffii]
gi|300155041|gb|EFJ21674.1| hypothetical protein SELMODRAFT_58746 [Selaginella moellendorffii]
Length = 194
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 98/181 (54%), Gaps = 36/181 (19%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
++K RGRPPGS + K + G G PHV+ + D+ I +F++ RA+C+L
Sbjct: 1 VRKPRGRPPGSKNKPKPPIIITRDTGSGMRPHVLEIAPNTDIVDAIATFARKRQRALCVL 60
Query: 184 SANGAISNVTLRQ---------------------------------AATSGGTVTYEGRF 210
SA G +SN+TL + AA + TV+++GRF
Sbjct: 61 SARGTVSNLTLLRHSPASSAASAPPSSPPSSSAASTGATPSSSRAAAAAATSTVSFQGRF 120
Query: 211 EILSLSGSFLLSE--SSGQRSRTGGLSVSLS-GPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E++SLSG+FL + S+G GL+VS++ GP G+VLGG+VAG L +A+PV V+ SF
Sbjct: 121 ELISLSGAFLQQQMPSAGILGAYSGLAVSVAGGPQGQVLGGNVAGPLVSASPVMVIAASF 180
Query: 268 L 268
+
Sbjct: 181 V 181
>gi|357481891|ref|XP_003611231.1| DNA binding protein [Medicago truncatula]
gi|355512566|gb|AES94189.1| DNA binding protein [Medicago truncatula]
Length = 192
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 69/173 (39%), Positives = 95/173 (54%), Gaps = 9/173 (5%)
Query: 163 EDVSSKIMSFSQNGPRA-VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLL 221
++ +K+ S Q GP +CILSA G + + +Q SG VTYEGRFE++SLSG +
Sbjct: 10 RNIVAKLASCCQGGPNTEICILSAQGLVGIASFQQ---SGVIVTYEGRFELVSLSGMLEV 66
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRM 281
+++ R G VSL GPD R LGG VA L AA+ V+V VGSF D +K S ++ ++
Sbjct: 67 CDNNSGCKRMGNFKVSLVGPDLRPLGGVVANKLIAASSVKVTVGSFTLDVKKASSNNLKI 126
Query: 282 ESLPVP-PKLAPGGQPAGQCSPPSRGTLSESSG-GPGSPLNHSTGACNNNHLP 332
VP ++A G P G +G ESSG SP + G+ NN P
Sbjct: 127 GPSSVPSSQIAASGTPIG---ATLQGPSYESSGDNQNSPFSQRLGSYNNASQP 176
>gi|302763145|ref|XP_002964994.1| hypothetical protein SELMODRAFT_67842 [Selaginella moellendorffii]
gi|300167227|gb|EFJ33832.1| hypothetical protein SELMODRAFT_67842 [Selaginella moellendorffii]
Length = 192
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 97/179 (54%), Gaps = 35/179 (19%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + G G PHV+ + D+ I +F++ RA+C+LS
Sbjct: 1 RKPRGRPPGSKNKPKPPIIITRDTGSGMRPHVLEIAPNTDIVDAIATFARKRQRALCVLS 60
Query: 185 ANGAISNVTLRQ--------------------------------AATSGGTVTYEGRFEI 212
A G +SN+TL + AA + TV+++GRFE+
Sbjct: 61 ARGTVSNLTLLRHSPASSTASAPPSSPPSSSAASTGATPSSSRAAAAATSTVSFQGRFEL 120
Query: 213 LSLSGSFLLSE--SSGQRSRTGGLSVSLS-GPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
+SLSG+FL + S+G GL+VS++ GP G+VLGG+VAG L +A+PV V+ SF+
Sbjct: 121 ISLSGAFLQQQMPSAGILGAYSGLAVSVAGGPQGQVLGGNVAGPLVSASPVMVIAASFV 179
>gi|449454628|ref|XP_004145056.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449473475|ref|XP_004153892.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449531743|ref|XP_004172845.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 282
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 96/173 (55%), Gaps = 3/173 (1%)
Query: 119 LSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPR 178
LS S ++ RGRP GS + K + + H+I + D+ + +F++ R
Sbjct: 61 LSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLATFARRRQR 120
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
VCILSA G ++NVTLRQ ++ G +T GRFEILSLSGSFL + S GL+V L
Sbjct: 121 GVCILSATGTVANVTLRQPSSPGAVITLPGRFEILSLSGSFLPPPAPPAAS---GLTVYL 177
Query: 239 SGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLA 291
+G G+V+GG+V G L+A+ PV ++ SF + E+ P P ++A
Sbjct: 178 AGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDEDETSPAPDQMA 230
>gi|315259979|gb|ADT92186.1| DNA-binding protein [Zea mays]
Length = 228
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/136 (41%), Positives = 77/136 (56%), Gaps = 20/136 (14%)
Query: 146 GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAA------- 198
GSAG FTPH+IT EDV++KI++F+ RAVC+LSA G++S LR A
Sbjct: 75 GSAGTSFTPHIITASPSEDVAAKIVAFATQSSRAVCVLSAMGSVSRAVLRHPADGSPMAR 134
Query: 199 -------TSGGTVTYEGRFEILSLSGSFLLSESSGQRSR------TGGLSVSLSGPDGRV 245
+ YEG +EI+SL+GS+ L+E S Q +GGLSV+L P+ V
Sbjct: 135 VHASPQPYNNSPAIYEGFYEIMSLTGSYNLAEGSQQEQCQGQGQPSGGLSVTLCSPERNV 194
Query: 246 LGGSVAGLLTAATPVQ 261
+GG + G L AA VQ
Sbjct: 195 IGGVLGGPLVAAGTVQ 210
>gi|356552959|ref|XP_003544827.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 256
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 65/180 (36%), Positives = 94/180 (52%), Gaps = 13/180 (7%)
Query: 109 SGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSK 168
SGLS + + +++ RGRP GS + K L +P ++ + G V
Sbjct: 44 SGLSGDQNNETTSEIMRRPRGRPSGSKNRPKPPLIITCEPEPVMSPFILEIPGGSGVVEA 103
Query: 169 IMSFSQNGPRAVCILSANGAISNVTLRQ-----AATSGGTVTYEGRFEILSLSGSFLLSE 223
+ FS+ +C+L+ +G ++NVTLRQ A S TVT+ GRF ILS+S +FL
Sbjct: 104 LARFSRRKNTGLCVLTGSGTVANVTLRQPSFTPAGASVATVTFHGRFNILSMSATFLHHG 163
Query: 224 SSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMES 283
S + L+VSLSGP G+++GG VAG L AA V V+ SF + S HR+ S
Sbjct: 164 SPA--AIPNALAVSLSGPQGQIVGGLVAGRLLAAGTVFVIAASF------NNPSYHRLSS 215
>gi|302797082|ref|XP_002980302.1| hypothetical protein SELMODRAFT_420013 [Selaginella moellendorffii]
gi|300151918|gb|EFJ18562.1| hypothetical protein SELMODRAFT_420013 [Selaginella moellendorffii]
Length = 192
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 77/116 (66%), Gaps = 4/116 (3%)
Query: 154 PHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEIL 213
PHV+ + G DV + +F++ R +C+L +G ++NVTLRQ A G TVT+ GRFEIL
Sbjct: 3 PHVLEIAGGCDVGETLAAFARRRARGLCVLGGSGTVANVTLRQLAAPGSTVTFHGRFEIL 62
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGP-DGRVLGGSVAGLLTAATPVQVVVGSFL 268
S+SG+FL + GL+V+L+G G+VLGGSV G+L AA+PV V+ SF+
Sbjct: 63 SISGAFLPPPAP---VAVAGLTVALAGAQQGQVLGGSVVGVLMAASPVLVIAASFV 115
>gi|242037267|ref|XP_002466028.1| hypothetical protein SORBIDRAFT_01g050300 [Sorghum bicolor]
gi|241919882|gb|EER93026.1| hypothetical protein SORBIDRAFT_01g050300 [Sorghum bicolor]
Length = 568
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 77/135 (57%), Gaps = 7/135 (5%)
Query: 151 GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRF 210
G PHV+ + AGED+ S+I+ S+ +AVC+LS GA+ + L +A + ++G
Sbjct: 145 GLQPHVLKIHAGEDIVSRIVQVSKIIGKAVCVLSVFGAVQDCYLLHSAV---ILNHKGPL 201
Query: 211 EILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD 270
EI+ + GS L S+S G G LSV+L+ D V+GG G L AATPVQ +VGSF D
Sbjct: 202 EIIHVFGSILTSDSPG----FGCLSVTLACGDCSVIGGVAVGPLIAATPVQAIVGSFHND 257
Query: 271 GRKESKSSHRMESLP 285
+ +K + P
Sbjct: 258 AFQANKKPKLIACYP 272
>gi|302759208|ref|XP_002963027.1| hypothetical protein SELMODRAFT_404546 [Selaginella moellendorffii]
gi|300169888|gb|EFJ36490.1| hypothetical protein SELMODRAFT_404546 [Selaginella moellendorffii]
Length = 192
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 77/116 (66%), Gaps = 4/116 (3%)
Query: 154 PHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEIL 213
PHV+ + G DV + +F++ R +C+L +G ++NVTLRQ A G TVT+ GRFEIL
Sbjct: 3 PHVLEIAGGCDVGETLAAFARRRARGLCVLGGSGTVANVTLRQLAAPGSTVTFHGRFEIL 62
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGP-DGRVLGGSVAGLLTAATPVQVVVGSFL 268
S+SG+FL + GL+V+L+G G+VLGGSV G+L AA+PV V+ SF+
Sbjct: 63 SISGAFLPPPAP---VAVAGLTVALAGAQQGQVLGGSVVGVLMAASPVLVIAASFV 115
>gi|356569317|ref|XP_003552849.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 302
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 86/143 (60%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + +G DV+ I +F+ R V +LS
Sbjct: 82 RRPRGRPAGSKNKPKPPIVITKESPNALRSHVLEIASGSDVAESIAAFANRRHRGVSVLS 141
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G ++NVTLRQ A G +T GRFEILSLSG+FL S S S GL+V L+G G+
Sbjct: 142 GSGIVANVTLRQPAAPAGVITLHGRFEILSLSGAFLPSPSP---SGATGLTVYLAGGQGQ 198
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GG+VAG L A+ PV V+ +F
Sbjct: 199 VVGGNVAGSLVASGPVMVIAATF 221
>gi|449437286|ref|XP_004136423.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449527047|ref|XP_004170524.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 285
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 83/146 (56%), Gaps = 5/146 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGS--AGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
++ RGRPPGS + K + + A +P+V+ V G D+ I F + +CI
Sbjct: 65 RRPRGRPPGSKNKPKPAAVVVANRDAEPPMSPYVLEVPGGSDIVEAISRFCRRRNTGLCI 124
Query: 183 LSANGAISNVTLRQAATSG-GTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
L+A G + +VTLRQ A+S GTVT+ GRF+ILS+ +F+ +S G +++L+GP
Sbjct: 125 LNAYGTVGDVTLRQPASSPVGTVTFHGRFDILSVCATFVPQTTSF--PIPNGFTITLAGP 182
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF 267
G++ GG VAG L V V+ SF
Sbjct: 183 QGQIFGGLVAGSLIGVGTVYVIAASF 208
>gi|115445949|ref|NP_001046754.1| Os02g0448000 [Oryza sativa Japonica Group]
gi|50252749|dbj|BAD28974.1| putative DNA-binding protein AT-hook 2 [Oryza sativa Japonica
Group]
gi|113536285|dbj|BAF08668.1| Os02g0448000 [Oryza sativa Japonica Group]
gi|125539298|gb|EAY85693.1| hypothetical protein OsI_07061 [Oryza sativa Indica Group]
gi|125581960|gb|EAZ22891.1| hypothetical protein OsJ_06576 [Oryza sativa Japonica Group]
Length = 316
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 92/156 (58%), Gaps = 6/156 (3%)
Query: 115 GGGPL---SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMS 171
GGGP S + ++ RGRP GS + K + + HV+ V G D+S I +
Sbjct: 84 GGGPDGAGSESATRRPRGRPAGSKNKPKPPIIITRDSANTLRTHVMEVAGGCDISESITT 143
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
F++ R VC+LS G ++NVTLRQ A+ G V GRFEILSLSGSFL + + +
Sbjct: 144 FARRRQRGVCVLSGAGTVTNVTLRQPASQGAVVALHGRFEILSLSGSFLPPPAPPEAT-- 201
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
GL+V L+G G+V+GGSV G LTAA PV ++ SF
Sbjct: 202 -GLTVYLAGGQGQVVGGSVVGALTAAGPVVIMAASF 236
>gi|226492016|ref|NP_001141263.1| uncharacterized protein LOC100273351 [Zea mays]
gi|194703628|gb|ACF85898.1| unknown [Zea mays]
gi|194708066|gb|ACF88117.1| unknown [Zea mays]
gi|413936536|gb|AFW71087.1| hypothetical protein ZEAMMB73_730676 [Zea mays]
Length = 309
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 89/145 (61%), Gaps = 3/145 (2%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
++++ +GRP GS + K + + HV+ V +G D+S I +F++ R VC+
Sbjct: 93 TLRRPKGRPAGSKNKPKPPIIITRDSANTLRTHVMEVASGCDISESITAFARRRQRGVCV 152
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LS G ++NVTLRQ A+ G V GRFEILSLSGSFL + + + GL+V L+G
Sbjct: 153 LSGAGTVTNVTLRQPASQGAVVALHGRFEILSLSGSFLPPPAPPEAT---GLTVYLAGGQ 209
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G LTAA PV ++ SF
Sbjct: 210 GQVVGGSVVGALTAAGPVVIMAASF 234
>gi|125605994|gb|EAZ45030.1| hypothetical protein OsJ_29669 [Oryza sativa Japonica Group]
Length = 334
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 89/162 (54%), Gaps = 13/162 (8%)
Query: 106 GTGSG-LSSPGGGP-LSPDSIKKSRGRPPGSGSGKKHQLEALGSA--GVGFTPHVITVKA 161
G+GSG L GGG S + KK RGRPPGS + K + A PHVI +
Sbjct: 61 GSGSGQLVVVGGGDGASIEVAKKRRGRPPGSKNKPKPPVVITREAEPAAAMRPHVIEIPG 120
Query: 162 GEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAA-----TSGGTVTYEGRFEILSLS 216
G DV+ + FS +C+L+ GA++NV+LR + ++ + + GR+EILSLS
Sbjct: 121 GRDVAEALARFSSRRNLGICVLAGTGAVANVSLRHPSPGVPGSAPAAIVFHGRYEILSLS 180
Query: 217 GSFL----LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLL 254
+FL S + GLS+SL+GP G+++GG+VAG L
Sbjct: 181 ATFLPPAMSSVAPQAAVAAAGLSISLAGPHGQIVGGAVAGPL 222
>gi|125564030|gb|EAZ09410.1| hypothetical protein OsI_31684 [Oryza sativa Indica Group]
Length = 334
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 89/162 (54%), Gaps = 13/162 (8%)
Query: 106 GTGSG-LSSPGGGP-LSPDSIKKSRGRPPGSGSGKKHQLEALGSA--GVGFTPHVITVKA 161
G+GSG L GGG S + KK RGRPPGS + K + A PHVI +
Sbjct: 61 GSGSGQLVVVGGGDGASIEVAKKRRGRPPGSKNKPKPPVVITREAEPAAAMRPHVIEIPG 120
Query: 162 GEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAA-----TSGGTVTYEGRFEILSLS 216
G DV+ + FS +C+L+ GA++NV+LR + ++ + + GR+EILSLS
Sbjct: 121 GRDVAEALARFSSRRNLGICVLAGTGAVANVSLRHPSPGVPGSAPAAIVFHGRYEILSLS 180
Query: 217 GSFL----LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLL 254
+FL S + GLS+SL+GP G+++GG+VAG L
Sbjct: 181 ATFLPPAMSSVAPQAAVAAAGLSISLAGPHGQIVGGAVAGPL 222
>gi|18414224|ref|NP_567432.1| AT-hook motif nuclear-localized protein 20 [Arabidopsis thaliana]
gi|26452422|dbj|BAC43296.1| unknown protein [Arabidopsis thaliana]
gi|30102626|gb|AAP21231.1| At4g14465 [Arabidopsis thaliana]
gi|110735855|dbj|BAE99903.1| hypothetical protein [Arabidopsis thaliana]
gi|119657384|tpd|FAA00291.1| TPA: AT-hook motif nuclear localized protein 20 [Arabidopsis
thaliana]
gi|332658048|gb|AEE83448.1| AT-hook motif nuclear-localized protein 20 [Arabidopsis thaliana]
Length = 281
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 61/96 (63%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ + G DV+ I FS+ R VC+LS
Sbjct: 67 RRPRGRPPGSKNKPKAPIFVTRDSPNALRSHVLEISDGSDVADTIAHFSRRRQRGVCVLS 126
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G+++NVTLRQAA GG V+ +GRFEILSL+G+FL
Sbjct: 127 GTGSVANVTLRQAAAPGGVVSLQGRFEILSLTGAFL 162
>gi|242095702|ref|XP_002438341.1| hypothetical protein SORBIDRAFT_10g012980 [Sorghum bicolor]
gi|241916564|gb|EER89708.1| hypothetical protein SORBIDRAFT_10g012980 [Sorghum bicolor]
Length = 310
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/178 (39%), Positives = 95/178 (53%), Gaps = 15/178 (8%)
Query: 95 PSPSSVTTATGGTGSGLSSPGG--GPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGF 152
PSP +V GG L GG GP+ +K RGRPPGS + K + +
Sbjct: 47 PSPENVDP--GGDQPALEGSGGSGGPM-----RKPRGRPPGSKNKPKPPIIITRDSPNAL 99
Query: 153 TPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ--AATSGGTV-TYEGR 209
HV+ V AG D+ + +++ R VC+LS GA+SN+ LRQ A G V T G+
Sbjct: 100 HSHVLEVAAGADIVECVSEYARRRCRGVCVLSGGGAVSNLALRQPGAEPPGSLVATLRGQ 159
Query: 210 FEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
FEILSL+G+ L + S LSV ++G G+V+GGSV G L AA PV ++ SF
Sbjct: 160 FEILSLTGTVLPPPAPPGAS---SLSVYVAGGQGQVMGGSVVGQLIAAGPVVLMAASF 214
>gi|224109476|ref|XP_002315208.1| predicted protein [Populus trichocarpa]
gi|222864248|gb|EEF01379.1| predicted protein [Populus trichocarpa]
Length = 157
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/143 (43%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + +G D+ I +FS R V ILS
Sbjct: 2 RRPRGRPAGSKNKPKPPIVITKESPNSLHSHVLEISSGSDIVESIATFSHRRHRGVSILS 61
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G ++NVTLRQ A GG +T GRFEILSLSGSFL + S + GL+V L+G G+
Sbjct: 62 GSGIVNNVTLRQPAAPGGVITLHGRFEILSLSGSFLPAPSPPGAT---GLTVYLAGGQGQ 118
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GG+V G L AA PV V+ +F
Sbjct: 119 VVGGTVMGELIAAGPVMVIAATF 141
>gi|297804852|ref|XP_002870310.1| hypothetical protein ARALYDRAFT_493459 [Arabidopsis lyrata subsp.
lyrata]
gi|297316146|gb|EFH46569.1| hypothetical protein ARALYDRAFT_493459 [Arabidopsis lyrata subsp.
lyrata]
Length = 273
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 61/96 (63%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ + G DV+ I FS+ R VC+LS
Sbjct: 67 RRPRGRPPGSKNKPKAPIFVTRDSPNALRSHVLEISDGSDVAETIAHFSRRRQRGVCVLS 126
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G+++NVTLRQAA GG V+ +GRFEILSL+G+FL
Sbjct: 127 GTGSVANVTLRQAAAPGGVVSLQGRFEILSLTGAFL 162
>gi|242061166|ref|XP_002451872.1| hypothetical protein SORBIDRAFT_04g009050 [Sorghum bicolor]
gi|241931703|gb|EES04848.1| hypothetical protein SORBIDRAFT_04g009050 [Sorghum bicolor]
Length = 327
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 86/143 (60%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G D+S I +F++ R VC+LS
Sbjct: 107 RRPRGRPAGSKNKPKPPIIITRDSANTLRTHVMEVAGGCDISESITAFARRRQRGVCVLS 166
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+ G V GRFEILSLSGSFL + + + GL+V L+G G+
Sbjct: 167 GAGTVTNVTLRQPASQGAVVALHGRFEILSLSGSFLPPPAPPEAT---GLTVYLAGGQGQ 223
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G LTAA PV ++ SF
Sbjct: 224 VVGGSVVGALTAAGPVVIMAASF 246
>gi|357144188|ref|XP_003573204.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 312
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 86/143 (60%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G D+S I +F++ R VC+LS
Sbjct: 97 RRPRGRPAGSKNKPKPPIIITRDSANTLRTHVMEVAGGCDISESITAFARRRQRGVCVLS 156
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+ G V GRFEILSLSGSFL + + + GL+V L+G G+
Sbjct: 157 GAGTVTNVTLRQPASQGAVVALHGRFEILSLSGSFLPPPAPPEAT---GLTVYLAGGQGQ 213
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G LTAA PV ++ SF
Sbjct: 214 VVGGSVVGALTAAGPVVIMAASF 236
>gi|226500036|ref|NP_001146992.1| DNA binding protein [Zea mays]
gi|195606236|gb|ACG24948.1| DNA binding protein [Zea mays]
gi|413925983|gb|AFW65915.1| DNA binding protein [Zea mays]
Length = 320
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 86/143 (60%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G D+S + +F++ R VC+LS
Sbjct: 101 RRPRGRPAGSKNKPKPPIIITRDSANTLRTHVMEVAGGCDISESVTAFARRRQRGVCVLS 160
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+ G V GRFEILSLSGSFL + + + GL+V L+G G+
Sbjct: 161 GAGTVTNVTLRQPASQGAVVALHGRFEILSLSGSFLPPPAPPEAT---GLTVYLAGGQGQ 217
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G LTAA PV V+ SF
Sbjct: 218 VVGGSVVGALTAAGPVVVMAASF 240
>gi|326500592|dbj|BAJ94962.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 331
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/146 (41%), Positives = 81/146 (55%), Gaps = 6/146 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V AG D+ + +++ R VC+LS
Sbjct: 98 RRPRGRPAGSKNKPKPPIIVTRDSPNALHSHVLEVSAGADIVDCVAEYARRRGRGVCVLS 157
Query: 185 ANGAISNVTLRQAATS--GGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
GA+ NV LRQ S G V T GRFEILSL+G+ L + S GL+V LSG
Sbjct: 158 GGGAVVNVALRQPGASPPGSVVATLRGRFEILSLTGTVLPPPAPPGAS---GLTVFLSGG 214
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L AA PV ++ SF
Sbjct: 215 QGQVIGGSVVGTLVAAGPVVLMAASF 240
>gi|50725207|dbj|BAD33958.1| DNA-binding protein-like [Oryza sativa Japonica Group]
Length = 363
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 89/162 (54%), Gaps = 13/162 (8%)
Query: 106 GTGSG-LSSPGGGP-LSPDSIKKSRGRPPGSGSGKKHQLEALGSA--GVGFTPHVITVKA 161
G+GSG L GGG S + KK RGRPPGS + K + A PHVI +
Sbjct: 61 GSGSGQLVVVGGGDGASIEVAKKRRGRPPGSKNKPKPPVVITREAEPAAAMRPHVIEIPG 120
Query: 162 GEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAA-----TSGGTVTYEGRFEILSLS 216
G DV+ + FS +C+L+ GA++NV+LR + ++ + + GR+EILSLS
Sbjct: 121 GRDVAEALARFSSRRNLGICVLAGTGAVANVSLRHPSPGVPGSAPAAIVFHGRYEILSLS 180
Query: 217 GSFL----LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLL 254
+FL S + GLS+SL+GP G+++GG+VAG L
Sbjct: 181 ATFLPPAMSSVAPQAAVAAAGLSISLAGPHGQIVGGAVAGPL 222
>gi|440655803|gb|AGC22550.1| male sterility related AT-hook DNA binding protein [Brassica
oleracea]
Length = 260
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 59/145 (40%), Positives = 84/145 (57%), Gaps = 3/145 (2%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+K+ RGRP GS + K + + H + + +G D+ + FS+ R +CIL
Sbjct: 55 VKRPRGRPAGSKNKPKPPIIVTHDSPNSLRAHAVEISSGNDICEALSDFSRRKQRGLCIL 114
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
SANG ++NVTLRQ A+SG VT GRFEILSL GS L + GL++ L+G G
Sbjct: 115 SANGCVTNVTLRQPASSGAIVTLHGRFEILSLLGSILPPPAPLG---ITGLTIYLAGHQG 171
Query: 244 RVLGGSVAGLLTAATPVQVVVGSFL 268
+V+GG V G L A+ PV ++ SF+
Sbjct: 172 QVVGGGVVGGLIASGPVVIMAASFM 196
>gi|326507624|dbj|BAK03205.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 309
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 93/160 (58%), Gaps = 6/160 (3%)
Query: 111 LSSPGGGPLS---PDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSS 167
L SP GG L + ++ RGRP GS + K + + HV+ V G D+S
Sbjct: 72 LISPSGGGLQDGGENGSRRPRGRPAGSKNKPKPPIIITRDSANTLRTHVMEVAGGCDISE 131
Query: 168 KIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQ 227
I +F++ R VC+LS G ++NVTLRQ A+ G V GRFEILSLSGSFL + +
Sbjct: 132 SITAFARRRQRGVCVLSGAGTVTNVTLRQPASQGAVVALHGRFEILSLSGSFLPPPAPPE 191
Query: 228 RSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+ GL+V L+G G+V+GG+V G LTAA PV ++ SF
Sbjct: 192 AT---GLTVYLAGGKGQVVGGTVVGSLTAAGPVVIMAASF 228
>gi|357137273|ref|XP_003570225.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 337
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/146 (41%), Positives = 82/146 (56%), Gaps = 6/146 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V AG D+ + +++ R VC+LS
Sbjct: 95 RRPRGRPAGSKNKPKPPIIVTRDSPNALHSHVLEVAAGADIVDCVAEYARRRGRGVCVLS 154
Query: 185 ANGAISNVTLRQ--AATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
GA+ NV LRQ A+ G V T GRFEILSL+G+ L + S GL+V LSG
Sbjct: 155 GGGAVVNVALRQPGASPPGSVVATLRGRFEILSLTGTVLPPPAPPGAS---GLTVFLSGG 211
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L AA PV ++ SF
Sbjct: 212 QGQVIGGSVVGSLVAAGPVVLMAASF 237
>gi|255566448|ref|XP_002524209.1| ESC, putative [Ricinus communis]
gi|223536486|gb|EEF38133.1| ESC, putative [Ricinus communis]
Length = 342
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 84/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ + +G D+ I +F+Q R V ILS
Sbjct: 111 RRPRGRPPGSKNKLKPPIVVTKESPNALRSHVLEISSGTDIVGSISNFAQRRHRGVSILS 170
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G ++NVTLRQ A GG +T GRFEILSL GSFL S + L+V L+G G+
Sbjct: 171 GSGIVTNVTLRQPAAPGGVITLHGRFEILSLLGSFLPPPSPPGAT---TLTVYLAGGQGQ 227
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GG+V G L AA PV V+ +F
Sbjct: 228 VVGGTVMGQLVAAGPVMVIAATF 250
>gi|449433267|ref|XP_004134419.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 300
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 86/143 (60%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + +V+ V AG DV+ I F++ R VC+LS
Sbjct: 81 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAAGSDVADSIAQFARKRQRGVCVLS 140
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A G ++NVTLRQ A G + +GRFEILSL+G+FL + + GL+V LSG G+
Sbjct: 141 ATGLVANVTLRQPAAPGSVMPLQGRFEILSLTGAFLPGPAPPGST---GLTVYLSGGQGQ 197
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L AA P+ V+ +F
Sbjct: 198 VVGGSVVGSLVAAGPIMVIAATF 220
>gi|15223074|ref|NP_177776.1| AT-hook motif nuclear localized protein 29 [Arabidopsis thaliana]
gi|12323978|gb|AAG51949.1|AC015450_10 unknown protein; 41834-42742 [Arabidopsis thaliana]
gi|119657402|tpd|FAA00300.1| TPA: AT-hook motif nuclear localized protein 29 [Arabidopsis
thaliana]
gi|332197729|gb|AEE35850.1| AT-hook motif nuclear localized protein 29 [Arabidopsis thaliana]
Length = 302
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 92/165 (55%), Gaps = 13/165 (7%)
Query: 113 SPGGGPLSPDSI-KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMS 171
PG P++ S K+ RGRPPGS + K + + HV+ V +G D+ + +
Sbjct: 59 DPGSDPVTSGSTGKRPRGRPPGSKNKPKPPVIVTRDSPNVLRSHVLEVSSGADIVESVTT 118
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAAT---------SGGTVTYEGRFEILSLSGSFLLS 222
+++ R V ILS NG ++NV+LRQ AT +GG V GRFEILSL+G+ L
Sbjct: 119 YARRRGRGVSILSGNGTVANVSLRQPATTAAHGANGGTGGVVALHGRFEILSLTGTVLPP 178
Query: 223 ESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+ +GGLS+ LSG G+V+GG+V L A+ PV ++ SF
Sbjct: 179 PAP---PGSGGLSIFLSGVQGQVIGGNVVAPLVASGPVILMAASF 220
>gi|21593180|gb|AAM65129.1| putative DNA-binding protein [Arabidopsis thaliana]
Length = 281
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 60/96 (62%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ + G DV+ I FS+ R VC+LS
Sbjct: 67 RRPRGRPPGSKNKPKAPIFVTRDSPNALRSHVLEISDGSDVADTIAHFSRRRQRGVCVLS 126
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G+++NV LRQAA GG V+ +GRFEILSL+G+FL
Sbjct: 127 GTGSVANVXLRQAAAPGGVVSLQGRFEILSLTGAFL 162
>gi|255541340|ref|XP_002511734.1| ESC, putative [Ricinus communis]
gi|223548914|gb|EEF50403.1| ESC, putative [Ricinus communis]
Length = 299
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 92/164 (56%), Gaps = 10/164 (6%)
Query: 111 LSSPGGGPLSP-------DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGE 163
+++P G L P + ++ RGRP GS + K + + HV+ + G
Sbjct: 62 ITTPEGKELVPTTGGGDGEMTRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIANGS 121
Query: 164 DVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSE 223
D+ + +F++ R VCILS G ++NVTLRQ A+ G VT GRFEILSLSGSFL
Sbjct: 122 DIMESVSTFARRRQRGVCILSGTGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSFLPPP 181
Query: 224 SSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+ S GL++ L+G G+V+GGSV G L A+ PV ++ SF
Sbjct: 182 APPAAS---GLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASF 222
>gi|225432991|ref|XP_002284519.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Vitis
vinifera]
Length = 260
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G DV+ + F++ R VC+LS
Sbjct: 48 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVAGGHDVAESVAQFARRRQRGVCVLS 107
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G+++NVTLRQ A G V GRFEILSL+G+FL + + GL+V L+G G+
Sbjct: 108 GSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP---PGSTGLTVYLAGGQGQ 164
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L AA PV V+ +F
Sbjct: 165 VVGGSVVGSLVAAGPVIVIAATF 187
>gi|297849858|ref|XP_002892810.1| hypothetical protein ARALYDRAFT_471623 [Arabidopsis lyrata subsp.
lyrata]
gi|297338652|gb|EFH69069.1| hypothetical protein ARALYDRAFT_471623 [Arabidopsis lyrata subsp.
lyrata]
Length = 207
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 84/150 (56%), Gaps = 6/150 (4%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+++ + RGRP GS K + + +P+++ V +G DV + F + C
Sbjct: 2 ETVGRPRGRP--QGSKNKPKAPIFVTIDPPMSPYILEVPSGNDVVEALNRFCRRKAIGFC 59
Query: 182 ILSANGAISNVTLRQA--ATSGGTVTYEGRFEILSLSGSFLLSESSGQRSR--TGGLSVS 237
+LS +G++++VTLRQ A G T+T+ G+F++LS+S +FL + +VS
Sbjct: 60 VLSGSGSVADVTLRQPSPAAPGSTITFHGKFDLLSVSATFLPPPPQTSLPPPFSNFFTVS 119
Query: 238 LSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+GP G+V+GG VAG L AA V VV SF
Sbjct: 120 LAGPQGQVIGGFVAGPLVAAGTVYVVATSF 149
>gi|119331586|gb|ABL63119.1| AT-hook DNA-binding protein [Catharanthus roseus]
Length = 256
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G DV+ I F++ R VC+LS
Sbjct: 25 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVLEVSGGSDVAESIAVFARKRQRGVCVLS 84
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G+++NVTLRQ A G V GRFEILSLSG+FL + + GL+V L+G G+
Sbjct: 85 GSGSVANVTLRQPAAPGAVVALHGRFEILSLSGAFLPGPAP---PGSTGLTVYLAGGQGQ 141
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L AA PV ++ +F
Sbjct: 142 VVGGSVVGSLVAAGPVLIIAATF 164
>gi|297839523|ref|XP_002887643.1| hypothetical protein ARALYDRAFT_476807 [Arabidopsis lyrata subsp.
lyrata]
gi|297333484|gb|EFH63902.1| hypothetical protein ARALYDRAFT_476807 [Arabidopsis lyrata subsp.
lyrata]
Length = 289
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 91/164 (55%), Gaps = 12/164 (7%)
Query: 113 SPGGGPLSPDSI--KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIM 170
PG P++ S K+ RGRPPGS + K + + HV+ V +G D+ +
Sbjct: 53 DPGSDPVTSGSTPGKRPRGRPPGSKNKPKPPVIVTRDSPNVLRSHVLEVSSGADIVESVT 112
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAAT-------SGGTVTYEGRFEILSLSGSFLLSE 223
++++ R V ILS NG ++NV+LRQ A +GG V GRFEILSL+G+ L
Sbjct: 113 TYARRRGRGVSILSGNGTVANVSLRQPAAAHGANGGTGGVVALHGRFEILSLTGTVLPPP 172
Query: 224 SSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+ +GGLS+ LSG G+V+GG+V L A+ PV ++ SF
Sbjct: 173 AP---PGSGGLSIFLSGVQGQVIGGNVVAPLVASGPVILMAASF 213
>gi|383146753|gb|AFG55091.1| Pinus taeda anonymous locus 2_10133_02 genomic sequence
gi|383146754|gb|AFG55092.1| Pinus taeda anonymous locus 2_10133_02 genomic sequence
gi|383146755|gb|AFG55093.1| Pinus taeda anonymous locus 2_10133_02 genomic sequence
gi|383146756|gb|AFG55094.1| Pinus taeda anonymous locus 2_10133_02 genomic sequence
gi|383146757|gb|AFG55095.1| Pinus taeda anonymous locus 2_10133_02 genomic sequence
Length = 149
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 49/106 (46%), Positives = 64/106 (60%), Gaps = 4/106 (3%)
Query: 115 GGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
GGG L ++SRGRPPGS + K + + G HV+ + G D+ + +F++
Sbjct: 45 GGGELG----RRSRGRPPGSKNKPKPPIIIHQDSPDGLAAHVLEIANGCDIGESLATFAR 100
Query: 175 NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
R VC+LS +G +SNVTLRQ A G VT GRFEILSLSGSFL
Sbjct: 101 RRQRGVCVLSGSGTVSNVTLRQPAAPGAIVTLHGRFEILSLSGSFL 146
>gi|449529339|ref|XP_004171657.1| PREDICTED: putative DNA-binding protein ESCAROLA-like, partial
[Cucumis sativus]
Length = 297
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 86/143 (60%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + +V+ V AG DV+ I F++ R VC+LS
Sbjct: 78 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAAGSDVADSIAQFARKRQRGVCVLS 137
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A G ++NVTLRQ A G + +GRFEILSL+G+FL + + GL+V LSG G+
Sbjct: 138 ATGLVANVTLRQPAAPGSVMPLQGRFEILSLTGAFLPGPAPPGST---GLTVYLSGGQGQ 194
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L AA P+ V+ +F
Sbjct: 195 VVGGSVVGSLVAAGPIMVIAATF 217
>gi|255576858|ref|XP_002529315.1| DNA binding protein, putative [Ricinus communis]
gi|223531239|gb|EEF33084.1| DNA binding protein, putative [Ricinus communis]
Length = 301
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G DV+ + F++ R VC+LS
Sbjct: 81 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVVGGADVAECVAQFARRRQRGVCVLS 140
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G+++NVTLRQ A G V GRFEILSL+G+FL + + GL+V L+G G+
Sbjct: 141 GSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP---PGSTGLTVYLAGGQGQ 197
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L AA PV V+ +F
Sbjct: 198 VVGGSVVGSLIAAGPVMVIAATF 220
>gi|224107887|ref|XP_002314642.1| predicted protein [Populus trichocarpa]
gi|222863682|gb|EEF00813.1| predicted protein [Populus trichocarpa]
Length = 207
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ + G DV+ + F++ R VC+LS
Sbjct: 1 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEIAGGADVAESVAQFARRRQRGVCVLS 60
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G+++NVTLRQ A G V GRFEILSL+G+FL + + GL+V L+G G+
Sbjct: 61 GSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP---PGSTGLTVYLAGGQGQ 117
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L AA PV V+ +F
Sbjct: 118 VVGGSVVGSLIAAGPVMVIAATF 140
>gi|356563284|ref|XP_003549894.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 287
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 84/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + H++ V G D+ + F++ R +CI+S
Sbjct: 73 RRPRGRPAGSKNKPKPPIIITRDSANAMRTHMMEVADGYDIVESVSEFARKRQRGICIMS 132
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+SG VT GRFEILSLSGSFL + S GL++ L+G G+
Sbjct: 133 GTGTVTNVTLRQPASSGSVVTLHGRFEILSLSGSFLPPPAPPAAS---GLTIYLAGGQGQ 189
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 190 VVGGSVVGTLVASGPVVIMAASF 212
>gi|225453933|ref|XP_002279636.1| PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera]
gi|147867329|emb|CAN81187.1| hypothetical protein VITISV_029906 [Vitis vinifera]
gi|296089162|emb|CBI38865.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R +CILS
Sbjct: 82 RRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIATGCDIMDSLNTFARRRQRGICILS 141
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G ++NVTLRQ A+ G VT GRFEILSLSGSFL + S GL++ L+G G+
Sbjct: 142 GSGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSFLPPPAPPAAS---GLTIYLAGGQGQ 198
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 199 VVGGSVVGPLLASGPVVIMAASF 221
>gi|449497591|ref|XP_004160444.1| PREDICTED: LOW QUALITY PROTEIN: putative DNA-binding protein
ESCAROLA-like [Cucumis sativus]
Length = 276
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 88/149 (59%), Gaps = 3/149 (2%)
Query: 120 SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA 179
S ++++ RGRP GS + K + + H I V G DV+ + +F++ R
Sbjct: 63 SDQALRRPRGRPAGSKNKPKPPIIVTRDSANALRAHAIEVSTGCDVNESLSNFARRKQRG 122
Query: 180 VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLS 239
VCILS +G ++NVTLRQAA+SG VT GRFEILS+ GS L + S GL++ LS
Sbjct: 123 VCILSGSGCVTNVTLRQAASSGAIVTLHGRFEILSMLGSILPPPAP---SGITGLTIYLS 179
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
G G+V+GG V G L A+ PV ++ +F+
Sbjct: 180 GAQGQVVGGVVVGALIASGPVVIMAATFM 208
>gi|356497181|ref|XP_003517441.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 300
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G D+ + +F++ R VCI+S
Sbjct: 79 RRPRGRPAGSKNKPKPPIIITRDSANALKTHVMEVADGCDIVDSVSAFARRRQRGVCIMS 138
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+SG VT GRFEILSL+GSFL + S GL++ L+G G+
Sbjct: 139 GTGTVTNVTLRQPASSGAVVTLHGRFEILSLAGSFLPPPAPPAAS---GLTIYLAGGQGQ 195
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 196 VVGGSVVGALIASGPVVIMSASF 218
>gi|224147184|ref|XP_002336424.1| predicted protein [Populus trichocarpa]
gi|222834973|gb|EEE73422.1| predicted protein [Populus trichocarpa]
Length = 56
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/55 (81%), Positives = 50/55 (90%)
Query: 208 GRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQV 262
GRFEIL+LSGS+L SE+ GQRSR+GGLSV LSGPDGRVLGG+VAGLL AA PVQV
Sbjct: 1 GRFEILALSGSYLPSENGGQRSRSGGLSVCLSGPDGRVLGGTVAGLLVAAAPVQV 55
>gi|357121024|ref|XP_003562222.1| PREDICTED: uncharacterized protein LOC100834381 [Brachypodium
distachyon]
Length = 222
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 71/121 (58%), Gaps = 8/121 (6%)
Query: 151 GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRF 210
G PHV+T+ AGED+ S++++ S+ +A+C+LSA GA+ L Q SG + ++G
Sbjct: 49 GLQPHVLTIAAGEDIISRVVAISRINAKAICVLSAFGAVKEAILLQP--SGAILNHKGPL 106
Query: 211 EILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD 270
EI+ L GS L S G L V+L+ D V+ G +AG L AAT +Q ++GSF D
Sbjct: 107 EIIRLVGSILTSND------LGCLRVTLASVDSSVISGIIAGPLIAATTIQAILGSFQND 160
Query: 271 G 271
Sbjct: 161 A 161
>gi|255572333|ref|XP_002527105.1| DNA binding protein, putative [Ricinus communis]
gi|223533528|gb|EEF35268.1| DNA binding protein, putative [Ricinus communis]
Length = 279
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 87/172 (50%), Gaps = 21/172 (12%)
Query: 50 GGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGT-MSLALVPSPSSVTTATGGTG 108
G D A+P N+M S EP RG R+ G + MS L S V++A G
Sbjct: 4 GADLAVPPIGSKNIME--SNQEP---NRGNYRRPGIEAILMSPKLPKSVPPVSSAVEG-- 56
Query: 109 SGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSK 168
++I++ RGRP GS + K + + H + V +G DVS
Sbjct: 57 -------------ETIRRPRGRPAGSKNKPKPPIIVTRDSANALRAHAMEVSSGCDVSES 103
Query: 169 IMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
+ +F++ R +C+LS +G ++NVTLRQ A+SG VT GRFEILSL GS L
Sbjct: 104 LANFARRRQRGICVLSGSGCVTNVTLRQPASSGAIVTLHGRFEILSLLGSIL 155
>gi|357154744|ref|XP_003576887.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 262
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 63/144 (43%), Positives = 86/144 (59%), Gaps = 4/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V +G D++ I FS+ R VC+LS
Sbjct: 35 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVSSGADIADSIAHFSRRRQRGVCVLS 94
Query: 185 ANGAISNVTLRQAATSGG-TVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
GA+++V LRQ A GG V GRFEILSL+G+FL S + GL+V L+G G
Sbjct: 95 GAGAVADVALRQPAAPGGAVVALRGRFEILSLTGTFLPGPSP---PGSTGLTVYLAGGQG 151
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G LTAA PV V+ +F
Sbjct: 152 QVVGGSVVGTLTAAGPVMVIASTF 175
>gi|297742664|emb|CBI34813.3| unnamed protein product [Vitis vinifera]
Length = 240
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 84/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VCI+S
Sbjct: 29 RRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIADGCDIVESVATFARRRQRGVCIMS 88
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+ G VT GRFEILSLSGSFL + + GL++ L+G G+
Sbjct: 89 GTGTVTNVTLRQPASPGAIVTLHGRFEILSLSGSFLPPPAPPAAT---GLTIYLAGGQGQ 145
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 146 VVGGSVVGQLLASGPVVIMAASF 168
>gi|356514176|ref|XP_003525782.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 283
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 84/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + H++ V G D+ + F++ R VCI+S
Sbjct: 70 RRPRGRPAGSKNKPKPPIIITRDSANAMRTHMMEVADGCDIVESVSEFARKRQRGVCIMS 129
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+SG VT GRFEILSLSGSFL + S GL++ L+G G+
Sbjct: 130 GTGTVNNVTLRQPASSGSVVTLHGRFEILSLSGSFLPPPAPPAAS---GLTIYLAGGQGQ 186
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 187 VVGGSVVGTLVASGPVVIMAASF 209
>gi|224063913|ref|XP_002301300.1| predicted protein [Populus trichocarpa]
gi|222843026|gb|EEE80573.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VCILS
Sbjct: 80 RRPRGRPAGSKNKPKPPIIITRDSPNALRSHVMEIATGCDIMESVSTFARRRQRGVCILS 139
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A G ++NVTL+Q A+ G VT GRFEILSLSGSFL + S GL++ L+G G+
Sbjct: 140 ATGTVTNVTLKQPASPGAVVTLHGRFEILSLSGSFLPPPAPPAAS---GLTIYLAGGQGQ 196
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 197 VVGGSVVGPLLASGPVVIMAASF 219
>gi|224058649|ref|XP_002299584.1| predicted protein [Populus trichocarpa]
gi|222846842|gb|EEE84389.1| predicted protein [Populus trichocarpa]
Length = 302
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + H++ V G D+ + +F++ R VCI+S
Sbjct: 86 RRPRGRPAGSKNKPKPPIIITRDSANALRTHLMEVADGCDIVESVATFARRRQRGVCIMS 145
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+ G VT GRFEILSL+GSFL + + GL++ L+G G+
Sbjct: 146 GTGTVTNVTLRQPASPGAIVTLHGRFEILSLAGSFLPPPAPPAAT---GLTIYLAGGQGQ 202
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G LTA+ PV ++ SF
Sbjct: 203 VVGGSVVGTLTASGPVVIMAASF 225
>gi|356495206|ref|XP_003516470.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 288
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ I +F++ R VC+LS
Sbjct: 67 RRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIANGCDIMESITAFARRRQRGVCVLS 126
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G ++NVTLRQ A+ G VT GRFEILSLSGSFL + S GL++ L+G G+
Sbjct: 127 GSGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSFLPPPAPPAAS---GLAIYLAGGQGQ 183
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 184 VVGGSVVGPLVASGPVVIMAASF 206
>gi|449503261|ref|XP_004161914.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 269
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/145 (40%), Positives = 86/145 (59%), Gaps = 3/145 (2%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
S ++ RGRPPGS + K + + HV+ + +G D+ I +F+Q R V +
Sbjct: 52 STRRPRGRPPGSKNKPKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSV 111
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LS NG ++NVTLR SGG +T +GRF+ILSLSG+FL + + + GL+V L+G
Sbjct: 112 LSGNGVVANVTLRHPGASGGVITLQGRFDILSLSGAFLPAPAPPGAT---GLTVYLAGGQ 168
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG V G L A PV V+ +F
Sbjct: 169 GQVVGGIVVGALVATGPVIVIAATF 193
>gi|449439125|ref|XP_004137338.1| PREDICTED: uncharacterized protein LOC101219306 [Cucumis sativus]
Length = 370
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 88/149 (59%), Gaps = 3/149 (2%)
Query: 120 SPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA 179
S ++++ RGRP GS + K + + H I V G DV+ + +F++ R
Sbjct: 63 SDQALRRPRGRPAGSKNKPKPPIIVTRDSANALRAHAIEVSTGCDVNESLSNFARRKQRG 122
Query: 180 VCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLS 239
VCILS +G ++NVTLRQAA+SG VT GRFEILS+ GS L + S GL++ LS
Sbjct: 123 VCILSGSGCVTNVTLRQAASSGAIVTLHGRFEILSMLGSILPPPAP---SGITGLTIYLS 179
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
G G+V+GG V G L A+ PV ++ +F+
Sbjct: 180 GAQGQVVGGVVVGALIASGPVVIMAATFM 208
>gi|225426655|ref|XP_002281296.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Vitis
vinifera]
Length = 302
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 84/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VCI+S
Sbjct: 82 RRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIADGCDIVESVATFARRRQRGVCIMS 141
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+ G VT GRFEILSLSGSFL + + GL++ L+G G+
Sbjct: 142 GTGTVTNVTLRQPASPGAIVTLHGRFEILSLSGSFLPPPAPPAAT---GLTIYLAGGQGQ 198
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 199 VVGGSVVGQLLASGPVVIMAASF 221
>gi|15223782|ref|NP_172901.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana]
gi|7262692|gb|AAF43950.1|AC012188_27 Contains similarity to an AT-hook protein 2 from Arabidopsis
thaliana gb|AJ224119.1 [Arabidopsis thaliana]
gi|119360061|gb|ABL66759.1| At1g14490 [Arabidopsis thaliana]
gi|119657400|tpd|FAA00299.1| TPA: AT-hook motif nuclear localized protein 28 [Arabidopsis
thaliana]
gi|225897926|dbj|BAH30295.1| hypothetical protein [Arabidopsis thaliana]
gi|332191050|gb|AEE29171.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana]
Length = 206
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/189 (33%), Positives = 101/189 (53%), Gaps = 13/189 (6%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+++ + RGRP GS K + + +P+++ V +G DV + F + C
Sbjct: 2 ETVGRPRGRP--RGSKNKPKAPIFVTIDPPMSPYILEVPSGNDVVEALNRFCRGKAIGFC 59
Query: 182 ILSANGAISNVTLRQ--AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSR--TGGLSVS 237
+LS +G++++VTLRQ A G T+T+ G+F++LS+S +FL S + +VS
Sbjct: 60 VLSGSGSVADVTLRQPSPAAPGSTITFHGKFDLLSVSATFLPPLPPTSLSPPVSNFFTVS 119
Query: 238 LSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPA 297
L+GP G+V+GG VAG L AA V V SF ++ S HR+ + + + G+
Sbjct: 120 LAGPQGKVIGGFVAGPLVAAGTVYFVATSF------KNPSYHRLPATEEEQRNSAEGEEE 173
Query: 298 GQCSPPSRG 306
GQ SPP G
Sbjct: 174 GQ-SPPVSG 181
>gi|89257682|gb|ABD65169.1| hypothetical protein 40.t00056 [Brassica oleracea]
Length = 293
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 74/114 (64%), Gaps = 5/114 (4%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V G DV + ++++ R +C+LS +G ++NV++RQ + +G VT +G FEILS
Sbjct: 112 HILEVTNGCDVFDCVATYARRRQRGICVLSGSGTVTNVSIRQPSAAGAVVTLQGTFEILS 171
Query: 215 LSGSFLLSES-SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSGSFL + G S L++ ++G G+V+GGSV G LTAA PV V+ SF
Sbjct: 172 LSGSFLPPPAPPGATS----LTIFVAGGQGQVIGGSVVGELTAAGPVIVIAASF 221
>gi|242081755|ref|XP_002445646.1| hypothetical protein SORBIDRAFT_07g023325 [Sorghum bicolor]
gi|241941996|gb|EES15141.1| hypothetical protein SORBIDRAFT_07g023325 [Sorghum bicolor]
Length = 323
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 94/174 (54%), Gaps = 28/174 (16%)
Query: 118 PLSPDSIKKSRGRPPGSGSGKK------HQLEALGSAGVGFTPHVITVKAGEDVSSKIMS 171
P+S ++ K+ RGRPPGS + K +E PHV+ + +G DV+ +
Sbjct: 76 PVSVETGKRRRGRPPGSKNKPKPPPVVTRDVEP----AAAMRPHVLEIPSGGDVARALAG 131
Query: 172 FSQNGPRAVCILSANGAISNVTLRQ--------------AATSGGTVTYEGRFEILSLSG 217
F++ +C+L+ GA+++V+LR AA + V + GR+EILS+S
Sbjct: 132 FARRRGLGICVLAGTGAVADVSLRHPAASSSADGGGGGAAAAAAAVVVFRGRYEILSISA 191
Query: 218 SFL---LSESSGQRSRTG-GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+FL +S + RS LS+SL+GP G+++GG+V G L AAT V V+ +F
Sbjct: 192 TFLAPSMSAAVPARSAVSRDLSISLAGPHGQIVGGAVVGPLVAATTVVVLAAAF 245
>gi|224071611|ref|XP_002303540.1| predicted protein [Populus trichocarpa]
gi|222840972|gb|EEE78519.1| predicted protein [Populus trichocarpa]
Length = 303
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + H++ V G D+ + +F++ R VCI+S
Sbjct: 87 RRPRGRPSGSKNKPKPPIIITRDSANALRTHLMEVADGCDIVESVATFARRRQRGVCIMS 146
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+ G VT GRFEILSL+GSFL + + GL++ L+G G+
Sbjct: 147 GTGTVTNVTLRQPASPGAIVTLHGRFEILSLAGSFLPPPAPPAAT---GLTIYLAGGQGQ 203
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G LTA+ PV ++ SF
Sbjct: 204 VVGGSVVGTLTASGPVVIMAASF 226
>gi|119331582|gb|ABL63117.1| AT-hook DNA-binding protein, partial [Catharanthus roseus]
Length = 250
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/144 (45%), Positives = 87/144 (60%), Gaps = 5/144 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V G+DV I ++++ R +CILS
Sbjct: 38 RRPRGRPPGSKNKAKPPVIITRESANTLRAHILEVGNGQDVFDCIATYARRRQRGICILS 97
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPDG 243
+G ++NVTLRQ A GG VT GRFEILSLSGSFL + G S L++ L G G
Sbjct: 98 GSGIVTNVTLRQPAGGGGVVTLHGRFEILSLSGSFLPPPAPPGATS----LTIFLGGGQG 153
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G LTAA PV V+ SF
Sbjct: 154 QVVGGSVVGELTAAGPVIVIASSF 177
>gi|15241852|ref|NP_198211.1| DNA-binding family protein [Arabidopsis thaliana]
gi|332006432|gb|AED93815.1| DNA-binding family protein [Arabidopsis thaliana]
Length = 216
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 74/134 (55%), Gaps = 24/134 (17%)
Query: 144 ALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGT 203
AL G FTPH++ + GEDV+ KI+ F+Q +C+LSA+G+ISN +L
Sbjct: 22 ALSKTGQCFTPHIVNITPGEDVAEKIVLFTQQSKHQLCVLSASGSISNASLSH------- 74
Query: 204 VTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVV 263
L+ + +TGGLSV LS DG++ GG V GLL AA PVQVV
Sbjct: 75 -----------------LASGTSHGGKTGGLSVCLSNSDGQIFGGGVGGLLKAAGPVQVV 117
Query: 264 VGSFLADGRKESKS 277
+G+F + +K+ ++
Sbjct: 118 LGTFQLEKKKDGRN 131
>gi|356540489|ref|XP_003538721.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 298
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 86/143 (60%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G D+ + +F++ R VCI+S
Sbjct: 78 RRPRGRPAGSKNKPKPPIIITRDSANALKTHVMEVADGCDIVESVSAFARRRQRGVCIMS 137
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+SG VT GRFEILSL+GSFL + + S GL++ L+G G+
Sbjct: 138 GTGTVTNVTLRQPASSGAVVTLHGRFEILSLAGSFLPPPAPPEAS---GLTIYLAGGQGQ 194
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 195 VVGGSVVGALIASGPVVIMSASF 217
>gi|357481857|ref|XP_003611214.1| AT-hook DNA-binding protein [Medicago truncatula]
gi|355512549|gb|AES94172.1| AT-hook DNA-binding protein [Medicago truncatula]
Length = 325
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 84/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G DV + +F++ R VCI+S
Sbjct: 100 RRPRGRPAGSKNKPKPPIIITRDSANALKTHVMEVADGCDVVESVNNFARRRQRGVCIMS 159
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+ G VT GRFEILSL+GSFL + S GL++ L+G G+
Sbjct: 160 GTGTVTNVTLRQPASPGAVVTLHGRFEILSLAGSFLPPPAPPAAS---GLTIYLAGGQGQ 216
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 217 VVGGSVVGALIASGPVVIMSASF 239
>gi|167600640|gb|ABZ89182.1| putative protein [Coffea canephora]
gi|326367382|gb|ADZ55300.1| DNA-binding protein [Coffea arabica]
Length = 289
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 86/145 (59%), Gaps = 3/145 (2%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
+ ++ RGRPPGS + K + + HV+ V G D++ I F++ R VC+
Sbjct: 69 ATRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDIAESIAQFARRRQRGVCV 128
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LSA+G ++NVTLRQ + G + GRFEILSL+G+FL + + GL++ L+G
Sbjct: 129 LSASGTVTNVTLRQPSAPGAVMALHGRFEILSLTGAFLPGPAPPGAT---GLTIYLAGGQ 185
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L A+ PV V+ +F
Sbjct: 186 GQVVGGSVVGSLVASGPVMVIASTF 210
>gi|324388027|gb|ADY38789.1| DNA-binding protein [Coffea arabica]
Length = 289
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G D++ I F++ R VC+LS
Sbjct: 71 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDIAESIAQFARRRQRGVCVLS 130
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
A+G ++NVTLRQ + G + GRFEILSL+G+FL + + GL++ L+G G+
Sbjct: 131 ASGTVTNVTLRQPSAPGAVMALHGRFEILSLTGAFLPGPAPPGAT---GLTIYLAGGQGQ 187
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV V+ +F
Sbjct: 188 VVGGSVVGSLVASGPVMVIASTF 210
>gi|414869998|tpg|DAA48555.1| TPA: hypothetical protein ZEAMMB73_420043 [Zea mays]
gi|414869999|tpg|DAA48556.1| TPA: hypothetical protein ZEAMMB73_420043 [Zea mays]
gi|414870000|tpg|DAA48557.1| TPA: hypothetical protein ZEAMMB73_420043 [Zea mays]
gi|414870001|tpg|DAA48558.1| TPA: hypothetical protein ZEAMMB73_420043 [Zea mays]
Length = 269
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 83/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G DV+ I F++ R VC+LS
Sbjct: 40 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVAGGADVAESIAHFARRRQRGVCVLS 99
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G +++V LRQ A G V GRFEILSL+G+FL + + GL+V L+G G+
Sbjct: 100 GAGTVADVALRQPAAPGAVVALRGRFEILSLTGTFLPGPAPPGST---GLTVYLAGGQGQ 156
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G LTAA PV V+ +F
Sbjct: 157 VVGGSVVGTLTAAGPVMVMASTF 179
>gi|297827997|ref|XP_002881881.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327720|gb|EFH58140.1| DNA-binding family protein [Arabidopsis lyrata subsp. lyrata]
Length = 260
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 84/144 (58%), Gaps = 3/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
K+ RGRP GS + K + + + + + +G D+ + F++ R +CILS
Sbjct: 56 KRPRGRPAGSKNKPKPPIIVTHDSPNSLRANAVEISSGCDICETLSDFARRKQRGLCILS 115
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANG ++NVTLRQ A+SG VT GR+EILSL GS L + GL++ L+GP G+
Sbjct: 116 ANGCVTNVTLRQPASSGAIVTLHGRYEILSLLGSILPPPAPLG---ITGLTIYLAGPQGQ 172
Query: 245 VLGGSVAGLLTAATPVQVVVGSFL 268
V+GG V G L A+ PV ++ SF+
Sbjct: 173 VVGGGVVGGLIASGPVVLMAASFM 196
>gi|15228036|ref|NP_181822.1| AT-hook DNA-binding-like protein [Arabidopsis thaliana]
gi|4512661|gb|AAD21715.1| putative DNA binding protein [Arabidopsis thaliana]
gi|20197862|gb|AAM15286.1| putative DNA binding protein [Arabidopsis thaliana]
gi|38454168|gb|AAR20778.1| At2g42940 [Arabidopsis thaliana]
gi|38604060|gb|AAR24773.1| At2g42940 [Arabidopsis thaliana]
gi|119657376|tpd|FAA00287.1| TPA: AT-hook motif nuclear localized protein 16 [Arabidopsis
thaliana]
gi|330255095|gb|AEC10189.1| AT-hook DNA-binding-like protein [Arabidopsis thaliana]
Length = 257
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 84/144 (58%), Gaps = 3/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
K+ RGRP GS + K + + + + + +G D+ + F++ R +CILS
Sbjct: 53 KRPRGRPAGSKNKPKPPIIVTHDSPNSLRANAVEISSGCDICETLSDFARRKQRGLCILS 112
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
ANG ++NVTLRQ A+SG VT GR+EILSL GS L + GL++ L+GP G+
Sbjct: 113 ANGCVTNVTLRQPASSGAIVTLHGRYEILSLLGSILPPPAPLG---ITGLTIYLAGPQGQ 169
Query: 245 VLGGSVAGLLTAATPVQVVVGSFL 268
V+GG V G L A+ PV ++ SF+
Sbjct: 170 VVGGGVVGGLIASGPVVLMAASFM 193
>gi|356533801|ref|XP_003535447.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 338
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ + G DV+ + F++ R VC+LS
Sbjct: 116 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEITGGADVAESVAQFARRRQRGVCVLS 175
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G+++NVTLRQ + G V GRFEILSL+G+FL + + GL+V L+G G+
Sbjct: 176 GSGSVANVTLRQPSAPGAVVALHGRFEILSLTGTFLPGPAP---PGSTGLTVYLAGGQGQ 232
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L AA PV V+ +F
Sbjct: 233 VVGGSVVGSLVAAGPVMVIAATF 255
>gi|357482403|ref|XP_003611487.1| hypothetical protein MTR_5g014450 [Medicago truncatula]
gi|355512822|gb|AES94445.1| hypothetical protein MTR_5g014450 [Medicago truncatula]
Length = 233
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 100/207 (48%), Gaps = 24/207 (11%)
Query: 9 NVMTSQPPASIQSMRLAFSADGTAVYKPITAT---SPTYQPSGAGGDGAIPQAQGLNVMN 65
NV+ +P + + TA PI SP Y+P + +P + ++ +
Sbjct: 19 NVILVEPNPFTNTTQTTIMEPNTAQLSPIIMNANLSPNYEPIV---NNIVPSSLNPSI-S 74
Query: 66 MGSGSEPMKRKRGRPRKYGPDGTM--SLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDS 123
+ S +E +KRKRGRPRK+ P G + SL P P+ + AT SP +
Sbjct: 75 VSSDTESIKRKRGRPRKHFPIGNIASSLGSDPGPTLASIAT--------SPSSSTCKKST 126
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQ--NGPR-AV 180
K RGRP GS KKH +E G F+PHVI V GED+ +K+ +FSQ GP +
Sbjct: 127 SGKGRGRPRGSFK-KKHLVETHGVTESCFSPHVIFVNQGEDIIAKVTAFSQAVAGPNIEI 185
Query: 181 CILSANGAISNVTLRQAATSGGTVTYE 207
CILSA+G + V L G + Y+
Sbjct: 186 CILSAHGLVGTVALHHL---GSIINYK 209
>gi|449461505|ref|XP_004148482.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449522823|ref|XP_004168425.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 271
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V +G DV I ++++ R +CILS
Sbjct: 62 RRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVGSGCDVFDCIATYARRRQRGICILS 121
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
NG ++NV LRQ +G +T +GRFEILSLSGSFL + + L++ L+G G+
Sbjct: 122 GNGMVTNVNLRQPTATGSVLTLQGRFEILSLSGSFLPPPAPPGAT---SLTIYLAGGQGQ 178
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GG+V G L AA PV ++ SF
Sbjct: 179 VVGGNVVGELVAAGPVTIIAASF 201
>gi|24418033|gb|AAN60483.1| Hypothetical protein [Oryza sativa Japonica Group]
Length = 928
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 69/120 (57%), Gaps = 8/120 (6%)
Query: 151 GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRF 210
G PH++ + AGE++ KI + S++ R +C+LS GA+ TL +SG T ++G
Sbjct: 731 GLQPHLLQIDAGEEIIPKITALSKSNGRVICVLSVLGAVQEATL--LLSSGVTSYHKGPL 788
Query: 211 EILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD 270
EI+ L GS L G L V+L+ D V+GG + G L AATPVQVVV SF +D
Sbjct: 789 EIIRLFGSILTPNDQ------GCLRVTLASGDSSVIGGVITGPLKAATPVQVVVASFYSD 842
>gi|356495537|ref|XP_003516633.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 250
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 93/158 (58%), Gaps = 6/158 (3%)
Query: 112 SSPGGGPLSPDSI-KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIM 170
+ P GP D + ++ RGRP GS + K + + H++ V +G DV +
Sbjct: 31 AKPQDGPQQGDVVGRRPRGRPAGSKNKPKPPVIITRESANALRAHILEVASGCDVFESVA 90
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRS 229
S+++ R +CILS +G ++NV+LRQ A++G T GRFEILSL+GSFL + G S
Sbjct: 91 SYARRRQRGICILSGSGTVTNVSLRQPASAGAVATLHGRFEILSLTGSFLPPPAPPGATS 150
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LS+ L+G G+V+GGSV G LTAA PV V+ SF
Sbjct: 151 ----LSIYLAGGQGQVVGGSVVGELTAAGPVIVIAASF 184
>gi|226509474|ref|NP_001146327.1| uncharacterized protein LOC100279903 [Zea mays]
gi|219886651|gb|ACL53700.1| unknown [Zea mays]
gi|413957232|gb|AFW89881.1| hypothetical protein ZEAMMB73_930024 [Zea mays]
Length = 573
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 99/212 (46%), Gaps = 36/212 (16%)
Query: 74 KRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPG 133
K++RGRPR L+P P G L+ PL +RG+P
Sbjct: 98 KQRRGRPRNCD-------RLLPPPP---------GFHLAPSARAPL------PARGQPSS 135
Query: 134 SGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVT 193
G + Q G HV+ + GED+ SKI+ S+ +AVC+LS GA+ +
Sbjct: 136 RGHPFRGQFG-------GLQLHVLKIHVGEDIVSKIVQVSKITGKAVCVLSVFGAVQDCY 188
Query: 194 LRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGL 253
L +A + ++G EI+ + GS L S+S G G LS +L+ D ++GG G
Sbjct: 189 LLHSAV---ILNHKGPLEIIHVFGSILTSDSPG----FGCLSATLACGDCSLVGGIAVGP 241
Query: 254 LTAATPVQVVVGSFLADGRKESKSSHRMESLP 285
L AATPVQ +VGSF D + +K + P
Sbjct: 242 LIAATPVQAIVGSFHNDAFQANKKPKLVACYP 273
>gi|297813091|ref|XP_002874429.1| hypothetical protein ARALYDRAFT_489653 [Arabidopsis lyrata subsp.
lyrata]
gi|297320266|gb|EFH50688.1| hypothetical protein ARALYDRAFT_489653 [Arabidopsis lyrata subsp.
lyrata]
Length = 217
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 51/129 (39%), Positives = 72/129 (55%), Gaps = 24/129 (18%)
Query: 149 GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEG 208
G FTPH++ + GEDV+ KI+ F+Q +CILSA+G+ISN +L
Sbjct: 32 GQSFTPHIVNITPGEDVAQKIVLFAQQSKHELCILSASGSISNASLSH------------ 79
Query: 209 RFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
L+ + +TGGLSV LS DG++ GG V GLL AA PVQVV+G+F
Sbjct: 80 ------------LASGTSHGGKTGGLSVCLSSSDGQIFGGGVGGLLKAAGPVQVVLGTFQ 127
Query: 269 ADGRKESKS 277
+ RK+ ++
Sbjct: 128 LEKRKDGRN 136
>gi|356536653|ref|XP_003536851.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 350
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G DV+ + F++ R VC+LS
Sbjct: 121 RRPRGRPPGSKNKPKPPIFVTRDSPNSLRSHVMEVAGGADVAESVAQFARRRQRGVCVLS 180
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G+++NVTLRQ + G V GRFEILSL+G+FL + + GL+V L+G G+
Sbjct: 181 GSGSVANVTLRQPSAPGAVVALHGRFEILSLTGAFLPGPAPPGAT---GLTVYLAGGQGQ 237
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L AA PV V+ +F
Sbjct: 238 VVGGSVVGSLVAAGPVMVIAATF 260
>gi|449459890|ref|XP_004147679.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 269
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 85/145 (58%), Gaps = 3/145 (2%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
S ++ RGRPPGS + K + + HV+ + +G D+ I +F+Q R V +
Sbjct: 52 STRRPRGRPPGSKNKPKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSV 111
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
L NG ++NVTLR SGG +T +GRF+ILSLSG+FL + + + GL+V L+G
Sbjct: 112 LGGNGVVANVTLRHPGASGGVITLQGRFDILSLSGAFLPAPAPPGAT---GLTVYLAGGQ 168
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG V G L A PV V+ +F
Sbjct: 169 GQVVGGIVVGALVATGPVIVIAATF 193
>gi|357494309|ref|XP_003617443.1| hypothetical protein MTR_5g091630 [Medicago truncatula]
gi|355518778|gb|AET00402.1| hypothetical protein MTR_5g091630 [Medicago truncatula]
Length = 254
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 88/147 (59%), Gaps = 3/147 (2%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+++K+ RGRP GS + K + + H + V +G DV+ +++F++ R +C
Sbjct: 44 ETLKRPRGRPAGSKNKPKPPIIVTRDSANALKAHAMEVSSGCDVNESLLNFARRKQRGLC 103
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
IL+ G ++NVTLRQ A+SG VT GRFEILSL GS L + GL++ L+G
Sbjct: 104 ILNGTGCVTNVTLRQPASSGAIVTLHGRFEILSLLGSILPPPAP---PGITGLTIYLAGA 160
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFL 268
G+V+GG+V G L A+ PV ++ SF+
Sbjct: 161 QGQVVGGAVVGALIASGPVVIMAASFM 187
>gi|338815363|gb|AEJ08744.1| RSI2 [Solanum tuberosum]
Length = 268
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 85/158 (53%), Gaps = 15/158 (9%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVG----------FTPHVITVKAGEDVSSKIMS 171
+ I++ RGRPPGS + K + + + +P+++ + G D+ I
Sbjct: 53 EVIRRPRGRPPGSKNKSKPKPKPEPNFFTAARDDHVERPTMSPYILEIPIGIDIIDSIYR 112
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATS--GGTVTYEGRFEILSLSGSFLLSESSGQRS 229
F N +CIL+ +G ++NVTL+Q + T+T+ G F ILS+S + + SE S
Sbjct: 113 FCGNQNMGLCILNRSGTVTNVTLKQPPINPADSTITFHGSFNILSISATIIPSEFS---R 169
Query: 230 RTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G S+SL+GP G+V+GG V G L AA PV ++ +F
Sbjct: 170 VANGFSISLAGPQGQVVGGPVIGPLLAAGPVYLIATTF 207
>gi|218191918|gb|EEC74345.1| hypothetical protein OsI_09643 [Oryza sativa Indica Group]
Length = 298
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 69/120 (57%), Gaps = 8/120 (6%)
Query: 151 GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRF 210
G PH++ + AGE++ KI + S++ R +C+LS GA+ TL +SG T ++G
Sbjct: 101 GLQPHLLQIDAGEEIIPKITALSKSNGRVICVLSVLGAVQEATL--LLSSGVTSYHKGPL 158
Query: 211 EILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD 270
EI+ L GS L G L V+L+ D V+GG + G L AATPVQVVV SF +D
Sbjct: 159 EIIRLFGSILTPNDQ------GCLRVTLASGDSSVIGGVITGPLKAATPVQVVVASFYSD 212
>gi|115450159|ref|NP_001048680.1| Os03g0105700 [Oryza sativa Japonica Group]
gi|108705733|gb|ABF93528.1| DNA-binding family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113547151|dbj|BAF10594.1| Os03g0105700 [Oryza sativa Japonica Group]
gi|222624032|gb|EEE58164.1| hypothetical protein OsJ_09085 [Oryza sativa Japonica Group]
Length = 298
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 69/120 (57%), Gaps = 8/120 (6%)
Query: 151 GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRF 210
G PH++ + AGE++ KI + S++ R +C+LS GA+ TL +SG T ++G
Sbjct: 101 GLQPHLLQIDAGEEIIPKITALSKSNGRVICVLSVLGAVQEATL--LLSSGVTSYHKGPL 158
Query: 211 EILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD 270
EI+ L GS L G L V+L+ D V+GG + G L AATPVQVVV SF +D
Sbjct: 159 EIIRLFGSILTPNDQ------GCLRVTLASGDSSVIGGVITGPLKAATPVQVVVASFYSD 212
>gi|356505681|ref|XP_003521618.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 310
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 58/146 (39%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
D ++ RGRP GS + K + + HV+ + G D+ + +F++ R +C
Sbjct: 83 DMGRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEITNGCDIMESVTAFARRRQRGIC 142
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
+LS +G ++NVTLRQ A+ VT GRFEILSLSGSFL + S GL++ L+G
Sbjct: 143 LLSGSGTVTNVTLRQPASPSAVVTLHGRFEILSLSGSFLPPPAPPAAS---GLAIYLAGG 199
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L A+ PV ++ SF
Sbjct: 200 QGQVVGGSVVGPLVASGPVVIMAASF 225
>gi|224125680|ref|XP_002319649.1| predicted protein [Populus trichocarpa]
gi|222858025|gb|EEE95572.1| predicted protein [Populus trichocarpa]
Length = 284
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 86/143 (60%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ + +G D++ + F++ R VC+LS
Sbjct: 72 RRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEIASGSDIAENLACFARKRQRGVCVLS 131
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G ++NVTL+Q + SG + GRFEILSL+G+FL + + GL++ L+G G+
Sbjct: 132 GSGMVTNVTLKQPSASGAVMALHGRFEILSLTGAFLPGPAPPGAT---GLTIYLAGGQGQ 188
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV V+ +F
Sbjct: 189 VVGGSVVGSLVASGPVMVIAATF 211
>gi|119331584|gb|ABL63118.1| AT-hook DNA-binding protein [Catharanthus roseus]
Length = 293
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 84/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VCI+S
Sbjct: 64 RRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIADGCDIMESVATFARRRQRGVCIMS 123
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G ++NVTLRQ A+ G VT GRFEILSL+GSFL + + L++ L+G G+
Sbjct: 124 GSGTVTNVTLRQPASPGAVVTLHGRFEILSLAGSFLPPPAPPAAT---SLTIYLAGGQGQ 180
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 181 VVGGSVVGALLASGPVVIMAASF 203
>gi|359485201|ref|XP_002279677.2| PREDICTED: putative DNA-binding protein ESCAROLA-like [Vitis
vinifera]
Length = 268
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 87/147 (59%), Gaps = 3/147 (2%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
++ ++ RGRP GS + K + + H + V +G DVS + +F++ R +C
Sbjct: 60 EATRRPRGRPAGSKNKPKPPIIITRDSANALRAHAMEVSSGCDVSESLANFARRKQRGIC 119
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
ILS +G ++NVTLRQ A+SG VT GRFEILSL GS L + + GL++ L+G
Sbjct: 120 ILSGSGCVTNVTLRQPASSGAIVTLHGRFEILSLLGSILPPPAPPGIT---GLTIYLAGA 176
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFL 268
G+V+GG V G L A+ PV V+ SF+
Sbjct: 177 QGQVVGGGVVGALIASGPVFVMAASFM 203
>gi|357489975|ref|XP_003615275.1| hypothetical protein MTR_5g066020 [Medicago truncatula]
gi|355516610|gb|AES98233.1| hypothetical protein MTR_5g066020 [Medicago truncatula]
Length = 252
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 75/131 (57%), Gaps = 10/131 (7%)
Query: 153 TPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ-AATSGGTVTYEGRFE 211
+PH++ + G DV I FS +C+L+ +G ++NVTLRQ + G TVT+ GRF
Sbjct: 81 SPHILEIPEGSDVVEAISRFSNRRKTGLCVLTGSGTVANVTLRQPSGPPGTTVTFHGRFN 140
Query: 212 ILSLSGSFLLS-ESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD 270
ILS+S +F ESS ++ S+SL+ P G+++GG V G L AA V V+ SF
Sbjct: 141 ILSISATFFSPLESSPPMNKE--FSISLAAPQGQIVGGFVVGPLLAAGTVFVIAASF--- 195
Query: 271 GRKESKSSHRM 281
+ S HR+
Sbjct: 196 ---NNPSYHRL 203
>gi|356539879|ref|XP_003538420.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 289
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 74/113 (65%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ + +G DV+ I +F+ R V +LS +G ++NVTLRQ A G +T GRFEILS
Sbjct: 106 HVLEITSGSDVAESIAAFANRRHRGVSVLSGSGIVANVTLRQPAAPAGVITLHGRFEILS 165
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSG+FL S S + GL+V L+G G+V+GG+VAG L A+ PV V+ +F
Sbjct: 166 LSGAFLPSPSPPGAT---GLTVYLAGGQGQVVGGTVAGSLVASGPVMVIAATF 215
>gi|226528096|ref|NP_001152438.1| DNA-binding protein [Zea mays]
gi|195656315|gb|ACG47625.1| DNA-binding protein [Zea mays]
gi|342899431|gb|AEL78914.1| barren stalk fastigiate1-related-1 [Zea mays]
gi|414885815|tpg|DAA61829.1| TPA: DNA-binding protein isoform 1 [Zea mays]
gi|414885816|tpg|DAA61830.1| TPA: DNA-binding protein isoform 2 [Zea mays]
Length = 351
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 75/141 (53%), Gaps = 11/141 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSA--GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
KK RGRPPGS + K + A PHVI + G DV+ + F+ +C+
Sbjct: 90 KKRRGRPPGSKNKPKPPVVITREAEPAAAMRPHVIEIPCGCDVADALARFAARRNLGICV 149
Query: 183 LSANGAISNVTLRQAATSGG-----TVTYEGRFEILSLSGSFLLSESSGQRSRTGG---- 233
L+ GA++NV+LR GG + G++EILS+S +FL S +
Sbjct: 150 LAGTGAVANVSLRHPMPCGGGGAPTAIMLHGQYEILSISATFLPPAISAVAPQAAAAAAC 209
Query: 234 LSVSLSGPDGRVLGGSVAGLL 254
LS+SL+GP G+++GG+VAG L
Sbjct: 210 LSISLAGPHGQIVGGAVAGPL 230
>gi|119331588|gb|ABL63120.1| AT-hook DNA-binding protein [Catharanthus roseus]
Length = 335
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 95/162 (58%), Gaps = 5/162 (3%)
Query: 111 LSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIM 170
LSS G S + +++ RGRPPGS + K + A +P+V+ + G D+ I
Sbjct: 86 LSSGGNDGASIEVVRRPRGRPPGSKNKPKPPVIITRDAEPSMSPYVLELPGGIDIVESIT 145
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAATS-GGTVTYEGRFEILSLSGSFL----LSESS 225
SF + +CIL+ +G ++NVTLRQ +T+ G +VT+ GRF+ILSLS + + LS +
Sbjct: 146 SFCRKRNMGLCILNGSGTVTNVTLRQPSTTPGASVTFHGRFDILSLSATVIPSNTLSAIA 205
Query: 226 GQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G ++SL+GP G+V+GG+V G L +A V ++ SF
Sbjct: 206 LSNGIANGFTISLAGPQGQVVGGAVVGSLFSAGTVYLIAASF 247
>gi|225457666|ref|XP_002273442.1| PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera]
Length = 292
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/144 (43%), Positives = 86/144 (59%), Gaps = 4/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V AG DV ++++++ R VC+LS
Sbjct: 72 RRPRGRPPGSKNKPKPPIIVTRDSPNALRSHVLEVAAGADVMESVLNYARRRGRGVCVLS 131
Query: 185 ANGAISNVTLRQAAT-SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
G + NVTLRQ A+ +G VT GRFEILSLSG+ L + GGLS+ LSG G
Sbjct: 132 GGGTVMNVTLRQPASPAGSIVTLHGRFEILSLSGTVLPPPAPPS---AGGLSIFLSGGQG 188
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G L A+ PV ++ SF
Sbjct: 189 QVVGGSVVGPLMASGPVVLMAASF 212
>gi|413953880|gb|AFW86529.1| hypothetical protein ZEAMMB73_546585 [Zea mays]
Length = 309
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 83/146 (56%), Gaps = 6/146 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRP GS + K + + H++ V AG D+ + +++ R VC+LS
Sbjct: 74 RKPRGRPLGSKNKPKPPIIITRDSPNALHSHLLEVAAGADIVECVSEYARRRCRGVCVLS 133
Query: 185 ANGAISNVTLRQ-AATSGGTV--TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
GA+SN+ LRQ A G++ T G+FEILSL+G+ L + S LSV ++G
Sbjct: 134 GGGAVSNLALRQPGADPPGSLLATLRGQFEILSLTGTVLPPPAPPGASN---LSVYVAGG 190
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSVAG L AA PV ++ SF
Sbjct: 191 QGQVMGGSVAGQLIAAGPVVLMAASF 216
>gi|224126485|ref|XP_002329566.1| predicted protein [Populus trichocarpa]
gi|222870275|gb|EEF07406.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 88/144 (61%), Gaps = 5/144 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + +K + + H++ V +G DV + ++++ R +CILS
Sbjct: 79 RRPRGRPPGSKNKEKPPIIITRESANTLRAHILEVGSGCDVFECVGNYARRRQRGICILS 138
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPDG 243
G ++NV++RQ A +G VT GRFEILSLSGSFL + G S L++ L+G G
Sbjct: 139 GAGTVTNVSIRQPAAAGSIVTLHGRFEILSLSGSFLPPPAPPGATS----LTIFLAGGQG 194
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G LTAA PV V+ SF
Sbjct: 195 QVVGGSVVGELTAAGPVIVIAASF 218
>gi|414585689|tpg|DAA36260.1| TPA: hypothetical protein ZEAMMB73_652841 [Zea mays]
Length = 347
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/169 (39%), Positives = 97/169 (57%), Gaps = 7/169 (4%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRPPGS + K + + H++ V +G DV + ++++ R VC+L
Sbjct: 83 VRRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVASGCDVFESVSTYARRRQRGVCVL 142
Query: 184 SANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGP 241
S +G ++NVTLRQ +A +G VT GRFEILSLSGSFL + G S L++ L+G
Sbjct: 143 SGSGVVTNVTLRQPSAPAGAVVTLHGRFEILSLSGSFLPPPAPPGATS----LTIFLAGG 198
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKL 290
G+V+GG+V G L AA PV V+ SF A+ E E+ PP L
Sbjct: 199 QGQVVGGNVVGALYAAGPVIVIAASF-ANVAYERLPLEEEEAQAAPPGL 246
>gi|115466262|ref|NP_001056730.1| Os06g0136900 [Oryza sativa Japonica Group]
gi|55296989|dbj|BAD68464.1| putative AT-hook protein 2 [Oryza sativa Japonica Group]
gi|113594770|dbj|BAF18644.1| Os06g0136900 [Oryza sativa Japonica Group]
gi|125553962|gb|EAY99567.1| hypothetical protein OsI_21541 [Oryza sativa Indica Group]
gi|215741551|dbj|BAG98046.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 328
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 88/152 (57%), Gaps = 4/152 (2%)
Query: 117 GPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNG 176
G +P I++ RGRP GS + K + + HV+ V +G D+ + +F++
Sbjct: 103 GAAAPVVIRRPRGRPAGSKNKPKPPVIITRDSASALRAHVLEVASGCDLVDSVATFARRR 162
Query: 177 PRAVCILSANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLS 235
VC+LSA GA++NV++RQ A G V GRF+ILSLSGSFL + + GL+
Sbjct: 163 QVGVCVLSATGAVTNVSVRQPGAGPGAVVNLTGRFDILSLSGSFLPPPAPPSAT---GLT 219
Query: 236 VSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
V +SG G+V+GG+VAG L A PV ++ SF
Sbjct: 220 VYVSGGQGQVVGGTVAGPLIAVGPVVIMAASF 251
>gi|326503874|dbj|BAK02723.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 312
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/219 (36%), Positives = 112/219 (51%), Gaps = 16/219 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V +G DV I +++ R VC+LS
Sbjct: 87 RRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVGSGCDVFECISTYACRRQRGVCVLS 146
Query: 185 ANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPD 242
+G ++NVTLRQ +A +G VT GRFEILSLSGSFL + G S L++ L+G
Sbjct: 147 GSGIVTNVTLRQPSAPAGAVVTLHGRFEILSLSGSFLPPPAPPGATS----LTIFLAGGQ 202
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKLAPGGQPAGQCSP 302
G+V+GG+V G L AA PV V+ SF ++ E LP+ + AP Q
Sbjct: 203 GQVVGGNVVGALYAAGPVIVIAASF---------ANVAYERLPLEDEEAPPATAGMQMQQ 253
Query: 303 PSRGTLSESSGG-PGSPLNHSTGACNNNHLPQGMATGIP 340
PS + GG P P + G N LP TG P
Sbjct: 254 PSDADPAAGMGGVPFPPDPSAAGLPFFNQLPLNNMTGGP 292
>gi|242093622|ref|XP_002437301.1| hypothetical protein SORBIDRAFT_10g024540 [Sorghum bicolor]
gi|241915524|gb|EER88668.1| hypothetical protein SORBIDRAFT_10g024540 [Sorghum bicolor]
Length = 270
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 79/144 (54%), Gaps = 8/144 (5%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + HV+ V G DVS+ + +++ R VC+L
Sbjct: 60 LRRPRGRPLGSKNKPKPPVIITRDSPDALHSHVLEVAPGADVSACVAEYARRRGRGVCVL 119
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
A+GA+ +V +R G T GRFE+LS++G+ L + + S GL+V +S G
Sbjct: 120 GASGAVGDVAVR-----GATAPLRGRFELLSVTGTVLPPPAPPEAS---GLAVLVSAGQG 171
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+VLGG V G L AA PV + +F
Sbjct: 172 QVLGGCVVGPLVAAGPVTIFAATF 195
>gi|296086196|emb|CBI31637.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/113 (47%), Positives = 77/113 (68%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ + +G D++ I +F+Q R V +LSA+G ++NVTLRQ A GG +T +GRFEILS
Sbjct: 113 HVLEISSGSDIAESIANFAQRRHRGVSVLSASGIVNNVTLRQPAAPGGVITLQGRFEILS 172
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSG+FL + S + GL+V L+G G+V+GGSV G L A+ PV V+ +F
Sbjct: 173 LSGAFLPAPSPPGAT---GLTVYLAGGQGQVVGGSVVGALMASGPVIVIAATF 222
>gi|147861256|emb|CAN83987.1| hypothetical protein VITISV_032602 [Vitis vinifera]
Length = 282
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 87/144 (60%), Gaps = 5/144 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V G DV + ++++ R +C+LS
Sbjct: 70 RRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVGNGCDVFDCVATYARRRQRGICVLS 129
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPDG 243
+G ++NV++RQ A +G +T GRFEILSLSGSFL + G S L++ L+G G
Sbjct: 130 GSGTVTNVSIRQPAAAGAILTLHGRFEILSLSGSFLPPPAPPGATS----LTIFLAGGQG 185
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G LTAA PV V+ SF
Sbjct: 186 QVVGGSVVGELTAAGPVIVIAASF 209
>gi|225449426|ref|XP_002277930.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Vitis
vinifera]
Length = 327
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/113 (47%), Positives = 77/113 (68%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ + +G D++ I +F+Q R V +LSA+G ++NVTLRQ A GG +T +GRFEILS
Sbjct: 140 HVLEISSGSDIAESIANFAQRRHRGVSVLSASGIVNNVTLRQPAAPGGVITLQGRFEILS 199
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSG+FL + S + GL+V L+G G+V+GGSV G L A+ PV V+ +F
Sbjct: 200 LSGAFLPAPSPPGAT---GLTVYLAGGQGQVVGGSVVGALMASGPVIVIAATF 249
>gi|147780475|emb|CAN75757.1| hypothetical protein VITISV_028561 [Vitis vinifera]
Length = 293
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/113 (47%), Positives = 77/113 (68%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ + +G D++ I +F+Q R V +LSA+G ++NVTLRQ A GG +T +GRFEILS
Sbjct: 106 HVLEISSGSDIAESIANFAQRRHRGVSVLSASGIVNNVTLRQPAAPGGVITLQGRFEILS 165
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSG+FL + S + GL+V L+G G+V+GGSV G L A+ PV V+ +F
Sbjct: 166 LSGAFLPAPSPPGAT---GLTVYLAGGQGQVVGGSVVGALMASGPVIVIAATF 215
>gi|255557593|ref|XP_002519826.1| ESC, putative [Ricinus communis]
gi|223540872|gb|EEF42430.1| ESC, putative [Ricinus communis]
Length = 289
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 62/144 (43%), Positives = 86/144 (59%), Gaps = 5/144 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V G DV I ++++ R +CILS
Sbjct: 73 RRPRGRPPGSRNKPKPPVIITRESANTLRAHILEVGNGCDVFECISNYARRRQRGICILS 132
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPDG 243
G ++NV++RQ A +G VT GRFEILSLSGSFL + G S L++ L+G G
Sbjct: 133 GAGTVTNVSIRQPAAAGAVVTLHGRFEILSLSGSFLPPPAPPGATS----LTIFLAGGQG 188
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G LTAA PV V+ SF
Sbjct: 189 QVVGGSVVGELTAAGPVIVIAASF 212
>gi|242094584|ref|XP_002437782.1| hypothetical protein SORBIDRAFT_10g002490 [Sorghum bicolor]
gi|241916005|gb|EER89149.1| hypothetical protein SORBIDRAFT_10g002490 [Sorghum bicolor]
Length = 349
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 59/145 (40%), Positives = 84/145 (57%), Gaps = 4/145 (2%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + HV+ V AG DV I F++ VC+L
Sbjct: 122 MRRPRGRPAGSKNKPKPPVIITRDSASALRAHVLEVAAGCDVVDSIAGFARRRQVGVCVL 181
Query: 184 SANGAISNVTLRQA-ATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
SA+G+++NV +R + A G VT G F+ILSLSGSFL + + GL+V LSG
Sbjct: 182 SASGSVANVCIRHSGAAPGAVVTMAGCFDILSLSGSFLPPPAPPAAT---GLTVYLSGGQ 238
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG+VAG L A+ PV +V F
Sbjct: 239 GQVVGGTVAGPLLASGPVVIVAACF 263
>gi|242076974|ref|XP_002448423.1| hypothetical protein SORBIDRAFT_06g026940 [Sorghum bicolor]
gi|241939606|gb|EES12751.1| hypothetical protein SORBIDRAFT_06g026940 [Sorghum bicolor]
Length = 312
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 67/168 (39%), Positives = 96/168 (57%), Gaps = 7/168 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V +G DV + ++++ R VC+LS
Sbjct: 87 RRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVGSGCDVFESVSTYARRRQRGVCVLS 146
Query: 185 ANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPD 242
+G ++NVTLRQ +A +G VT GRFEILSLSGSFL + G S L++ L+G
Sbjct: 147 GSGVVTNVTLRQPSAPTGAVVTLHGRFEILSLSGSFLPPPAPPGATS----LTIFLAGGQ 202
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVPPKL 290
G+V+GG+V G L AA PV V+ SF A+ E E+ PP L
Sbjct: 203 GQVVGGNVVGALYAAGPVIVIAASF-ANVAYERLPLEEEEAQAAPPGL 249
>gi|225427274|ref|XP_002281411.1| PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera]
Length = 282
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 87/144 (60%), Gaps = 5/144 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V G DV + ++++ R +C+LS
Sbjct: 70 RRPRGRPPGSKNRPKPPVIITRESANTLRAHILEVGNGCDVFDCVATYARRRQRGICVLS 129
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPDG 243
+G ++NV++RQ A +G +T GRFEILSLSGSFL + G S L++ L+G G
Sbjct: 130 GSGTVTNVSIRQPAAAGAILTLHGRFEILSLSGSFLPPPAPPGATS----LTIFLAGGQG 185
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G LTAA PV V+ SF
Sbjct: 186 QVVGGSVVGELTAAGPVIVIAASF 209
>gi|357498723|ref|XP_003619650.1| hypothetical protein MTR_6g060670 [Medicago truncatula]
gi|355494665|gb|AES75868.1| hypothetical protein MTR_6g060670 [Medicago truncatula]
Length = 305
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/228 (33%), Positives = 114/228 (50%), Gaps = 30/228 (13%)
Query: 44 YQPSGAGGDGAIPQAQGLNVMNMGSGSEP----MKRKRGRPRKYGPDGTMSLALVPSPSS 99
+ A D A+GL + + S P ++KRGRP +Y +L SP
Sbjct: 86 FDSKQAAKDACSGPAKGLPMQTLESAPAPNCTKERKKRGRPLQYELGSKAAL----SPMP 141
Query: 100 VTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITV 159
V+ A TG +S G RG G + S G F+ H V
Sbjct: 142 VSFAFPMTGEFSASNRG-----------RGLNDFKDDGPSN------SIGSHFSHHAFIV 184
Query: 160 KAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF 219
+GEDV+S+I + + +A+ +LS +G+IS+VT+ + + T+ YEG F++LSL+GSF
Sbjct: 185 NSGEDVASRISLLALDF-QAISVLSGSGSISSVTIDMSDSGIETLKYEGIFDLLSLTGSF 243
Query: 220 LLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E + +G L+VSL+ GRV+ G +AG L AA PV+VVV SF
Sbjct: 244 ---EPNKDGLVSGKLTVSLA-IGGRVIQGPLAGSLVAAGPVKVVVASF 287
>gi|15225475|ref|NP_182067.1| AT-hook motif nuclear-localized protein 22 [Arabidopsis thaliana]
gi|2583112|gb|AAB82621.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana]
gi|50198795|gb|AAT70431.1| At2g45430 [Arabidopsis thaliana]
gi|56121926|gb|AAV74244.1| At2g45430 [Arabidopsis thaliana]
gi|119657388|tpd|FAA00293.1| TPA: AT-hook motif nuclear localized protein 22 [Arabidopsis
thaliana]
gi|225898599|dbj|BAH30430.1| hypothetical protein [Arabidopsis thaliana]
gi|330255458|gb|AEC10552.1| AT-hook motif nuclear-localized protein 22 [Arabidopsis thaliana]
Length = 317
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 60/147 (40%), Positives = 84/147 (57%), Gaps = 7/147 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G DV + F++ R +C+LS
Sbjct: 89 RRPRGRPAGSKNKPKPPIIITRDSANALKSHVMEVANGCDVMESVTVFARRRQRGICVLS 148
Query: 185 ANGAISNVTLRQAATSGG----TVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
NGA++NVT+RQ A+ G V GRFEILSLSGSFL + S GL++ L+G
Sbjct: 149 GNGAVTNVTIRQPASVPGGGSSVVNLHGRFEILSLSGSFLPPPAPPAAS---GLTIYLAG 205
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L A+ PV ++ SF
Sbjct: 206 GQGQVVGGSVVGPLMASGPVVIMAASF 232
>gi|297824593|ref|XP_002880179.1| hypothetical protein ARALYDRAFT_903987 [Arabidopsis lyrata subsp.
lyrata]
gi|297326018|gb|EFH56438.1| hypothetical protein ARALYDRAFT_903987 [Arabidopsis lyrata subsp.
lyrata]
Length = 318
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 60/147 (40%), Positives = 84/147 (57%), Gaps = 7/147 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G DV + F++ R +C+LS
Sbjct: 88 RRPRGRPAGSKNKPKPPIIITRDSANALKSHVMEVANGCDVMESVTVFARRRQRGICVLS 147
Query: 185 ANGAISNVTLRQAATSGG----TVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
NGA++NVT+RQ A+ G V GRFEILSLSGSFL + S GL++ L+G
Sbjct: 148 GNGAVTNVTIRQPASVPGGGSSVVNLHGRFEILSLSGSFLPPPAPPAAS---GLTIYLAG 204
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L A+ PV ++ SF
Sbjct: 205 GQGQVVGGSVVGPLMASGPVVIMAASF 231
>gi|226502634|ref|NP_001151240.1| DNA-binding protein [Zea mays]
gi|195645262|gb|ACG42099.1| DNA-binding protein [Zea mays]
gi|413921737|gb|AFW61669.1| DNA-binding protein [Zea mays]
Length = 265
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 81/143 (56%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V G DV+ I F++ R VC+LS
Sbjct: 39 RRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVAGGADVAESIAHFARRRQRGVCVLS 98
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G +++V LRQ G V GRFEILS++G+FL + + GL+V L+G G+
Sbjct: 99 GAGTVTDVALRQPTAPGAVVALRGRFEILSITGTFLPGPAPPGST---GLTVYLAGGQGQ 155
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L AA PV V+ +F
Sbjct: 156 VVGGSVVGTLIAAGPVMVMASTF 178
>gi|93212583|gb|ABF01666.1| AT-hook1 protein [Capsicum annuum]
Length = 257
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 82/154 (53%), Gaps = 10/154 (6%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVG------FTPHVITVKAGEDVSSKIMSFSQN 175
+ +++ RGRPPGS + K + + +P+++ + G D+ + F +
Sbjct: 67 EVVRRPRGRPPGSKNKPKPAPNYITTTRDDHMEKSTMSPYILEIPLGVDIIDSVYRFCRK 126
Query: 176 GPRAVCILSANGAISNVTLRQAATSG--GTVTYEGRFEILSLSGSFLLSESSGQRSRTGG 233
+CI++ +G ++NVTLRQ T+ T+T+ G F ILS+S + + S G
Sbjct: 127 HNTGLCIINGSGTVTNVTLRQPFTNNPDSTITFHGNFNILSISATII--PQSIFSKVLNG 184
Query: 234 LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
S+SL+GP G+V+GG V L +A PV ++ SF
Sbjct: 185 FSISLAGPQGQVVGGPVIRPLLSAGPVYLIAASF 218
>gi|255580141|ref|XP_002530902.1| DNA binding protein, putative [Ricinus communis]
gi|223529524|gb|EEF31478.1| DNA binding protein, putative [Ricinus communis]
Length = 251
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 7/135 (5%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + + P ++ + AG D+ I++F++ + ++S
Sbjct: 30 RKPRGRPPGSKNKPKPPIVITKDSDSAMKPVILEISAGSDIIDSIINFARRNHSGISVIS 89
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQR-------SRTGGLSVS 237
A G++SNVTLR + +++ G F ILSLSG+FL S + Q S + +S
Sbjct: 90 ATGSVSNVTLRHPLSHAPSLSLHGPFNILSLSGTFLGSFTPKQSAGSSSVGSPSCCFGIS 149
Query: 238 LSGPDGRVLGGSVAG 252
L+G G+V GG VAG
Sbjct: 150 LAGAQGQVFGGIVAG 164
>gi|356541471|ref|XP_003539199.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 250
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + H++ V G DV + S+++ R +CILS
Sbjct: 48 RRPRGRPAGSKNKPKPPVIITRESANTLRAHILEVANGCDVFESVASYARRRQRGICILS 107
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
+G ++NV+LRQ A++G VT GRFEILSL+GSFL
Sbjct: 108 GSGTVTNVSLRQPASAGAVVTLHGRFEILSLTGSFL 143
>gi|449531705|ref|XP_004172826.1| PREDICTED: LOW QUALITY PROTEIN: putative DNA-binding protein
ESCAROLA-like [Cucumis sativus]
Length = 303
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 55/95 (57%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HVI V G D+ + +F++ R VCI+S
Sbjct: 69 RRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVATFARRRQRGVCIMS 128
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF 219
G ++NVTLRQ A+ G V GRFEILSL+GSF
Sbjct: 129 GTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSF 163
>gi|449459662|ref|XP_004147565.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 303
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 55/95 (57%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HVI V G D+ + +F++ R VCI+S
Sbjct: 69 RRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVATFARRRQRGVCIMS 128
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSF 219
G ++NVTLRQ A+ G V GRFEILSL+GSF
Sbjct: 129 GTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSF 163
>gi|357117633|ref|XP_003560568.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 291
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 83/147 (56%), Gaps = 6/147 (4%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
++K RGRP GS + K + + HV+ V G DVS+ + +++ R VC+L
Sbjct: 73 LRKPRGRPLGSKNKPKPPVIITRDSPDALHSHVLEVSPGADVSACVAQYARARGRGVCVL 132
Query: 184 SANGAISNVTLRQ--AATSGGT-VTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
A+G +++V +R A +G +T GRFE+LS++G+ L + + S GL+V L+
Sbjct: 133 GASGTVADVAVRVPGAPAAGALPLTLPGRFELLSVTGTVLPPPAPAEAS---GLAVLLAA 189
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+VLGG V G L AATPV + +F
Sbjct: 190 GQGQVLGGRVVGPLVAATPVTLFAATF 216
>gi|255561901|ref|XP_002521959.1| DNA binding protein, putative [Ricinus communis]
gi|223538763|gb|EEF40363.1| DNA binding protein, putative [Ricinus communis]
Length = 299
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 87/156 (55%), Gaps = 3/156 (1%)
Query: 112 SSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMS 171
P G + + ++ RGRP GS + K + + HV+ + G D++ +
Sbjct: 65 DEPKEGAIEVATHRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIANGSDIAESLAC 124
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
F++ R VC+LS +G ++NVTL+Q + G + GRFEILSL+G+FL + +
Sbjct: 125 FARKKQRGVCVLSGSGMVTNVTLKQPSAPGAVMALHGRFEILSLTGAFLPGPAPPGAT-- 182
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
GL++ L+G G+V+GGSV G LTA PV V+ +F
Sbjct: 183 -GLTIYLAGGQGQVVGGSVVGSLTATGPVMVIAATF 217
>gi|356512004|ref|XP_003524711.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 276
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 85/143 (59%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + H++ V +G DV + ++++ R +C+LS
Sbjct: 74 RRPRGRPSGSKNKPKPPVIITRESANTLRAHILEVGSGSDVFDCVTAYARRRQRGICVLS 133
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G ++NV+LRQ A +G V GRFEILSLSGSFL + + L++ L+G G+
Sbjct: 134 GSGTVTNVSLRQPAAAGAVVRLHGRFEILSLSGSFLPPPAPPGAT---SLTIYLAGGQGQ 190
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GG+V G LTAA PV V+ SF
Sbjct: 191 VVGGNVVGELTAAGPVIVIAASF 213
>gi|413942786|gb|AFW75435.1| hypothetical protein ZEAMMB73_958269 [Zea mays]
Length = 485
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 67/113 (59%), Gaps = 6/113 (5%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V AG DV + +F++ VC+LS G+++NV +R + T T GRFE+LS
Sbjct: 298 HVLEVAAGCDVVGSVAAFARRRQVGVCVLSGAGSVANVRIRNQPGAVVTTTLAGRFEVLS 357
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L GSFL ++ GL+V LS G+V+GG+VAG L A+ PV +V F
Sbjct: 358 LCGSFLPPLAA------TGLTVYLSAGQGQVVGGAVAGPLVASGPVVIVAACF 404
>gi|413954758|gb|AFW87407.1| hypothetical protein ZEAMMB73_125178 [Zea mays]
Length = 271
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 78/144 (54%), Gaps = 8/144 (5%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + HV+ V G DV + + +++ R VC+L
Sbjct: 69 LRRPRGRPLGSKNKPKPPVIITRDSPDALHSHVLEVSPGADVCACVAEYARRRGRGVCVL 128
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
A+GA+ +V +R G GRFE+LS++G+ L + + S GL+V +S G
Sbjct: 129 GASGAVGDVAVR-----GAAAPLRGRFELLSVTGTVLPPPAPPEAS---GLAVLVSAGQG 180
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+VLGGSV G L AA PV + +F
Sbjct: 181 QVLGGSVVGPLVAAGPVTIFAATF 204
>gi|255537141|ref|XP_002509637.1| ESC, putative [Ricinus communis]
gi|223549536|gb|EEF51024.1| ESC, putative [Ricinus communis]
Length = 298
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 83/143 (58%), Gaps = 3/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + H++ V G D+ + +F++ R V I+S
Sbjct: 82 RRPRGRPAGSKNKPKPPIIITRDSANALRTHLMEVADGCDIVESVATFARRRQRGVSIMS 141
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G ++NVTLRQ A+ G VT GRFEILSL+GSFL + + GL++ L+G G+
Sbjct: 142 GTGTVTNVTLRQPASPGAVVTLHGRFEILSLAGSFLPPPAPPAAT---GLTIYLAGGQGQ 198
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G L A+ PV ++ SF
Sbjct: 199 VVGGSVVGTLIASGPVVIMAASF 221
>gi|356565443|ref|XP_003550949.1| PREDICTED: LOW QUALITY PROTEIN: putative DNA-binding protein
ESCAROLA-like [Glycine max]
Length = 246
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 75/114 (65%), Gaps = 5/114 (4%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V +G DV + ++++ R +C+LS +G ++NV+LRQ A +G VT GRFEILS
Sbjct: 107 HILEVGSGSDVFDCVTAYARRRQRGICVLSGSGTVTNVSLRQPAAAGAVVTLHGRFEILS 166
Query: 215 LSGSFLLSES-SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSGSFL + G S L++ L+G G+V+GG+V G LTAA PV V+ SF
Sbjct: 167 LSGSFLPPPAPPGATS----LTIYLAGGQGQVVGGNVIGELTAAGPVIVIAASF 216
>gi|115460208|ref|NP_001053704.1| Os04g0590200 [Oryza sativa Japonica Group]
gi|38346718|emb|CAE04868.2| OSJNBa0086O06.16 [Oryza sativa Japonica Group]
gi|113565275|dbj|BAF15618.1| Os04g0590200 [Oryza sativa Japonica Group]
gi|125549530|gb|EAY95352.1| hypothetical protein OsI_17183 [Oryza sativa Indica Group]
gi|215769296|dbj|BAH01525.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 305
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/145 (41%), Positives = 88/145 (60%), Gaps = 6/145 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V +G DV + ++++ R VC+LS
Sbjct: 82 RRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVGSGCDVFECVSTYARRRQRGVCVLS 141
Query: 185 ANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPD 242
+G ++NVTLRQ +A +G V+ GRFEILSLSGSFL + G S L++ L+G
Sbjct: 142 GSGVVTNVTLRQPSAPAGAVVSLHGRFEILSLSGSFLPPPAPPGATS----LTIFLAGGQ 197
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG+V G L AA PV V+ SF
Sbjct: 198 GQVVGGNVVGALYAAGPVIVIAASF 222
>gi|357168310|ref|XP_003581586.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 325
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 88/145 (60%), Gaps = 6/145 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V +G DV + +++ R VC+LS
Sbjct: 90 RRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVGSGCDVFECVSTYACRRQRGVCVLS 149
Query: 185 ANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPD 242
+G ++NVTLRQ +A +G VT +GRFEILSLSGSFL + G S L+V L+G
Sbjct: 150 GSGVVTNVTLRQPSAPAGAVVTLQGRFEILSLSGSFLPPPAPPGATS----LTVFLAGGQ 205
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG+V G L AA PV V+ SF
Sbjct: 206 GQVVGGNVVGALYAAGPVIVIAASF 230
>gi|326504130|dbj|BAK02851.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 287
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/135 (42%), Positives = 80/135 (59%), Gaps = 10/135 (7%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V G DV+ I FS+ R VC+LS G +++V LRQ A G V GRFEILS
Sbjct: 102 HVMEVAGGADVAESIAHFSRRRQRGVCVLSGAGTVADVALRQPAAPGAVVALRGRFEILS 161
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKE 274
L+G+FL S + GL+V L+G G+V+GGSV G LTAA PV V+ +F A+ E
Sbjct: 162 LTGTFLPGPSP---PGSTGLTVYLAGGQGQVVGGSVVGALTAAGPVMVIASTF-ANATYE 217
Query: 275 ------SKSSHRMES 283
++ H++E+
Sbjct: 218 RLPLDDAEEDHQLEA 232
>gi|259490392|ref|NP_001159201.1| uncharacterized protein LOC100304287 [Zea mays]
gi|223942597|gb|ACN25382.1| unknown [Zea mays]
gi|342899429|gb|AEL78913.1| barren stalk fastigiate 1 [Zea mays]
gi|413953311|gb|AFW85960.1| hypothetical protein ZEAMMB73_663755 [Zea mays]
Length = 341
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 81/144 (56%), Gaps = 3/144 (2%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + HV+ V AG DV + F++ VC+L
Sbjct: 109 MRRPRGRPAGSKNKPKPPVIITRDSASALRAHVLEVAAGCDVVDSVAGFARRRQVGVCVL 168
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
S G+++NV +RQ G VT GRFEILSL GSFL + + GL+V LSG G
Sbjct: 169 SGAGSVANVCVRQPGAGAGAVTLPGRFEILSLCGSFLPPPAPPAAT---GLTVYLSGGQG 225
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSVAG L A+ PV +V F
Sbjct: 226 QVVGGSVAGPLLASGPVVIVAACF 249
>gi|413919176|gb|AFW59108.1| hypothetical protein ZEAMMB73_282218 [Zea mays]
Length = 310
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/144 (40%), Positives = 86/144 (59%), Gaps = 4/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + H++ V +G DV + ++++ R VC+LS
Sbjct: 84 RRPRGRPAGSKNKPKPPVIITRESANTLRAHILEVASGCDVFESVSTYARRRQRGVCVLS 143
Query: 185 ANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
+G ++NVTLRQ +A +G VT GRFEILSLSGSFL + + L++ L+G G
Sbjct: 144 GSGEVTNVTLRQPSAPTGAVVTLHGRFEILSLSGSFLPPPAPPGAT---SLTIFLAGGQG 200
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GG+V G L AA PV V+ SF
Sbjct: 201 QVVGGNVVGALYAAGPVIVIAASF 224
>gi|383145923|gb|AFG54575.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145927|gb|AFG54577.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
Length = 132
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 58/136 (42%), Positives = 81/136 (59%), Gaps = 5/136 (3%)
Query: 133 GSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNV 192
GS + KH + + H++ + G DV+ + +F++ RAVCILS +G + NV
Sbjct: 1 GSKNKVKHPMIVQKESASCLKAHILEIANGCDVAESLATFARRRQRAVCILSGSGTVHNV 60
Query: 193 TLRQAATSGGTVTYEGRFEILSLSGSFL-LSESSGQRSRTGGLSVSLSGPDGRVLGGSVA 251
TLRQ T+G V EGRFE+LSLSGSFL E SG + GL++ L G G+V+GGSV
Sbjct: 61 TLRQPGTAGTIVNLEGRFEMLSLSGSFLPTVEPSG----STGLTIYLVGGQGQVVGGSVV 116
Query: 252 GLLTAATPVQVVVGSF 267
G L A+ P+ V+ F
Sbjct: 117 GALMASGPIVVIAAIF 132
>gi|361067911|gb|AEW08267.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145909|gb|AFG54568.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145911|gb|AFG54569.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145913|gb|AFG54570.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145915|gb|AFG54571.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145917|gb|AFG54572.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145919|gb|AFG54573.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145921|gb|AFG54574.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145925|gb|AFG54576.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145929|gb|AFG54578.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145931|gb|AFG54579.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
gi|383145933|gb|AFG54580.1| Pinus taeda anonymous locus 2_4619_01 genomic sequence
Length = 132
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 58/136 (42%), Positives = 81/136 (59%), Gaps = 5/136 (3%)
Query: 133 GSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNV 192
GS + KH + + H++ + G DV+ + +F++ RAVCILS +G + NV
Sbjct: 1 GSKNKVKHPMIVHKESASCLKAHILEIANGCDVAESLATFARRRQRAVCILSGSGTVHNV 60
Query: 193 TLRQAATSGGTVTYEGRFEILSLSGSFL-LSESSGQRSRTGGLSVSLSGPDGRVLGGSVA 251
TLRQ T+G V EGRFE+LSLSGSFL E SG + GL++ L G G+V+GGSV
Sbjct: 61 TLRQPGTAGTIVNLEGRFEMLSLSGSFLPTVEPSG----STGLTIYLVGGQGQVVGGSVV 116
Query: 252 GLLTAATPVQVVVGSF 267
G L A+ P+ V+ F
Sbjct: 117 GALMASGPIVVIAAIF 132
>gi|147840658|emb|CAN68541.1| hypothetical protein VITISV_020444 [Vitis vinifera]
Length = 275
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 81/161 (50%), Gaps = 9/161 (5%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + G P VI V G D+ ++ F++ + IL
Sbjct: 67 RKPRGRPPGSKNKPKPPIVITRECESGMKPIVIEVAPGNDLFETVVQFARRRRVGITILH 126
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL--LSESSGQRSRTGGLSVSLSGPD 242
G ISNVT RQ T + G I+ +SG +L + ++ SR SVS++G
Sbjct: 127 GFGTISNVTFRQPVPHAPTYSLHGPLCIIYISGWYLGCPTPATPATSR-ASFSVSVAGTQ 185
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMES 283
G++ GG VAG +TA+ PV ++ +F + S HR+ S
Sbjct: 186 GQIYGGQVAGKVTASGPVTLIASTF------TNPSVHRLPS 220
>gi|224138108|ref|XP_002326520.1| predicted protein [Populus trichocarpa]
gi|222833842|gb|EEE72319.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 60/144 (41%), Positives = 85/144 (59%), Gaps = 5/144 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + H++ V G DV + ++++ R +CILS
Sbjct: 81 RRPRGRPAGSKNKPKPPVIITRESANTLRAHILEVGNGCDVFECVANYARRRQRGICILS 140
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPDG 243
G ++NV++RQ A +G VT GRFEILSLSGSFL + G S L++ L+G G
Sbjct: 141 GAGTVTNVSIRQPAAAGAIVTLHGRFEILSLSGSFLPPPAPPGATS----LTIFLAGGQG 196
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G LTAA PV V+ SF
Sbjct: 197 QVVGGSVVGELTAAGPVIVIAASF 220
>gi|225436640|ref|XP_002276021.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Vitis
vinifera]
Length = 275
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 81/161 (50%), Gaps = 9/161 (5%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + G P VI V G D+ ++ F++ + IL
Sbjct: 67 RKPRGRPPGSKNKPKPPIVITRECESGMKPIVIEVAPGNDLFETVVQFARRRRVGITILH 126
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL--LSESSGQRSRTGGLSVSLSGPD 242
G ISNVT RQ T + G I+ +SG +L + ++ SR SVS++G
Sbjct: 127 GFGTISNVTFRQPVPHAPTYSLHGPLCIIYISGWYLGCPTPATPATSR-ASFSVSVAGTQ 185
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMES 283
G++ GG VAG +TA+ PV ++ +F + S HR+ S
Sbjct: 186 GQIYGGQVAGKVTASGPVTLIASTF------TNPSVHRLPS 220
>gi|224085352|ref|XP_002307550.1| predicted protein [Populus trichocarpa]
gi|222856999|gb|EEE94546.1| predicted protein [Populus trichocarpa]
Length = 232
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 85/147 (57%), Gaps = 3/147 (2%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+++++ RGRP GS + K + + H + V +G DV + +F++ R +
Sbjct: 24 ETLRRPRGRPAGSKNKPKPPIIVTRDSANALRAHAMEVSSGCDVCESLANFARRKQRGIS 83
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
+LS +G ++NVTLRQ A+SG VT GRFEILSL GS L + GL++ L+G
Sbjct: 84 VLSGSGCVTNVTLRQPASSGAIVTLHGRFEILSLLGSVLPPPAP---QGITGLTIYLAGA 140
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSFL 268
G+V+GG V G L A+ PV ++ SF+
Sbjct: 141 QGQVVGGVVVGALIASGPVVIMAASFM 167
>gi|357441305|ref|XP_003590930.1| DNA-binding protein [Medicago truncatula]
gi|355479978|gb|AES61181.1| DNA-binding protein [Medicago truncatula]
Length = 305
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 63/164 (38%), Positives = 88/164 (53%), Gaps = 12/164 (7%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
S ++ RGRP GS + K + + HV+ V G D+S I+ F++ R +CI
Sbjct: 72 STRRPRGRPSGSKNKPKPPIFITRDSPNALRSHVMEVATGTDISDSIVQFARKRQRGICI 131
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LSA+G + NV+LRQ G V GRF+ILSL+GS L S + GL++ LSG
Sbjct: 132 LSASGTVVNVSLRQPTGPGAVVALPGRFDILSLTGSVLPGPSPPGAT---GLTIYLSGGQ 188
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPV 286
G+V+GG V G L AA PV ++ +F S+ E LPV
Sbjct: 189 GQVVGGGVVGPLVAAGPVMLMAATF---------SNATYERLPV 223
>gi|242080659|ref|XP_002445098.1| hypothetical protein SORBIDRAFT_07g004070 [Sorghum bicolor]
gi|241941448|gb|EES14593.1| hypothetical protein SORBIDRAFT_07g004070 [Sorghum bicolor]
Length = 298
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 58/102 (56%), Gaps = 6/102 (5%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V AG DV + ++++ R VC+LS
Sbjct: 74 RRPRGRPPGSKNKPKPPVIITRESANALRAHILEVAAGCDVFEALTAYARRRQRGVCVLS 133
Query: 185 ANGAISNVTLRQAA------TSGGTVTYEGRFEILSLSGSFL 220
A G ++NVTLRQ TS T GRFEILSL+GSFL
Sbjct: 134 AAGTVANVTLRQPQSSQTGPTSPAVATLHGRFEILSLAGSFL 175
>gi|388507706|gb|AFK41919.1| unknown [Medicago truncatula]
Length = 305
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 63/164 (38%), Positives = 88/164 (53%), Gaps = 12/164 (7%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
S ++ RGRP GS + K + + HV+ V G D+S I+ F++ R +CI
Sbjct: 72 STRRPRGRPSGSKNKPKPPIFITRDSPNALRSHVMEVATGTDISDSIVQFARKRQRGICI 131
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
LSA+G + NV+LRQ G V GRF+ILSL+GS L S + GL++ LSG
Sbjct: 132 LSASGTVVNVSLRQPTGPGAVVALPGRFDILSLTGSVLPGPSPPGAT---GLTIYLSGGQ 188
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPV 286
G+V+GG V G L AA PV ++ +F S+ E LPV
Sbjct: 189 GQVVGGGVVGPLVAAGPVMLMAATF---------SNATYERLPV 223
>gi|225428348|ref|XP_002280017.1| PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera]
Length = 289
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
S ++ RGRPPGS + K + + HV+ + AG D+ + ++++ R VCI
Sbjct: 61 SSRRPRGRPPGSKNKAKPPIIITRDSPNALRSHVLEISAGADIVESVSNYARRRGRGVCI 120
Query: 183 LSANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGS 218
LS GA+++VTLRQ AA SG VT GRFEILSL+G+
Sbjct: 121 LSGGGAVTDVTLRQPAAPSGSVVTLHGRFEILSLTGT 157
>gi|356553603|ref|XP_003545144.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 249
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 59/99 (59%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
D++++ RGRP GS + K + + H + V +G DV+ +++F++ R +
Sbjct: 41 DTLRRPRGRPAGSKNKPKPPIIVTRDSANALKAHAMEVSSGCDVNESLLNFARRKQRGLY 100
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
IL+ G ++NVTLRQ ++G VT GRFEILSL GS L
Sbjct: 101 ILNGTGCVTNVTLRQPGSAGAIVTLHGRFEILSLLGSIL 139
>gi|449473795|ref|XP_004153985.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449499020|ref|XP_004160698.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 253
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 59/97 (60%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + H++ V G DV + +++ R +C+L
Sbjct: 51 VRRPRGRPAGSKNKPKPPVIITRESANTLRAHILEVGGGCDVFEAVAGYARRRQRGICVL 110
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
S +G ++NV+LRQ A +G +T +GRFEILSLSGSFL
Sbjct: 111 SGSGIVNNVSLRQPAAAGSVLTLQGRFEILSLSGSFL 147
>gi|449454656|ref|XP_004145070.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 253
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 59/97 (60%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + H++ V G DV + +++ R +C+L
Sbjct: 51 VRRPRGRPAGSKNKPKPPVIITRESANTLRAHILEVGGGCDVFEAVAGYARRRQRGICVL 110
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
S +G ++NV+LRQ A +G +T +GRFEILSLSGSFL
Sbjct: 111 SGSGIVNNVSLRQPAAAGSVLTLQGRFEILSLSGSFL 147
>gi|326491631|dbj|BAJ94293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 322
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 72/114 (63%), Gaps = 4/114 (3%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ-AATSGGTVTYEGRFEIL 213
HV+ + +G D+ + +F++ R V +LS +G + NVTLRQ AA G VT GRFEIL
Sbjct: 123 HVLEIASGADIMEAVATFARRRQRGVSVLSGSGVVGNVTLRQPAAPPGSVVTLHGRFEIL 182
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
SLSG+FL S + GL+V L+G G+V+GG+V G L A+ PV VV +F
Sbjct: 183 SLSGAFLPSPCPPGAT---GLAVYLAGGQGQVVGGTVIGELVASGPVMVVAATF 233
>gi|326501302|dbj|BAJ98882.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326505696|dbj|BAJ95519.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 256
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 72/114 (63%), Gaps = 4/114 (3%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ-AATSGGTVTYEGRFEIL 213
HV+ + +G D+ I +FS+ R V +LS +GA++NVTLRQ A T V GRFEIL
Sbjct: 74 HVLEIASGADIVEAIAAFSRRRQRGVSVLSGSGAVTNVTLRQPAGTGAAAVALRGRFEIL 133
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
SLSG+FL + + + GL+V L+G G+V+GGSV G L A PV V+ +F
Sbjct: 134 SLSGAFLPAPAPPGAT---GLAVYLAGGQGQVVGGSVMGELLACGPVMVIAATF 184
>gi|115449761|ref|NP_001048546.1| Os02g0820800 [Oryza sativa Japonica Group]
gi|48716263|dbj|BAD22878.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|48716505|dbj|BAD23110.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|113538077|dbj|BAF10460.1| Os02g0820800 [Oryza sativa Japonica Group]
gi|125541659|gb|EAY88054.1| hypothetical protein OsI_09483 [Oryza sativa Indica Group]
Length = 266
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/113 (46%), Positives = 69/113 (61%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V G DV+ I FS+ R VC+LS G ++NV LRQ + G V GRFEILS
Sbjct: 88 HVMEVAGGADVADAIAQFSRRRQRGVCVLSGAGTVANVALRQPSAPGAVVALHGRFEILS 147
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+G+FL + + GL+V L+G G+V+GGSV G L AA PV V+ +F
Sbjct: 148 LTGTFLPGPAPPGST---GLTVYLAGGQGQVVGGSVVGSLIAAGPVMVIASTF 197
>gi|449432239|ref|XP_004133907.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 263
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 84/147 (57%), Gaps = 4/147 (2%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRP GS + K + HVI + DV + F++ R +C
Sbjct: 52 EVLRRPRGRPAGSKNKPKPPTIITRDSANALRCHVIEIANANDVIETLTIFARQRQRGIC 111
Query: 182 ILSANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
+L+ GA++NVTL+Q +T+G ++ GRFEILSLSGSFL + S GL+V LSG
Sbjct: 112 VLTGAGAVTNVTLKQPVSTAGAVISLPGRFEILSLSGSFLPPPAPAAAS---GLTVYLSG 168
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L ++ PV + SF
Sbjct: 169 GQGQVVGGSVVGPLMSSGPVVITAASF 195
>gi|226491364|ref|NP_001150826.1| DNA-binding protein [Zea mays]
gi|195642210|gb|ACG40573.1| DNA-binding protein [Zea mays]
Length = 245
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 78/140 (55%), Gaps = 4/140 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V G DV+ I FS+ R VC+LS G ++NV LRQ + V GRFEILS
Sbjct: 65 HVMEVAGGADVADAIAQFSRRRQRGVCVLSGAGTVANVALRQPSAPTAVVALRGRFEILS 124
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKE 274
L+G+FL + + GL+V L+G G+V+GGSV G L AA PV V+ +F A+ E
Sbjct: 125 LTGTFLPGPAPXGST---GLTVYLAGGQGQVVGGSVVGTLIAAGPVMVIASTF-ANATYE 180
Query: 275 SKSSHRMESLPVPPKLAPGG 294
+ P PP + GG
Sbjct: 181 RLPLEEEDEGPAPPMASGGG 200
>gi|242055603|ref|XP_002456947.1| hypothetical protein SORBIDRAFT_03g046120 [Sorghum bicolor]
gi|241928922|gb|EES02067.1| hypothetical protein SORBIDRAFT_03g046120 [Sorghum bicolor]
Length = 250
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 84/153 (54%), Gaps = 8/153 (5%)
Query: 108 GSGLSSPGGGP---LSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGED 164
G+GL P P L P + +K RGRP GS + K + + P V+ + AG D
Sbjct: 9 GAGLMEPAPAPARALMPVTARKPRGRPLGSKNKPKPPVVVTRESDAAMRPVVLELAAGCD 68
Query: 165 VSSKIMSFSQNGPRAVCILSANGAISNVTLRQAAT--SGGTVTYEGRFEILSLSGSFLLS 222
V S + +F++ V +L GA++ VTLR AA + VT GRFE+L+LSG+ L S
Sbjct: 69 VVSAVAAFARRRRVGVSVLCGRGAVAAVTLRLAAAEDTASAVTLHGRFEVLALSGTVLPS 128
Query: 223 ESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLT 255
S S SVSL+G G+V+GG++AG +T
Sbjct: 129 YSP---SLAPAFSVSLAGLGGQVIGGTLAGEMT 158
>gi|115474893|ref|NP_001061043.1| Os08g0159700 [Oryza sativa Japonica Group]
gi|29467557|dbj|BAC66727.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|37806155|dbj|BAC99660.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|113623012|dbj|BAF22957.1| Os08g0159700 [Oryza sativa Japonica Group]
Length = 289
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 58/102 (56%), Gaps = 6/102 (5%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V AG DV + ++++ R VC+LS
Sbjct: 63 RRPRGRPPGSKNKPKPPVIITRESANALRAHILEVAAGCDVFEALTAYARRRQRGVCVLS 122
Query: 185 ANGAISNVTLRQAAT------SGGTVTYEGRFEILSLSGSFL 220
A G ++NVTLRQ + S T GRFEILSL+GSFL
Sbjct: 123 AAGTVANVTLRQPQSAQPGPASPAVATLHGRFEILSLAGSFL 164
>gi|125560222|gb|EAZ05670.1| hypothetical protein OsI_27898 [Oryza sativa Indica Group]
Length = 289
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 58/102 (56%), Gaps = 6/102 (5%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V AG DV + ++++ R VC+LS
Sbjct: 63 RRPRGRPPGSKNKPKPPVIITRESANALRAHILEVAAGCDVFEALTAYARRRQRGVCVLS 122
Query: 185 ANGAISNVTLRQAAT------SGGTVTYEGRFEILSLSGSFL 220
A G ++NVTLRQ + S T GRFEILSL+GSFL
Sbjct: 123 AAGTVANVTLRQPQSAQPGPASPAVATLHGRFEILSLAGSFL 164
>gi|356499122|ref|XP_003518392.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 255
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 76/113 (67%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ + G DV+ I +F+ R V +LS +G ++NVTLRQ A GG +T +GRFEILS
Sbjct: 88 HILEISGGSDVAECIATFATRRHRGVSVLSGSGVVTNVTLRQPAAPGGVITLQGRFEILS 147
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSG+FL + S + + GL+V L+G +G+V+GGSV G L A+ PV VV +F
Sbjct: 148 LSGAFLPAPSPPEAT---GLTVYLAGGEGQVVGGSVVGPLVASGPVMVVAATF 197
>gi|356576664|ref|XP_003556450.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 259
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 84/156 (53%), Gaps = 3/156 (1%)
Query: 112 SSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMS 171
P G + + ++ RGRPPGS + K + + HV+ + AG D++ +
Sbjct: 30 DEPREGAIDVSTTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEIAAGADIADCVAQ 89
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
F++ R V ILS +G + NVT+RQ G + GRF+ILSL+GSFL S +
Sbjct: 90 FARRLQRGVSILSGSGTVVNVTIRQPTAPGAVMALHGRFDILSLTGSFLPGPSPPGAT-- 147
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
GL++ L+G G V+GG V G L AA PV ++ +F
Sbjct: 148 -GLTIYLAGGQGHVVGGGVVGPLLAAGPVLLMAATF 182
>gi|224062723|ref|XP_002300879.1| predicted protein [Populus trichocarpa]
gi|222842605|gb|EEE80152.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 82/145 (56%), Gaps = 3/145 (2%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
I++ RGRP GS + K + + H + V +G DV + +F++ R + +L
Sbjct: 60 IRRPRGRPAGSKNKPKPPIIVTRDSANALRAHAMEVSSGCDVCESLANFARRKQRGISVL 119
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
S +G ++NVTLRQ +SG VT GRFEILSL GS L + GL++ L+G G
Sbjct: 120 SGSGCVTNVTLRQPTSSGAIVTLHGRFEILSLLGSVLPPPAP---QGITGLTIYLAGAQG 176
Query: 244 RVLGGSVAGLLTAATPVQVVVGSFL 268
+V+GG V G L A+ PV ++ SF+
Sbjct: 177 QVVGGGVVGALIASGPVVIMAASFM 201
>gi|225454068|ref|XP_002265280.1| PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera]
gi|147822229|emb|CAN61959.1| hypothetical protein VITISV_013618 [Vitis vinifera]
Length = 246
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/145 (41%), Positives = 86/145 (59%), Gaps = 6/145 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V G DV + +++ +CILS
Sbjct: 51 RRPRGRPPGSKNKPKPPVVISRESTNTLRAHILEVGHGCDVFHSVAEYTEKRRCGICILS 110
Query: 185 ANGAISNVTLRQAATSGGTVTY-EGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPD 242
+G +++V+LRQ A +GG V + +GRFEILSLSGSFL + G S L+V L+G
Sbjct: 111 GSGMVTDVSLRQPAAAGGAVAFLQGRFEILSLSGSFLPRPAPPGATS----LTVFLAGSQ 166
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G LTA PV V+ SF
Sbjct: 167 GQVVGGSVVGGLTACGPVVVIAASF 191
>gi|413939532|gb|AFW74083.1| DNA-binding protein [Zea mays]
Length = 245
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 78/140 (55%), Gaps = 4/140 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V G DV+ I FS+ R VC+LS G ++NV LRQ + V GRFEILS
Sbjct: 65 HVMEVAGGADVADAIAQFSRRRQRGVCVLSGAGTVANVALRQPSAPTAVVALRGRFEILS 124
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKE 274
L+G+FL + + GL+V L+G G+V+GGSV G L AA PV V+ +F A+ E
Sbjct: 125 LTGTFLPGPAPPGST---GLTVYLAGGQGQVVGGSVVGTLIAAGPVMVIASTF-ANATYE 180
Query: 275 SKSSHRMESLPVPPKLAPGG 294
+ P PP + GG
Sbjct: 181 RLPLEEEDEGPAPPMASGGG 200
>gi|449533526|ref|XP_004173725.1| PREDICTED: LOW QUALITY PROTEIN: putative DNA-binding protein
ESCAROLA-like [Cucumis sativus]
Length = 255
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 84/147 (57%), Gaps = 4/147 (2%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
+ +++ RGRP GS + K + HVI + DV + F++ R +C
Sbjct: 44 EVLRRPRGRPAGSKNKPKPPTIITRDSANALRCHVIEIANANDVIETLTIFARQRQRGIC 103
Query: 182 ILSANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
+L+ GA++NVTL+Q +T+G ++ GRFEILSLSGSFL + S GL+V LSG
Sbjct: 104 VLTGAGAVTNVTLKQPVSTAGAVISLPGRFEILSLSGSFLPPPAPAAAS---GLTVYLSG 160
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L ++ PV + SF
Sbjct: 161 GQGQVVGGSVVGPLMSSGPVVITAASF 187
>gi|297820312|ref|XP_002878039.1| hypothetical protein ARALYDRAFT_906980 [Arabidopsis lyrata subsp.
lyrata]
gi|297323877|gb|EFH54298.1| hypothetical protein ARALYDRAFT_906980 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 75/113 (66%), Gaps = 2/113 (1%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ + G DV+ + +F++ R V +LS +G ++NVTLRQ A SGG V+ G+FEILS
Sbjct: 116 HVLEIATGADVAESLNAFARRRGRGVSVLSGSGLVTNVTLRQPAASGGVVSLRGQFEILS 175
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+ G+FL +SG + GL++ L+G G+V+GG VAG L A+ PV V+ +F
Sbjct: 176 MCGAFL--PTSGSPAAAAGLTIYLAGAQGQVVGGGVAGPLIASGPVIVIAATF 226
>gi|15233302|ref|NP_191115.1| AT-hook protein of GA feedback 2 [Arabidopsis thaliana]
gi|7076799|emb|CAB75914.1| putative protein [Arabidopsis thaliana]
gi|21554159|gb|AAM63238.1| unknown [Arabidopsis thaliana]
gi|89001051|gb|ABD59115.1| At3g55560 [Arabidopsis thaliana]
gi|119657374|tpd|FAA00286.1| TPA: AT-hook motif nuclear localized protein 15 [Arabidopsis
thaliana]
gi|332645879|gb|AEE79400.1| AT-hook protein of GA feedback 2 [Arabidopsis thaliana]
Length = 310
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 75/113 (66%), Gaps = 2/113 (1%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ + G DV+ + +F++ R V +LS +G ++NVTLRQ A SGG V+ G+FEILS
Sbjct: 118 HVLEIATGADVAESLNAFARRRGRGVSVLSGSGLVTNVTLRQPAASGGVVSLRGQFEILS 177
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+ G+FL +SG + GL++ L+G G+V+GG VAG L A+ PV V+ +F
Sbjct: 178 MCGAFL--PTSGSPAAAAGLTIYLAGAQGQVVGGGVAGPLIASGPVIVIAATF 228
>gi|357153953|ref|XP_003576620.1| PREDICTED: uncharacterized protein LOC100834433 [Brachypodium
distachyon]
Length = 371
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 75/154 (48%), Gaps = 24/154 (15%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSA--GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
KK RGRPPGS + K + A PHVI + G D++ + F+ +C+
Sbjct: 103 KKRRGRPPGSKNKPKPPVVITREAEPAAAMRPHVIEIPGGRDIAEALSRFAGRRGLGICV 162
Query: 183 LSANGAISNVTLRQAATSG-------------GTVTYEGRFEILSLSGSFLLSESSGQRS 229
L+ GA++NV+LR + V +GR+EILS+S +FL +
Sbjct: 163 LAGTGAVANVSLRHPCSPATAALAPPGLAAPAAVVVVQGRYEILSISATFLPPAMAAAMD 222
Query: 230 RTG---------GLSVSLSGPDGRVLGGSVAGLL 254
G+S+SL+GP G+++GG+VAG L
Sbjct: 223 MAPQAAAAMAAAGISISLAGPHGQIVGGAVAGPL 256
>gi|297803842|ref|XP_002869805.1| hypothetical protein ARALYDRAFT_492588 [Arabidopsis lyrata subsp.
lyrata]
gi|297315641|gb|EFH46064.1| hypothetical protein ARALYDRAFT_492588 [Arabidopsis lyrata subsp.
lyrata]
Length = 319
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 81/146 (55%), Gaps = 6/146 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VC++S
Sbjct: 98 RRPRGRPAGSKNKPKPPIIVTRDSANALRTHVMEIGDGCDLVESVATFARRRQRGVCVMS 157
Query: 185 ANGAISNVTLRQAATS---GGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
G ++NVT+RQ + G V+ GRFEILSLSGSFL + GLSV L+G
Sbjct: 158 GTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSGSFLPPPAP---PTATGLSVYLAGG 214
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L A PV V+ SF
Sbjct: 215 QGQVVGGSVVGPLLCAGPVVVMAASF 240
>gi|15234404|ref|NP_192942.1| putative AT-hook DNA-binding family protein [Arabidopsis thaliana]
gi|4586110|emb|CAB40946.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|7267906|emb|CAB78248.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|32815961|gb|AAP88365.1| At4g12050 [Arabidopsis thaliana]
gi|110736316|dbj|BAF00128.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|119657396|tpd|FAA00297.1| TPA: AT-hook motif nuclear localized protein 26 [Arabidopsis
thaliana]
gi|225898773|dbj|BAH30517.1| hypothetical protein [Arabidopsis thaliana]
gi|332657691|gb|AEE83091.1| putative AT-hook DNA-binding family protein [Arabidopsis thaliana]
Length = 339
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 83/144 (57%), Gaps = 4/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VC++S
Sbjct: 118 RRPRGRPAGSKNKPKAPIIITRDSANALRTHVMEIGDGCDIVDCMATFARRRQRGVCVMS 177
Query: 185 ANGAISNVTLRQAAT-SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
G+++NVT+RQ + G V+ GRFEILSLSGSFL + + GLSV L+G G
Sbjct: 178 GTGSVTNVTIRQPGSPPGSVVSLHGRFEILSLSGSFLPPPAPPAAT---GLSVYLAGGQG 234
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G L + PV V+ SF
Sbjct: 235 QVVGGSVVGPLLCSGPVVVMAASF 258
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 59/98 (60%), Gaps = 6/98 (6%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV + D++ I +F+Q R V ILSA G ++++TLRQ G +T RFEILS
Sbjct: 622 HVFEIATATDIADSIFTFTQRRRRGVSILSATGLVTDITLRQPP---GVITLHQRFEILS 678
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAG 252
LSG+FL + S T L+V L+G GRV+GG VAG
Sbjct: 679 LSGAFLPTPSP---HGTSALTVYLAGDQGRVVGGLVAG 713
>gi|356535220|ref|XP_003536146.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 280
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 83/156 (53%), Gaps = 3/156 (1%)
Query: 112 SSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMS 171
P G + + ++ RGRPPGS + K + + HV+ + G D++ +
Sbjct: 49 DEPREGAIDVATTRRPRGRPPGSRNKPKPPIFVTRDSPNALRSHVMEIAVGADIADCVAQ 108
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
F++ R V ILS +G + NV LRQ G + GRF+ILSL+GSFL S +
Sbjct: 109 FARRRQRGVSILSGSGTVVNVNLRQPTAPGAVMALHGRFDILSLTGSFLPGPSPPGAT-- 166
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
GL++ L+G G+++GG V G L AA PV V+ +F
Sbjct: 167 -GLTIYLAGGQGQIVGGGVVGPLVAAGPVLVMAATF 201
>gi|297813721|ref|XP_002874744.1| hypothetical protein ARALYDRAFT_490024 [Arabidopsis lyrata subsp.
lyrata]
gi|297320581|gb|EFH51003.1| hypothetical protein ARALYDRAFT_490024 [Arabidopsis lyrata subsp.
lyrata]
Length = 331
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 82/144 (56%), Gaps = 4/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VC++S
Sbjct: 110 RRPRGRPAGSKNKPKAPIIITRDSANALRTHVMEIGDGCDIVDCMATFARRRQRGVCVMS 169
Query: 185 ANGAISNVTLRQAAT-SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
G ++NVT+RQ + G V+ GRFEILSLSGSFL + + GLSV L+G G
Sbjct: 170 GTGNVTNVTIRQPGSPPGSVVSLHGRFEILSLSGSFLPPPAPPAAT---GLSVYLAGGQG 226
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GGSV G L + PV V+ SF
Sbjct: 227 QVVGGSVVGPLLCSGPVVVMAASF 250
>gi|356568547|ref|XP_003552472.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 268
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 47/66 (71%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V +G DV + ++++ R +C+LS +G ++NVTLRQ A +G VT GRFEILS
Sbjct: 86 HILEVSSGCDVFESVATYARKRQRGICVLSGSGTVTNVTLRQPAAAGAVVTLHGRFEILS 145
Query: 215 LSGSFL 220
LSGSFL
Sbjct: 146 LSGSFL 151
>gi|255647630|gb|ACU24278.1| unknown [Glycine max]
Length = 268
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 47/66 (71%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V +G DV + ++++ R +C+LS +G ++NVTLRQ A +G VT GRFEILS
Sbjct: 86 HILEVSSGCDVFESVATYARKRQRGICVLSGSGTVTNVTLRQPAAAGAVVTLHGRFEILS 145
Query: 215 LSGSFL 220
LSGSFL
Sbjct: 146 LSGSFL 151
>gi|388500788|gb|AFK38460.1| unknown [Medicago truncatula]
Length = 269
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 47/66 (71%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V +G DV + ++++ R +C+LS +G ++NVTLRQ A +G VT GRFEILS
Sbjct: 87 HILEVSSGCDVFDSVATYARKRQRGICVLSGSGTVTNVTLRQPAAAGSVVTLHGRFEILS 146
Query: 215 LSGSFL 220
LSGSFL
Sbjct: 147 LSGSFL 152
>gi|15235815|ref|NP_194012.1| putative AT-hook DNA-binding family protein [Arabidopsis thaliana]
gi|2827558|emb|CAA16566.1| putative DNA binding protein [Arabidopsis thaliana]
gi|7269128|emb|CAB79236.1| putative DNA binding protein [Arabidopsis thaliana]
gi|110738517|dbj|BAF01184.1| putative DNA binding protein [Arabidopsis thaliana]
gi|119657392|tpd|FAA00295.1| TPA: AT-hook motif nuclear localized protein 24 [Arabidopsis
thaliana]
gi|225898801|dbj|BAH30531.1| hypothetical protein [Arabidopsis thaliana]
gi|332659260|gb|AEE84660.1| putative AT-hook DNA-binding family protein [Arabidopsis thaliana]
Length = 324
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 81/146 (55%), Gaps = 6/146 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VC++S
Sbjct: 105 RRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIGDGCDLVESVATFARRRQRGVCVMS 164
Query: 185 ANGAISNVTLRQAATS---GGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
G ++NVT+RQ + G V+ GRFEILSLSGSFL + GLSV L+G
Sbjct: 165 GTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSGSFLPPPAP---PTATGLSVYLAGG 221
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L A PV V+ SF
Sbjct: 222 QGQVVGGSVVGPLLCAGPVVVMAASF 247
>gi|356531844|ref|XP_003534486.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 270
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 46/66 (69%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V G DV + ++++ R +C+LS +G ++NVTLRQ A +G VT GRFEILS
Sbjct: 86 HILEVSTGCDVFESVATYARKRQRGICVLSGSGTVTNVTLRQPAAAGAVVTLHGRFEILS 145
Query: 215 LSGSFL 220
LSGSFL
Sbjct: 146 LSGSFL 151
>gi|326532560|dbj|BAK05209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 285
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 79/144 (54%), Gaps = 6/144 (4%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + H++ V +G DV++ + +++ R VC+L
Sbjct: 77 LRRPRGRPMGSKNKPKPPIIITRDSPDALHSHILEVASGADVAACVAEYARRRGRGVCVL 136
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
A+G++ +V +R AA GRFE+LS++G+ L + + S GL+V LS G
Sbjct: 137 GASGSVVDVVVRGAAAPA---PLPGRFELLSMTGTVLPPPAPSEAS---GLAVMLSAGQG 190
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+VLGG V G L AA V + +F
Sbjct: 191 QVLGGCVVGPLVAAGTVTLFAATF 214
>gi|255647626|gb|ACU24276.1| unknown [Glycine max]
Length = 254
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 46/66 (69%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V G DV + ++++ R +C+LS +G ++NVTLRQ A +G VT GRFEILS
Sbjct: 86 HILEVSTGCDVFESVATYARKRQRGICVLSGSGTVTNVTLRQPAAAGAVVTLHGRFEILS 145
Query: 215 LSGSFL 220
LSGSFL
Sbjct: 146 LSGSFL 151
>gi|110740456|dbj|BAF02122.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 324
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 81/146 (55%), Gaps = 6/146 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + G D+ + +F++ R VC++S
Sbjct: 105 RRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIGDGCDLVESVATFARRRQRGVCVMS 164
Query: 185 ANGAISNVTLRQAATS---GGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
G ++NVT+RQ + G V+ GRFEILSLSGSFL + GLSV L+G
Sbjct: 165 GTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSGSFLPPPAP---PTATGLSVYLAGG 221
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L A PV V+ SF
Sbjct: 222 QGQVVGGSVVGPLLCAGPVVVMAASF 247
>gi|115452163|ref|NP_001049682.1| Os03g0270000 [Oryza sativa Japonica Group]
gi|29893608|gb|AAP06862.1| hypothetical protein [Oryza sativa Japonica Group]
gi|29893674|gb|AAP06928.1| unknown protein [Oryza sativa Japonica Group]
gi|108707407|gb|ABF95202.1| DNA-binding protein, putative, expressed [Oryza sativa Japonica
Group]
gi|113548153|dbj|BAF11596.1| Os03g0270000 [Oryza sativa Japonica Group]
gi|125543266|gb|EAY89405.1| hypothetical protein OsI_10910 [Oryza sativa Indica Group]
gi|215692598|dbj|BAG88018.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741049|dbj|BAG97544.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 258
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 72/114 (63%), Gaps = 4/114 (3%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ-AATSGGTVTYEGRFEIL 213
HV+ + +G D+ I FS+ R V +LS +GA++NVTLRQ A T V GRFEIL
Sbjct: 75 HVLEIASGADIVEAIAGFSRRRQRGVSVLSGSGAVTNVTLRQPAGTGAAAVALRGRFEIL 134
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
S+SG+FL + + + GL+V L+G G+V+GGSV G L A+ PV V+ +F
Sbjct: 135 SMSGAFLPAPAPPGAT---GLAVYLAGGQGQVVGGSVMGELIASGPVMVIAATF 185
>gi|297823323|ref|XP_002879544.1| hypothetical protein ARALYDRAFT_482492 [Arabidopsis lyrata subsp.
lyrata]
gi|297325383|gb|EFH55803.1| hypothetical protein ARALYDRAFT_482492 [Arabidopsis lyrata subsp.
lyrata]
Length = 287
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 88/164 (53%), Gaps = 5/164 (3%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + H++ V +G DV I ++++ R +C+L
Sbjct: 77 MRRPRGRPAGSKNKPKPPVIVTRESANTLRAHILEVGSGCDVFECISTYARRRQRGICVL 136
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPD 242
S G ++NV++RQ +G VT G FEILSLSGSFL + G S L++ L+G
Sbjct: 137 SGTGTVTNVSIRQPTAAGAVVTLRGTFEILSLSGSFLPPPAPPGATS----LTIFLAGAQ 192
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPV 286
G+V+GG+V G L AA PV V+ SF + E L V
Sbjct: 193 GQVVGGNVVGELMAAGPVMVMAASFTNVAYERLPLDEHEEHLQV 236
>gi|15226945|ref|NP_181070.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana]
gi|3668079|gb|AAC61811.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana]
gi|119657386|tpd|FAA00292.1| TPA: AT-hook motif nuclear localized protein 21 [Arabidopsis
thaliana]
gi|330253994|gb|AEC09088.1| putative AT-hook DNA-binding protein [Arabidopsis thaliana]
Length = 285
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 56/145 (38%), Positives = 84/145 (57%), Gaps = 5/145 (3%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + H++ V +G DV I ++++ R +C+L
Sbjct: 77 MRRPRGRPAGSKNKPKPPVIVTRESANTLRAHILEVGSGCDVFECISTYARRRQRGICVL 136
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPD 242
S G ++NV++RQ +G VT G FEILSLSGSFL + G S L++ L+G
Sbjct: 137 SGTGTVTNVSIRQPTAAGAVVTLRGTFEILSLSGSFLPPPAPPGATS----LTIFLAGAQ 192
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG+V G L AA PV V+ SF
Sbjct: 193 GQVVGGNVVGELMAAGPVMVMAASF 217
>gi|296084128|emb|CBI24516.3| unnamed protein product [Vitis vinifera]
Length = 970
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 40/67 (59%), Positives = 48/67 (71%)
Query: 179 AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSL 238
A+ ILSANGAI NV L Q +S GT+T EG FEI SGS + +ES GQR G+S+SL
Sbjct: 857 AIFILSANGAILNVNLHQPNSSVGTLTNEGHFEIFPWSGSCMPTESRGQRGDLAGMSISL 916
Query: 239 SGPDGRV 245
+GPDGRV
Sbjct: 917 AGPDGRV 923
>gi|449496318|ref|XP_004160103.1| PREDICTED: uncharacterized LOC101216092 [Cucumis sativus]
Length = 155
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 79/149 (53%), Gaps = 20/149 (13%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
D + K + + SG+ G A G++ V G ++ ++I +FS R VC
Sbjct: 15 DDVDKEKRKERRIFSGR-------GRAIAGYSRRV-----GLNIVNRISNFSVPRSRTVC 62
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
I+SA G +S++ + + T+ +EG FEIL LSG G R +++S S
Sbjct: 63 IISAVGLVSSIIIHDPNSVASTLKFEGTFEILQLSG----WSHEGDDIRL--MTISFSKL 116
Query: 242 DGR--VLGGSVAGLLTAATPVQVVVGSFL 268
DGR V GG+VA L AATPVQ+++GSF+
Sbjct: 117 DGRNQVFGGAVASSLIAATPVQIIMGSFI 145
>gi|357507933|ref|XP_003624255.1| AT-hook DNA-binding protein [Medicago truncatula]
gi|355499270|gb|AES80473.1| AT-hook DNA-binding protein [Medicago truncatula]
Length = 316
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 47/66 (71%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V +G DV + ++++ R +C+LS +G ++NVTLRQ A +G VT GRFEILS
Sbjct: 134 HILEVSSGCDVFDSVATYARKRQRGICVLSGSGTVTNVTLRQPAAAGSVVTLHGRFEILS 193
Query: 215 LSGSFL 220
LSGSFL
Sbjct: 194 LSGSFL 199
>gi|125561386|gb|EAZ06834.1| hypothetical protein OsI_29071 [Oryza sativa Indica Group]
Length = 236
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 69/113 (61%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V G DV+ I F++ R VC+LS G +++V LRQ A V GRFEILS
Sbjct: 67 HVMEVAGGADVAESIAHFARRRQRGVCVLSGAGTVTDVALRQPAAPSAVVALRGRFEILS 126
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+G+FL + + GL+V L+G G+V+GGSV G LTAA PV V+ +F
Sbjct: 127 LTGTFLPGPAPPGST---GLTVYLAGGQGQVVGGSVVGTLTAAGPVMVIASTF 176
>gi|357476665|ref|XP_003608618.1| AT-hook DNA-binding protein [Medicago truncatula]
gi|355509673|gb|AES90815.1| AT-hook DNA-binding protein [Medicago truncatula]
Length = 285
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 47/66 (71%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V G DV + ++++ R +C+LS +G ++NV++RQ A +GG VT GRFEILS
Sbjct: 95 HILEVAGGSDVFECVSTYARRRQRGICVLSGSGTVTNVSIRQPAAAGGVVTLHGRFEILS 154
Query: 215 LSGSFL 220
LSGSFL
Sbjct: 155 LSGSFL 160
>gi|255539338|ref|XP_002510734.1| DNA binding protein, putative [Ricinus communis]
gi|223551435|gb|EEF52921.1| DNA binding protein, putative [Ricinus communis]
Length = 289
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G D+ + +++ R VC+LS
Sbjct: 71 RRPRGRPAGSKNKPKPPIIVTRDSPNALRSHVLEVSTGSDIMESVSIYARKRGRGVCVLS 130
Query: 185 ANGAISNVTLRQAAT-SGGTVTYEGRFEILSLSGSFL 220
NG ++NVTLRQ A+ +G VT GRFEILSLSG+ L
Sbjct: 131 GNGTVANVTLRQPASPAGSVVTLHGRFEILSLSGTVL 167
>gi|449462059|ref|XP_004148759.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 248
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 76/114 (66%), Gaps = 6/114 (5%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V +G DV + S+++ R +CILS +G ++NV LRQ A + G +T +GRFEILS
Sbjct: 77 HILEVGSGCDVFDCVASYARRRQRGICILSGSGNVTNVGLRQPA-AAGVLTLQGRFEILS 135
Query: 215 LSGSFLLSES-SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSGSFL + G S L++ L+G G+V+GG+VAG LTAA PV ++ SF
Sbjct: 136 LSGSFLPPPAPPGATS----LTIFLAGGQGQVVGGTVAGELTAAGPVILIAASF 185
>gi|449511147|ref|XP_004163877.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 248
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 76/114 (66%), Gaps = 6/114 (5%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V +G DV + S+++ R +CILS +G ++NV LRQ A + G +T +GRFEILS
Sbjct: 77 HILEVGSGCDVFDCVASYARRRQRGICILSGSGNVTNVGLRQPA-AAGVLTLQGRFEILS 135
Query: 215 LSGSFLLSES-SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSGSFL + G S L++ L+G G+V+GG+VAG LTAA PV ++ SF
Sbjct: 136 LSGSFLPPPAPPGATS----LTIFLAGGQGQVVGGTVAGELTAAGPVILIAASF 185
>gi|357131729|ref|XP_003567487.1| PREDICTED: uncharacterized protein LOC100822741 [Brachypodium
distachyon]
Length = 283
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 87/168 (51%), Gaps = 13/168 (7%)
Query: 126 KSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
++RGRPPGS + K + + P V+ + G DV+ + +F++ V +L
Sbjct: 30 RARGRPPGSRNKPKPPVIVTRESAAAMRPVVLELAPGCDVAGAVAAFARRRGLGVSVLCG 89
Query: 186 NGAISNVTLR------QAATSGGTVTYEGRFEILSLSGSFL--LSESSGQRSRTGGLSVS 237
GA+ + LR +AA +G V +GR E+L++SG+ L S SS + V+
Sbjct: 90 RGAVCAIALRLASAAPEAAGNGHVVRLQGRLEVLTMSGTVLPSSSSSSAPAAPPPPFVVT 149
Query: 238 LSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLP 285
+G +GRV+GG++AG +TAA VVV + D +HR+ + P
Sbjct: 150 FAGENGRVIGGTLAGEMTAAEDGVVVVAATFKD-----PETHRLPAAP 192
>gi|18396925|ref|NP_566232.1| AT-hook motif nuclear-localized protein 19 [Arabidopsis thaliana]
gi|6175162|gb|AAF04888.1|AC011437_3 hypothetical protein [Arabidopsis thaliana]
gi|21553701|gb|AAM62794.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|29028876|gb|AAO64817.1| At3g04570 [Arabidopsis thaliana]
gi|110736382|dbj|BAF00160.1| hypothetical protein [Arabidopsis thaliana]
gi|119657382|tpd|FAA00290.1| TPA: AT-hook motif nuclear localized protein 19 [Arabidopsis
thaliana]
gi|332640577|gb|AEE74098.1| AT-hook motif nuclear-localized protein 19 [Arabidopsis thaliana]
Length = 315
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 85/151 (56%), Gaps = 11/151 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + +G DV + +F++ R +CILS
Sbjct: 80 RRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIASGTDVIETLATFARRRQRGICILS 139
Query: 185 ANGAISNVTLRQAAT--------SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSV 236
NG ++NVTLRQ +T + +GRFEILSL+GSFL + + GL++
Sbjct: 140 GNGTVANVTLRQPSTAAVAAAPGGAAVLALQGRFEILSLTGSFLPGPAPPGST---GLTI 196
Query: 237 SLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+G G+V+GGSV G L AA PV ++ +F
Sbjct: 197 YLAGGQGQVVGGSVVGPLMAAGPVMLIAATF 227
>gi|297828962|ref|XP_002882363.1| hypothetical protein ARALYDRAFT_477713 [Arabidopsis lyrata subsp.
lyrata]
gi|297328203|gb|EFH58622.1| hypothetical protein ARALYDRAFT_477713 [Arabidopsis lyrata subsp.
lyrata]
Length = 314
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 85/151 (56%), Gaps = 11/151 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + +G DV + +F++ R +CILS
Sbjct: 80 RRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIASGTDVIETLATFARRRQRGICILS 139
Query: 185 ANGAISNVTLRQAAT--------SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSV 236
NG ++NVTLRQ +T + +GRFEILSL+GSFL + + GL++
Sbjct: 140 GNGTVANVTLRQPSTAAVAAAPGGAAVLALQGRFEILSLTGSFLPGPAPPGST---GLTI 196
Query: 237 SLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+G G+V+GGSV G L AA PV ++ +F
Sbjct: 197 YLAGGQGQVVGGSVVGPLMAAGPVMLIAATF 227
>gi|297726533|ref|NP_001175630.1| Os08g0478466 [Oryza sativa Japonica Group]
gi|42407866|dbj|BAD09008.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|255678532|dbj|BAH94358.1| Os08g0478466 [Oryza sativa Japonica Group]
Length = 324
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 77/150 (51%), Gaps = 20/150 (13%)
Query: 125 KKSRGRPPGSGSGKK-------HQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGP 177
K+ RGRPPGS + K A +A HV+ + G DV+ + +++
Sbjct: 67 KRRRGRPPGSKNKPKPPVVVTREAAAAEPAAAAAMRSHVLEIPGGGDVAGALAGYARRRG 126
Query: 178 RAVCILSANGAISNVTLRQ-----------AATSGGTVTYEGRFEILSLSGSFLLSESSG 226
+C+L+ GA++NV+LR + V + GR+EILS+S +FL +
Sbjct: 127 LGICVLAGTGAVANVSLRHPLPSGAAAEIGGGAAAAVVVFHGRYEILSISATFLPPAMAA 186
Query: 227 QRSRT--GGLSVSLSGPDGRVLGGSVAGLL 254
R GGLS+SL+GP G++ GG+VAG L
Sbjct: 187 AAPRAALGGLSISLAGPHGQIFGGAVAGPL 216
>gi|115477857|ref|NP_001062524.1| Os08g0563200 [Oryza sativa Japonica Group]
gi|42408442|dbj|BAD09624.1| putative SAP1 protein [Oryza sativa Japonica Group]
gi|113624493|dbj|BAF24438.1| Os08g0563200 [Oryza sativa Japonica Group]
gi|215766739|dbj|BAG98967.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 235
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 69/113 (61%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V G DV+ I F++ R VC+LS G +++V LRQ A V GRFEILS
Sbjct: 65 HVMEVAGGADVAESIAHFARRRQRGVCVLSGAGTVTDVALRQPAAPSAVVALRGRFEILS 124
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+G+FL + + GL+V L+G G+V+GGSV G LTAA PV V+ +F
Sbjct: 125 LTGTFLPGPAPPGST---GLTVYLAGGQGQVVGGSVVGTLTAAGPVMVIASTF 174
>gi|224083372|ref|XP_002307001.1| predicted protein [Populus trichocarpa]
gi|222856450|gb|EEE93997.1| predicted protein [Populus trichocarpa]
Length = 157
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 85/144 (59%), Gaps = 4/144 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HVI + G D+ + ++++ R VC+LS
Sbjct: 1 RRPRGRPAGSKNKPKPPIIVTRDSPNALRSHVIEISNGADIVESVSTYARKRGRGVCVLS 60
Query: 185 ANGAISNVTLRQAATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
+G ++NVTLRQ A+ G+V T GRFEILSLSG+ L + GGLS+ LSG G
Sbjct: 61 GSGTVANVTLRQPASPAGSVLTLHGRFEILSLSGTVLPPPAPPG---AGGLSIFLSGGQG 117
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GG+V G L AA PV ++ SF
Sbjct: 118 QVVGGNVVGPLMAAGPVVLMAASF 141
>gi|119331590|gb|ABL63121.1| AT-hook DNA-binding protein [Catharanthus roseus]
Length = 302
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 72/113 (63%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ + G DV I +F+ R V +LS +G ++NV+LRQ A GG VT GRFEILS
Sbjct: 117 HVLEISNGSDVVECISTFALRRHRGVSVLSGSGIVNNVSLRQPAAPGGVVTLHGRFEILS 176
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSGSFL + S + GL+V L+G G+V+GG+V G L A+ PV V+ +F
Sbjct: 177 LSGSFLPAPSPPGAT---GLTVYLAGGQGQVVGGTVVGSLVASGPVMVIAATF 226
>gi|388500298|gb|AFK38215.1| unknown [Lotus japonicus]
Length = 138
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/62 (61%), Positives = 47/62 (75%), Gaps = 1/62 (1%)
Query: 228 RSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKS-SHRMESLPV 286
+SR+GG+SVSL+GPDGRV+GG +AGLL AA PVQVVVGSFL E K+ HR+E +
Sbjct: 9 KSRSGGMSVSLAGPDGRVMGGGLAGLLIAAGPVQVVVGSFLPGHHLEHKAKKHRVEHVST 68
Query: 287 PP 288
P
Sbjct: 69 IP 70
>gi|253761229|ref|XP_002489068.1| hypothetical protein SORBIDRAFT_0169s002010 [Sorghum bicolor]
gi|241947183|gb|EES20328.1| hypothetical protein SORBIDRAFT_0169s002010 [Sorghum bicolor]
Length = 199
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 68/113 (60%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V G DV+ I FS+ R VC+LS G ++NV LRQ + V GRFEILS
Sbjct: 20 HVMEVAGGADVADAIAQFSRRRQRGVCVLSGAGTVANVALRQPSAPTAVVALHGRFEILS 79
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+G+FL + + GL+V L+G G+V+GGSV G L AA PV V+ +F
Sbjct: 80 LTGTFLPGPAPPGST---GLTVYLAGGQGQVVGGSVVGTLIAAGPVMVIASTF 129
>gi|413917337|gb|AFW57269.1| hypothetical protein ZEAMMB73_059217, partial [Zea mays]
Length = 130
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 55/98 (56%), Gaps = 6/98 (6%)
Query: 129 GRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGA 188
GRPPGS + K + + H++ V AG DV + ++++ R VC+LSA G
Sbjct: 28 GRPPGSKNKPKPPVIITRESANALRAHILEVAAGCDVFEALTAYARRRQRGVCVLSAAGT 87
Query: 189 ISNVTLRQAATSG------GTVTYEGRFEILSLSGSFL 220
++NVTLRQ +S T GRFEILSL+GSFL
Sbjct: 88 VANVTLRQPQSSQAGPASPAVATLHGRFEILSLAGSFL 125
>gi|326494838|dbj|BAJ94538.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 285
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 65/105 (61%), Gaps = 11/105 (10%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLR--QAATSG----GTVTYEG 208
H++ V AG DV + ++++ R VC+LSA GA++NVTLR Q+A SG T G
Sbjct: 88 HILEVAAGCDVFEALTAYARRRQRGVCVLSAAGAVTNVTLRQPQSAQSGPGSPAVATLHG 147
Query: 209 RFEILSLSGSFLLSES-SGQRSRTGGLSVSLSGPDGRVLGGSVAG 252
RFEILSL+GSFL + G S LS L+ G+V+GGSVAG
Sbjct: 148 RFEILSLAGSFLPPPAPPGATS----LSAFLARGQGQVVGGSVAG 188
>gi|224102185|ref|XP_002312579.1| predicted protein [Populus trichocarpa]
gi|222852399|gb|EEE89946.1| predicted protein [Populus trichocarpa]
Length = 167
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 70/111 (63%), Gaps = 3/111 (2%)
Query: 157 ITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLS 216
+ + G DV+ + F++ R VC+LS +G+++NVTLRQ A G V GRFEILSL+
Sbjct: 1 MEIAGGADVAESVAQFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLT 60
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+FL + + GL+V L+G G+V+GGSV G L AA PV V+ +F
Sbjct: 61 GAFLPGPAPPGST---GLTVYLAGGQGQVVGGSVVGSLVAAGPVMVIAATF 108
>gi|68160564|gb|AAY86771.1| putative DNA-binding protein [Noccaea caerulescens]
Length = 312
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 84/151 (55%), Gaps = 11/151 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ + +G DV + +F++ R +CILS
Sbjct: 81 RRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIASGTDVIETLATFARRRQRGICILS 140
Query: 185 ANGAISNVTLRQ--------AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSV 236
NG ++NVTLRQ A + +GRFEILSL+GSFL + + GL++
Sbjct: 141 GNGTVANVTLRQPSSAAVAAAPGGAAVLALQGRFEILSLTGSFLPGPAP---PGSTGLTI 197
Query: 237 SLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+G G+V+GGSV G L AA PV ++ +F
Sbjct: 198 YLAGGQGQVVGGSVVGPLMAAGPVMLIAATF 228
>gi|413938537|gb|AFW73088.1| hypothetical protein ZEAMMB73_437326 [Zea mays]
Length = 324
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 86/176 (48%), Gaps = 13/176 (7%)
Query: 95 PSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTP 154
PS S++ GG + S+ +P + AL S
Sbjct: 64 PSSSAMVPVEGGGSGSGTGGTPTRRPRGRPPGSKNKPKPPIIVTRDSPNALHS------- 116
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ--AATSGGTV-TYEGRFE 211
HV+ V AG DV + +++ R VC+LS GA+ NV LRQ A+ G V T GRFE
Sbjct: 117 HVLEVAAGADVVDCVAEYARRRGRGVCVLSGGGAVVNVALRQPGASPPGSMVATLRGRFE 176
Query: 212 ILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
ILSL+G+ L + S GL+V LSG G+V+GGSV G L AA PV ++ SF
Sbjct: 177 ILSLTGTVLPPPAPPGAS---GLTVFLSGGQGQVIGGSVVGPLVAAGPVVLMAASF 229
>gi|357134112|ref|XP_003568662.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 321
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 72/114 (63%), Gaps = 4/114 (3%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ-AATSGGTVTYEGRFEIL 213
HV+ + +G D+ + +F++ R V +LS +G + NVTLRQ AA G VT GRFEIL
Sbjct: 119 HVLEIASGADIMDAVATFARRRQRGVSVLSGSGVVGNVTLRQPAAPPGAVVTLHGRFEIL 178
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
SLSG+FL S + GL+V L+G G+V+GG+V G L A+ P+ VV +F
Sbjct: 179 SLSGAFLPSPCPPGAT---GLAVYLAGGQGQVVGGTVVGELVASGPIMVVAATF 229
>gi|224065637|ref|XP_002301896.1| predicted protein [Populus trichocarpa]
gi|222843622|gb|EEE81169.1| predicted protein [Populus trichocarpa]
Length = 158
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 86/143 (60%), Gaps = 4/143 (2%)
Query: 126 KSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSA 185
+ RGRP GS + K + + HV+ V +G D+ + ++++ VC+LS
Sbjct: 3 RPRGRPAGSKNKPKPPIIVTRDSPNALRSHVLEVSSGADIVESVSNYARKRGIGVCVLSG 62
Query: 186 NGAISNVTLRQAATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
+G+++NVTLRQ A+ G+V T GRFEILSLSG+ L + GGLS+ LSG G+
Sbjct: 63 SGSVANVTLRQPASPAGSVLTLHGRFEILSLSGTVLPPPAPPG---AGGLSIFLSGGQGQ 119
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GG+V GLL AA PV ++ SF
Sbjct: 120 VVGGNVVGLLMAAGPVVLMAASF 142
>gi|414869929|tpg|DAA48486.1| TPA: hypothetical protein ZEAMMB73_759309 [Zea mays]
Length = 294
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 74/136 (54%), Gaps = 15/136 (11%)
Query: 123 SIKKSRGRPPGSGSGKKHQL--------EALGSAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
+ K+ RGRPPGS + K Q + A PHV+ V +G DV+ + F++
Sbjct: 52 AWKRRRGRPPGSKNKPKPQAAAAAAAVARDVEPASSAMRPHVLEVPSGGDVARALAGFAR 111
Query: 175 NGPRAVCILSANGAISNVTLRQAATS-----GGTVTYEGRFEILSLSGSFLL--SESSGQ 227
+C+L+ GA+++V+LR ++S G + GR+EILS+S +FL + ++
Sbjct: 112 RRGLGICVLAGTGAVADVSLRHPSSSADGAGGSAAVFRGRYEILSISATFLAPSTPAAVA 171
Query: 228 RSRTGGLSVSLSGPDG 243
R+ LSVSL+GP G
Sbjct: 172 RATVRDLSVSLAGPHG 187
>gi|356499354|ref|XP_003518506.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 248
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 56/99 (56%)
Query: 122 DSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVC 181
D++++ RGRP GS + K + + H + V +G DV+ + +F++ R +
Sbjct: 40 DTLRRPRGRPAGSKNKPKPPIIVTRDSANALKAHAMEVSSGCDVNESLSNFARRKQRGLY 99
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
I + G ++NVTL Q +SG VT GRFEILSL GS L
Sbjct: 100 IFNGTGCVTNVTLCQPGSSGAIVTLHGRFEILSLLGSIL 138
>gi|449443241|ref|XP_004139388.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449483112|ref|XP_004156496.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 293
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 72/113 (63%), Gaps = 3/113 (2%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ + G D++ + F++ R V +LS +G ++NVTLRQ + G + +GRFEILS
Sbjct: 100 HVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILS 159
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+G+FL + + GL++ L+G G+V+GGSV G LTAA PV V+ +F
Sbjct: 160 LTGTFLPGPAPPGST---GLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATF 209
>gi|297724797|ref|NP_001174762.1| Os06g0326900 [Oryza sativa Japonica Group]
gi|50725742|dbj|BAD33253.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|50725981|dbj|BAD33507.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|215768965|dbj|BAH01194.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255677005|dbj|BAH93490.1| Os06g0326900 [Oryza sativa Japonica Group]
Length = 322
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 60/147 (40%), Positives = 84/147 (57%), Gaps = 6/147 (4%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + F HV+ V AG D+ + F++ R V +L
Sbjct: 82 MRRPRGRPLGSKNKPKPPIIVTRDSPNAFHSHVLEVAAGTDIVECVCEFARRRGRGVSVL 141
Query: 184 SANGAISNVTLRQ--AATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
S GA++NV LRQ A+ G V T G+FEILSL+G+ L + GL+V LSG
Sbjct: 142 SGGGAVANVALRQPGASPPGSLVATMRGQFEILSLTGTVLPPPAP---PSASGLTVFLSG 198
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSVAG L AA PV ++ SF
Sbjct: 199 GQGQVVGGSVAGQLIAAGPVFLMAASF 225
>gi|125555146|gb|EAZ00752.1| hypothetical protein OsI_22779 [Oryza sativa Indica Group]
Length = 324
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 60/147 (40%), Positives = 84/147 (57%), Gaps = 6/147 (4%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + F HV+ V AG D+ + F++ R V +L
Sbjct: 84 MRRPRGRPLGSKNKPKPPIIVTRDSPNAFHSHVLEVAAGTDIVECVCEFARRRGRGVSVL 143
Query: 184 SANGAISNVTLRQ--AATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
S GA++NV LRQ A+ G V T G+FEILSL+G+ L + GL+V LSG
Sbjct: 144 SGGGAVANVALRQPGASPPGSLVATMRGQFEILSLTGTVLPPPAP---PSASGLTVFLSG 200
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSVAG L AA PV ++ SF
Sbjct: 201 GQGQVVGGSVAGQLIAAGPVFLMAASF 227
>gi|224059721|ref|XP_002299979.1| predicted protein [Populus trichocarpa]
gi|222847237|gb|EEE84784.1| predicted protein [Populus trichocarpa]
Length = 172
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 71/140 (50%), Gaps = 12/140 (8%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + P ++ + AG D+ I++F++ + ++S
Sbjct: 4 RKPRGRPPGSKNRPKPPIIITKDCESSMKPVILEISAGSDIIETIINFARRNHAGISVMS 63
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQ------------RSRTG 232
ANG++SNVTL + +++ G F +L+L GSF+ S +S +
Sbjct: 64 ANGSVSNVTLSHPVSHAPSLSLHGPFNLLALFGSFVGSFASNKVPCASSSSSPGSVYSCS 123
Query: 233 GLSVSLSGPDGRVLGGSVAG 252
+SL+G G+V GG VAG
Sbjct: 124 SFGISLAGAQGQVFGGIVAG 143
>gi|449442723|ref|XP_004139130.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449530311|ref|XP_004172139.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 277
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 72/141 (51%), Gaps = 11/141 (7%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
++KK RGRPPGS + K + P VI + AG DV ++ F++ + +
Sbjct: 51 TMKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTV 110
Query: 183 LSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSES-----------SGQRSRT 231
LS +G++SNVTLR + +++ G F ++SLSGSFL + + S S +
Sbjct: 111 LSGSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPS 170
Query: 232 GGLSVSLSGPDGRVLGGSVAG 252
+ L+G G+V GG V G
Sbjct: 171 SSFGICLAGAQGQVFGGIVGG 191
>gi|413920023|gb|AFW59955.1| hypothetical protein ZEAMMB73_895910 [Zea mays]
gi|413920024|gb|AFW59956.1| hypothetical protein ZEAMMB73_895910 [Zea mays]
Length = 297
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/100 (46%), Positives = 55/100 (55%), Gaps = 13/100 (13%)
Query: 70 SEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRG 129
+EP+KRKRGRPRKYGPDGTM + + G +G + S G + S KK RG
Sbjct: 105 TEPVKRKRGRPRKYGPDGTMKQQQL---VAAQPRIGPSGPNMISSAG--IEDSSQKKRRG 159
Query: 130 RPPGSGSGKKHQLEA------LGSAGVGFTPHVITVKAGE 163
RPP G+ KKHQ GSAG FTPH+IT E
Sbjct: 160 RPP--GTAKKHQPSPSQGNAFAGSAGTSFTPHIITASPSE 197
>gi|413919173|gb|AFW59105.1| hypothetical protein ZEAMMB73_384381 [Zea mays]
Length = 230
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/89 (48%), Positives = 57/89 (64%), Gaps = 8/89 (8%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
SG + +K+KRGRPRKYGPDG++ L L + + VT ATG + GGG +P+ K
Sbjct: 119 SGGDLVKKKRGRPRKYGPDGSIGLGLKTAAAGVTEATG------AQSGGGGSTPNPDGKR 172
Query: 128 RGRPPGSGSGKKHQLEALGSAGVGFTPHV 156
RGRPP GSGKK QL+ALG+ P++
Sbjct: 173 RGRPP--GSGKKKQLDALGNIACSPAPYL 199
>gi|89274231|gb|ABD65635.1| hypothetical protein 23.t00073 [Brassica oleracea]
Length = 292
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 47/66 (71%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V +G DV + ++++ R +C+LS +G ++NVT+RQ + +G VT +G FEILS
Sbjct: 111 HILEVTSGCDVFDCVATYARRRQRGICVLSGSGTVTNVTIRQPSAAGAVVTLQGTFEILS 170
Query: 215 LSGSFL 220
LSGSFL
Sbjct: 171 LSGSFL 176
>gi|224103955|ref|XP_002313259.1| predicted protein [Populus trichocarpa]
gi|222849667|gb|EEE87214.1| predicted protein [Populus trichocarpa]
Length = 303
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 81/155 (52%), Gaps = 11/155 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
+K RGRPPGS + K + P ++ + AG DV I++F++ + ++S
Sbjct: 72 RKPRGRPPGSKNRPKPPIIITKDCESSMKPAILEISAGSDVIETIVNFARRNHAGISVIS 131
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRS-----------RTGG 233
A G+++NVTLR + +++ G F +L+L GS + S ++ + S
Sbjct: 132 ATGSVANVTLRHPVSHTPSLSLHGPFNLLALFGSVVGSLATNKASCASSPPGSAVHSCSS 191
Query: 234 LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
+SL+G G+V GG VAG + AAT V VV +FL
Sbjct: 192 FGISLAGAQGQVFGGIVAGKVIAATQVVVVAATFL 226
>gi|413923671|gb|AFW63603.1| hypothetical protein ZEAMMB73_729481 [Zea mays]
Length = 434
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/146 (41%), Positives = 82/146 (56%), Gaps = 6/146 (4%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V AG DV + F++ R VC+LS
Sbjct: 202 RRPRGRPAGSKNKPKPPIIVTRDSPNALHSHVLEVAAGADVVDCVAEFARRRGRGVCVLS 261
Query: 185 ANGAISNVTLRQ--AATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
GA++NV LRQ A+ G V T GR EILSL+G+ L + S GL+V LSG
Sbjct: 262 GGGAVANVALRQPGASPPGSMVATLRGRLEILSLTGTVLPPPAPPGAS---GLTVFLSGG 318
Query: 242 DGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GGSV G L AA PV ++ SF
Sbjct: 319 QGQVVGGSVVGPLVAAGPVVLMAASF 344
>gi|218201321|gb|EEC83748.1| hypothetical protein OsI_29612 [Oryza sativa Indica Group]
Length = 223
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 64/113 (56%), Gaps = 13/113 (11%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ-----------AATSGGT 203
HV+ + G DV+ + +++ +C+L+ GA++NV+LR +
Sbjct: 4 HVLEIPGGGDVAGALAGYARRRGLGICVLAGTGAVANVSLRHPLPSGAAAEIGGGGAAAV 63
Query: 204 VTYEGRFEILSLSGSFLLSESSGQRSR--TGGLSVSLSGPDGRVLGGSVAGLL 254
V + GR+EILS+S +FL + R GGLS+SL+GP G+++GG+VAG L
Sbjct: 64 VVFHGRYEILSISATFLPPAMAAAAPRAALGGLSISLAGPHGQIVGGAVAGPL 116
>gi|357139394|ref|XP_003571267.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 285
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 84/143 (58%), Gaps = 4/143 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + + HV+ V G D++ I +F++ R VC+LS
Sbjct: 60 RRPRGRPAGSKNKPKPPIFVTRDSPNALRSHVMEVAGGADIADAIAAFARRRQRGVCVLS 119
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G +++V LRQ A +G V GRFEILSL+G+FL + + GL+V L+G G+
Sbjct: 120 GAGTVADVALRQPA-AGSVVALRGRFEILSLTGTFLPGPAPPGST---GLTVYLAGGQGQ 175
Query: 245 VLGGSVAGLLTAATPVQVVVGSF 267
V+GGSV G LTAA PV V+ +F
Sbjct: 176 VVGGSVVGALTAAGPVMVIASTF 198
>gi|242049524|ref|XP_002462506.1| hypothetical protein SORBIDRAFT_02g026970 [Sorghum bicolor]
gi|241925883|gb|EER99027.1| hypothetical protein SORBIDRAFT_02g026970 [Sorghum bicolor]
Length = 354
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 78/158 (49%), Gaps = 19/158 (12%)
Query: 103 ATGGTGSGLSSPGGGPLSPDSI-KKSRGRPPGSGSGKKHQLEALGSA--GVGFTPHVITV 159
A+GG G+ + S GGG + + KK RGRPPGS + K + A PHVI +
Sbjct: 58 ASGGAGALVVSGGGGDEASMELSKKRRGRPPGSKNKPKPPVVITREAEPAAAMRPHVIEI 117
Query: 160 KAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGG------------TVTYE 207
G DV+ + F+ +C+L+ GA++NV+LR + G + +
Sbjct: 118 PCGCDVADALARFAARRNLGICVLAGTGAVANVSLRHPMSGGVAVGGGGGGAPTTAIVFH 177
Query: 208 GRFEILSLSGSFLLSESSGQRSRTGG----LSVSLSGP 241
G++EILS+S +FL S + LS+SL+GP
Sbjct: 178 GQYEILSISATFLPPAMSAVAPQAAAAAACLSISLAGP 215
>gi|224131940|ref|XP_002328145.1| predicted protein [Populus trichocarpa]
gi|222837660|gb|EEE76025.1| predicted protein [Populus trichocarpa]
Length = 169
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 83/145 (57%), Gaps = 5/145 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + H++ + G D+ I ++++ VCILS
Sbjct: 9 RRPRGRPAGSKNKPKPPIIIARDTPNALRSHLLEISPGSDIVESISNYARRRAHGVCILS 68
Query: 185 ANGAISNVTLRQ--AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
+GA++NVTLRQ S +T GRFEILSL+G+ L S + + GGLS+SL+G
Sbjct: 69 GSGAVTNVTLRQPGGGGSSAVMTLHGRFEILSLTGTSLPSPAPPE---AGGLSISLAGGQ 125
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG V G L A++ V ++ SF
Sbjct: 126 GQVVGGRVVGPLMASSLVVLMAASF 150
>gi|15236657|ref|NP_193515.1| putative AT-hook DNA-binding family protein [Arabidopsis thaliana]
gi|17933299|gb|AAL48232.1|AF446359_1 AT4g17800/dl4935c [Arabidopsis thaliana]
gi|2245139|emb|CAB10560.1| hypothetical protein [Arabidopsis thaliana]
gi|7268533|emb|CAB78783.1| hypothetical protein [Arabidopsis thaliana]
gi|20453387|gb|AAM19932.1| AT4g17800/dl4935c [Arabidopsis thaliana]
gi|119657390|tpd|FAA00294.1| TPA: AT-hook motif nuclear localized protein 23 [Arabidopsis
thaliana]
gi|332658552|gb|AEE83952.1| putative AT-hook DNA-binding family protein [Arabidopsis thaliana]
Length = 292
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 74/114 (64%), Gaps = 5/114 (4%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V G DV + ++++ R +C+LS +G ++NV++RQ + +G VT +G FEILS
Sbjct: 112 HILEVTNGCDVFDCVATYARRRQRGICVLSGSGTVTNVSIRQPSAAGAVVTLQGTFEILS 171
Query: 215 LSGSFLLSES-SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSGSFL + G S L++ L+G G+V+GGSV G LTAA PV V+ SF
Sbjct: 172 LSGSFLPPPAPPGATS----LTIFLAGGQGQVVGGSVVGELTAAGPVIVIAASF 221
>gi|297800302|ref|XP_002868035.1| hypothetical protein ARALYDRAFT_493093 [Arabidopsis lyrata subsp.
lyrata]
gi|297313871|gb|EFH44294.1| hypothetical protein ARALYDRAFT_493093 [Arabidopsis lyrata subsp.
lyrata]
Length = 294
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 74/114 (64%), Gaps = 5/114 (4%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V G DV + ++++ R +C+LS +G ++NV++RQ + +G VT +G FEILS
Sbjct: 114 HILEVTNGCDVFDCVATYARRRQRGICVLSGSGTVTNVSIRQPSAAGAVVTLQGTFEILS 173
Query: 215 LSGSFLLSES-SGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
LSGSFL + G S L++ L+G G+V+GGSV G LTAA PV V+ SF
Sbjct: 174 LSGSFLPPPAPPGATS----LTIFLAGGQGQVVGGSVVGELTAAGPVIVIAASF 223
>gi|242041443|ref|XP_002468116.1| hypothetical protein SORBIDRAFT_01g039840 [Sorghum bicolor]
gi|241921970|gb|EER95114.1| hypothetical protein SORBIDRAFT_01g039840 [Sorghum bicolor]
Length = 272
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 70/114 (61%), Gaps = 4/114 (3%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ-AATSGGTVTYEGRFEIL 213
HV+ + +G D+ I FS+ R V +LS GA++NVTLRQ A + GRFEIL
Sbjct: 85 HVLEIASGADIVDAIAGFSRRRQRGVSVLSGTGAVTNVTLRQPAGAGAAAIALRGRFEIL 144
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
S+SG+FL + + + GL+V L+G G+V+GGSV G L A+ PV V+ +F
Sbjct: 145 SMSGAFLPAPAPPGAT---GLAVYLAGGQGQVVGGSVMGELIASGPVMVIAATF 195
>gi|242079595|ref|XP_002444566.1| hypothetical protein SORBIDRAFT_07g023830 [Sorghum bicolor]
gi|241940916|gb|EES14061.1| hypothetical protein SORBIDRAFT_07g023830 [Sorghum bicolor]
Length = 165
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 68/111 (61%), Gaps = 3/111 (2%)
Query: 157 ITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLS 216
+ V G DV+ I F++ R VC+LS G +++V LRQ A G V GRFEILSL+
Sbjct: 1 MEVAGGADVAESIAHFARRRQRGVCVLSGAGTVTDVALRQPAAPGAVVALRGRFEILSLT 60
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+FL + + GL+V L+G G+V+GGSV G LTAA PV V+ +F
Sbjct: 61 GTFLPGPAPPGST---GLTVYLAGGQGQVVGGSVVGTLTAAGPVMVMASTF 108
>gi|297802408|ref|XP_002869088.1| hypothetical protein ARALYDRAFT_491108 [Arabidopsis lyrata subsp.
lyrata]
gi|297314924|gb|EFH45347.1| hypothetical protein ARALYDRAFT_491108 [Arabidopsis lyrata subsp.
lyrata]
Length = 292
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/145 (38%), Positives = 83/145 (57%), Gaps = 5/145 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + HV+ V +G D+S + +++ VCI+S
Sbjct: 56 RRPRGRPAGSKNKPKPPTIITRDSPNVLRSHVLEVTSGSDISEAVSTYATRRGCGVCIIS 115
Query: 185 ANGAISNVTLRQ--AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
GA++NVT+RQ A GG +T GRFEILSL+G+ + GGL+V L+G
Sbjct: 116 GTGAVTNVTIRQPAAPAGGGVITLHGRFEILSLTGT---ALPPPAPPGAGGLTVYLAGGQ 172
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG+VAG L A+ PV ++ SF
Sbjct: 173 GQVVGGNVAGSLIASGPVVLMAASF 197
>gi|30690333|ref|NP_195265.2| AT-hook protein of GA feedback 1 [Arabidopsis thaliana]
gi|50198777|gb|AAT70422.1| At4g35390 [Arabidopsis thaliana]
gi|53828597|gb|AAU94408.1| At4g35390 [Arabidopsis thaliana]
gi|119657394|tpd|FAA00296.1| TPA: AT-hook motif nuclear localized protein 25 [Arabidopsis
thaliana]
gi|332661106|gb|AEE86506.1| AT-hook protein of GA feedback 1 [Arabidopsis thaliana]
Length = 299
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 83/145 (57%), Gaps = 5/145 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + HV+ V +G D+S + +++ VCI+S
Sbjct: 63 RRPRGRPAGSKNKPKPPTIITRDSPNVLRSHVLEVTSGSDISEAVSTYATRRGCGVCIIS 122
Query: 185 ANGAISNVTLRQ--AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
GA++NVT+RQ A GG +T GRF+ILSL+G+ + GGL+V L+G
Sbjct: 123 GTGAVTNVTIRQPAAPAGGGVITLHGRFDILSLTGT---ALPPPAPPGAGGLTVYLAGGQ 179
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG+VAG L A+ PV ++ SF
Sbjct: 180 GQVVGGNVAGSLIASGPVVLMAASF 204
>gi|195650785|gb|ACG44860.1| hypothetical protein [Zea mays]
Length = 166
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 66/133 (49%), Gaps = 22/133 (16%)
Query: 180 VCILSANGAISNVTLRQAAT-------------SGGTVTYEGRFEILSLSGSFLLSESSG 226
+C+LSA G++S LR A YEG +EILSL+GS+ L+
Sbjct: 1 MCVLSAMGSVSRAVLRHPADGSPMARVHASPQPYKNPAVYEGFYEILSLTGSYNLAHG-- 58
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADG--RKESKSSHRMESL 284
GGLSV+L P+ V+GG + G L AA VQVV+GSF G K K+ + ++
Sbjct: 59 -----GGLSVTLCSPERNVIGGVLGGPLVAAGTVQVVLGSFHQGGSRSKSKKAGKQQQAA 113
Query: 285 PVPPKLAPGGQPA 297
P GGQ A
Sbjct: 114 AFSPDSLTGGQEA 126
>gi|218202028|gb|EEC84455.1| hypothetical protein OsI_31079 [Oryza sativa Indica Group]
Length = 264
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 42/66 (63%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ + GED+ + +F++ R VC+LS +G ++N TLRQ V GRFEILS
Sbjct: 101 HVLEIAGGEDIIEAVAAFARRCQRKVCVLSGSGVVANPTLRQPGEPRSIVALHGRFEILS 160
Query: 215 LSGSFL 220
LSG+F+
Sbjct: 161 LSGAFV 166
>gi|115471287|ref|NP_001059242.1| Os07g0235200 [Oryza sativa Japonica Group]
gi|113610778|dbj|BAF21156.1| Os07g0235200, partial [Oryza sativa Japonica Group]
Length = 189
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 58/101 (57%), Gaps = 8/101 (7%)
Query: 167 SKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSG 226
+ I F++ R +C+LS A+++V LRQ A G V GRFEILSL+G+FL
Sbjct: 31 TSIAHFARRQRRGICVLSRADAVTDVALRQPAAPGAVVALRGRFEILSLTGTFLPGPGPP 90
Query: 227 QRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+R L+V L+G G+V+ G LTAA PV V+ +F
Sbjct: 91 GSTR---LTVYLAGGQGQVV-----GTLTAAGPVMVIASTF 123
>gi|297720769|ref|NP_001172746.1| Os01g0953801 [Oryza sativa Japonica Group]
gi|15528814|dbj|BAB64709.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|222619887|gb|EEE56019.1| hypothetical protein OsJ_04794 [Oryza sativa Japonica Group]
gi|255674081|dbj|BAH91476.1| Os01g0953801 [Oryza sativa Japonica Group]
Length = 265
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 79/147 (53%)
Query: 121 PDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
P +K RGRP GS + K + + P V+ + AG +V++ + +F++ V
Sbjct: 23 PSPPRKPRGRPLGSKNKPKPPVVVTRESEAAMRPVVLELGAGCEVAAAVAAFARRRRVGV 82
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
+L G ++ VTLR + V GRFE+LSLSG+ L S + + SVSL+G
Sbjct: 83 SVLCGRGTVAAVTLRLPTSPPAAVKLHGRFEVLSLSGTVLPSAAGEGAAPPPPFSVSLAG 142
Query: 241 PDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG++AG +T A + VV +F
Sbjct: 143 AGGQVIGGTLAGEMTTADGLVVVAATF 169
>gi|15218067|ref|NP_173514.1| putative DNA-binding protein ESCAROLA [Arabidopsis thaliana]
gi|20532086|sp|Q9S7C9.1|ESCA_ARATH RecName: Full=Putative DNA-binding protein ESCAROLA
gi|4836899|gb|AAD30602.1|AC007369_12 Unknown protein [Arabidopsis thaliana]
gi|6319180|gb|AAF07197.1|AF194974_1 ESCAROLA [Arabidopsis thaliana]
gi|30102700|gb|AAP21268.1| At1g20900 [Arabidopsis thaliana]
gi|110736548|dbj|BAF00240.1| putative DNA-binding protein [Arabidopsis thaliana]
gi|119657398|tpd|FAA00298.1| TPA: AT-hook motif nuclear localized protein 27 [Arabidopsis
thaliana]
gi|225897950|dbj|BAH30307.1| hypothetical protein [Arabidopsis thaliana]
gi|332191917|gb|AEE30038.1| putative DNA-binding protein ESCAROLA [Arabidopsis thaliana]
Length = 311
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 82/152 (53%), Gaps = 12/152 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
K+ RGRPPGS + K + + HV+ V G D+ + ++++ R V +L
Sbjct: 86 KRPRGRPPGSKNKAKPPIIVTRDSPNALRSHVLEVSPGADIVESVSTYARRRGRGVSVLG 145
Query: 185 ANGAISNVTLRQAAT---------SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLS 235
NG +SNVTLRQ T GG VT GRFEILSL+G+ L + GGLS
Sbjct: 146 GNGTVSNVTLRQPVTPGNGGGVSGGGGVVTLHGRFEILSLTGTVLPPPAP---PGAGGLS 202
Query: 236 VSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+ L+G G+V+GGSV L A+ PV ++ SF
Sbjct: 203 IFLAGGQGQVVGGSVVAPLIASAPVILMAASF 234
>gi|3080411|emb|CAA18730.1| putative protein [Arabidopsis thaliana]
gi|7270491|emb|CAB80256.1| putative protein [Arabidopsis thaliana]
Length = 270
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 83/145 (57%), Gaps = 5/145 (3%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRP GS + K + HV+ V +G D+S + +++ VCI+S
Sbjct: 34 RRPRGRPAGSKNKPKPPTIITRDSPNVLRSHVLEVTSGSDISEAVSTYATRRGCGVCIIS 93
Query: 185 ANGAISNVTLRQ--AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPD 242
GA++NVT+RQ A GG +T GRF+ILSL+G+ + GGL+V L+G
Sbjct: 94 GTGAVTNVTIRQPAAPAGGGVITLHGRFDILSLTGT---ALPPPAPPGAGGLTVYLAGGQ 150
Query: 243 GRVLGGSVAGLLTAATPVQVVVGSF 267
G+V+GG+VAG L A+ PV ++ SF
Sbjct: 151 GQVVGGNVAGSLIASGPVVLMAASF 175
>gi|357144916|ref|XP_003573459.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 291
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/72 (44%), Positives = 47/72 (65%), Gaps = 6/72 (8%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGG------TVTYEG 208
H++ V AG DV + ++++ R VC+LSA GA++NVT+RQ ++ T +G
Sbjct: 95 HILEVAAGCDVFEALTAYARRRQRGVCVLSAAGAVANVTIRQQPSNSSSSSSPVVATLQG 154
Query: 209 RFEILSLSGSFL 220
RFEILSL+GSFL
Sbjct: 155 RFEILSLAGSFL 166
>gi|24059979|dbj|BAC21441.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|24060074|dbj|BAC21527.1| DNA-binding protein-like [Oryza sativa Japonica Group]
Length = 206
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 56/96 (58%), Gaps = 8/96 (8%)
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
F++ R +C+LS A+++V LRQ A G V GRFEILSL+G+FL +R
Sbjct: 53 FARRQRRGICVLSRADAVTDVALRQPAAPGAVVALRGRFEILSLTGTFLPGPGPPGSTR- 111
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L+V L+G G+V+ G LTAA PV V+ +F
Sbjct: 112 --LTVYLAGGQGQVV-----GTLTAAGPVMVIASTF 140
>gi|356519866|ref|XP_003528590.1| PREDICTED: uncharacterized protein LOC100818645 [Glycine max]
Length = 297
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 76/158 (48%), Gaps = 10/158 (6%)
Query: 118 PLSPDSIKKSRGRPPGSGSGKKHQLEALGS-AGVGFTPHVITVKAGEDVSSKIMSFSQNG 176
P++ KK RGRPPGS + K +G A ++ V G D+ I+ ++ G
Sbjct: 52 PIATPPTKKPRGRPPGSKNKPKTTSFPVGQPAEPSMKLVIVNVTPGSDIIESILDVARRG 111
Query: 177 PRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGG--- 233
++ ILSA+G IS VTL + +T G F +LSL+GS+L ++ G
Sbjct: 112 HVSLTILSASGTISKVTLHNSIHGVAALTLRGPFTLLSLNGSYL--HNNHYTLHPGATPP 169
Query: 234 ----LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+S S G+V GG++ G + A V + + +F
Sbjct: 170 PPLSFGISFSTSQGQVFGGAIGGRVIAGDDVSLTISTF 207
>gi|297845066|ref|XP_002890414.1| hypothetical protein ARALYDRAFT_472326 [Arabidopsis lyrata subsp.
lyrata]
gi|297336256|gb|EFH66673.1| hypothetical protein ARALYDRAFT_472326 [Arabidopsis lyrata subsp.
lyrata]
Length = 314
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 81/152 (53%), Gaps = 12/152 (7%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
K+ RGRPPGS + K + + HV+ V G D+ + ++++ R V +L
Sbjct: 89 KRPRGRPPGSKNKAKPPIIVTRDSPNALRSHVLEVSPGADIVESVSTYARRRGRGVSVLG 148
Query: 185 ANGAISNVTLRQAAT---------SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLS 235
NG +SNVTLRQ GG VT GRFEILSL+G+ L + GGLS
Sbjct: 149 GNGTVSNVTLRQPVNPGNGGGVSGGGGVVTLHGRFEILSLTGTVLPPPAP---PGAGGLS 205
Query: 236 VSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+ L+G G+V+GGSV L A+ PV ++ SF
Sbjct: 206 IFLAGGQGQVVGGSVVAPLIASAPVILMAASF 237
>gi|242062730|ref|XP_002452654.1| hypothetical protein SORBIDRAFT_04g030040 [Sorghum bicolor]
gi|241932485|gb|EES05630.1| hypothetical protein SORBIDRAFT_04g030040 [Sorghum bicolor]
Length = 328
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 66/121 (54%), Gaps = 16/121 (13%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ--AATSGGTV-TYEGRFE 211
HV+ V AG DV + +++ R VC+LS GA+ NV LRQ A+ G V T GRFE
Sbjct: 122 HVLEVAAGADVVDCVAEYARRRGRGVCVLSGGGAVVNVALRQPGASPPGSMVATLRGRFE 181
Query: 212 ILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG-----GSVAGLLTAATPVQVVVGS 266
ILSL+G+ L + S GL+V LSG G+V+G L AA PV ++ S
Sbjct: 182 ILSLTGTVLPPPAPPGAS---GLTVFLSGGQGQVIGGSVVGP-----LVAAGPVVLMAAS 233
Query: 267 F 267
F
Sbjct: 234 F 234
>gi|357481893|ref|XP_003611232.1| hypothetical protein MTR_5g011720 [Medicago truncatula]
gi|355512567|gb|AES94190.1| hypothetical protein MTR_5g011720 [Medicago truncatula]
Length = 282
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 58/129 (44%), Gaps = 35/129 (27%)
Query: 64 MNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDS 123
+N GS K+KRGRPRKY PDG ++L SSV T + + SP L S
Sbjct: 54 VNASFGSSSFKKKRGRPRKYFPDGNITLG----SSSVPTQ----NAAIISPSS--LGSCS 103
Query: 124 IKKSRGRP-------------------------PGSGSGKKHQLEALGSAGVGFTPHVIT 158
IKK RGRP P S K Q+E LG G F+ H+IT
Sbjct: 104 IKKKRGRPRKYFLNGNITLGSSSVPTQNAAIISPSSTMKKNQQVEVLGDNGTDFSAHLIT 163
Query: 159 VKAGEDVSS 167
V GE +S+
Sbjct: 164 VNHGEPLSN 172
>gi|356517911|ref|XP_003527629.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 254
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 92/173 (53%), Gaps = 16/173 (9%)
Query: 115 GGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQ 174
G GP S + ++ RGRP GS + K + + HV+ V +G DV + ++++
Sbjct: 36 GEGPFS--TQRRPRGRPMGSKNKPKPPVIVTRDSPNVLRSHVLEVSSGADVVESLSNYAR 93
Query: 175 NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGL 234
R V +LS +G ++NV LRQ A G +T GRFEI+S++G+ L + + GL
Sbjct: 94 RRGRGVSVLSGSGTVANVVLRQPA--GSVLTLHGRFEIVSMTGTVLPPPAP---PGSDGL 148
Query: 235 SVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRMESLPVP 287
SV LSG G+V+GG V L A++ V +V SF ++ E LP+P
Sbjct: 149 SVYLSGAQGQVVGGVVVAPLVASSHVVLVAASF---------ANAMFERLPLP 192
>gi|218191457|gb|EEC73884.1| hypothetical protein OsI_08674 [Oryza sativa Indica Group]
Length = 415
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 65/121 (53%), Gaps = 16/121 (13%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ--AATSGGTV-TYEGRFE 211
HV+ V G DV + +++ R VC+LS GA+ NV LRQ A+ G V T GRFE
Sbjct: 208 HVLEVAGGADVVDCVAEYARRRGRGVCVLSGGGAVVNVALRQPGASPPGSMVATLRGRFE 267
Query: 212 ILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG-----GSVAGLLTAATPVQVVVGS 266
ILSL+G+ L + S GL+V LSG G+V+G L AA PV ++ S
Sbjct: 268 ILSLTGTVLPPPAPPGAS---GLTVFLSGGQGQVIGGSVVGP-----LVAAGPVVLMAAS 319
Query: 267 F 267
F
Sbjct: 320 F 320
>gi|218198574|gb|EEC81001.1| hypothetical protein OsI_23753 [Oryza sativa Indica Group]
Length = 391
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 78/144 (54%), Gaps = 8/144 (5%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + H+I V G DV++ + +++ R VC++
Sbjct: 182 LRRPRGRPLGSKNKPKPPVIITRDSPDALHSHIIEVAPGADVAACVAEYARRRGRGVCLM 241
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
A+GA+++V +R G GRFE+LS++G+ L + S GLSV LS G
Sbjct: 242 GASGAVADVAVR-----GAAAPLPGRFELLSVTGTVLPPPAPPGAS---GLSVLLSAGQG 293
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GG V G L AA PV + +F
Sbjct: 294 QVVGGCVVGPLVAAGPVTLFAATF 317
>gi|449453768|ref|XP_004144628.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
gi|449526622|ref|XP_004170312.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cucumis
sativus]
Length = 254
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 74/148 (50%), Gaps = 9/148 (6%)
Query: 123 SIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCI 182
S ++ RGRP GS + K + + HV+ V G DV I ++ VCI
Sbjct: 51 SSRRPRGRPAGSKNKPKPPVIVTRDSPNSLRSHVLEVSPGSDVVESISTYVTRRRYGVCI 110
Query: 183 LSANGAISNVTLRQAAT-SGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
L GA++NV LRQ + SG +T G FEI+SL+G+ L S GGL++ L+
Sbjct: 111 LGGTGAVTNVNLRQPMSPSGSVMTLHGTFEIVSLTGTAL------PPSGAGGLTIYLADR 164
Query: 242 DGRVLGGSVAGL--LTAATPVQVVVGSF 267
+ + + L A++PV ++V SF
Sbjct: 165 QRQGHVVGGSVVGPLRASSPVTLMVASF 192
>gi|376337577|gb|AFB33353.1| hypothetical protein 2_3947_01, partial [Pinus mugo]
gi|376337579|gb|AFB33354.1| hypothetical protein 2_3947_01, partial [Pinus mugo]
Length = 137
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/121 (44%), Positives = 72/121 (59%), Gaps = 15/121 (12%)
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK-ESKSSHR 280
+E++G RSRTGGLS+SL+GPDGRV+GG VAG+L AA+PVQV+ GSF+ DG+K + K +
Sbjct: 1 TENNGARSRTGGLSISLAGPDGRVVGGVVAGMLMAASPVQVIAGSFILDGKKVQGKPENP 60
Query: 281 MESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGS-PLNHSTGACNNNHLPQGMATGI 339
+ SL G + G L GGPG P N S+GA N + Q +
Sbjct: 61 LSSL-------------GLQHVAASGHLGAKHGGPGGPPFNSSSGASGINSVGQQSTQNM 107
Query: 340 P 340
P
Sbjct: 108 P 108
>gi|115448269|ref|NP_001047914.1| Os02g0713700 [Oryza sativa Japonica Group]
gi|41052877|dbj|BAD07790.1| DNA-binding protein-like [Oryza sativa Japonica Group]
gi|113537445|dbj|BAF09828.1| Os02g0713700 [Oryza sativa Japonica Group]
gi|215768749|dbj|BAH00978.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 336
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 65/121 (53%), Gaps = 16/121 (13%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ--AATSGGTV-TYEGRFE 211
HV+ V G DV + +++ R VC+LS GA+ NV LRQ A+ G V T GRFE
Sbjct: 129 HVLEVAGGADVVDCVAEYARRRGRGVCVLSGGGAVVNVALRQPGASPPGSMVATLRGRFE 188
Query: 212 ILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLG-----GSVAGLLTAATPVQVVVGS 266
ILSL+G+ L + S GL+V LSG G+V+G L AA PV ++ S
Sbjct: 189 ILSLTGTVLPPPAPPGAS---GLTVFLSGGQGQVIGGSVVGP-----LVAAGPVVLMAAS 240
Query: 267 F 267
F
Sbjct: 241 F 241
>gi|51091035|dbj|BAD35677.1| DNA-binding protein-like [Oryza sativa Japonica Group]
Length = 258
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 78/144 (54%), Gaps = 8/144 (5%)
Query: 124 IKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
+++ RGRP GS + K + + H+I V G DV++ + +++ R VC++
Sbjct: 49 LRRPRGRPLGSKNKPKPPVIITRDSPDALHSHIIEVAPGADVAACVAEYARRRGRGVCLM 108
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDG 243
A+GA+++V +R G GRFE+LS++G+ L + S GLSV LS G
Sbjct: 109 GASGAVADVAVR-----GAAAPLPGRFELLSVTGTVLPPPAPPGAS---GLSVLLSAGQG 160
Query: 244 RVLGGSVAGLLTAATPVQVVVGSF 267
+V+GG V G L AA PV + +F
Sbjct: 161 QVVGGCVVGPLVAAGPVTLFAATF 184
>gi|226494155|ref|NP_001152652.1| DNA-binding protein [Zea mays]
gi|195658581|gb|ACG48758.1| DNA-binding protein [Zea mays]
Length = 273
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 75/132 (56%), Gaps = 12/132 (9%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ--AATSGGTVTYEGRFEI 212
HV+ + +G D+ I FS+ R V +LS GA++NVTLR+ A V GRFEI
Sbjct: 84 HVLEIASGADIVDAIAGFSRRRQRGVSVLSGTGAVTNVTLREPAGAGGAAAVALRGRFEI 143
Query: 213 LSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF----- 267
LS+SG+FL + + + GL+V L+G G+V+GGSV G L A+ PV V+ +F
Sbjct: 144 LSMSGAFLPAPAPPGAT---GLTVYLAGGQGQVVGGSVMGELIASGPVMVIAATFGNATY 200
Query: 268 --LADGRKESKS 277
L + +++
Sbjct: 201 ERLPLDQADAEE 212
>gi|414866047|tpg|DAA44604.1| TPA: DNA-binding protein [Zea mays]
Length = 273
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 75/131 (57%), Gaps = 12/131 (9%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ--AATSGGTVTYEGRFEI 212
HV+ + +G D+ I FS+ R V +LS GA++NVTLR+ A V GRFEI
Sbjct: 84 HVLEIASGADIVDAIAGFSRRRQRGVSVLSGTGAVTNVTLREPAGAGGAAAVALRGRFEI 143
Query: 213 LSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF----- 267
LS+SG+FL + + + GL+V L+G G+V+GGSV G L A+ PV V+ +F
Sbjct: 144 LSMSGAFLPAPAPPGAT---GLTVYLAGGQGQVVGGSVMGELIASGPVMVIAATFGNATY 200
Query: 268 --LADGRKESK 276
L + +++
Sbjct: 201 ERLPLDQADAE 211
>gi|110738434|dbj|BAF01143.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 166
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 60/111 (54%), Gaps = 18/111 (16%)
Query: 67 GSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKK 126
G S P+K++RGRPRKYG DG A+ SP+ +++A T + S S K+
Sbjct: 65 GFSSGPIKKRRGRPRKYGHDG---AAVTLSPNPISSAAPTTSHVID------FSTTSEKR 115
Query: 127 SRGRP----PGSGSGKKHQLEALG-----SAGVGFTPHVITVKAGEDVSSK 168
+ +P P S K+Q+E LG SA FTPH+ITV AGE + +K
Sbjct: 116 GKMKPATPTPSSFIRPKYQVENLGEWSPSSAAANFTPHIITVNAGEVIMTK 166
>gi|357137663|ref|XP_003570419.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 261
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 86/193 (44%), Gaps = 30/193 (15%)
Query: 95 PSPSSVTTAT-------------------GGTGSGLSSPGGGPLSPDSIKKSRGRPPGSG 135
P P A GG + P S+ +P
Sbjct: 5 PEPGDSNNADSGSGSGGGNGTTNNGAEPRGGDPGTVVLPAPNRRPRGRPPGSKNKPKPPI 64
Query: 136 SGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLR 195
+ AL S HV+ V G DV+ I FS+ R VC+LS G ++NV LR
Sbjct: 65 FVTRDSPNALRS-------HVMEVAGGADVADAIAHFSRRRQRGVCVLSGAGTVANVALR 117
Query: 196 QAATSGG-TVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLL 254
Q + GG V GRFEILSL+G+FL + + GL+V L+G G+V+GGSV G L
Sbjct: 118 QPSAPGGAVVALHGRFEILSLTGTFLPGPAPPGST---GLTVYLAGGQGQVVGGSVVGAL 174
Query: 255 TAATPVQVVVGSF 267
TAA PV V+ +F
Sbjct: 175 TAAGPVMVIASTF 187
>gi|405952395|gb|EKC20212.1| WD repeat-containing protein 27 [Crassostrea gigas]
Length = 983
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 59/106 (55%), Gaps = 11/106 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G D+ ++ F++ NG A +++ G+++ TLR A S TYEG FEI+SL G+
Sbjct: 863 GADLQKGLLKFTEDNGLSAAFVITCVGSVTKATLRMA-NSTTIKTYEGHFEIVSLVGTL- 920
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
SSG G L +S+S +G V GG V G + T +V+VG+
Sbjct: 921 ---SSG-----GHLHMSISDAEGNVFGGHVFGDVIVYTTAEVIVGN 958
>gi|357112928|ref|XP_003558257.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 283
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/120 (41%), Positives = 70/120 (58%), Gaps = 13/120 (10%)
Query: 155 HVITVKAGEDVSSKIMS---FSQNGPRAVCILSANGAISNVTLRQAATSGGT----VTYE 207
HV+ + +G D+ I + Q R V +LS +GA++ VTLRQ A G V
Sbjct: 87 HVLEIASGADIVEAIAAFSRRRQ---RGVSVLSGSGAVTGVTLRQPAGMAGNGAPAVALR 143
Query: 208 GRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
GRFEILSLSG+FL + + + GL+V L+G G+V+GGSV G L A+ PV V+ +F
Sbjct: 144 GRFEILSLSGAFLPAPAPPGAT---GLAVYLAGGQGQVVGGSVMGELLASGPVMVIAATF 200
>gi|414878647|tpg|DAA55778.1| TPA: hypothetical protein ZEAMMB73_584155 [Zea mays]
Length = 294
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 76/131 (58%), Gaps = 2/131 (1%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
KK RGRP GS + K + + P V+ + AG DV + +F++ V +L
Sbjct: 26 KKPRGRPLGSKNKPKPPVVVTRESEAAMRPVVLELAAGCDVVGAVAAFARRRRVGVSVLC 85
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
GA++ VTLR AA+S VT GRFE+L+LSG+ ++ SS + SVSL+G G+
Sbjct: 86 GRGAVAAVTLRLAASSAA-VTLHGRFEVLALSGT-VVPSSSSASASAPAFSVSLAGEGGQ 143
Query: 245 VLGGSVAGLLT 255
V+GG++AG +T
Sbjct: 144 VIGGTLAGEMT 154
>gi|255645805|gb|ACU23393.1| unknown [Glycine max]
Length = 141
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/72 (48%), Positives = 43/72 (59%), Gaps = 16/72 (22%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
EP+KRKRGRPRKYG DG++SLAL P+P+S + T S K+ RGR
Sbjct: 82 EPVKRKRGRPRKYGTDGSVSLALTPTPTSSSYPGALT--------------QSQKRGRGR 127
Query: 131 PPGSGSGKKHQL 142
PP G+GKK Q
Sbjct: 128 PP--GTGKKQQF 137
>gi|357492341|ref|XP_003616459.1| AT-hook DNA-binding protein [Medicago truncatula]
gi|355517794|gb|AES99417.1| AT-hook DNA-binding protein [Medicago truncatula]
Length = 328
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/187 (33%), Positives = 93/187 (49%), Gaps = 24/187 (12%)
Query: 96 SPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR---------------PPGSGSGKKH 140
+P+ T+ GTG+ G ++ D ++ S GR PPGS + K
Sbjct: 51 TPTRSNTSNTGTGN-----SNGHVN-DELENSNGRSGDQTARSGRRPRGRPPGSKNKPKP 104
Query: 141 QLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATS 200
L + ++ V G D++ I S++ R V +LS G ++NVTLRQ
Sbjct: 105 PLMITKETPNALSSVILEVANGADIAHSISSYANRRHRGVSVLSGTGYVTNVTLRQDNAP 164
Query: 201 GGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPV 260
GG ++ +GR ILSLSG+FL S + GL+V L+G G+V+GG V G L A+ PV
Sbjct: 165 GGMISLQGRCHILSLSGAFLPPPSPPDAT---GLTVYLAGGQGQVVGGLVIGSLIASGPV 221
Query: 261 QVVVGSF 267
VV +F
Sbjct: 222 MVVAATF 228
>gi|357482199|ref|XP_003611385.1| AT-hook DNA-binding protein [Medicago truncatula]
gi|355512720|gb|AES94343.1| AT-hook DNA-binding protein [Medicago truncatula]
Length = 205
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 48/96 (50%), Gaps = 11/96 (11%)
Query: 68 SGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKS 127
SG +K+KRGRPRKY D ++L+L P T T + S + G
Sbjct: 29 SGYGSIKKKRGRPRKYFLDHDITLSLGSGPMHDATITYPSHSIVKKSTRG---------- 78
Query: 128 RGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGE 163
RGRP GS KK ++E LG F+PH+I V GE
Sbjct: 79 RGRPRGSFK-KKQEVEVLGVTNTSFSPHLIVVNYGE 113
>gi|414588595|tpg|DAA39166.1| TPA: hypothetical protein ZEAMMB73_847336 [Zea mays]
Length = 153
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/60 (61%), Positives = 48/60 (80%), Gaps = 2/60 (3%)
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
GSF ++E R RTGGLSVSL+GPDGRV+GG VAG+L AA+P+QV+VGSFL + K+ +
Sbjct: 2 GSFTMAEEG--RKRTGGLSVSLAGPDGRVVGGVVAGMLRAASPIQVIVGSFLPNSLKQHQ 59
>gi|356577269|ref|XP_003556750.1| PREDICTED: uncharacterized protein LOC100777794 [Glycine max]
Length = 236
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 72/156 (46%), Gaps = 6/156 (3%)
Query: 118 PLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVIT-VKAGEDVSSKIMSFSQNG 176
PL+ S KK GRP GS + K L + + +I V D+ I+ ++ G
Sbjct: 52 PLTNPSTKKPCGRPVGSKNKPKTTLFLVAQPVEPYMKVIIVNVTPSSDIIESILDVARRG 111
Query: 177 PRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQR-----SRT 231
++ +LSA+G I+ VTL + +T G F +LSL+GS+L + +
Sbjct: 112 HVSLTVLSASGTITGVTLNNSLHGVDALTLHGPFTLLSLNGSYLYNNHYTLHPGATPAPP 171
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+S S G+V GG++ + A V + + +F
Sbjct: 172 LSFGISFSTSQGQVFGGAIGSRVIAGNDVSLTICTF 207
>gi|357481875|ref|XP_003611223.1| DNA-binding protein [Medicago truncatula]
gi|355512558|gb|AES94181.1| DNA-binding protein [Medicago truncatula]
Length = 118
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 60/115 (52%), Gaps = 14/115 (12%)
Query: 204 VTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVV 263
+ + G ++I SLSGSF+ R+ G++VS DG V+GG VAG L A+P V+
Sbjct: 3 IEFHGIYQIQSLSGSFM--------RRSSGMNVSFVDLDGNVVGGRVAGPLVVASPAAVM 54
Query: 264 VGSFLADGRKESK-SSHRMESL-PVPPKLAPGGQPAGQCSPPSRGTLSESSGGPG 316
V +FLA + E K ++ + E + V P +A AG P LS SS G
Sbjct: 55 VVTFLASEQHEQKLNTQKNEVISTVTPTVAARMSSAG----PMLNNLSSSSCFHG 105
>gi|388519107|gb|AFK47615.1| unknown [Lotus japonicus]
Length = 144
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/55 (67%), Positives = 45/55 (81%)
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
+ES G RSR+GG+SVSLS PDGRV+GG VAGLL AA+PVQVVV SFL +++ K
Sbjct: 3 TESQGTRSRSGGMSVSLSSPDGRVVGGGVAGLLVAASPVQVVVASFLPSNQQDQK 57
>gi|242051431|ref|XP_002454861.1| hypothetical protein SORBIDRAFT_03g000250 [Sorghum bicolor]
gi|241926836|gb|EER99980.1| hypothetical protein SORBIDRAFT_03g000250 [Sorghum bicolor]
Length = 211
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 64/116 (55%), Gaps = 8/116 (6%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V AG DV S + +F++ G R +L A G +++V LR+ A + G EILS
Sbjct: 54 HVVEVPAGRDVLSCVSAFARRGRRGALVLGAAGHVTDVVLREPA-----LVLRGTMEILS 108
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDG-RVLGGSVAGLLTAATPVQVVVGSFLA 269
L+G F G S G +V L+GP G + GG G L AA PV V+V +F+A
Sbjct: 109 LAGCFFPFPGPG--SAATGTAVFLAGPRGSVLGGGVALGGLVAAGPVVVMVATFVA 162
>gi|413944405|gb|AFW77054.1| hypothetical protein ZEAMMB73_369732 [Zea mays]
Length = 184
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 69/151 (45%), Gaps = 32/151 (21%)
Query: 24 LAFSADGTAVYKPITATSPTYQPSGAGGDGAIPQAQGLNVMNMGSGSEPMKRKRGRPRKY 83
L ++ DG AVY+ P + AG + +P A G + + SEP KRKRGRPRKY
Sbjct: 34 LFYTHDGVAVYR--NPVMPAFYQQPAGSNVVVPAAPGP--AHSPASSEPFKRKRGRPRKY 89
Query: 84 GP-DGTMSLALVPSPSSVTTATGGTGSGLS---SPGGGPLSPDS---------------- 123
P DG + LA+VP PS TA S S PG P SP S
Sbjct: 90 APADGAVPLAIVP-PSQPPTARAPATSEASPTVPPGFSP-SPQSGGVVSRQASPAPAPAS 147
Query: 124 ----IKKSRGRPPGSGSGKKHQLEALGSAGV 150
+KK RGRP G S KK Q +A V
Sbjct: 148 GAPDVKK-RGRPSGP-SSKKQQPQAAAPGNV 176
>gi|297745610|emb|CBI40775.3| unnamed protein product [Vitis vinifera]
Length = 227
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 42/75 (56%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + HV+ V AG DV ++++++ R VC+LS
Sbjct: 137 RRPRGRPPGSKNKPKPPIIVTRDSPNALRSHVLEVAAGADVMESVLNYARRRGRGVCVLS 196
Query: 185 ANGAISNVTLRQAAT 199
G + NVTLRQ A
Sbjct: 197 GGGTVMNVTLRQPAV 211
>gi|226502550|ref|NP_001150963.1| DNA binding protein [Zea mays]
gi|195643242|gb|ACG41089.1| DNA binding protein [Zea mays]
gi|413947876|gb|AFW80525.1| DNA binding protein [Zea mays]
Length = 203
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 69/137 (50%), Gaps = 17/137 (12%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V AG DV S + +F++ G R +L A G +++V LR+ A + G EILS
Sbjct: 46 HVVEVPAGRDVLSCVSAFARRGRRGALVLGAAGQVTDVVLREPA----ALVLRGTMEILS 101
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDG-RVLGGSVAGLLTAATPVQVVVGSFLAD--- 270
L+G F + G +V L+GP G + GG G L AA PV V+V +F+A
Sbjct: 102 LAGCFFPFPAPAT-----GTAVFLAGPRGSVLGGGVALGGLVAAGPVVVMVATFVAAALD 156
Query: 271 ----GRKESKSSHRMES 283
G K H M++
Sbjct: 157 RLPLGNKGCDDVHAMDT 173
>gi|356505773|ref|XP_003521664.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
Length = 170
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 65/144 (45%), Gaps = 8/144 (5%)
Query: 121 PDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
P S KSRGRP GS + K L + + P I V DV ++ F++ ++
Sbjct: 29 PPSSNKSRGRPLGSKNKPKIPLVINQDSDLALKPIFIQVPKNSDVIEAVVQFARQCQVSI 88
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSG 240
+ SA+G+I TL Q T G F ++SL+G+++ + S + S L
Sbjct: 89 TVQSASGSILEATLCQTLPDTSTFVVFGPFTLISLTGTYINNNCSFRISFCSNL------ 142
Query: 241 PDGRVLGGSVAGLLTAATPVQVVV 264
G+ G V G + A V VVV
Sbjct: 143 --GQSFTGIVGGKIIAGDDVNVVV 164
>gi|357497481|ref|XP_003619029.1| AT-hook protein, partial [Medicago truncatula]
gi|355494044|gb|AES75247.1| AT-hook protein, partial [Medicago truncatula]
Length = 157
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 65/154 (42%), Gaps = 43/154 (27%)
Query: 170 MSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYE---------------------- 207
M+FS+N + ILSA G T+ G T TYE
Sbjct: 1 MTFSKNLSGNISILSAIGTTFKATI---CVDGKTQTYECIIILVKNTLTCLCGYMITSEI 57
Query: 208 ------------GRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLT 255
G+FEI+++ GSF + Q GL+VSL DG GG + +L
Sbjct: 58 DSNFLLFILFCHGKFEIITIGGSFFPVKKESQCEVFEGLNVSLIS-DGNAFGGKLIDILI 116
Query: 256 AATPVQVVVGSFLADGRKESKSSHRMESLPVPPK 289
AA+PVQVV+GS+ A +E K + PPK
Sbjct: 117 AASPVQVVLGSYPAGSNEEVKFDTKE-----PPK 145
>gi|367066222|gb|AEX12482.1| hypothetical protein 2_3808_01 [Pinus taeda]
gi|367066224|gb|AEX12483.1| hypothetical protein 2_3808_01 [Pinus taeda]
gi|367066226|gb|AEX12484.1| hypothetical protein 2_3808_01 [Pinus taeda]
gi|367066228|gb|AEX12485.1| hypothetical protein 2_3808_01 [Pinus taeda]
gi|367066230|gb|AEX12486.1| hypothetical protein 2_3808_01 [Pinus taeda]
gi|367066232|gb|AEX12487.1| hypothetical protein 2_3808_01 [Pinus taeda]
gi|367066234|gb|AEX12488.1| hypothetical protein 2_3808_01 [Pinus taeda]
gi|367066236|gb|AEX12489.1| hypothetical protein 2_3808_01 [Pinus taeda]
Length = 138
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 51/82 (62%), Gaps = 5/82 (6%)
Query: 187 GAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVL 246
G ++NVTLRQ A VT GRFEILSLSGSFL + GL++ LS G+V+
Sbjct: 1 GTVTNVTLRQPAAPNAVVTLHGRFEILSLSGSFLPPPAPH-----TGLTIYLSSGQGQVV 55
Query: 247 GGSVAGLLTAATPVQVVVGSFL 268
GG+V G L A+ PV ++ SFL
Sbjct: 56 GGNVVGPLIASGPVIIMAASFL 77
>gi|125585739|gb|EAZ26403.1| hypothetical protein OsJ_10287 [Oryza sativa Japonica Group]
Length = 259
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/95 (45%), Positives = 60/95 (63%), Gaps = 5/95 (5%)
Query: 174 QNGPRAVCILSANGAISNVTLRQ-AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTG 232
Q P + C + +GA++NVTLRQ A T V GRFEILS+SG+FL + + +
Sbjct: 96 QAAPASPCS-AGSGAVTNVTLRQPAGTGAAAVALRGRFEILSMSGAFLPAPAPPGAT--- 151
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
GL+V L+G G+V+GGSV G L A+ PV V+ +F
Sbjct: 152 GLAVYLAGGQGQVVGGSVMGELIASGPVMVIAATF 186
>gi|419217219|ref|ZP_13760215.1| putative DNA-binding protein [Escherichia coli DEC8D]
gi|378059808|gb|EHW22007.1| putative DNA-binding protein [Escherichia coli DEC8D]
Length = 143
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGPR-AVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q R A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQRHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|443696366|gb|ELT97084.1| hypothetical protein CAPTEDRAFT_151507 [Capitella teleta]
Length = 149
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 58/109 (53%), Gaps = 12/109 (11%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAA---TSGGTVTYEGRFEILSLSG 217
GED+ + + F+Q R+ +LS G+++ TLR A + T+ FEIL+LSG
Sbjct: 22 GEDLITTLQEFAQKQQLRSAFVLSCCGSVTKATLRFAQKDDSENEIRTFNEHFEILALSG 81
Query: 218 SFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
+ S+G+ G L V+L +G+V+GG V G + T +VV+
Sbjct: 82 TL----SAGE----GHLHVALGDKEGKVIGGHVIGDMPIFTTAEVVIAE 122
>gi|419927348|ref|ZP_14445085.1| putative DNA-binding protein [Escherichia coli 541-1]
gi|388407577|gb|EIL67942.1| putative DNA-binding protein [Escherichia coli 541-1]
Length = 143
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 65/115 (56%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + AV I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAVWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|416272080|ref|ZP_11643105.1| hypothetical protein SDB_03393 [Shigella dysenteriae CDC 74-1112]
gi|320174085|gb|EFW49253.1| hypothetical protein SDB_03393 [Shigella dysenteriae CDC 74-1112]
Length = 143
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 65/115 (56%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G+FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYARQEGTTLL-NGKFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|357493845|ref|XP_003617211.1| DNA-binding protein [Medicago truncatula]
gi|355518546|gb|AET00170.1| DNA-binding protein [Medicago truncatula]
Length = 230
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 66/141 (46%), Gaps = 3/141 (2%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
KK RGRPPGS + K + + I + +G+D+ +I++ + ++ +
Sbjct: 50 KKPRGRPPGSKNKPKPPVNIEENMDNNMKMIYIEIPSGKDIVGEIINCAHRYQASITVSR 109
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL---LSESSGQRSRTGGLSVSLSGP 241
G ++NVTL T T G FE+ SL G+++ ++ S+ LSG
Sbjct: 110 GYGLVTNVTLLNPKTHFPTPPMIGPFEMTSLLGTYVNINCRRNTLNHPPCSCFSILLSGH 169
Query: 242 DGRVLGGSVAGLLTAATPVQV 262
V GG+V G + AA+ V +
Sbjct: 170 GAVVYGGTVGGTIIAASNVWI 190
>gi|390342605|ref|XP_003725695.1| PREDICTED: bifunctional protein GlmU-like [Strongylocentrotus
purpuratus]
Length = 161
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 60/117 (51%), Gaps = 10/117 (8%)
Query: 152 FTPHVITVKAGEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRF 210
T H + ++ GE++ +K++ + Q +G +A ILS G++ ++R A S + +
Sbjct: 10 MTCHALRLRPGEELKTKLLEYVQEHGLKAAFILSCVGSLRKASVRMA-DSVSVINVDKNH 68
Query: 211 EILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
EI+SL G+ S G G L +SLS G+V GG + G T +VV+G
Sbjct: 69 EIVSLVGTL----SGGH----GHLHISLSDEKGKVFGGHLLGSAEVFTTAEVVLGEL 117
>gi|357493957|ref|XP_003617267.1| hypothetical protein MTR_5g089700 [Medicago truncatula]
gi|355518602|gb|AET00226.1| hypothetical protein MTR_5g089700 [Medicago truncatula]
Length = 232
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 70/158 (44%), Gaps = 7/158 (4%)
Query: 111 LSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIM 170
L P P +P S K+SRGRP GS + K + I + AG DV I+
Sbjct: 62 LVMPTSPPRAPSS-KRSRGRPKGSKNKPKTPAVVMVEPQTLMKQIFIEIPAGYDVLESII 120
Query: 171 SFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSR 230
+ + +L G +S++T+ + + +T EG ++ SLSG+++ S
Sbjct: 121 KMAWRHEADITVLRGFGIVSDITIHSSLSHTPPLTIEGPVQMTSLSGTYVNPNVDNVPSE 180
Query: 231 T------GGLSVSLSGPDGRVLGGSVAGLLTAATPVQV 262
S+ LSG G+V GG V G + ++ V +
Sbjct: 181 VIANPACSSFSIFLSGSHGQVYGGIVVGKVMTSSVVMI 218
>gi|74313484|ref|YP_311903.1| hypothetical protein SSON_3080 [Shigella sonnei Ss046]
gi|383180088|ref|YP_005458093.1| hypothetical protein SSON53_17945 [Shigella sonnei 53G]
gi|414577688|ref|ZP_11434863.1| putative DNA-binding protein [Shigella sonnei 3233-85]
gi|415845494|ref|ZP_11525031.1| hypothetical protein SS53G_1742 [Shigella sonnei 53G]
gi|418268240|ref|ZP_12887039.1| putative DNA-binding protein [Shigella sonnei str. Moseley]
gi|420360246|ref|ZP_14861204.1| putative DNA-binding protein [Shigella sonnei 3226-85]
gi|420364911|ref|ZP_14865782.1| putative DNA-binding protein [Shigella sonnei 4822-66]
gi|73856961|gb|AAZ89668.1| conserved hypothetical protein [Shigella sonnei Ss046]
gi|323168026|gb|EFZ53715.1| hypothetical protein SS53G_1742 [Shigella sonnei 53G]
gi|391279386|gb|EIQ38074.1| putative DNA-binding protein [Shigella sonnei 3226-85]
gi|391283221|gb|EIQ41844.1| putative DNA-binding protein [Shigella sonnei 3233-85]
gi|391292844|gb|EIQ51155.1| putative DNA-binding protein [Shigella sonnei 4822-66]
gi|397897222|gb|EJL13632.1| putative DNA-binding protein [Shigella sonnei str. Moseley]
Length = 143
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|157157292|ref|YP_001464267.1| hypothetical protein EcE24377A_3258 [Escherichia coli E24377A]
gi|191168199|ref|ZP_03029994.1| conserved hypothetical protein [Escherichia coli B7A]
gi|193067286|ref|ZP_03048254.1| conserved hypothetical protein [Escherichia coli E110019]
gi|209920383|ref|YP_002294467.1| hypothetical protein ECSE_3192 [Escherichia coli SE11]
gi|218555476|ref|YP_002388389.1| hypothetical protein ECIAI1_3048 [Escherichia coli IAI1]
gi|218696521|ref|YP_002404188.1| DNA-binding protein [Escherichia coli 55989]
gi|260845594|ref|YP_003223372.1| DNA-binding protein [Escherichia coli O103:H2 str. 12009]
gi|260869603|ref|YP_003236005.1| putative DNA-binding protein [Escherichia coli O111:H- str. 11128]
gi|300824822|ref|ZP_07104925.1| conserved hypothetical protein [Escherichia coli MS 119-7]
gi|309794002|ref|ZP_07688427.1| conserved hypothetical protein [Escherichia coli MS 145-7]
gi|331669665|ref|ZP_08370511.1| conserved hypothetical protein [Escherichia coli TA271]
gi|331678916|ref|ZP_08379590.1| conserved hypothetical protein [Escherichia coli H591]
gi|332280384|ref|ZP_08392797.1| conserved hypothetical protein [Shigella sp. D9]
gi|407470802|ref|YP_006782755.1| DNA-binding protein [Escherichia coli O104:H4 str. 2009EL-2071]
gi|407480537|ref|YP_006777686.1| DNA-binding protein [Escherichia coli O104:H4 str. 2011C-3493]
gi|410481103|ref|YP_006768649.1| DNA-binding protein [Escherichia coli O104:H4 str. 2009EL-2050]
gi|415787084|ref|ZP_11493817.1| hypothetical protein ECEPECA14_3423 [Escherichia coli EPECa14]
gi|415811488|ref|ZP_11503838.1| hypothetical protein ECLT68_2182 [Escherichia coli LT-68]
gi|415818629|ref|ZP_11508351.1| hypothetical protein ECOK1180_1057 [Escherichia coli OK1180]
gi|416340353|ref|ZP_11675368.1| hypothetical protein ECoL_00252 [Escherichia coli EC4100B]
gi|417123213|ref|ZP_11972123.1| PF03479 domain protein [Escherichia coli 97.0246]
gi|417151069|ref|ZP_11990808.1| PF03479 domain protein [Escherichia coli 1.2264]
gi|417175563|ref|ZP_12005359.1| PF03479 domain protein [Escherichia coli 3.2608]
gi|417186348|ref|ZP_12011491.1| PF03479 domain protein [Escherichia coli 93.0624]
gi|417199964|ref|ZP_12017201.1| PF03479 domain protein [Escherichia coli 4.0522]
gi|417211543|ref|ZP_12021842.1| PF03479 domain protein [Escherichia coli JB1-95]
gi|417222679|ref|ZP_12026119.1| PF03479 domain protein [Escherichia coli 96.154]
gi|417237339|ref|ZP_12035306.1| PF03479 domain protein [Escherichia coli 9.0111]
gi|417269028|ref|ZP_12056388.1| PF03479 domain protein [Escherichia coli 3.3884]
gi|417296699|ref|ZP_12083946.1| PF03479 domain protein [Escherichia coli 900105 (10e)]
gi|417593277|ref|ZP_12243970.1| hypothetical protein EC253486_3901 [Escherichia coli 2534-86]
gi|417603621|ref|ZP_12254188.1| hypothetical protein ECSTEC94C_3443 [Escherichia coli STEC_94C]
gi|417718964|ref|ZP_12367856.1| hypothetical protein SFK227_3716 [Shigella flexneri K-227]
gi|417806466|ref|ZP_12453407.1| hypothetical protein HUSEC_16308 [Escherichia coli O104:H4 str.
LB226692]
gi|417834215|ref|ZP_12480661.1| hypothetical protein HUSEC41_15953 [Escherichia coli O104:H4 str.
01-09591]
gi|417867396|ref|ZP_12512433.1| hypothetical protein C22711_4323 [Escherichia coli O104:H4 str.
C227-11]
gi|419198566|ref|ZP_13741863.1| putative DNA-binding protein [Escherichia coli DEC8A]
gi|419204994|ref|ZP_13748167.1| putative DNA-binding protein [Escherichia coli DEC8B]
gi|419211340|ref|ZP_13754409.1| putative DNA-binding protein [Escherichia coli DEC8C]
gi|419222959|ref|ZP_13765875.1| putative DNA-binding protein [Escherichia coli DEC8E]
gi|419228373|ref|ZP_13771220.1| putative DNA-binding protein [Escherichia coli DEC9A]
gi|419233743|ref|ZP_13776515.1| putative DNA-binding protein [Escherichia coli DEC9B]
gi|419239360|ref|ZP_13782071.1| putative DNA-binding protein [Escherichia coli DEC9C]
gi|419244878|ref|ZP_13787513.1| putative DNA-binding protein [Escherichia coli DEC9D]
gi|419250693|ref|ZP_13793265.1| putative DNA-binding protein [Escherichia coli DEC9E]
gi|419256490|ref|ZP_13798996.1| putative DNA-binding protein [Escherichia coli DEC10A]
gi|419262791|ref|ZP_13805202.1| putative DNA-binding protein [Escherichia coli DEC10B]
gi|419268932|ref|ZP_13811277.1| putative DNA-binding protein [Escherichia coli DEC10C]
gi|419274238|ref|ZP_13816529.1| putative DNA-binding protein [Escherichia coli DEC10D]
gi|419279453|ref|ZP_13821697.1| putative DNA-binding protein [Escherichia coli DEC10E]
gi|419285632|ref|ZP_13827801.1| putative DNA-binding protein [Escherichia coli DEC10F]
gi|419301729|ref|ZP_13843726.1| putative DNA-binding protein [Escherichia coli DEC11C]
gi|419346609|ref|ZP_13887980.1| putative DNA-binding protein [Escherichia coli DEC13A]
gi|419351073|ref|ZP_13892406.1| putative DNA-binding protein [Escherichia coli DEC13B]
gi|419356476|ref|ZP_13897728.1| putative DNA-binding protein [Escherichia coli DEC13C]
gi|419361547|ref|ZP_13902760.1| putative DNA-binding protein [Escherichia coli DEC13D]
gi|419366672|ref|ZP_13907827.1| putative DNA-binding protein [Escherichia coli DEC13E]
gi|419371415|ref|ZP_13912528.1| putative DNA-binding protein [Escherichia coli DEC14A]
gi|419376917|ref|ZP_13917940.1| putative DNA-binding protein [Escherichia coli DEC14B]
gi|419382224|ref|ZP_13923170.1| putative DNA-binding protein [Escherichia coli DEC14C]
gi|419387563|ref|ZP_13928435.1| putative DNA-binding protein [Escherichia coli DEC14D]
gi|419393051|ref|ZP_13933854.1| putative DNA-binding protein [Escherichia coli DEC15A]
gi|419398157|ref|ZP_13938920.1| putative DNA-binding protein [Escherichia coli DEC15B]
gi|419403440|ref|ZP_13944160.1| putative DNA-binding protein [Escherichia coli DEC15C]
gi|419408598|ref|ZP_13949284.1| putative DNA-binding protein [Escherichia coli DEC15D]
gi|419414139|ref|ZP_13954779.1| putative DNA-binding protein [Escherichia coli DEC15E]
gi|419807197|ref|ZP_14332269.1| hypothetical protein ECAI27_39120 [Escherichia coli AI27]
gi|419864626|ref|ZP_14387054.1| putative DNA-binding protein [Escherichia coli O103:H25 str.
CVM9340]
gi|419867792|ref|ZP_14390107.1| putative DNA-binding protein [Escherichia coli O103:H2 str.
CVM9450]
gi|419874204|ref|ZP_14396151.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CVM9534]
gi|419879878|ref|ZP_14401298.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CVM9545]
gi|419886437|ref|ZP_14407078.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CVM9570]
gi|419892758|ref|ZP_14412765.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CVM9574]
gi|419899136|ref|ZP_14418661.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM9942]
gi|419910196|ref|ZP_14428723.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM10026]
gi|419924090|ref|ZP_14441988.1| hypothetical protein EC54115_13688 [Escherichia coli 541-15]
gi|419948237|ref|ZP_14464537.1| hypothetical protein ECMT8_02961 [Escherichia coli CUMT8]
gi|420089563|ref|ZP_14601346.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CVM9602]
gi|420094419|ref|ZP_14606010.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CVM9634]
gi|420112040|ref|ZP_14621851.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CVM9553]
gi|420112953|ref|ZP_14622729.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM10021]
gi|420120573|ref|ZP_14629771.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM10030]
gi|420129289|ref|ZP_14637826.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM10224]
gi|420132313|ref|ZP_14640682.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM9952]
gi|422010534|ref|ZP_16357492.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CVM9455]
gi|422354784|ref|ZP_16435509.1| hypothetical protein HMPREF9542_04104 [Escherichia coli MS 117-3]
gi|422760384|ref|ZP_16814144.1| hypothetical protein ERBG_00308 [Escherichia coli E1167]
gi|422775852|ref|ZP_16829507.1| hypothetical protein EREG_01829 [Escherichia coli H120]
gi|422989039|ref|ZP_16979812.1| hypothetical protein EUAG_04154 [Escherichia coli O104:H4 str.
C227-11]
gi|422995931|ref|ZP_16986695.1| hypothetical protein EUBG_03582 [Escherichia coli O104:H4 str.
C236-11]
gi|423001077|ref|ZP_16991831.1| hypothetical protein EUEG_03494 [Escherichia coli O104:H4 str.
09-7901]
gi|423004745|ref|ZP_16995491.1| hypothetical protein EUDG_02229 [Escherichia coli O104:H4 str.
04-8351]
gi|423011248|ref|ZP_17001982.1| hypothetical protein EUFG_03574 [Escherichia coli O104:H4 str.
11-3677]
gi|423020476|ref|ZP_17011185.1| hypothetical protein EUHG_03586 [Escherichia coli O104:H4 str.
11-4404]
gi|423025642|ref|ZP_17016339.1| hypothetical protein EUIG_03587 [Escherichia coli O104:H4 str.
11-4522]
gi|423031463|ref|ZP_17022150.1| hypothetical protein EUJG_04905 [Escherichia coli O104:H4 str.
11-4623]
gi|423039288|ref|ZP_17029962.1| hypothetical protein EUKG_03565 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|423044408|ref|ZP_17035075.1| hypothetical protein EULG_03583 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|423046137|ref|ZP_17036797.1| hypothetical protein EUMG_03155 [Escherichia coli O104:H4 str.
11-4632 C3]
gi|423054675|ref|ZP_17043482.1| hypothetical protein EUNG_04392 [Escherichia coli O104:H4 str.
11-4632 C4]
gi|423061650|ref|ZP_17050446.1| hypothetical protein EUOG_03590 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|423707086|ref|ZP_17681469.1| hypothetical protein ESTG_01562 [Escherichia coli B799]
gi|424748294|ref|ZP_18176441.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CFSAN001629]
gi|424758234|ref|ZP_18185950.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CFSAN001630]
gi|424773886|ref|ZP_18200937.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CFSAN001632]
gi|425381140|ref|ZP_18765148.1| bifunctional protein glmU [Escherichia coli EC1865]
gi|425423774|ref|ZP_18804937.1| bifunctional protein glmU [Escherichia coli 0.1288]
gi|429720507|ref|ZP_19255432.1| hypothetical protein MO3_03217 [Escherichia coli O104:H4 str.
Ec11-9450]
gi|429772405|ref|ZP_19304425.1| hypothetical protein C212_02188 [Escherichia coli O104:H4 str.
11-02030]
gi|429777352|ref|ZP_19309326.1| hypothetical protein C213_02186 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429786077|ref|ZP_19317972.1| hypothetical protein C214_02184 [Escherichia coli O104:H4 str.
11-02092]
gi|429791967|ref|ZP_19323821.1| hypothetical protein C215_02185 [Escherichia coli O104:H4 str.
11-02093]
gi|429792816|ref|ZP_19324664.1| hypothetical protein C216_02187 [Escherichia coli O104:H4 str.
11-02281]
gi|429799391|ref|ZP_19331189.1| hypothetical protein C217_02184 [Escherichia coli O104:H4 str.
11-02318]
gi|429803008|ref|ZP_19334768.1| hypothetical protein C218_02184 [Escherichia coli O104:H4 str.
11-02913]
gi|429812804|ref|ZP_19344487.1| hypothetical protein C219_02184 [Escherichia coli O104:H4 str.
11-03439]
gi|429813352|ref|ZP_19345031.1| hypothetical protein C220_02185 [Escherichia coli O104:H4 str.
11-04080]
gi|429818560|ref|ZP_19350194.1| hypothetical protein C221_02184 [Escherichia coli O104:H4 str.
11-03943]
gi|429904911|ref|ZP_19370890.1| hypothetical protein MO5_01836 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429909047|ref|ZP_19375011.1| hypothetical protein MO7_01816 [Escherichia coli O104:H4 str.
Ec11-9941]
gi|429914921|ref|ZP_19380868.1| hypothetical protein O7C_01839 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429919951|ref|ZP_19385882.1| hypothetical protein O7E_01841 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429925771|ref|ZP_19391684.1| hypothetical protein O7G_02660 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429929707|ref|ZP_19395609.1| hypothetical protein O7I_01532 [Escherichia coli O104:H4 str.
Ec11-4987]
gi|429936246|ref|ZP_19402132.1| hypothetical protein O7K_03083 [Escherichia coli O104:H4 str.
Ec11-4988]
gi|429941926|ref|ZP_19407800.1| hypothetical protein O7M_03659 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429944607|ref|ZP_19410469.1| hypothetical protein O7O_01154 [Escherichia coli O104:H4 str.
Ec11-6006]
gi|429952165|ref|ZP_19418011.1| hypothetical protein S7Y_03615 [Escherichia coli O104:H4 str.
Ec12-0465]
gi|429955514|ref|ZP_19421346.1| hypothetical protein S91_01917 [Escherichia coli O104:H4 str.
Ec12-0466]
gi|432378107|ref|ZP_19621093.1| hypothetical protein WCQ_02996 [Escherichia coli KTE12]
gi|432482247|ref|ZP_19724198.1| hypothetical protein A15U_03379 [Escherichia coli KTE210]
gi|432676033|ref|ZP_19911487.1| hypothetical protein A1YU_02584 [Escherichia coli KTE142]
gi|432751395|ref|ZP_19985978.1| hypothetical protein WEQ_02813 [Escherichia coli KTE29]
gi|432766287|ref|ZP_20000704.1| hypothetical protein A1S5_03850 [Escherichia coli KTE48]
gi|432810620|ref|ZP_20044498.1| hypothetical protein A1WM_01783 [Escherichia coli KTE101]
gi|432828557|ref|ZP_20062175.1| hypothetical protein A1YM_00324 [Escherichia coli KTE135]
gi|432968990|ref|ZP_20157902.1| hypothetical protein A15G_04110 [Escherichia coli KTE203]
gi|433093309|ref|ZP_20279567.1| hypothetical protein WK1_02953 [Escherichia coli KTE138]
gi|157079322|gb|ABV19030.1| conserved hypothetical protein [Escherichia coli E24377A]
gi|190901741|gb|EDV61495.1| conserved hypothetical protein [Escherichia coli B7A]
gi|192959243|gb|EDV89678.1| conserved hypothetical protein [Escherichia coli E110019]
gi|209913642|dbj|BAG78716.1| conserved hypothetical protein [Escherichia coli SE11]
gi|218353253|emb|CAU99195.1| conserved hypothetical protein with PD1-like DNA-binding motif
[Escherichia coli 55989]
gi|218362244|emb|CAQ99863.1| conserved hypothetical protein with PD1-like DNA-binding motif
[Escherichia coli IAI1]
gi|257760741|dbj|BAI32238.1| putative DNA-binding protein [Escherichia coli O103:H2 str. 12009]
gi|257765959|dbj|BAI37454.1| putative DNA-binding protein [Escherichia coli O111:H- str. 11128]
gi|300522660|gb|EFK43729.1| conserved hypothetical protein [Escherichia coli MS 119-7]
gi|308122409|gb|EFO59671.1| conserved hypothetical protein [Escherichia coli MS 145-7]
gi|320202590|gb|EFW77160.1| hypothetical protein ECoL_00252 [Escherichia coli EC4100B]
gi|323154623|gb|EFZ40822.1| hypothetical protein ECEPECA14_3423 [Escherichia coli EPECa14]
gi|323173863|gb|EFZ59492.1| hypothetical protein ECLT68_2182 [Escherichia coli LT-68]
gi|323180375|gb|EFZ65927.1| hypothetical protein ECOK1180_1057 [Escherichia coli OK1180]
gi|323946587|gb|EGB42610.1| hypothetical protein EREG_01829 [Escherichia coli H120]
gi|324017248|gb|EGB86467.1| hypothetical protein HMPREF9542_04104 [Escherichia coli MS 117-3]
gi|324119720|gb|EGC13600.1| hypothetical protein ERBG_00308 [Escherichia coli E1167]
gi|331063333|gb|EGI35246.1| conserved hypothetical protein [Escherichia coli TA271]
gi|331073746|gb|EGI45067.1| conserved hypothetical protein [Escherichia coli H591]
gi|332102736|gb|EGJ06082.1| conserved hypothetical protein [Shigella sp. D9]
gi|333015260|gb|EGK34602.1| hypothetical protein SFK227_3716 [Shigella flexneri K-227]
gi|340733211|gb|EGR62343.1| hypothetical protein HUSEC41_15953 [Escherichia coli O104:H4 str.
01-09591]
gi|340738928|gb|EGR73168.1| hypothetical protein HUSEC_16308 [Escherichia coli O104:H4 str.
LB226692]
gi|341920685|gb|EGT70291.1| hypothetical protein C22711_4323 [Escherichia coli O104:H4 str.
C227-11]
gi|345335369|gb|EGW67808.1| hypothetical protein EC253486_3901 [Escherichia coli 2534-86]
gi|345349143|gb|EGW81434.1| hypothetical protein ECSTEC94C_3443 [Escherichia coli STEC_94C]
gi|354862766|gb|EHF23204.1| hypothetical protein EUBG_03582 [Escherichia coli O104:H4 str.
C236-11]
gi|354868050|gb|EHF28472.1| hypothetical protein EUAG_04154 [Escherichia coli O104:H4 str.
C227-11]
gi|354868445|gb|EHF28863.1| hypothetical protein EUDG_02229 [Escherichia coli O104:H4 str.
04-8351]
gi|354874048|gb|EHF34425.1| hypothetical protein EUEG_03494 [Escherichia coli O104:H4 str.
09-7901]
gi|354880731|gb|EHF41067.1| hypothetical protein EUFG_03574 [Escherichia coli O104:H4 str.
11-3677]
gi|354887885|gb|EHF48150.1| hypothetical protein EUHG_03586 [Escherichia coli O104:H4 str.
11-4404]
gi|354892473|gb|EHF52682.1| hypothetical protein EUIG_03587 [Escherichia coli O104:H4 str.
11-4522]
gi|354893679|gb|EHF53882.1| hypothetical protein EUKG_03565 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|354896482|gb|EHF56653.1| hypothetical protein EUJG_04905 [Escherichia coli O104:H4 str.
11-4623]
gi|354897859|gb|EHF58016.1| hypothetical protein EULG_03583 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|354911711|gb|EHF71715.1| hypothetical protein EUOG_03590 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|354913660|gb|EHF73650.1| hypothetical protein EUMG_03155 [Escherichia coli O104:H4 str.
11-4632 C3]
gi|354916617|gb|EHF76589.1| hypothetical protein EUNG_04392 [Escherichia coli O104:H4 str.
11-4632 C4]
gi|378045111|gb|EHW07517.1| putative DNA-binding protein [Escherichia coli DEC8A]
gi|378046189|gb|EHW08569.1| putative DNA-binding protein [Escherichia coli DEC8B]
gi|378050535|gb|EHW12862.1| putative DNA-binding protein [Escherichia coli DEC8C]
gi|378063768|gb|EHW25932.1| putative DNA-binding protein [Escherichia coli DEC8E]
gi|378071618|gb|EHW33687.1| putative DNA-binding protein [Escherichia coli DEC9A]
gi|378075550|gb|EHW37564.1| putative DNA-binding protein [Escherichia coli DEC9B]
gi|378082554|gb|EHW44499.1| putative DNA-binding protein [Escherichia coli DEC9C]
gi|378088840|gb|EHW50690.1| putative DNA-binding protein [Escherichia coli DEC9D]
gi|378092562|gb|EHW54384.1| putative DNA-binding protein [Escherichia coli DEC9E]
gi|378098727|gb|EHW60459.1| putative DNA-binding protein [Escherichia coli DEC10A]
gi|378104753|gb|EHW66411.1| putative DNA-binding protein [Escherichia coli DEC10B]
gi|378109438|gb|EHW71049.1| putative DNA-binding protein [Escherichia coli DEC10C]
gi|378114944|gb|EHW76495.1| putative DNA-binding protein [Escherichia coli DEC10D]
gi|378126732|gb|EHW88126.1| putative DNA-binding protein [Escherichia coli DEC10E]
gi|378129662|gb|EHW91033.1| putative DNA-binding protein [Escherichia coli DEC10F]
gi|378149328|gb|EHX10455.1| putative DNA-binding protein [Escherichia coli DEC11C]
gi|378184556|gb|EHX45192.1| putative DNA-binding protein [Escherichia coli DEC13A]
gi|378198301|gb|EHX58772.1| putative DNA-binding protein [Escherichia coli DEC13C]
gi|378198660|gb|EHX59130.1| putative DNA-binding protein [Escherichia coli DEC13B]
gi|378201750|gb|EHX62193.1| putative DNA-binding protein [Escherichia coli DEC13D]
gi|378211146|gb|EHX71490.1| putative DNA-binding protein [Escherichia coli DEC13E]
gi|378215552|gb|EHX75849.1| putative DNA-binding protein [Escherichia coli DEC14A]
gi|378218464|gb|EHX78736.1| putative DNA-binding protein [Escherichia coli DEC14B]
gi|378226720|gb|EHX86906.1| putative DNA-binding protein [Escherichia coli DEC14C]
gi|378229948|gb|EHX90079.1| putative DNA-binding protein [Escherichia coli DEC14D]
gi|378236019|gb|EHX96074.1| putative DNA-binding protein [Escherichia coli DEC15A]
gi|378241091|gb|EHY01058.1| putative DNA-binding protein [Escherichia coli DEC15B]
gi|378245695|gb|EHY05632.1| putative DNA-binding protein [Escherichia coli DEC15C]
gi|378253159|gb|EHY13037.1| putative DNA-binding protein [Escherichia coli DEC15D]
gi|378258122|gb|EHY17953.1| putative DNA-binding protein [Escherichia coli DEC15E]
gi|384469812|gb|EIE53951.1| hypothetical protein ECAI27_39120 [Escherichia coli AI27]
gi|385710637|gb|EIG47614.1| hypothetical protein ESTG_01562 [Escherichia coli B799]
gi|386146604|gb|EIG93049.1| PF03479 domain protein [Escherichia coli 97.0246]
gi|386160563|gb|EIH22374.1| PF03479 domain protein [Escherichia coli 1.2264]
gi|386178255|gb|EIH55734.1| PF03479 domain protein [Escherichia coli 3.2608]
gi|386182340|gb|EIH65098.1| PF03479 domain protein [Escherichia coli 93.0624]
gi|386187767|gb|EIH76580.1| PF03479 domain protein [Escherichia coli 4.0522]
gi|386195117|gb|EIH89353.1| PF03479 domain protein [Escherichia coli JB1-95]
gi|386202481|gb|EII01472.1| PF03479 domain protein [Escherichia coli 96.154]
gi|386214424|gb|EII24847.1| PF03479 domain protein [Escherichia coli 9.0111]
gi|386227833|gb|EII55189.1| PF03479 domain protein [Escherichia coli 3.3884]
gi|386260143|gb|EIJ15617.1| PF03479 domain protein [Escherichia coli 900105 (10e)]
gi|388339607|gb|EIL05960.1| putative DNA-binding protein [Escherichia coli O103:H25 str.
CVM9340]
gi|388346865|gb|EIL12575.1| putative DNA-binding protein [Escherichia coli O103:H2 str.
CVM9450]
gi|388351357|gb|EIL16598.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CVM9534]
gi|388365642|gb|EIL29425.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CVM9570]
gi|388368919|gb|EIL32539.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CVM9574]
gi|388370360|gb|EIL33890.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CVM9545]
gi|388372031|gb|EIL35481.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM10026]
gi|388380473|gb|EIL43076.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM9942]
gi|388391094|gb|EIL52568.1| hypothetical protein EC54115_13688 [Escherichia coli 541-15]
gi|388421658|gb|EIL81263.1| hypothetical protein ECMT8_02961 [Escherichia coli CUMT8]
gi|394383215|gb|EJE60821.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM10224]
gi|394387300|gb|EJE64758.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CVM9602]
gi|394394081|gb|EJE70710.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CVM9455]
gi|394396269|gb|EJE72645.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CVM9634]
gi|394397366|gb|EJE73639.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CVM9553]
gi|394413479|gb|EJE87518.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM10021]
gi|394428870|gb|EJF01355.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM10030]
gi|394429972|gb|EJF02355.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CVM9952]
gi|406776265|gb|AFS55689.1| putative DNA-binding protein [Escherichia coli O104:H4 str.
2009EL-2050]
gi|407052834|gb|AFS72885.1| putative DNA-binding protein [Escherichia coli O104:H4 str.
2011C-3493]
gi|407066837|gb|AFS87884.1| putative DNA-binding protein [Escherichia coli O104:H4 str.
2009EL-2071]
gi|408295074|gb|EKJ13416.1| bifunctional protein glmU [Escherichia coli EC1865]
gi|408342637|gb|EKJ57064.1| bifunctional protein glmU [Escherichia coli 0.1288]
gi|421935384|gb|EKT93076.1| putative DNA-binding protein [Escherichia coli O111:H8 str.
CFSAN001632]
gi|421944924|gb|EKU02163.1| putative DNA-binding protein [Escherichia coli O26:H11 str.
CFSAN001629]
gi|421948747|gb|EKU05751.1| putative DNA-binding protein [Escherichia coli O111:H11 str.
CFSAN001630]
gi|429347607|gb|EKY84380.1| hypothetical protein C214_02184 [Escherichia coli O104:H4 str.
11-02092]
gi|429358643|gb|EKY95312.1| hypothetical protein C212_02188 [Escherichia coli O104:H4 str.
11-02030]
gi|429360388|gb|EKY97047.1| hypothetical protein C213_02186 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429360699|gb|EKY97357.1| hypothetical protein C215_02185 [Escherichia coli O104:H4 str.
11-02093]
gi|429364067|gb|EKZ00692.1| hypothetical protein C217_02184 [Escherichia coli O104:H4 str.
11-02318]
gi|429375622|gb|EKZ12156.1| hypothetical protein C216_02187 [Escherichia coli O104:H4 str.
11-02281]
gi|429378030|gb|EKZ14545.1| hypothetical protein C219_02184 [Escherichia coli O104:H4 str.
11-03439]
gi|429389675|gb|EKZ26095.1| hypothetical protein C218_02184 [Escherichia coli O104:H4 str.
11-02913]
gi|429393509|gb|EKZ29904.1| hypothetical protein C221_02184 [Escherichia coli O104:H4 str.
11-03943]
gi|429403513|gb|EKZ39797.1| hypothetical protein C220_02185 [Escherichia coli O104:H4 str.
11-04080]
gi|429404698|gb|EKZ40969.1| hypothetical protein MO5_01836 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429408213|gb|EKZ44453.1| hypothetical protein MO3_03217 [Escherichia coli O104:H4 str.
Ec11-9450]
gi|429413317|gb|EKZ49506.1| hypothetical protein O7I_01532 [Escherichia coli O104:H4 str.
Ec11-4987]
gi|429416046|gb|EKZ52204.1| hypothetical protein O7C_01839 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429419727|gb|EKZ55862.1| hypothetical protein O7G_02660 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429430566|gb|EKZ66627.1| hypothetical protein O7K_03083 [Escherichia coli O104:H4 str.
Ec11-4988]
gi|429434932|gb|EKZ70953.1| hypothetical protein O7M_03659 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429437065|gb|EKZ73077.1| hypothetical protein O7O_01154 [Escherichia coli O104:H4 str.
Ec11-6006]
gi|429442014|gb|EKZ77977.1| hypothetical protein O7E_01841 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429446735|gb|EKZ82663.1| hypothetical protein S7Y_03615 [Escherichia coli O104:H4 str.
Ec12-0465]
gi|429450347|gb|EKZ86243.1| hypothetical protein MO7_01816 [Escherichia coli O104:H4 str.
Ec11-9941]
gi|429456104|gb|EKZ91951.1| hypothetical protein S91_01917 [Escherichia coli O104:H4 str.
Ec12-0466]
gi|430897359|gb|ELC19569.1| hypothetical protein WCQ_02996 [Escherichia coli KTE12]
gi|431004749|gb|ELD19958.1| hypothetical protein A15U_03379 [Escherichia coli KTE210]
gi|431212738|gb|ELF10664.1| hypothetical protein A1YU_02584 [Escherichia coli KTE142]
gi|431294571|gb|ELF84750.1| hypothetical protein WEQ_02813 [Escherichia coli KTE29]
gi|431308341|gb|ELF96621.1| hypothetical protein A1S5_03850 [Escherichia coli KTE48]
gi|431360971|gb|ELG47570.1| hypothetical protein A1WM_01783 [Escherichia coli KTE101]
gi|431383411|gb|ELG67535.1| hypothetical protein A1YM_00324 [Escherichia coli KTE135]
gi|431468700|gb|ELH48633.1| hypothetical protein A15G_04110 [Escherichia coli KTE203]
gi|431608590|gb|ELI77932.1| hypothetical protein WK1_02953 [Escherichia coli KTE138]
Length = 143
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|301327334|ref|ZP_07220587.1| conserved hypothetical protein [Escherichia coli MS 78-1]
gi|300846066|gb|EFK73826.1| conserved hypothetical protein [Escherichia coli MS 78-1]
Length = 143
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|260857049|ref|YP_003230940.1| DNA-binding protein [Escherichia coli O26:H11 str. 11368]
gi|257755698|dbj|BAI27200.1| putative DNA-binding protein [Escherichia coli O26:H11 str. 11368]
Length = 142
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 79 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 126
>gi|82545452|ref|YP_409399.1| hypothetical protein SBO_3065 [Shigella boydii Sb227]
gi|187733710|ref|YP_001881697.1| hypothetical protein SbBS512_E3353 [Shigella boydii CDC 3083-94]
gi|417683734|ref|ZP_12333078.1| hypothetical protein SB359474_3559 [Shigella boydii 3594-74]
gi|420337579|ref|ZP_14839141.1| putative DNA-binding protein [Shigella flexneri K-315]
gi|81246863|gb|ABB67571.1| conserved hypothetical protein [Shigella boydii Sb227]
gi|187430702|gb|ACD09976.1| conserved hypothetical protein [Shigella boydii CDC 3083-94]
gi|332091326|gb|EGI96414.1| hypothetical protein SB359474_3559 [Shigella boydii 3594-74]
gi|391259453|gb|EIQ18527.1| putative DNA-binding protein [Shigella flexneri K-315]
Length = 143
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYARQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|417229191|ref|ZP_12030949.1| PF03479 domain protein [Escherichia coli 5.0959]
gi|386208526|gb|EII13031.1| PF03479 domain protein [Escherichia coli 5.0959]
Length = 143
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQLHQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|110806828|ref|YP_690348.1| hypothetical protein SFV_2974 [Shigella flexneri 5 str. 8401]
gi|424839214|ref|ZP_18263851.1| hypothetical protein SF5M90T_2900 [Shigella flexneri 5a str. M90T]
gi|110616376|gb|ABF05043.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
gi|383468266|gb|EID63287.1| hypothetical protein SF5M90T_2900 [Shigella flexneri 5a str. M90T]
Length = 143
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|417708948|ref|ZP_12357976.1| hypothetical protein SFVA6_3782 [Shigella flexneri VA-6]
gi|420332751|ref|ZP_14834400.1| putative DNA-binding protein [Shigella flexneri K-1770]
gi|332999635|gb|EGK19220.1| hypothetical protein SFVA6_3782 [Shigella flexneri VA-6]
gi|391248829|gb|EIQ08067.1| putative DNA-binding protein [Shigella flexneri K-1770]
Length = 143
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 60/107 (56%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E SG+ L + +S P G +LGG + T T +++V+GS
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSL 119
>gi|357118952|ref|XP_003561211.1| PREDICTED: uncharacterized protein LOC100829454 [Brachypodium
distachyon]
Length = 337
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 58/109 (53%), Gaps = 16/109 (14%)
Query: 124 IKKSRGRPPGSGSGKKHQ----LEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA 179
+++ RGRP +G K++ + + HV+ V G DV + F++
Sbjct: 115 MRRPRGRP----AGSKNKPKPPVIITRDSASALRAHVLEVAPGCDVVDAVADFARRRQVG 170
Query: 180 VCILSANGAISNVTLRQ--------AATSGGTVTYEGRFEILSLSGSFL 220
VC+LSA G+++ +++RQ +GG V+ GRF+IL+LSGSFL
Sbjct: 171 VCVLSATGSVAGISVRQPGGGGGSNGNGNGGVVSIAGRFDILTLSGSFL 219
>gi|293449250|ref|ZP_06663671.1| hypothetical protein ECCG_02280 [Escherichia coli B088]
gi|300815618|ref|ZP_07095842.1| conserved hypothetical protein [Escherichia coli MS 107-1]
gi|300906518|ref|ZP_07124211.1| hypothetical protein HMPREF9536_04478 [Escherichia coli MS 84-1]
gi|301306365|ref|ZP_07212434.1| hypothetical protein HMPREF9347_04978 [Escherichia coli MS 124-1]
gi|307310454|ref|ZP_07590102.1| protein of unknown function DUF296 [Escherichia coli W]
gi|378711624|ref|YP_005276517.1| hypothetical protein [Escherichia coli KO11FL]
gi|386610314|ref|YP_006125800.1| putative DNA-binding protein [Escherichia coli W]
gi|386700122|ref|YP_006163959.1| hypothetical protein KO11_08130 [Escherichia coli KO11FL]
gi|386710821|ref|YP_006174542.1| hypothetical protein WFL_15540 [Escherichia coli W]
gi|415830387|ref|ZP_11516289.1| hypothetical protein ECOK1357_3265 [Escherichia coli OK1357]
gi|415862174|ref|ZP_11535706.1| UDP-N-acetylglucosamine diphosphorylase [Escherichia coli MS 85-1]
gi|415874162|ref|ZP_11541259.1| UDP-N-acetylglucosamine diphosphorylase [Escherichia coli MS 79-10]
gi|417156898|ref|ZP_11994522.1| PF03479 domain protein [Escherichia coli 96.0497]
gi|417582424|ref|ZP_12233225.1| hypothetical protein ECSTECB2F1_3111 [Escherichia coli STEC_B2F1]
gi|417598277|ref|ZP_12248909.1| hypothetical protein EC30301_3426 [Escherichia coli 3030-1]
gi|417609545|ref|ZP_12260045.1| hypothetical protein ECSTECDG1313_3963 [Escherichia coli
STEC_DG131-3]
gi|417640737|ref|ZP_12290875.1| hypothetical protein ECTX1999_3462 [Escherichia coli TX1999]
gi|417668333|ref|ZP_12317875.1| hypothetical protein ECSTECO31_3165 [Escherichia coli STEC_O31]
gi|419171731|ref|ZP_13715612.1| putative DNA-binding protein [Escherichia coli DEC7A]
gi|419182286|ref|ZP_13725897.1| putative DNA-binding protein [Escherichia coli DEC7C]
gi|419187913|ref|ZP_13731420.1| putative DNA-binding protein [Escherichia coli DEC7D]
gi|419193033|ref|ZP_13736482.1| putative DNA-binding protein [Escherichia coli DEC7E]
gi|420387063|ref|ZP_14886407.1| putative DNA-binding protein [Escherichia coli EPECa12]
gi|427806103|ref|ZP_18973170.1| Putative uncharacterized protein [Escherichia coli chi7122]
gi|427810696|ref|ZP_18977761.1| Putative uncharacterized protein [Escherichia coli]
gi|432807101|ref|ZP_20041016.1| hypothetical protein A1WA_03005 [Escherichia coli KTE91]
gi|432935894|ref|ZP_20135162.1| hypothetical protein A13E_04337 [Escherichia coli KTE184]
gi|433131462|ref|ZP_20316893.1| hypothetical protein WKG_03207 [Escherichia coli KTE163]
gi|433136124|ref|ZP_20321461.1| hypothetical protein WKI_03069 [Escherichia coli KTE166]
gi|433194968|ref|ZP_20378949.1| hypothetical protein WGU_03290 [Escherichia coli KTE90]
gi|443618978|ref|YP_007382834.1| hypothetical protein APECO78_18355 [Escherichia coli APEC O78]
gi|291322340|gb|EFE61769.1| hypothetical protein ECCG_02280 [Escherichia coli B088]
gi|300401694|gb|EFJ85232.1| hypothetical protein HMPREF9536_04478 [Escherichia coli MS 84-1]
gi|300531547|gb|EFK52609.1| conserved hypothetical protein [Escherichia coli MS 107-1]
gi|300838360|gb|EFK66120.1| hypothetical protein HMPREF9347_04978 [Escherichia coli MS 124-1]
gi|306909349|gb|EFN39844.1| protein of unknown function DUF296 [Escherichia coli W]
gi|315062231|gb|ADT76558.1| putative DNA-binding protein [Escherichia coli W]
gi|315256813|gb|EFU36781.1| UDP-N-acetylglucosamine diphosphorylase [Escherichia coli MS 85-1]
gi|323183486|gb|EFZ68883.1| hypothetical protein ECOK1357_3265 [Escherichia coli OK1357]
gi|323377185|gb|ADX49453.1| protein of unknown function DUF296 [Escherichia coli KO11FL]
gi|342930280|gb|EGU99002.1| UDP-N-acetylglucosamine diphosphorylase [Escherichia coli MS 79-10]
gi|345335881|gb|EGW68318.1| hypothetical protein ECSTECB2F1_3111 [Escherichia coli STEC_B2F1]
gi|345351499|gb|EGW83760.1| hypothetical protein EC30301_3426 [Escherichia coli 3030-1]
gi|345356756|gb|EGW88957.1| hypothetical protein ECSTECDG1313_3963 [Escherichia coli
STEC_DG131-3]
gi|345392520|gb|EGX22301.1| hypothetical protein ECTX1999_3462 [Escherichia coli TX1999]
gi|378013518|gb|EHV76435.1| putative DNA-binding protein [Escherichia coli DEC7A]
gi|378022406|gb|EHV85093.1| putative DNA-binding protein [Escherichia coli DEC7C]
gi|378025662|gb|EHV88302.1| putative DNA-binding protein [Escherichia coli DEC7D]
gi|378036880|gb|EHV99416.1| putative DNA-binding protein [Escherichia coli DEC7E]
gi|383391649|gb|AFH16607.1| hypothetical protein KO11_08130 [Escherichia coli KO11FL]
gi|383406513|gb|AFH12756.1| hypothetical protein WFL_15540 [Escherichia coli W]
gi|386165648|gb|EIH32168.1| PF03479 domain protein [Escherichia coli 96.0497]
gi|391303943|gb|EIQ61769.1| putative DNA-binding protein [Escherichia coli EPECa12]
gi|397784299|gb|EJK95155.1| hypothetical protein ECSTECO31_3165 [Escherichia coli STEC_O31]
gi|412964285|emb|CCK48213.1| Putative uncharacterized protein [Escherichia coli chi7122]
gi|412970875|emb|CCJ45527.1| Putative uncharacterized protein [Escherichia coli]
gi|431353543|gb|ELG40296.1| hypothetical protein A1WA_03005 [Escherichia coli KTE91]
gi|431451786|gb|ELH32257.1| hypothetical protein A13E_04337 [Escherichia coli KTE184]
gi|431644825|gb|ELJ12479.1| hypothetical protein WKG_03207 [Escherichia coli KTE163]
gi|431654783|gb|ELJ21830.1| hypothetical protein WKI_03069 [Escherichia coli KTE166]
gi|431714353|gb|ELJ78545.1| hypothetical protein WGU_03290 [Escherichia coli KTE90]
gi|443423486|gb|AGC88390.1| hypothetical protein APECO78_18355 [Escherichia coli APEC O78]
Length = 143
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 63/115 (54%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|417163145|ref|ZP_11998475.1| PF03479 domain protein [Escherichia coli 99.0741]
gi|386173636|gb|EIH45648.1| PF03479 domain protein [Escherichia coli 99.0741]
Length = 143
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVLGSLEELAFSRQ 127
>gi|417133369|ref|ZP_11978154.1| PF03479 domain protein [Escherichia coli 5.0588]
gi|386151223|gb|EIH02512.1| PF03479 domain protein [Escherichia coli 5.0588]
Length = 143
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q + A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQHQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --ELSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|424533882|ref|ZP_17977230.1| bifunctional protein glmU [Escherichia coli EC4422]
gi|424577095|ref|ZP_18017153.1| bifunctional protein glmU [Escherichia coli EC1845]
gi|424582915|ref|ZP_18022562.1| bifunctional protein glmU [Escherichia coli EC1863]
gi|425111699|ref|ZP_18513620.1| bifunctional protein glmU [Escherichia coli 6.0172]
gi|425207726|ref|ZP_18603523.1| bifunctional protein glmU [Escherichia coli FRIK2001]
gi|428948647|ref|ZP_19020927.1| putative DNA-binding protein [Escherichia coli 88.1467]
gi|444926471|ref|ZP_21245753.1| hypothetical protein EC09BKT78844_4105 [Escherichia coli
09BKT078844]
gi|390859939|gb|EIP22267.1| bifunctional protein glmU [Escherichia coli EC4422]
gi|390918041|gb|EIP76457.1| bifunctional protein glmU [Escherichia coli EC1863]
gi|390919041|gb|EIP77415.1| bifunctional protein glmU [Escherichia coli EC1845]
gi|408120077|gb|EKH51107.1| bifunctional protein glmU [Escherichia coli FRIK2001]
gi|408549688|gb|EKK27048.1| bifunctional protein glmU [Escherichia coli 6.0172]
gi|427207204|gb|EKV77382.1| putative DNA-binding protein [Escherichia coli 88.1467]
gi|444538346|gb|ELV18214.1| hypothetical protein EC09BKT78844_4105 [Escherichia coli
09BKT078844]
Length = 142
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 63/115 (54%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G+++++ LR A G T+ G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFAQQQQLHAAWIAGCTGSLTDIALRYAGQEGTTLL-NGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 79 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 126
>gi|15803462|ref|NP_289495.1| hypothetical protein Z4267 [Escherichia coli O157:H7 str. EDL933]
gi|15833053|ref|NP_311826.1| hypothetical protein ECs3799 [Escherichia coli O157:H7 str. Sakai]
gi|82778309|ref|YP_404658.1| hypothetical protein SDY_3154 [Shigella dysenteriae Sd197]
gi|168747587|ref|ZP_02772609.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4113]
gi|168753872|ref|ZP_02778879.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
gi|168760062|ref|ZP_02785069.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gi|168766927|ref|ZP_02791934.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
gi|168773440|ref|ZP_02798447.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
gi|168781779|ref|ZP_02806786.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
gi|168785778|ref|ZP_02810785.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
gi|168797495|ref|ZP_02822502.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
gi|195936545|ref|ZP_03081927.1| hypothetical protein EscherichcoliO157_08797 [Escherichia coli
O157:H7 str. EC4024]
gi|208805842|ref|ZP_03248179.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208813766|ref|ZP_03255095.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208820426|ref|ZP_03260746.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209400091|ref|YP_002272402.1| hypothetical protein ECH74115_4225 [Escherichia coli O157:H7 str.
EC4115]
gi|217327137|ref|ZP_03443220.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254794875|ref|YP_003079712.1| hypothetical protein ECSP_3894 [Escherichia coli O157:H7 str.
TW14359]
gi|261226237|ref|ZP_05940518.1| hypothetical protein EscherichiacoliO157_16823 [Escherichia coli
O157:H7 str. FRIK2000]
gi|261256506|ref|ZP_05949039.1| hypothetical protein EscherichiacoliO157EcO_11801 [Escherichia coli
O157:H7 str. FRIK966]
gi|291284244|ref|YP_003501062.1| hypothetical protein G2583_3582 [Escherichia coli O55:H7 str.
CB9615]
gi|293416185|ref|ZP_06658825.1| hypothetical protein ECDG_03788 [Escherichia coli B185]
gi|331654432|ref|ZP_08355432.1| conserved hypothetical protein [Escherichia coli M718]
gi|387508277|ref|YP_006160533.1| hypothetical protein ECO55CA74_17075 [Escherichia coli O55:H7 str.
RM12579]
gi|387884114|ref|YP_006314416.1| hypothetical protein CDCO157_3550 [Escherichia coli Xuzhou21]
gi|416314451|ref|ZP_11658686.1| hypothetical protein ECoA_04528 [Escherichia coli O157:H7 str.
1044]
gi|416322093|ref|ZP_11663941.1| hypothetical protein ECoD_04276 [Escherichia coli O157:H7 str.
EC1212]
gi|416327835|ref|ZP_11667755.1| hypothetical protein ECF_02644 [Escherichia coli O157:H7 str. 1125]
gi|416777031|ref|ZP_11875065.1| hypothetical protein ECO5101_04119 [Escherichia coli O157:H7 str.
G5101]
gi|416788491|ref|ZP_11879990.1| hypothetical protein ECO9389_23606 [Escherichia coli O157:H- str.
493-89]
gi|416800478|ref|ZP_11884902.1| hypothetical protein ECO2687_11593 [Escherichia coli O157:H- str. H
2687]
gi|416811041|ref|ZP_11889666.1| hypothetical protein ECO7815_01750 [Escherichia coli O55:H7 str.
3256-97]
gi|416821731|ref|ZP_11894316.1| hypothetical protein ECO5905_09688 [Escherichia coli O55:H7 str.
USDA 5905]
gi|416832123|ref|ZP_11899413.1| hypothetical protein ECOSU61_08769 [Escherichia coli O157:H7 str.
LSU-61]
gi|417630270|ref|ZP_12280506.1| hypothetical protein ECSTECMHI813_3213 [Escherichia coli
STEC_MHI813]
gi|419046913|ref|ZP_13593848.1| putative DNA-binding protein [Escherichia coli DEC3A]
gi|419052684|ref|ZP_13599551.1| putative DNA-binding protein [Escherichia coli DEC3B]
gi|419058679|ref|ZP_13605482.1| putative DNA-binding protein [Escherichia coli DEC3C]
gi|419064176|ref|ZP_13610899.1| putative DNA-binding protein [Escherichia coli DEC3D]
gi|419071122|ref|ZP_13616737.1| putative DNA-binding protein [Escherichia coli DEC3E]
gi|419077174|ref|ZP_13622677.1| putative DNA-binding protein [Escherichia coli DEC3F]
gi|419082146|ref|ZP_13627593.1| putative DNA-binding protein [Escherichia coli DEC4A]
gi|419087985|ref|ZP_13633338.1| putative DNA-binding protein [Escherichia coli DEC4B]
gi|419093994|ref|ZP_13639276.1| putative DNA-binding protein [Escherichia coli DEC4C]
gi|419099841|ref|ZP_13645034.1| putative DNA-binding protein [Escherichia coli DEC4D]
gi|419105491|ref|ZP_13650618.1| putative DNA-binding protein [Escherichia coli DEC4E]
gi|419110955|ref|ZP_13656009.1| putative DNA-binding protein [Escherichia coli DEC4F]
gi|419116318|ref|ZP_13661333.1| putative DNA-binding protein [Escherichia coli DEC5A]
gi|419122010|ref|ZP_13666956.1| putative DNA-binding protein [Escherichia coli DEC5B]
gi|419127469|ref|ZP_13672346.1| putative DNA-binding protein [Escherichia coli DEC5C]
gi|419132946|ref|ZP_13677780.1| putative DNA-binding protein [Escherichia coli DEC5D]
gi|419138095|ref|ZP_13682886.1| putative DNA-binding protein [Escherichia coli DEC5E]
gi|420271196|ref|ZP_14773550.1| bifunctional protein glmU [Escherichia coli PA22]
gi|420276962|ref|ZP_14779244.1| bifunctional protein glmU [Escherichia coli PA40]
gi|420282210|ref|ZP_14784443.1| bifunctional protein glmU [Escherichia coli TW06591]
gi|420288457|ref|ZP_14790641.1| bifunctional protein glmU [Escherichia coli TW10246]
gi|420293964|ref|ZP_14796079.1| bifunctional protein glmU [Escherichia coli TW11039]
gi|420299881|ref|ZP_14801927.1| bifunctional protein glmU [Escherichia coli TW09109]
gi|420305534|ref|ZP_14807524.1| bifunctional protein glmU [Escherichia coli TW10119]
gi|420310998|ref|ZP_14812928.1| bifunctional protein glmU [Escherichia coli EC1738]
gi|420316839|ref|ZP_14818712.1| bifunctional protein glmU [Escherichia coli EC1734]
gi|421813948|ref|ZP_16249660.1| bifunctional protein glmU [Escherichia coli 8.0416]
gi|421819768|ref|ZP_16255259.1| putative DNA-binding protein [Escherichia coli 10.0821]
gi|421825774|ref|ZP_16261129.1| bifunctional protein glmU [Escherichia coli FRIK920]
gi|421832472|ref|ZP_16267755.1| bifunctional protein glmU [Escherichia coli PA7]
gi|422834119|ref|ZP_16882182.1| hypothetical protein ESOG_01783 [Escherichia coli E101]
gi|423726811|ref|ZP_17700772.1| bifunctional protein glmU [Escherichia coli PA31]
gi|424079069|ref|ZP_17816043.1| bifunctional protein glmU [Escherichia coli FDA505]
gi|424085522|ref|ZP_17822017.1| bifunctional protein glmU [Escherichia coli FDA517]
gi|424091936|ref|ZP_17827869.1| bifunctional protein glmU [Escherichia coli FRIK1996]
gi|424098582|ref|ZP_17833871.1| bifunctional protein glmU [Escherichia coli FRIK1985]
gi|424104808|ref|ZP_17839559.1| bifunctional protein glmU [Escherichia coli FRIK1990]
gi|424111459|ref|ZP_17845695.1| bifunctional protein glmU [Escherichia coli 93-001]
gi|424117397|ref|ZP_17851235.1| bifunctional protein glmU [Escherichia coli PA3]
gi|424123584|ref|ZP_17856900.1| bifunctional protein glmU [Escherichia coli PA5]
gi|424129737|ref|ZP_17862644.1| bifunctional protein glmU [Escherichia coli PA9]
gi|424136056|ref|ZP_17868511.1| bifunctional protein glmU [Escherichia coli PA10]
gi|424142603|ref|ZP_17874480.1| bifunctional protein glmU [Escherichia coli PA14]
gi|424149011|ref|ZP_17880387.1| bifunctional protein glmU [Escherichia coli PA15]
gi|424154844|ref|ZP_17885784.1| bifunctional protein glmU [Escherichia coli PA24]
gi|424252679|ref|ZP_17891345.1| bifunctional protein glmU [Escherichia coli PA25]
gi|424331033|ref|ZP_17897252.1| bifunctional protein glmU [Escherichia coli PA28]
gi|424451286|ref|ZP_17902968.1| bifunctional protein glmU [Escherichia coli PA32]
gi|424457477|ref|ZP_17908597.1| bifunctional protein glmU [Escherichia coli PA33]
gi|424463930|ref|ZP_17914329.1| bifunctional protein glmU [Escherichia coli PA39]
gi|424470245|ref|ZP_17920064.1| bifunctional protein glmU [Escherichia coli PA41]
gi|424476758|ref|ZP_17926076.1| bifunctional protein glmU [Escherichia coli PA42]
gi|424482520|ref|ZP_17931499.1| bifunctional protein glmU [Escherichia coli TW07945]
gi|424488689|ref|ZP_17937244.1| bifunctional protein glmU [Escherichia coli TW09098]
gi|424495303|ref|ZP_17942962.1| bifunctional protein glmU [Escherichia coli TW09195]
gi|424502050|ref|ZP_17948941.1| bifunctional protein glmU [Escherichia coli EC4203]
gi|424508296|ref|ZP_17954690.1| bifunctional protein glmU [Escherichia coli EC4196]
gi|424515642|ref|ZP_17960292.1| bifunctional protein glmU [Escherichia coli TW14313]
gi|424521850|ref|ZP_17965970.1| bifunctional protein glmU [Escherichia coli TW14301]
gi|424527730|ref|ZP_17971447.1| bifunctional protein glmU [Escherichia coli EC4421]
gi|424539934|ref|ZP_17982878.1| bifunctional protein glmU [Escherichia coli EC4013]
gi|424546048|ref|ZP_17988428.1| bifunctional protein glmU [Escherichia coli EC4402]
gi|424552277|ref|ZP_17994126.1| bifunctional protein glmU [Escherichia coli EC4439]
gi|424558457|ref|ZP_17999870.1| bifunctional protein glmU [Escherichia coli EC4436]
gi|424564795|ref|ZP_18005799.1| bifunctional protein glmU [Escherichia coli EC4437]
gi|424570937|ref|ZP_18011487.1| bifunctional protein glmU [Escherichia coli EC4448]
gi|425099588|ref|ZP_18502320.1| putative DNA-binding protein [Escherichia coli 3.4870]
gi|425105684|ref|ZP_18508003.1| putative DNA-binding protein [Escherichia coli 5.2239]
gi|425127619|ref|ZP_18528788.1| putative DNA-binding protein [Escherichia coli 8.0586]
gi|425133355|ref|ZP_18534205.1| putative DNA-binding protein [Escherichia coli 8.2524]
gi|425139940|ref|ZP_18540321.1| bifunctional protein glmU [Escherichia coli 10.0833]
gi|425145649|ref|ZP_18545646.1| putative DNA-binding protein [Escherichia coli 10.0869]
gi|425151763|ref|ZP_18551378.1| putative DNA-binding protein [Escherichia coli 88.0221]
gi|425157638|ref|ZP_18556902.1| bifunctional protein glmU [Escherichia coli PA34]
gi|425163987|ref|ZP_18562874.1| bifunctional protein glmU [Escherichia coli FDA506]
gi|425169730|ref|ZP_18568204.1| bifunctional protein glmU [Escherichia coli FDA507]
gi|425175793|ref|ZP_18573913.1| bifunctional protein glmU [Escherichia coli FDA504]
gi|425181832|ref|ZP_18579528.1| bifunctional protein glmU [Escherichia coli FRIK1999]
gi|425188095|ref|ZP_18585370.1| bifunctional protein glmU [Escherichia coli FRIK1997]
gi|425194866|ref|ZP_18591635.1| bifunctional protein glmU [Escherichia coli NE1487]
gi|425201336|ref|ZP_18597545.1| bifunctional protein glmU [Escherichia coli NE037]
gi|425213480|ref|ZP_18608882.1| bifunctional protein glmU [Escherichia coli PA4]
gi|425219603|ref|ZP_18614567.1| bifunctional protein glmU [Escherichia coli PA23]
gi|425226153|ref|ZP_18620621.1| bifunctional protein glmU [Escherichia coli PA49]
gi|425232412|ref|ZP_18626453.1| bifunctional protein glmU [Escherichia coli PA45]
gi|425238336|ref|ZP_18632056.1| bifunctional protein glmU [Escherichia coli TT12B]
gi|425244574|ref|ZP_18637880.1| bifunctional protein glmU [Escherichia coli MA6]
gi|425250710|ref|ZP_18643652.1| bifunctional protein glmU [Escherichia coli 5905]
gi|425256545|ref|ZP_18649060.1| bifunctional protein glmU [Escherichia coli CB7326]
gi|425262800|ref|ZP_18654804.1| bifunctional protein glmU [Escherichia coli EC96038]
gi|425268800|ref|ZP_18660430.1| bifunctional protein glmU [Escherichia coli 5412]
gi|425296248|ref|ZP_18686425.1| bifunctional protein glmU [Escherichia coli PA38]
gi|425312939|ref|ZP_18702120.1| bifunctional protein glmU [Escherichia coli EC1735]
gi|425318925|ref|ZP_18707715.1| bifunctional protein glmU [Escherichia coli EC1736]
gi|425325010|ref|ZP_18713372.1| bifunctional protein glmU [Escherichia coli EC1737]
gi|425331377|ref|ZP_18719219.1| bifunctional protein glmU [Escherichia coli EC1846]
gi|425337555|ref|ZP_18724915.1| bifunctional protein glmU [Escherichia coli EC1847]
gi|425343877|ref|ZP_18730768.1| bifunctional protein glmU [Escherichia coli EC1848]
gi|425349682|ref|ZP_18736151.1| bifunctional protein glmU [Escherichia coli EC1849]
gi|425355982|ref|ZP_18742050.1| bifunctional protein glmU [Escherichia coli EC1850]
gi|425361944|ref|ZP_18747592.1| bifunctional protein glmU [Escherichia coli EC1856]
gi|425368148|ref|ZP_18753282.1| bifunctional protein glmU [Escherichia coli EC1862]
gi|425374473|ref|ZP_18759117.1| bifunctional protein glmU [Escherichia coli EC1864]
gi|425387367|ref|ZP_18770926.1| bifunctional protein glmU [Escherichia coli EC1866]
gi|425394020|ref|ZP_18777129.1| bifunctional protein glmU [Escherichia coli EC1868]
gi|425400155|ref|ZP_18782862.1| bifunctional protein glmU [Escherichia coli EC1869]
gi|425406244|ref|ZP_18788467.1| bifunctional protein glmU [Escherichia coli EC1870]
gi|425412629|ref|ZP_18794393.1| bifunctional protein glmU [Escherichia coli NE098]
gi|425418954|ref|ZP_18800225.1| bifunctional protein glmU [Escherichia coli FRIK523]
gi|425430216|ref|ZP_18810828.1| bifunctional protein glmU [Escherichia coli 0.1304]
gi|428954729|ref|ZP_19026527.1| putative DNA-binding protein [Escherichia coli 88.1042]
gi|428960718|ref|ZP_19032014.1| putative DNA-binding protein [Escherichia coli 89.0511]
gi|428967332|ref|ZP_19038045.1| putative DNA-binding protein [Escherichia coli 90.0091]
gi|428973017|ref|ZP_19043342.1| putative DNA-binding protein [Escherichia coli 90.0039]
gi|428979313|ref|ZP_19049136.1| putative DNA-binding protein [Escherichia coli 90.2281]
gi|428985313|ref|ZP_19054708.1| putative DNA-binding protein [Escherichia coli 93.0055]
gi|428991443|ref|ZP_19060434.1| putative DNA-binding protein [Escherichia coli 93.0056]
gi|428997324|ref|ZP_19065921.1| putative DNA-binding protein [Escherichia coli 94.0618]
gi|429003606|ref|ZP_19071708.1| putative DNA-binding protein [Escherichia coli 95.0183]
gi|429009688|ref|ZP_19077160.1| putative DNA-binding protein [Escherichia coli 95.1288]
gi|429016222|ref|ZP_19083107.1| putative DNA-binding protein [Escherichia coli 95.0943]
gi|429022047|ref|ZP_19088571.1| putative DNA-binding protein [Escherichia coli 96.0428]
gi|429028111|ref|ZP_19094110.1| putative DNA-binding protein [Escherichia coli 96.0427]
gi|429034297|ref|ZP_19099821.1| putative DNA-binding protein [Escherichia coli 96.0939]
gi|429040379|ref|ZP_19105482.1| putative DNA-binding protein [Escherichia coli 96.0932]
gi|429045979|ref|ZP_19110693.1| putative DNA-binding protein [Escherichia coli 96.0107]
gi|429051657|ref|ZP_19116224.1| putative DNA-binding protein [Escherichia coli 97.0003]
gi|429057078|ref|ZP_19121382.1| putative DNA-binding protein [Escherichia coli 97.1742]
gi|429062581|ref|ZP_19126579.1| putative DNA-binding protein [Escherichia coli 97.0007]
gi|429068839|ref|ZP_19132298.1| putative DNA-binding protein [Escherichia coli 99.0672]
gi|429074757|ref|ZP_19138009.1| bifunctional protein glmU [Escherichia coli 99.0678]
gi|429079989|ref|ZP_19143124.1| putative DNA-binding protein [Escherichia coli 99.0713]
gi|429828011|ref|ZP_19359040.1| putative DNA-binding protein [Escherichia coli 96.0109]
gi|429834381|ref|ZP_19364699.1| putative DNA-binding protein [Escherichia coli 97.0010]
gi|432451100|ref|ZP_19693358.1| hypothetical protein A13W_02059 [Escherichia coli KTE193]
gi|432948988|ref|ZP_20143911.1| hypothetical protein A153_03691 [Escherichia coli KTE196]
gi|433034783|ref|ZP_20222484.1| hypothetical protein WIC_03350 [Escherichia coli KTE112]
gi|433044466|ref|ZP_20231953.1| hypothetical protein WIG_03004 [Escherichia coli KTE117]
gi|444932231|ref|ZP_21251259.1| hypothetical protein EC990814_3608 [Escherichia coli 99.0814]
gi|444937653|ref|ZP_21256421.1| hypothetical protein EC990815_3602 [Escherichia coli 99.0815]
gi|444944671|ref|ZP_21263137.1| hypothetical protein EC990816_5056 [Escherichia coli 99.0816]
gi|444949923|ref|ZP_21268199.1| hypothetical protein EC990839_4904 [Escherichia coli 99.0839]
gi|444954326|ref|ZP_21272411.1| hypothetical protein EC990848_3603 [Escherichia coli 99.0848]
gi|444959835|ref|ZP_21277678.1| hypothetical protein EC991753_3667 [Escherichia coli 99.1753]
gi|444964991|ref|ZP_21282583.1| hypothetical protein EC991775_3489 [Escherichia coli 99.1775]
gi|444970989|ref|ZP_21288345.1| hypothetical protein EC991793_3909 [Escherichia coli 99.1793]
gi|444976259|ref|ZP_21293369.1| hypothetical protein EC991805_3478 [Escherichia coli 99.1805]
gi|444981664|ref|ZP_21298574.1| hypothetical protein ECATCC700728_3496 [Escherichia coli ATCC
700728]
gi|444987054|ref|ZP_21303833.1| hypothetical protein ECPA11_3667 [Escherichia coli PA11]
gi|444992365|ref|ZP_21309007.1| hypothetical protein ECPA19_3631 [Escherichia coli PA19]
gi|444997672|ref|ZP_21314169.1| hypothetical protein ECPA13_3462 [Escherichia coli PA13]
gi|445003246|ref|ZP_21319635.1| hypothetical protein ECPA2_3807 [Escherichia coli PA2]
gi|445009891|ref|ZP_21326102.1| hypothetical protein ECPA47_4803 [Escherichia coli PA47]
gi|445013782|ref|ZP_21329888.1| hypothetical protein ECPA48_3489 [Escherichia coli PA48]
gi|445019681|ref|ZP_21335644.1| hypothetical protein ECPA8_3820 [Escherichia coli PA8]
gi|445025065|ref|ZP_21340887.1| hypothetical protein EC71982_3731 [Escherichia coli 7.1982]
gi|445030486|ref|ZP_21346157.1| hypothetical protein EC991781_3888 [Escherichia coli 99.1781]
gi|445035908|ref|ZP_21351438.1| hypothetical protein EC991762_3857 [Escherichia coli 99.1762]
gi|445042939|ref|ZP_21358293.1| hypothetical protein ECPA35_5247 [Escherichia coli PA35]
gi|445046764|ref|ZP_21362014.1| hypothetical protein EC34880_3716 [Escherichia coli 3.4880]
gi|445052304|ref|ZP_21367342.1| hypothetical protein EC950083_3601 [Escherichia coli 95.0083]
gi|445058036|ref|ZP_21372894.1| hypothetical protein EC990670_3846 [Escherichia coli 99.0670]
gi|452970746|ref|ZP_21968973.1| DNA-binding protein [Escherichia coli O157:H7 str. EC4009]
gi|12517463|gb|AAG58054.1|AE005523_3 orf; hypothetical protein [Escherichia coli O157:H7 str. EDL933]
gi|13363271|dbj|BAB37222.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|81242457|gb|ABB63167.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
gi|187770780|gb|EDU34624.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
gi|188017816|gb|EDU55938.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4113]
gi|189000516|gb|EDU69502.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
gi|189358679|gb|EDU77098.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
gi|189363888|gb|EDU82307.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
gi|189369308|gb|EDU87724.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gi|189374000|gb|EDU92416.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
gi|189379830|gb|EDU98246.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
gi|208725643|gb|EDZ75244.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208735043|gb|EDZ83730.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208740549|gb|EDZ88231.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209161491|gb|ACI38924.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|217319504|gb|EEC27929.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254594275|gb|ACT73636.1| hypothetical protein ECSP_3894 [Escherichia coli O157:H7 str.
TW14359]
gi|290764117|gb|ADD58078.1| hypothetical protein G2583_3582 [Escherichia coli O55:H7 str.
CB9615]
gi|291432374|gb|EFF05356.1| hypothetical protein ECDG_03788 [Escherichia coli B185]
gi|320189273|gb|EFW63932.1| hypothetical protein ECoD_04276 [Escherichia coli O157:H7 str.
EC1212]
gi|320640570|gb|EFX10109.1| hypothetical protein ECO5101_04119 [Escherichia coli O157:H7 str.
G5101]
gi|320645817|gb|EFX14802.1| hypothetical protein ECO9389_23606 [Escherichia coli O157:H- str.
493-89]
gi|320651117|gb|EFX19557.1| hypothetical protein ECO2687_11593 [Escherichia coli O157:H- str. H
2687]
gi|320656613|gb|EFX24509.1| hypothetical protein ECO7815_01750 [Escherichia coli O55:H7 str.
3256-97 TW 07815]
gi|320662132|gb|EFX29533.1| hypothetical protein ECO5905_09688 [Escherichia coli O55:H7 str.
USDA 5905]
gi|320667208|gb|EFX34171.1| hypothetical protein ECOSU61_08769 [Escherichia coli O157:H7 str.
LSU-61]
gi|326338986|gb|EGD62801.1| hypothetical protein ECoA_04528 [Escherichia coli O157:H7 str.
1044]
gi|326343132|gb|EGD66900.1| hypothetical protein ECF_02644 [Escherichia coli O157:H7 str. 1125]
gi|331047814|gb|EGI19891.1| conserved hypothetical protein [Escherichia coli M718]
gi|345371841|gb|EGX03810.1| hypothetical protein ECSTECMHI813_3213 [Escherichia coli
STEC_MHI813]
gi|371602654|gb|EHN91342.1| hypothetical protein ESOG_01783 [Escherichia coli E101]
gi|374360271|gb|AEZ41978.1| hypothetical protein ECO55CA74_17075 [Escherichia coli O55:H7 str.
RM12579]
gi|377891531|gb|EHU55983.1| putative DNA-binding protein [Escherichia coli DEC3B]
gi|377892516|gb|EHU56962.1| putative DNA-binding protein [Escherichia coli DEC3A]
gi|377904273|gb|EHU68560.1| putative DNA-binding protein [Escherichia coli DEC3C]
gi|377908205|gb|EHU72423.1| putative DNA-binding protein [Escherichia coli DEC3D]
gi|377910579|gb|EHU74767.1| putative DNA-binding protein [Escherichia coli DEC3E]
gi|377919252|gb|EHU83295.1| putative DNA-binding protein [Escherichia coli DEC3F]
gi|377925117|gb|EHU89058.1| putative DNA-binding protein [Escherichia coli DEC4A]
gi|377929259|gb|EHU93159.1| putative DNA-binding protein [Escherichia coli DEC4B]
gi|377939797|gb|EHV03551.1| putative DNA-binding protein [Escherichia coli DEC4D]
gi|377941107|gb|EHV04853.1| putative DNA-binding protein [Escherichia coli DEC4C]
gi|377946671|gb|EHV10351.1| putative DNA-binding protein [Escherichia coli DEC4E]
gi|377956524|gb|EHV20074.1| putative DNA-binding protein [Escherichia coli DEC4F]
gi|377959670|gb|EHV23166.1| putative DNA-binding protein [Escherichia coli DEC5A]
gi|377964268|gb|EHV27705.1| putative DNA-binding protein [Escherichia coli DEC5B]
gi|377972609|gb|EHV35957.1| putative DNA-binding protein [Escherichia coli DEC5C]
gi|377974371|gb|EHV37699.1| putative DNA-binding protein [Escherichia coli DEC5D]
gi|377982515|gb|EHV45767.1| putative DNA-binding protein [Escherichia coli DEC5E]
gi|386797572|gb|AFJ30606.1| hypothetical protein CDCO157_3550 [Escherichia coli Xuzhou21]
gi|390639681|gb|EIN19151.1| bifunctional protein glmU [Escherichia coli FRIK1996]
gi|390641542|gb|EIN20967.1| bifunctional protein glmU [Escherichia coli FDA517]
gi|390641954|gb|EIN21377.1| bifunctional protein glmU [Escherichia coli FDA505]
gi|390659377|gb|EIN37144.1| bifunctional protein glmU [Escherichia coli 93-001]
gi|390659645|gb|EIN37400.1| bifunctional protein glmU [Escherichia coli FRIK1985]
gi|390662085|gb|EIN39712.1| bifunctional protein glmU [Escherichia coli FRIK1990]
gi|390675828|gb|EIN51951.1| bifunctional protein glmU [Escherichia coli PA3]
gi|390679334|gb|EIN55246.1| bifunctional protein glmU [Escherichia coli PA5]
gi|390682837|gb|EIN58580.1| bifunctional protein glmU [Escherichia coli PA9]
gi|390694558|gb|EIN69130.1| bifunctional protein glmU [Escherichia coli PA10]
gi|390699381|gb|EIN73731.1| bifunctional protein glmU [Escherichia coli PA14]
gi|390699655|gb|EIN73998.1| bifunctional protein glmU [Escherichia coli PA15]
gi|390713532|gb|EIN86470.1| bifunctional protein glmU [Escherichia coli PA22]
gi|390721083|gb|EIN93784.1| bifunctional protein glmU [Escherichia coli PA25]
gi|390722485|gb|EIN95156.1| bifunctional protein glmU [Escherichia coli PA24]
gi|390726059|gb|EIN98536.1| bifunctional protein glmU [Escherichia coli PA28]
gi|390739955|gb|EIO11113.1| bifunctional protein glmU [Escherichia coli PA31]
gi|390740653|gb|EIO11773.1| bifunctional protein glmU [Escherichia coli PA32]
gi|390743950|gb|EIO14895.1| bifunctional protein glmU [Escherichia coli PA33]
gi|390757310|gb|EIO26799.1| bifunctional protein glmU [Escherichia coli PA40]
gi|390765462|gb|EIO34628.1| bifunctional protein glmU [Escherichia coli PA39]
gi|390765612|gb|EIO34775.1| bifunctional protein glmU [Escherichia coli PA41]
gi|390767566|gb|EIO36649.1| bifunctional protein glmU [Escherichia coli PA42]
gi|390780371|gb|EIO48071.1| bifunctional protein glmU [Escherichia coli TW06591]
gi|390788162|gb|EIO55631.1| bifunctional protein glmU [Escherichia coli TW07945]
gi|390789019|gb|EIO56484.1| bifunctional protein glmU [Escherichia coli TW10246]
gi|390795578|gb|EIO62862.1| bifunctional protein glmU [Escherichia coli TW11039]
gi|390803447|gb|EIO70453.1| bifunctional protein glmU [Escherichia coli TW09098]
gi|390806289|gb|EIO73211.1| bifunctional protein glmU [Escherichia coli TW09109]
gi|390814799|gb|EIO81348.1| bifunctional protein glmU [Escherichia coli TW10119]
gi|390824391|gb|EIO90372.1| bifunctional protein glmU [Escherichia coli EC4203]
gi|390827059|gb|EIO92846.1| bifunctional protein glmU [Escherichia coli TW09195]
gi|390829395|gb|EIO94996.1| bifunctional protein glmU [Escherichia coli EC4196]
gi|390844204|gb|EIP07956.1| bifunctional protein glmU [Escherichia coli TW14313]
gi|390844719|gb|EIP08418.1| bifunctional protein glmU [Escherichia coli TW14301]
gi|390849620|gb|EIP13042.1| bifunctional protein glmU [Escherichia coli EC4421]
gi|390864572|gb|EIP26680.1| bifunctional protein glmU [Escherichia coli EC4013]
gi|390868873|gb|EIP30581.1| bifunctional protein glmU [Escherichia coli EC4402]
gi|390877153|gb|EIP38104.1| bifunctional protein glmU [Escherichia coli EC4439]
gi|390882648|gb|EIP43149.1| bifunctional protein glmU [Escherichia coli EC4436]
gi|390892240|gb|EIP51828.1| bifunctional protein glmU [Escherichia coli EC4437]
gi|390894487|gb|EIP54004.1| bifunctional protein glmU [Escherichia coli EC4448]
gi|390899193|gb|EIP58441.1| bifunctional protein glmU [Escherichia coli EC1738]
gi|390907096|gb|EIP65965.1| bifunctional protein glmU [Escherichia coli EC1734]
gi|408063435|gb|EKG97927.1| bifunctional protein glmU [Escherichia coli PA7]
gi|408065867|gb|EKH00337.1| bifunctional protein glmU [Escherichia coli FRIK920]
gi|408069066|gb|EKH03480.1| bifunctional protein glmU [Escherichia coli PA34]
gi|408078326|gb|EKH12499.1| bifunctional protein glmU [Escherichia coli FDA506]
gi|408081708|gb|EKH15715.1| bifunctional protein glmU [Escherichia coli FDA507]
gi|408090388|gb|EKH23665.1| bifunctional protein glmU [Escherichia coli FDA504]
gi|408096451|gb|EKH29391.1| bifunctional protein glmU [Escherichia coli FRIK1999]
gi|408103212|gb|EKH35597.1| bifunctional protein glmU [Escherichia coli FRIK1997]
gi|408107613|gb|EKH39689.1| bifunctional protein glmU [Escherichia coli NE1487]
gi|408114115|gb|EKH45677.1| bifunctional protein glmU [Escherichia coli NE037]
gi|408126361|gb|EKH56921.1| bifunctional protein glmU [Escherichia coli PA4]
gi|408136374|gb|EKH66121.1| bifunctional protein glmU [Escherichia coli PA23]
gi|408139002|gb|EKH68636.1| bifunctional protein glmU [Escherichia coli PA49]
gi|408145488|gb|EKH74666.1| bifunctional protein glmU [Escherichia coli PA45]
gi|408154085|gb|EKH82455.1| bifunctional protein glmU [Escherichia coli TT12B]
gi|408159050|gb|EKH87153.1| bifunctional protein glmU [Escherichia coli MA6]
gi|408162939|gb|EKH90826.1| bifunctional protein glmU [Escherichia coli 5905]
gi|408172121|gb|EKH99208.1| bifunctional protein glmU [Escherichia coli CB7326]
gi|408178701|gb|EKI05398.1| bifunctional protein glmU [Escherichia coli EC96038]
gi|408181867|gb|EKI08409.1| bifunctional protein glmU [Escherichia coli 5412]
gi|408215704|gb|EKI40076.1| bifunctional protein glmU [Escherichia coli PA38]
gi|408225732|gb|EKI49398.1| bifunctional protein glmU [Escherichia coli EC1735]
gi|408237136|gb|EKI60003.1| bifunctional protein glmU [Escherichia coli EC1736]
gi|408240543|gb|EKI63218.1| bifunctional protein glmU [Escherichia coli EC1737]
gi|408245311|gb|EKI67703.1| bifunctional protein glmU [Escherichia coli EC1846]
gi|408254045|gb|EKI75605.1| bifunctional protein glmU [Escherichia coli EC1847]
gi|408257807|gb|EKI79104.1| bifunctional protein glmU [Escherichia coli EC1848]
gi|408264348|gb|EKI85148.1| bifunctional protein glmU [Escherichia coli EC1849]
gi|408273046|gb|EKI93112.1| bifunctional protein glmU [Escherichia coli EC1850]
gi|408276295|gb|EKI96228.1| bifunctional protein glmU [Escherichia coli EC1856]
gi|408284651|gb|EKJ03743.1| bifunctional protein glmU [Escherichia coli EC1862]
gi|408290247|gb|EKJ08984.1| bifunctional protein glmU [Escherichia coli EC1864]
gi|408306502|gb|EKJ23868.1| bifunctional protein glmU [Escherichia coli EC1868]
gi|408307097|gb|EKJ24459.1| bifunctional protein glmU [Escherichia coli EC1866]
gi|408317883|gb|EKJ34113.1| bifunctional protein glmU [Escherichia coli EC1869]
gi|408323942|gb|EKJ39903.1| bifunctional protein glmU [Escherichia coli EC1870]
gi|408325388|gb|EKJ41272.1| bifunctional protein glmU [Escherichia coli NE098]
gi|408335544|gb|EKJ50382.1| bifunctional protein glmU [Escherichia coli FRIK523]
gi|408345454|gb|EKJ59796.1| bifunctional protein glmU [Escherichia coli 0.1304]
gi|408548213|gb|EKK25598.1| putative DNA-binding protein [Escherichia coli 3.4870]
gi|408548360|gb|EKK25744.1| putative DNA-binding protein [Escherichia coli 5.2239]
gi|408567310|gb|EKK43370.1| putative DNA-binding protein [Escherichia coli 8.0586]
gi|408577663|gb|EKK53222.1| bifunctional protein glmU [Escherichia coli 10.0833]
gi|408580231|gb|EKK55649.1| putative DNA-binding protein [Escherichia coli 8.2524]
gi|408590308|gb|EKK64790.1| putative DNA-binding protein [Escherichia coli 10.0869]
gi|408595553|gb|EKK69788.1| putative DNA-binding protein [Escherichia coli 88.0221]
gi|408600315|gb|EKK74174.1| bifunctional protein glmU [Escherichia coli 8.0416]
gi|408611763|gb|EKK85123.1| putative DNA-binding protein [Escherichia coli 10.0821]
gi|427203476|gb|EKV73781.1| putative DNA-binding protein [Escherichia coli 88.1042]
gi|427204612|gb|EKV74887.1| putative DNA-binding protein [Escherichia coli 89.0511]
gi|427219672|gb|EKV88633.1| putative DNA-binding protein [Escherichia coli 90.0091]
gi|427223123|gb|EKV91882.1| putative DNA-binding protein [Escherichia coli 90.2281]
gi|427226019|gb|EKV94627.1| putative DNA-binding protein [Escherichia coli 90.0039]
gi|427240608|gb|EKW08061.1| putative DNA-binding protein [Escherichia coli 93.0056]
gi|427240776|gb|EKW08228.1| putative DNA-binding protein [Escherichia coli 93.0055]
gi|427244489|gb|EKW11808.1| putative DNA-binding protein [Escherichia coli 94.0618]
gi|427258849|gb|EKW24925.1| putative DNA-binding protein [Escherichia coli 95.0183]
gi|427259929|gb|EKW25949.1| putative DNA-binding protein [Escherichia coli 95.0943]
gi|427262844|gb|EKW28702.1| putative DNA-binding protein [Escherichia coli 95.1288]
gi|427275166|gb|EKW39789.1| putative DNA-binding protein [Escherichia coli 96.0428]
gi|427277856|gb|EKW42366.1| putative DNA-binding protein [Escherichia coli 96.0427]
gi|427282041|gb|EKW46321.1| putative DNA-binding protein [Escherichia coli 96.0939]
gi|427290525|gb|EKW53996.1| putative DNA-binding protein [Escherichia coli 96.0932]
gi|427297720|gb|EKW60744.1| putative DNA-binding protein [Escherichia coli 96.0107]
gi|427299409|gb|EKW62383.1| putative DNA-binding protein [Escherichia coli 97.0003]
gi|427310621|gb|EKW72861.1| putative DNA-binding protein [Escherichia coli 97.1742]
gi|427313501|gb|EKW75608.1| putative DNA-binding protein [Escherichia coli 97.0007]
gi|427318059|gb|EKW79942.1| putative DNA-binding protein [Escherichia coli 99.0672]
gi|427326791|gb|EKW88198.1| bifunctional protein glmU [Escherichia coli 99.0678]
gi|427328287|gb|EKW89655.1| putative DNA-binding protein [Escherichia coli 99.0713]
gi|429252414|gb|EKY36952.1| putative DNA-binding protein [Escherichia coli 96.0109]
gi|429253974|gb|EKY38425.1| putative DNA-binding protein [Escherichia coli 97.0010]
gi|430978381|gb|ELC95192.1| hypothetical protein A13W_02059 [Escherichia coli KTE193]
gi|431455620|gb|ELH35975.1| hypothetical protein A153_03691 [Escherichia coli KTE196]
gi|431548322|gb|ELI22604.1| hypothetical protein WIC_03350 [Escherichia coli KTE112]
gi|431554211|gb|ELI28092.1| hypothetical protein WIG_03004 [Escherichia coli KTE117]
gi|444536788|gb|ELV16781.1| hypothetical protein EC990814_3608 [Escherichia coli 99.0814]
gi|444546711|gb|ELV25408.1| hypothetical protein EC990815_3602 [Escherichia coli 99.0815]
gi|444553566|gb|ELV31182.1| hypothetical protein EC990816_5056 [Escherichia coli 99.0816]
gi|444553909|gb|ELV31498.1| hypothetical protein EC990839_4904 [Escherichia coli 99.0839]
gi|444561895|gb|ELV38997.1| hypothetical protein EC990848_3603 [Escherichia coli 99.0848]
gi|444571236|gb|ELV47724.1| hypothetical protein EC991753_3667 [Escherichia coli 99.1753]
gi|444574891|gb|ELV51152.1| hypothetical protein EC991775_3489 [Escherichia coli 99.1775]
gi|444578153|gb|ELV54241.1| hypothetical protein EC991793_3909 [Escherichia coli 99.1793]
gi|444591690|gb|ELV66961.1| hypothetical protein ECPA11_3667 [Escherichia coli PA11]
gi|444592503|gb|ELV67762.1| hypothetical protein ECATCC700728_3496 [Escherichia coli ATCC
700728]
gi|444593095|gb|ELV68327.1| hypothetical protein EC991805_3478 [Escherichia coli 99.1805]
gi|444605409|gb|ELV80051.1| hypothetical protein ECPA13_3462 [Escherichia coli PA13]
gi|444606191|gb|ELV80817.1| hypothetical protein ECPA19_3631 [Escherichia coli PA19]
gi|444614764|gb|ELV88990.1| hypothetical protein ECPA2_3807 [Escherichia coli PA2]
gi|444617947|gb|ELV92046.1| hypothetical protein ECPA47_4803 [Escherichia coli PA47]
gi|444622680|gb|ELV96625.1| hypothetical protein ECPA48_3489 [Escherichia coli PA48]
gi|444628880|gb|ELW02617.1| hypothetical protein ECPA8_3820 [Escherichia coli PA8]
gi|444637444|gb|ELW10818.1| hypothetical protein EC71982_3731 [Escherichia coli 7.1982]
gi|444639937|gb|ELW13234.1| hypothetical protein EC991781_3888 [Escherichia coli 99.1781]
gi|444644004|gb|ELW17130.1| hypothetical protein EC991762_3857 [Escherichia coli 99.1762]
gi|444650621|gb|ELW23449.1| hypothetical protein ECPA35_5247 [Escherichia coli PA35]
gi|444659070|gb|ELW31507.1| hypothetical protein EC34880_3716 [Escherichia coli 3.4880]
gi|444662236|gb|ELW34498.1| hypothetical protein EC950083_3601 [Escherichia coli 95.0083]
gi|444669191|gb|ELW41189.1| hypothetical protein EC990670_3846 [Escherichia coli 99.0670]
Length = 143
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 63/115 (54%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G+++++ LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQQQLHAAWIAGCTGSLTDIALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|170765469|ref|ZP_02900280.1| conserved hypothetical protein [Escherichia albertii TW07627]
gi|170124615|gb|EDS93546.1| conserved hypothetical protein [Escherichia albertii TW07627]
Length = 142
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 61/114 (53%), Gaps = 11/114 (9%)
Query: 162 GEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLL 221
G++V S++ +F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQLHAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL-- 78
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 79 -EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 126
>gi|356577361|ref|XP_003556795.1| PREDICTED: uncharacterized protein LOC100790942 [Glycine max]
Length = 201
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 8/146 (5%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGF-TPHVITVKAGEDVSSKIMSFSQNGPRAVCIL 183
KK GRP GS + K + A V P I V DV ++ F+ + ++ +L
Sbjct: 25 KKKVGRPLGSKNKPKLS-HVISQANVQVQKPIYIEVPNNLDVIEAMVQFAHHHKVSITVL 83
Query: 184 SANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLS------VS 237
SA+G I++VTL + T T G F ++SL+G+++ + + S + L +S
Sbjct: 84 SASGTIASVTLNYTDSYASTFTLYGPFSLISLTGTYINNTAISSSSSSCNLDHPCCFRIS 143
Query: 238 LSGPDGRVLGGSVAGLLTAATPVQVV 263
S G+ + G V G L AA V V+
Sbjct: 144 FSTISGQSIIGFVRGKLVAANGVIVM 169
>gi|226528577|ref|NP_001150385.1| DNA binding protein [Zea mays]
gi|195638812|gb|ACG38874.1| DNA binding protein [Zea mays]
gi|414875546|tpg|DAA52677.1| TPA: hypothetical protein ZEAMMB73_741073 [Zea mays]
Length = 197
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 64/116 (55%), Gaps = 7/116 (6%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V AG DV S + +F++ G +L A G +++V LR+ A + G EILS
Sbjct: 46 HLVEVPAGRDVLSCVSAFARRGRCGAMVLGAAGHVTDVVLREPA-----LVLRGTMEILS 100
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDG-RVLGGSVAGLLTAATPVQVVVGSFLA 269
LSG F G + T G +V ++GP G + GG G L AA PV V+V +F+A
Sbjct: 101 LSGCFFPFPGPGSVAAT-GTAVFMAGPRGSVLGGGVALGGLVAAGPVVVMVATFVA 155
>gi|428306194|ref|YP_007143019.1| hypothetical protein Cri9333_2652 [Crinalium epipsammum PCC 9333]
gi|428247729|gb|AFZ13509.1| protein of unknown function DUF296 [Crinalium epipsammum PCC 9333]
Length = 137
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 71/129 (55%), Gaps = 13/129 (10%)
Query: 164 DVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLS 222
D+ ++S+ + G +A CI+S G++ ++T+R A S TV E +FEI+SL+G+ +S
Sbjct: 19 DLKKSLISYCEFYGIQAACIISCVGSLRSLTIRFANKSNLTVI-EEKFEIISLAGT--IS 75
Query: 223 ESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD--GRKESKSSHR 280
+ L +S+S +G++LGG +A T ++V+G L D ++E S+
Sbjct: 76 QHEAH------LHISISDGEGKMLGGHLAEGSLIYTTCEIVIG-ILDDVVFKRELDSTTG 128
Query: 281 MESLPVPPK 289
+ L + K
Sbjct: 129 YKELKIYQK 137
>gi|415796434|ref|ZP_11497570.1| hypothetical protein ECE128010_1244 [Escherichia coli E128010]
gi|323162479|gb|EFZ48329.1| hypothetical protein ECE128010_1244 [Escherichia coli E128010]
Length = 142
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 62/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 79 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFNRQ 126
>gi|193063542|ref|ZP_03044631.1| conserved hypothetical protein [Escherichia coli E22]
gi|417251755|ref|ZP_12043520.1| PF03479 domain protein [Escherichia coli 4.0967]
gi|417624935|ref|ZP_12275230.1| hypothetical protein ECSTECH18_3706 [Escherichia coli STEC_H.1.8]
gi|419290982|ref|ZP_13833070.1| putative DNA-binding protein [Escherichia coli DEC11A]
gi|419296264|ref|ZP_13838306.1| putative DNA-binding protein [Escherichia coli DEC11B]
gi|419307859|ref|ZP_13849756.1| putative DNA-binding protein [Escherichia coli DEC11D]
gi|419312863|ref|ZP_13854723.1| putative DNA-binding protein [Escherichia coli DEC11E]
gi|419318255|ref|ZP_13860056.1| putative DNA-binding protein [Escherichia coli DEC12A]
gi|419324548|ref|ZP_13866238.1| putative DNA-binding protein [Escherichia coli DEC12B]
gi|419330526|ref|ZP_13872125.1| putative DNA-binding protein [Escherichia coli DEC12C]
gi|419336033|ref|ZP_13877554.1| putative DNA-binding protein [Escherichia coli DEC12D]
gi|419341393|ref|ZP_13882854.1| putative DNA-binding protein [Escherichia coli DEC12E]
gi|420392949|ref|ZP_14892197.1| putative DNA-binding protein [Escherichia coli EPEC C342-62]
gi|192930819|gb|EDV83424.1| conserved hypothetical protein [Escherichia coli E22]
gi|345376021|gb|EGX07967.1| hypothetical protein ECSTECH18_3706 [Escherichia coli STEC_H.1.8]
gi|378127994|gb|EHW89380.1| putative DNA-binding protein [Escherichia coli DEC11A]
gi|378140332|gb|EHX01560.1| putative DNA-binding protein [Escherichia coli DEC11B]
gi|378146786|gb|EHX07936.1| putative DNA-binding protein [Escherichia coli DEC11D]
gi|378156940|gb|EHX17986.1| putative DNA-binding protein [Escherichia coli DEC11E]
gi|378163763|gb|EHX24715.1| putative DNA-binding protein [Escherichia coli DEC12B]
gi|378168052|gb|EHX28963.1| putative DNA-binding protein [Escherichia coli DEC12A]
gi|378168219|gb|EHX29128.1| putative DNA-binding protein [Escherichia coli DEC12C]
gi|378180436|gb|EHX41123.1| putative DNA-binding protein [Escherichia coli DEC12D]
gi|378185942|gb|EHX46566.1| putative DNA-binding protein [Escherichia coli DEC12E]
gi|386218604|gb|EII35087.1| PF03479 domain protein [Escherichia coli 4.0967]
gi|391311548|gb|EIQ69184.1| putative DNA-binding protein [Escherichia coli EPEC C342-62]
Length = 143
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 62/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFNRQ 127
>gi|449494648|ref|XP_004159608.1| PREDICTED: uncharacterized protein LOC101232466 [Cucumis sativus]
Length = 120
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/55 (65%), Positives = 45/55 (81%)
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
S+S G +SR GG+SVSL+ PDGRV+GG VAGLL AA+PVQVVVGSF++ + E K
Sbjct: 3 SDSIGTKSRIGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQK 57
>gi|357127813|ref|XP_003565572.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Brachypodium
distachyon]
Length = 252
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 64/116 (55%), Gaps = 2/116 (1%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
HV+ V AG DV S + +F++ G R +L A G +++ L ++ + G EIL
Sbjct: 77 HVLEVPAGRDVLSCVAAFARRGRRGAMVLGAAGRVADAVL-TSSDPAAALVLRGTAEILG 135
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDG-RVLGGSVAGLLTAATPVQVVVGSFLA 269
L+G F S S + + G++V LSGP G + GG AG L AA PV V+V +F A
Sbjct: 136 LAGCFFPSASPSSAAASAGVAVFLSGPRGGVLGGGVAAGGLVAAGPVVVMVATFAA 191
>gi|125583443|gb|EAZ24374.1| hypothetical protein OsJ_08128 [Oryza sativa Japonica Group]
Length = 158
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/63 (50%), Positives = 40/63 (63%), Gaps = 3/63 (4%)
Query: 205 TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVV 264
T GRFEILSL+G+ L + S GL+V LSG G+V+GGSV G L AA PV ++
Sbjct: 4 TLRGRFEILSLTGTVLPPPAPPGAS---GLTVFLSGGQGQVIGGSVVGPLVAAGPVVLMA 60
Query: 265 GSF 267
SF
Sbjct: 61 ASF 63
>gi|223973355|gb|ACN30865.1| unknown [Zea mays]
Length = 155
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 64/116 (55%), Gaps = 7/116 (6%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ V AG DV S + +F++ G +L A G +++V LR+ A + G EILS
Sbjct: 4 HLVEVPAGRDVLSCVSAFARRGRCGAMVLGAAGHVTDVVLREPA-----LVLRGTMEILS 58
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDG-RVLGGSVAGLLTAATPVQVVVGSFLA 269
LSG F G + T G +V ++GP G + GG G L AA PV V+V +F+A
Sbjct: 59 LSGCFFPFPGPGSVAAT-GTAVFMAGPRGSVLGGGVALGGLVAAGPVVVMVATFVA 113
>gi|423125726|ref|ZP_17113405.1| hypothetical protein HMPREF9694_02417 [Klebsiella oxytoca 10-5250]
gi|376398807|gb|EHT11430.1| hypothetical protein HMPREF9694_02417 [Klebsiella oxytoca 10-5250]
Length = 141
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
GE+V S++ +F Q+ +A I G++SNV LR A T+ G +E++SL+G+
Sbjct: 21 GEEVFSRLRAFVQQHHIQAAWIAGCTGSLSNVALRFAGQDETTLL-NGIYEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +SLS P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLSLSDPQGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|291234187|ref|XP_002737023.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 189
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 59/111 (53%), Gaps = 14/111 (12%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQA-ATSGGT---VTYEGRFEILSLS 216
GE++ S +M F +N + I++ G+++ +R A AT+ T + + +FEI+SL
Sbjct: 61 GEEIFSTLMKFVDENQLDSAFIVTCVGSVTRAKIRLAHATAEETNKILELDDKFEIVSLV 120
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+ S+G G L +SLS G V+GG V G L T ++V+G
Sbjct: 121 GTL----SAG-----GHLHISLSDRKGAVIGGHVLGDLKVFTTAEIVIGQL 162
>gi|357493939|ref|XP_003617258.1| hypothetical protein MTR_5g089600 [Medicago truncatula]
gi|355518593|gb|AET00217.1| hypothetical protein MTR_5g089600 [Medicago truncatula]
Length = 236
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 68/146 (46%), Gaps = 8/146 (5%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
K+SRGR GS + K + I + AG DV I+ + + ++
Sbjct: 77 KRSRGRSKGSKNKPKPPVVITVEPESFMKQIFIEISAGCDVVESIIKMAWRHQADISVMR 136
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT------GGLSVSL 238
+G +SN+T+R + + +T EG +++SLSG+++ S S S+ L
Sbjct: 137 GSGLVSNITIRNSTSHSPALTIEGPIKMMSLSGTYINPNSDTVPSEFITNPNHSSFSIFL 196
Query: 239 S--GPDGRVLGGSVAGLLTAATPVQV 262
S G +G+V GG V G + A+ V +
Sbjct: 197 SGNGNEGQVYGGIVIGKIMASGNVMI 222
>gi|194431601|ref|ZP_03063892.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|416279881|ref|ZP_11645026.1| hypothetical protein SGB_00538 [Shigella boydii ATCC 9905]
gi|417673905|ref|ZP_12323350.1| hypothetical protein SD15574_3425 [Shigella dysenteriae 155-74]
gi|417691199|ref|ZP_12340416.1| hypothetical protein SB521682_3478 [Shigella boydii 5216-82]
gi|420348920|ref|ZP_14850301.1| putative DNA-binding protein [Shigella boydii 965-58]
gi|194419957|gb|EDX36035.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|320182168|gb|EFW57071.1| hypothetical protein SGB_00538 [Shigella boydii ATCC 9905]
gi|332086852|gb|EGI91988.1| hypothetical protein SB521682_3478 [Shigella boydii 5216-82]
gi|332087737|gb|EGI92864.1| hypothetical protein SD15574_3425 [Shigella dysenteriae 155-74]
gi|391267106|gb|EIQ26043.1| putative DNA-binding protein [Shigella boydii 965-58]
Length = 143
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 63/115 (54%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I + G+++++ LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQQQLHAAWIAGSTGSLTDIALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++++G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELMIGCLEELAFSRQ 127
>gi|161616029|ref|YP_001589994.1| hypothetical protein SPAB_03830 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|161365393|gb|ABX69161.1| hypothetical protein SPAB_03830 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
Length = 141
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN RA I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|194426320|ref|ZP_03058875.1| conserved hypothetical protein [Escherichia coli B171]
gi|194415628|gb|EDX31895.1| conserved hypothetical protein [Escherichia coli B171]
Length = 143
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A G T+ G FE++SL+G+
Sbjct: 22 GQEVFSQLHAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEGTTLL-NGTFEVISLNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVSGCLEELAFNRQ 127
>gi|327270676|ref|XP_003220115.1| PREDICTED: bifunctional protein glmU-like [Anolis carolinensis]
Length = 146
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 61/120 (50%), Gaps = 16/120 (13%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQA----ATSGGTVTYEGRFEILSLS 216
GED+ S ++ F ++ ++ +++ G+IS TLR A + + V RFEI+SL
Sbjct: 18 GEDILSTLVKFVKDRKLKSPFVMTCVGSISKATLRLANAIASNTNKIVHLNERFEIVSLV 77
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESK 276
G+ L+E+ L + LS DG+ +GG V L T ++VVG DG S+
Sbjct: 78 GT--LNEAPH-------LHICLSDKDGKTIGGHVVSDLIVFTTAEIVVGE--CDGLWFSR 126
>gi|16766372|ref|NP_461987.1| DNA-binding protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. LT2]
gi|167990408|ref|ZP_02571508.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|197265357|ref|ZP_03165431.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|374979088|ref|ZP_09720427.1| hypothetical protein SEE_01103 [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|378446425|ref|YP_005234057.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. D23580]
gi|378451855|ref|YP_005239215.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|378700979|ref|YP_005182936.1| hypothetical protein SL1344_3047 [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|378985664|ref|YP_005248820.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|378990391|ref|YP_005253555.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|379702328|ref|YP_005244056.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|383497735|ref|YP_005398424.1| hypothetical protein UMN798_3339 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|422027288|ref|ZP_16373631.1| hypothetical protein B571_15298 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|422032323|ref|ZP_16378437.1| hypothetical protein B572_15419 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|427554026|ref|ZP_18928928.1| hypothetical protein B576_15355 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|427571580|ref|ZP_18933643.1| hypothetical protein B577_14775 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|427592328|ref|ZP_18938442.1| hypothetical protein B573_14810 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|427615872|ref|ZP_18943332.1| hypothetical protein B574_15204 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|427639718|ref|ZP_18948212.1| hypothetical protein B575_15430 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|427657303|ref|ZP_18952957.1| hypothetical protein B578_15011 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|427662621|ref|ZP_18957922.1| hypothetical protein B579_15918 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|427676244|ref|ZP_18962737.1| hypothetical protein B580_15679 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|427800287|ref|ZP_18968065.1| hypothetical protein B581_18177 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
gi|16421623|gb|AAL21946.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|197243612|gb|EDY26232.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|205331224|gb|EDZ17988.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|261248204|emb|CBG26040.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gi|267995234|gb|ACY90119.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|301159627|emb|CBW19146.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. SL1344]
gi|312914093|dbj|BAJ38067.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|321225748|gb|EFX50802.1| hypothetical protein SEE_01103 [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|323131427|gb|ADX18857.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|332989938|gb|AEF08921.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|380464556|gb|AFD59959.1| hypothetical protein UMN798_3339 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|414015085|gb|EKS98912.1| hypothetical protein B571_15298 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|414015936|gb|EKS99726.1| hypothetical protein B576_15355 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|414016613|gb|EKT00376.1| hypothetical protein B572_15419 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|414029363|gb|EKT12523.1| hypothetical protein B577_14775 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|414030857|gb|EKT13938.1| hypothetical protein B573_14810 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|414033964|gb|EKT16905.1| hypothetical protein B574_15204 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|414044196|gb|EKT26652.1| hypothetical protein B575_15430 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|414044913|gb|EKT27343.1| hypothetical protein B578_15011 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|414049665|gb|EKT31864.1| hypothetical protein B579_15918 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|414057325|gb|EKT39083.1| hypothetical protein B580_15679 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|414063546|gb|EKT44669.1| hypothetical protein B581_18177 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
Length = 141
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN RA I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGVMLGGHMMPGCTVRTTLELVIGEL 118
>gi|340000605|ref|YP_004731489.1| hypothetical protein SBG_2675 [Salmonella bongori NCTC 12419]
gi|339513967|emb|CCC31726.1| conserved hypothetical protein [Salmonella bongori NCTC 12419]
Length = 141
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 59/109 (54%), Gaps = 10/109 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A + G++++V LR A T+ G FE++SL+G+
Sbjct: 21 GQEVFSQLHTFVQQNQLHAAWVAGCTGSLTDVALRYAGQESTTL-LTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLA 269
ES G+ L +S++ P G +LGG + T T +++++G A
Sbjct: 79 --ESGGEH-----LHLSIADPCGAMLGGHMMSGCTVRTTLELIIGELTA 120
>gi|422780134|ref|ZP_16832919.1| hypothetical protein ERFG_00372 [Escherichia coli TW10509]
gi|432888178|ref|ZP_20101930.1| hypothetical protein A31C_03669 [Escherichia coli KTE158]
gi|323978781|gb|EGB73862.1| hypothetical protein ERFG_00372 [Escherichia coli TW10509]
gi|431414633|gb|ELG97184.1| hypothetical protein A31C_03669 [Escherichia coli KTE158]
Length = 143
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTILGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|331648676|ref|ZP_08349764.1| conserved hypothetical protein [Escherichia coli M605]
gi|417663484|ref|ZP_12313064.1| hypothetical protein ECAA86_03127 [Escherichia coli AA86]
gi|330908957|gb|EGH37471.1| hypothetical protein ECAA86_03127 [Escherichia coli AA86]
gi|331042423|gb|EGI14565.1| conserved hypothetical protein [Escherichia coli M605]
Length = 143
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 62/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T+ G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQENTTLL-SGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|432623146|ref|ZP_19859168.1| hypothetical protein A1UO_03030 [Escherichia coli KTE76]
gi|431157785|gb|ELE58419.1| hypothetical protein A1UO_03030 [Escherichia coli KTE76]
Length = 143
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHIMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|125605376|gb|EAZ44412.1| hypothetical protein OsJ_29032 [Oryza sativa Japonica Group]
Length = 243
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 29/42 (69%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQ 196
H++ V AG DV + ++++ R VC+LSA G ++NVTLRQ
Sbjct: 46 HILEVAAGCDVFEALTAYARRRQRGVCVLSAAGTVANVTLRQ 87
>gi|157148472|ref|YP_001455791.1| hypothetical protein CKO_04298 [Citrobacter koseri ATCC BAA-895]
gi|157085677|gb|ABV15355.1| hypothetical protein CKO_04298 [Citrobacter koseri ATCC BAA-895]
Length = 141
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 69/131 (52%), Gaps = 13/131 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F Q+ RA I G++++V LR A T+ G +E++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQHQLRAAWIAGCTGSLTDVALRFAGQEETTL-LTGTYEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRKESKSS 278
E +G+ L +S+S P G +LGG + T T +++V+G LA R+ S
Sbjct: 79 --ELTGEH-----LHLSVSDPKGAMLGGHMMAGCTVRTTLELVIGELNALAFSRQPCAVS 131
Query: 279 HRMESLPVPPK 289
E L + P+
Sbjct: 132 -GYEELVISPR 141
>gi|417140474|ref|ZP_11983724.1| PF03479 domain protein [Escherichia coli 97.0259]
gi|432816619|ref|ZP_20050381.1| hypothetical protein A1Y1_03020 [Escherichia coli KTE115]
gi|386156597|gb|EIH12942.1| PF03479 domain protein [Escherichia coli 97.0259]
gi|431363238|gb|ELG49811.1| hypothetical protein A1Y1_03020 [Escherichia coli KTE115]
Length = 143
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 62/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T+ G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQENTTLL-SGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|417360598|ref|ZP_12134681.1| putative regulator [Salmonella enterica subsp. enterica serovar
Give str. S5-487]
gi|353586279|gb|EHC45901.1| putative regulator [Salmonella enterica subsp. enterica serovar
Give str. S5-487]
Length = 141
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLADVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMSGCTVRTTLELVIGEL 118
>gi|116667883|pdb|2HX0|A Chain A, Three-Dimensional Structure Of The Hypothetical Protein
From Salmonella Cholerae-Suis (Aka Salmonella Enterica)
At The Resolution 1.55 A. Northeast Structural Genomics
Target Scr59
Length = 154
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 57/107 (53%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN RA I G++++V LR A T + G FE++SL+G+
Sbjct: 28 GQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 85
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G LGG T T +++V+G
Sbjct: 86 --ELTGEH-----LHLAVSDPYGVXLGGHXXPGCTVRTTLELVIGEL 125
>gi|50725928|dbj|BAD33456.1| DNA-binding protein-like [Oryza sativa Japonica Group]
Length = 347
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 14/109 (12%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILS 214
H++ + GEDV+ + F++ V +LRQ G + G EILS
Sbjct: 197 HMMEIADGEDVAEAVADFARRRQSWV-----------ASLRQPGEPGSVIELSGPLEILS 245
Query: 215 LSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVV 263
LSG+F+ S + GL L+G G+V+GG+V G L A V ++
Sbjct: 246 LSGAFMPPPSLANAT---GLKALLAGGQGQVIGGNVVGALRARGHVTIL 291
>gi|334187343|ref|NP_001190975.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
gi|332661739|gb|AEE87139.1| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
Length = 62
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 29/50 (58%), Positives = 33/50 (66%), Gaps = 4/50 (8%)
Query: 194 LRQAATSG--GTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
LRQA S GTV YEGRFEI+SLSGSFL SE + + G L +LS P
Sbjct: 2 LRQANNSNPTGTVKYEGRFEIISLSGSFLNSERN--ENHGGVLDHTLSHP 49
>gi|432864154|ref|ZP_20087881.1| hypothetical protein A311_03635 [Escherichia coli KTE146]
gi|431403435|gb|ELG86716.1| hypothetical protein A311_03635 [Escherichia coli KTE146]
Length = 143
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|194436700|ref|ZP_03068800.1| conserved hypothetical protein [Escherichia coli 101-1]
gi|218701634|ref|YP_002409263.1| hypothetical protein ECIAI39_3342 [Escherichia coli IAI39]
gi|300925088|ref|ZP_07141003.1| hypothetical protein HMPREF9548_03192 [Escherichia coli MS 182-1]
gi|386625651|ref|YP_006145379.1| hypothetical protein CE10_3364 [Escherichia coli O7:K1 str. CE10]
gi|422828289|ref|ZP_16876461.1| hypothetical protein ESNG_00966 [Escherichia coli B093]
gi|432418387|ref|ZP_19660983.1| hypothetical protein WGI_03904 [Escherichia coli KTE44]
gi|194424182|gb|EDX40169.1| conserved hypothetical protein [Escherichia coli 101-1]
gi|218371620|emb|CAR19459.1| conserved hypothetical protein with PD1-like DNA-binding motif
[Escherichia coli IAI39]
gi|300418750|gb|EFK02061.1| hypothetical protein HMPREF9548_03192 [Escherichia coli MS 182-1]
gi|349739387|gb|AEQ14093.1| hypothetical protein CE10_3364 [Escherichia coli O7:K1 str. CE10]
gi|371614991|gb|EHO03451.1| hypothetical protein ESNG_00966 [Escherichia coli B093]
gi|430937665|gb|ELC57919.1| hypothetical protein WGI_03904 [Escherichia coli KTE44]
Length = 143
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|351722831|ref|NP_001234954.1| uncharacterized protein LOC100527104 [Glycine max]
gi|255631562|gb|ACU16148.1| unknown [Glycine max]
Length = 187
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 63/153 (41%), Gaps = 9/153 (5%)
Query: 121 PDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAV 180
P S K GRP GS + K L + + P I V DV ++ F+++ ++
Sbjct: 29 PPSSNKGCGRPLGSKNKPKIPLVINQDSDLALKPIFIQVPKNSDVIEAVVQFARHCQVSI 88
Query: 181 CILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL---------LSESSGQRSRT 231
+ A+G+I TL Q T G F ++SL+G+++ S
Sbjct: 89 TVQCASGSILEATLCQTLPDTSTFVVFGPFTLISLTGTYINNNLSASSSSLSSPSNLDHN 148
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVV 264
++S G+ G V G + AA V VVV
Sbjct: 149 CSFTISFCSNFGQSFNGIVGGKVIAADDVTVVV 181
>gi|26249341|ref|NP_755381.1| hypothetical protein c3506 [Escherichia coli CFT073]
gi|110643072|ref|YP_670802.1| hypothetical protein ECP_2917 [Escherichia coli 536]
gi|300980280|ref|ZP_07174934.1| conserved hypothetical protein [Escherichia coli MS 45-1]
gi|301049289|ref|ZP_07196259.1| conserved hypothetical protein [Escherichia coli MS 185-1]
gi|306812174|ref|ZP_07446372.1| hypothetical protein ECNC101_09684 [Escherichia coli NC101]
gi|331659056|ref|ZP_08359998.1| conserved hypothetical protein [Escherichia coli TA206]
gi|386630673|ref|YP_006150393.1| hypothetical protein i02_3227 [Escherichia coli str. 'clone D i2']
gi|386635593|ref|YP_006155312.1| hypothetical protein i14_3227 [Escherichia coli str. 'clone D i14']
gi|416336908|ref|ZP_11673378.1| hypothetical protein EcoM_02804 [Escherichia coli WV_060327]
gi|419701729|ref|ZP_14229328.1| hypothetical protein OQA_14371 [Escherichia coli SCI-07]
gi|422363361|ref|ZP_16443898.1| conserved hypothetical protein [Escherichia coli MS 153-1]
gi|422372558|ref|ZP_16452915.1| conserved hypothetical protein [Escherichia coli MS 16-3]
gi|422383261|ref|ZP_16463413.1| hypothetical protein HMPREF9532_04828 [Escherichia coli MS 57-2]
gi|432382627|ref|ZP_19625566.1| hypothetical protein WCU_02788 [Escherichia coli KTE15]
gi|432388560|ref|ZP_19631441.1| hypothetical protein WCY_03825 [Escherichia coli KTE16]
gi|432433121|ref|ZP_19675546.1| hypothetical protein A13K_03422 [Escherichia coli KTE187]
gi|432437604|ref|ZP_19679991.1| hypothetical protein A13M_03331 [Escherichia coli KTE188]
gi|432457947|ref|ZP_19700126.1| hypothetical protein A15C_03752 [Escherichia coli KTE201]
gi|432467083|ref|ZP_19709168.1| hypothetical protein A15K_03043 [Escherichia coli KTE205]
gi|432472230|ref|ZP_19714270.1| hypothetical protein A15M_03124 [Escherichia coli KTE206]
gi|432515190|ref|ZP_19752411.1| hypothetical protein A17M_03062 [Escherichia coli KTE224]
gi|432525078|ref|ZP_19762202.1| hypothetical protein A17Y_03208 [Escherichia coli KTE230]
gi|432544565|ref|ZP_19781405.1| hypothetical protein A197_03159 [Escherichia coli KTE236]
gi|432550055|ref|ZP_19786819.1| hypothetical protein A199_03534 [Escherichia coli KTE237]
gi|432554965|ref|ZP_19791684.1| hypothetical protein A1S3_03380 [Escherichia coli KTE47]
gi|432569967|ref|ZP_19806475.1| hypothetical protein A1SE_03563 [Escherichia coli KTE53]
gi|432581987|ref|ZP_19818401.1| hypothetical protein A1SM_01192 [Escherichia coli KTE57]
gi|432594100|ref|ZP_19830413.1| hypothetical protein A1SS_03533 [Escherichia coli KTE60]
gi|432608766|ref|ZP_19844949.1| hypothetical protein A1U7_03783 [Escherichia coli KTE67]
gi|432612908|ref|ZP_19849066.1| hypothetical protein A1UG_03286 [Escherichia coli KTE72]
gi|432647460|ref|ZP_19883246.1| hypothetical protein A1W5_03229 [Escherichia coli KTE86]
gi|432657051|ref|ZP_19892751.1| hypothetical protein A1WE_03178 [Escherichia coli KTE93]
gi|432700319|ref|ZP_19935469.1| hypothetical protein A31M_03081 [Escherichia coli KTE169]
gi|432707168|ref|ZP_19942246.1| hypothetical protein WCG_00438 [Escherichia coli KTE6]
gi|432733662|ref|ZP_19968487.1| hypothetical protein WGK_03522 [Escherichia coli KTE45]
gi|432746884|ref|ZP_19981546.1| hypothetical protein WGG_03006 [Escherichia coli KTE43]
gi|432760748|ref|ZP_19995238.1| hypothetical protein A1S1_02890 [Escherichia coli KTE46]
gi|432784797|ref|ZP_20018975.1| hypothetical protein A1SY_03659 [Escherichia coli KTE63]
gi|432845951|ref|ZP_20078632.1| hypothetical protein A1YS_03396 [Escherichia coli KTE141]
gi|432900132|ref|ZP_20110554.1| hypothetical protein A13U_03335 [Escherichia coli KTE192]
gi|432906285|ref|ZP_20115013.1| hypothetical protein A13Y_03402 [Escherichia coli KTE194]
gi|432939410|ref|ZP_20137513.1| hypothetical protein A13C_01954 [Escherichia coli KTE183]
gi|432973065|ref|ZP_20161926.1| hypothetical protein A15O_03649 [Escherichia coli KTE207]
gi|432975031|ref|ZP_20163866.1| hypothetical protein A15S_00893 [Escherichia coli KTE209]
gi|432986649|ref|ZP_20175366.1| hypothetical protein A175_03117 [Escherichia coli KTE215]
gi|432996590|ref|ZP_20185173.1| hypothetical protein A17A_03667 [Escherichia coli KTE218]
gi|433029819|ref|ZP_20217671.1| hypothetical protein WIA_02926 [Escherichia coli KTE109]
gi|433039891|ref|ZP_20227487.1| hypothetical protein WIE_03251 [Escherichia coli KTE113]
gi|433059369|ref|ZP_20246409.1| hypothetical protein WIM_03145 [Escherichia coli KTE124]
gi|433074126|ref|ZP_20260771.1| hypothetical protein WIS_03088 [Escherichia coli KTE129]
gi|433079077|ref|ZP_20265599.1| hypothetical protein WIU_02944 [Escherichia coli KTE131]
gi|433083819|ref|ZP_20270271.1| hypothetical protein WIW_02972 [Escherichia coli KTE133]
gi|433088564|ref|ZP_20274931.1| hypothetical protein WIY_03025 [Escherichia coli KTE137]
gi|433102474|ref|ZP_20288550.1| hypothetical protein WK5_03031 [Escherichia coli KTE145]
gi|433116772|ref|ZP_20302559.1| hypothetical protein WKA_02967 [Escherichia coli KTE153]
gi|433121463|ref|ZP_20307127.1| hypothetical protein WKC_02895 [Escherichia coli KTE157]
gi|433145491|ref|ZP_20330628.1| hypothetical protein WKO_03036 [Escherichia coli KTE168]
gi|433184599|ref|ZP_20368839.1| hypothetical protein WGO_03039 [Escherichia coli KTE85]
gi|433189673|ref|ZP_20373765.1| hypothetical protein WGS_02758 [Escherichia coli KTE88]
gi|433209006|ref|ZP_20392677.1| hypothetical protein WI1_02787 [Escherichia coli KTE97]
gi|433213790|ref|ZP_20397378.1| hypothetical protein WI3_02980 [Escherichia coli KTE99]
gi|442605058|ref|ZP_21019896.1| Predicted regulator of STY3230 transporter operon [Escherichia coli
Nissle 1917]
gi|26109749|gb|AAN81954.1|AE016766_42 Hypothetical protein c3506 [Escherichia coli CFT073]
gi|110344664|gb|ABG70901.1| hypothetical protein ECP_2917 [Escherichia coli 536]
gi|300298888|gb|EFJ55273.1| conserved hypothetical protein [Escherichia coli MS 185-1]
gi|300409288|gb|EFJ92826.1| conserved hypothetical protein [Escherichia coli MS 45-1]
gi|305854212|gb|EFM54650.1| hypothetical protein ECNC101_09684 [Escherichia coli NC101]
gi|315293895|gb|EFU53247.1| conserved hypothetical protein [Escherichia coli MS 153-1]
gi|315295713|gb|EFU55033.1| conserved hypothetical protein [Escherichia coli MS 16-3]
gi|320195042|gb|EFW69671.1| hypothetical protein EcoM_02804 [Escherichia coli WV_060327]
gi|324005577|gb|EGB74796.1| hypothetical protein HMPREF9532_04828 [Escherichia coli MS 57-2]
gi|331053638|gb|EGI25667.1| conserved hypothetical protein [Escherichia coli TA206]
gi|355421572|gb|AER85769.1| hypothetical protein i02_3227 [Escherichia coli str. 'clone D i2']
gi|355426492|gb|AER90688.1| hypothetical protein i14_3227 [Escherichia coli str. 'clone D i14']
gi|380347191|gb|EIA35480.1| hypothetical protein OQA_14371 [Escherichia coli SCI-07]
gi|430904793|gb|ELC26492.1| hypothetical protein WCY_03825 [Escherichia coli KTE16]
gi|430905687|gb|ELC27295.1| hypothetical protein WCU_02788 [Escherichia coli KTE15]
gi|430951303|gb|ELC70523.1| hypothetical protein A13K_03422 [Escherichia coli KTE187]
gi|430961777|gb|ELC79784.1| hypothetical protein A13M_03331 [Escherichia coli KTE188]
gi|430980949|gb|ELC97693.1| hypothetical protein A15C_03752 [Escherichia coli KTE201]
gi|430992328|gb|ELD08701.1| hypothetical protein A15K_03043 [Escherichia coli KTE205]
gi|430996861|gb|ELD13136.1| hypothetical protein A15M_03124 [Escherichia coli KTE206]
gi|431040565|gb|ELD51100.1| hypothetical protein A17M_03062 [Escherichia coli KTE224]
gi|431050224|gb|ELD59975.1| hypothetical protein A17Y_03208 [Escherichia coli KTE230]
gi|431073500|gb|ELD81151.1| hypothetical protein A197_03159 [Escherichia coli KTE236]
gi|431078777|gb|ELD85817.1| hypothetical protein A199_03534 [Escherichia coli KTE237]
gi|431082316|gb|ELD88630.1| hypothetical protein A1S3_03380 [Escherichia coli KTE47]
gi|431098599|gb|ELE03912.1| hypothetical protein A1SE_03563 [Escherichia coli KTE53]
gi|431122269|gb|ELE25138.1| hypothetical protein A1SM_01192 [Escherichia coli KTE57]
gi|431126502|gb|ELE28849.1| hypothetical protein A1SS_03533 [Escherichia coli KTE60]
gi|431136845|gb|ELE38701.1| hypothetical protein A1U7_03783 [Escherichia coli KTE67]
gi|431147091|gb|ELE48514.1| hypothetical protein A1UG_03286 [Escherichia coli KTE72]
gi|431178807|gb|ELE78714.1| hypothetical protein A1W5_03229 [Escherichia coli KTE86]
gi|431189224|gb|ELE88649.1| hypothetical protein A1WE_03178 [Escherichia coli KTE93]
gi|431241930|gb|ELF36359.1| hypothetical protein A31M_03081 [Escherichia coli KTE169]
gi|431256278|gb|ELF49352.1| hypothetical protein WCG_00438 [Escherichia coli KTE6]
gi|431272570|gb|ELF63669.1| hypothetical protein WGK_03522 [Escherichia coli KTE45]
gi|431289996|gb|ELF80721.1| hypothetical protein WGG_03006 [Escherichia coli KTE43]
gi|431306055|gb|ELF94368.1| hypothetical protein A1S1_02890 [Escherichia coli KTE46]
gi|431327954|gb|ELG15274.1| hypothetical protein A1SY_03659 [Escherichia coli KTE63]
gi|431393461|gb|ELG77025.1| hypothetical protein A1YS_03396 [Escherichia coli KTE141]
gi|431423905|gb|ELH06002.1| hypothetical protein A13U_03335 [Escherichia coli KTE192]
gi|431430676|gb|ELH12507.1| hypothetical protein A13Y_03402 [Escherichia coli KTE194]
gi|431461080|gb|ELH41348.1| hypothetical protein A13C_01954 [Escherichia coli KTE183]
gi|431480225|gb|ELH59952.1| hypothetical protein A15O_03649 [Escherichia coli KTE207]
gi|431487097|gb|ELH66742.1| hypothetical protein A15S_00893 [Escherichia coli KTE209]
gi|431497918|gb|ELH77135.1| hypothetical protein A175_03117 [Escherichia coli KTE215]
gi|431503385|gb|ELH82120.1| hypothetical protein A17A_03667 [Escherichia coli KTE218]
gi|431541501|gb|ELI16940.1| hypothetical protein WIA_02926 [Escherichia coli KTE109]
gi|431550289|gb|ELI24286.1| hypothetical protein WIE_03251 [Escherichia coli KTE113]
gi|431568011|gb|ELI41003.1| hypothetical protein WIM_03145 [Escherichia coli KTE124]
gi|431585287|gb|ELI57239.1| hypothetical protein WIS_03088 [Escherichia coli KTE129]
gi|431595131|gb|ELI65205.1| hypothetical protein WIU_02944 [Escherichia coli KTE131]
gi|431599959|gb|ELI69637.1| hypothetical protein WIW_02972 [Escherichia coli KTE133]
gi|431603580|gb|ELI73005.1| hypothetical protein WIY_03025 [Escherichia coli KTE137]
gi|431617726|gb|ELI86737.1| hypothetical protein WK5_03031 [Escherichia coli KTE145]
gi|431632788|gb|ELJ01075.1| hypothetical protein WKA_02967 [Escherichia coli KTE153]
gi|431640754|gb|ELJ08509.1| hypothetical protein WKC_02895 [Escherichia coli KTE157]
gi|431659740|gb|ELJ26630.1| hypothetical protein WKO_03036 [Escherichia coli KTE168]
gi|431704039|gb|ELJ68673.1| hypothetical protein WGS_02758 [Escherichia coli KTE88]
gi|431704200|gb|ELJ68832.1| hypothetical protein WGO_03039 [Escherichia coli KTE85]
gi|431729161|gb|ELJ92800.1| hypothetical protein WI1_02787 [Escherichia coli KTE97]
gi|431733703|gb|ELJ97138.1| hypothetical protein WI3_02980 [Escherichia coli KTE99]
gi|441714149|emb|CCQ05873.1| Predicted regulator of STY3230 transporter operon [Escherichia coli
Nissle 1917]
Length = 142
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 67/131 (51%), Gaps = 13/131 (9%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRKESKSS 278
E SG+ L + +S P G +LGG + T T +++V+G LA R+ S
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQSCALS 132
Query: 279 HRMESLPVPPK 289
+ L + P+
Sbjct: 133 -GYDELHISPR 142
>gi|432505687|ref|ZP_19747408.1| hypothetical protein A17E_02759 [Escherichia coli KTE220]
gi|432652410|ref|ZP_19888161.1| hypothetical protein A1W7_03435 [Escherichia coli KTE87]
gi|433001164|ref|ZP_20189685.1| hypothetical protein A17K_03512 [Escherichia coli KTE223]
gi|433126445|ref|ZP_20311997.1| hypothetical protein WKE_02944 [Escherichia coli KTE160]
gi|433140513|ref|ZP_20325763.1| hypothetical protein WKM_02797 [Escherichia coli KTE167]
gi|433150432|ref|ZP_20335446.1| hypothetical protein WKQ_03089 [Escherichia coli KTE174]
gi|431037203|gb|ELD48191.1| hypothetical protein A17E_02759 [Escherichia coli KTE220]
gi|431189510|gb|ELE88933.1| hypothetical protein A1W7_03435 [Escherichia coli KTE87]
gi|431506589|gb|ELH85184.1| hypothetical protein A17K_03512 [Escherichia coli KTE223]
gi|431642844|gb|ELJ10551.1| hypothetical protein WKE_02944 [Escherichia coli KTE160]
gi|431658368|gb|ELJ25282.1| hypothetical protein WKM_02797 [Escherichia coli KTE167]
gi|431669293|gb|ELJ35720.1| hypothetical protein WKQ_03089 [Escherichia coli KTE174]
Length = 143
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|432407975|ref|ZP_19650680.1| hypothetical protein WEO_03178 [Escherichia coli KTE28]
gi|430928471|gb|ELC49020.1| hypothetical protein WEO_03178 [Escherichia coli KTE28]
Length = 143
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 62/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V +++ +F+Q A I G++++V LR A T+ G+FE+++L+G+
Sbjct: 22 GQEVLAQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQENTTLL-SGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|255586936|ref|XP_002534068.1| DNA binding protein, putative [Ricinus communis]
gi|223525895|gb|EEF28312.1| DNA binding protein, putative [Ricinus communis]
Length = 109
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 36/62 (58%), Gaps = 10/62 (16%)
Query: 159 VKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGS 218
V +G DVS + +F++ R ++VTLRQ A+SG VT GRFEILSL GS
Sbjct: 3 VSSGCDVSESLANFARRKQRG----------TSVTLRQPASSGAIVTLHGRFEILSLLGS 52
Query: 219 FL 220
L
Sbjct: 53 IL 54
>gi|91212304|ref|YP_542290.1| hypothetical protein UTI89_C3311 [Escherichia coli UTI89]
gi|117625153|ref|YP_854141.1| hypothetical protein APECO1_3605 [Escherichia coli APEC O1]
gi|215488221|ref|YP_002330652.1| DNA-binding protein [Escherichia coli O127:H6 str. E2348/69]
gi|218559915|ref|YP_002392828.1| hypothetical protein ECS88_3204 [Escherichia coli S88]
gi|218691047|ref|YP_002399259.1| hypothetical protein ECED1_3384 [Escherichia coli ED1a]
gi|222157613|ref|YP_002557752.1| hypothetical protein LF82_454 [Escherichia coli LF82]
gi|237706427|ref|ZP_04536908.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
gi|312964814|ref|ZP_07779054.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|386600922|ref|YP_006102428.1| hypothetical protein ECOK1_3311 [Escherichia coli IHE3034]
gi|386603020|ref|YP_006109320.1| hypothetical protein UM146_01905 [Escherichia coli UM146]
gi|386620501|ref|YP_006140081.1| hypothetical protein ECNA114_2968 [Escherichia coli NA114]
gi|387618195|ref|YP_006121217.1| hypothetical protein NRG857_14355 [Escherichia coli O83:H1 str. NRG
857C]
gi|387830773|ref|YP_003350710.1| hypothetical protein ECSF_2720 [Escherichia coli SE15]
gi|417086424|ref|ZP_11953624.1| hypothetical protein i01_04027 [Escherichia coli cloneA_i1]
gi|417757170|ref|ZP_12405241.1| putative DNA-binding protein [Escherichia coli DEC2B]
gi|418998130|ref|ZP_13545720.1| putative DNA-binding protein [Escherichia coli DEC1A]
gi|419003510|ref|ZP_13551028.1| putative DNA-binding protein [Escherichia coli DEC1B]
gi|419009046|ref|ZP_13556470.1| putative DNA-binding protein [Escherichia coli DEC1C]
gi|419014837|ref|ZP_13562180.1| putative DNA-binding protein [Escherichia coli DEC1D]
gi|419019863|ref|ZP_13567167.1| putative DNA-binding protein [Escherichia coli DEC1E]
gi|419025252|ref|ZP_13572475.1| putative DNA-binding protein [Escherichia coli DEC2A]
gi|419030408|ref|ZP_13577564.1| putative DNA-binding protein [Escherichia coli DEC2C]
gi|419035928|ref|ZP_13583011.1| putative DNA-binding protein [Escherichia coli DEC2D]
gi|419041095|ref|ZP_13588117.1| putative DNA-binding protein [Escherichia coli DEC2E]
gi|419916121|ref|ZP_14434452.1| hypothetical protein ECKD1_23024 [Escherichia coli KD1]
gi|419944486|ref|ZP_14460965.1| hypothetical protein ECHM605_10716 [Escherichia coli HM605]
gi|422356717|ref|ZP_16437390.1| conserved hypothetical protein [Escherichia coli MS 110-3]
gi|422750054|ref|ZP_16803965.1| hypothetical protein ERKG_02280 [Escherichia coli H252]
gi|422754298|ref|ZP_16808124.1| hypothetical protein ERLG_01420 [Escherichia coli H263]
gi|422840917|ref|ZP_16888887.1| hypothetical protein ESPG_03573 [Escherichia coli H397]
gi|432359247|ref|ZP_19602463.1| hypothetical protein WCC_03211 [Escherichia coli KTE4]
gi|432364094|ref|ZP_19607251.1| hypothetical protein WCE_03126 [Escherichia coli KTE5]
gi|432423240|ref|ZP_19665780.1| hypothetical protein A137_03670 [Escherichia coli KTE178]
gi|432442356|ref|ZP_19684693.1| hypothetical protein A13O_03195 [Escherichia coli KTE189]
gi|432447470|ref|ZP_19689768.1| hypothetical protein A13S_03528 [Escherichia coli KTE191]
gi|432501371|ref|ZP_19743125.1| hypothetical protein A177_03481 [Escherichia coli KTE216]
gi|432560108|ref|ZP_19796771.1| hypothetical protein A1S7_03765 [Escherichia coli KTE49]
gi|432575102|ref|ZP_19811576.1| hypothetical protein A1SI_03809 [Escherichia coli KTE55]
gi|432589232|ref|ZP_19825585.1| hypothetical protein A1SO_03602 [Escherichia coli KTE58]
gi|432599097|ref|ZP_19835368.1| hypothetical protein A1SW_03838 [Escherichia coli KTE62]
gi|432695705|ref|ZP_19930899.1| hypothetical protein A31I_03188 [Escherichia coli KTE162]
gi|432755782|ref|ZP_19990328.1| hypothetical protein WEA_02778 [Escherichia coli KTE22]
gi|432779862|ref|ZP_20014083.1| hypothetical protein A1SQ_03524 [Escherichia coli KTE59]
gi|432788854|ref|ZP_20022982.1| hypothetical protein A1U3_02984 [Escherichia coli KTE65]
gi|432803090|ref|ZP_20037045.1| hypothetical protein A1W3_03342 [Escherichia coli KTE84]
gi|432822291|ref|ZP_20055980.1| hypothetical protein A1Y5_03907 [Escherichia coli KTE118]
gi|432823800|ref|ZP_20057470.1| hypothetical protein A1YA_00467 [Escherichia coli KTE123]
gi|432890201|ref|ZP_20103210.1| hypothetical protein A31K_00297 [Escherichia coli KTE165]
gi|432921003|ref|ZP_20124522.1| hypothetical protein A133_03461 [Escherichia coli KTE173]
gi|432928617|ref|ZP_20129737.1| hypothetical protein A135_03805 [Escherichia coli KTE175]
gi|432982264|ref|ZP_20171037.1| hypothetical protein A15W_03408 [Escherichia coli KTE211]
gi|433006381|ref|ZP_20194806.1| hypothetical protein A17S_03966 [Escherichia coli KTE227]
gi|433009049|ref|ZP_20197462.1| hypothetical protein A17W_01768 [Escherichia coli KTE229]
gi|433015167|ref|ZP_20203505.1| hypothetical protein WI5_02994 [Escherichia coli KTE104]
gi|433024754|ref|ZP_20212732.1| hypothetical protein WI9_02920 [Escherichia coli KTE106]
gi|433097688|ref|ZP_20283867.1| hypothetical protein WK3_02896 [Escherichia coli KTE139]
gi|433107144|ref|ZP_20293112.1| hypothetical protein WK7_03013 [Escherichia coli KTE148]
gi|433155000|ref|ZP_20339935.1| hypothetical protein WKS_02934 [Escherichia coli KTE176]
gi|433164885|ref|ZP_20349617.1| hypothetical protein WKW_03102 [Escherichia coli KTE179]
gi|433169870|ref|ZP_20354493.1| hypothetical protein WKY_03122 [Escherichia coli KTE180]
gi|433199623|ref|ZP_20383514.1| hypothetical protein WGW_03173 [Escherichia coli KTE94]
gi|433322106|ref|ZP_20399610.1| hypothetical protein B185_002184 [Escherichia coli J96]
gi|91073878|gb|ABE08759.1| hypothetical protein UTI89_C3311 [Escherichia coli UTI89]
gi|115514277|gb|ABJ02352.1| conserved hypothetical protein [Escherichia coli APEC O1]
gi|215266293|emb|CAS10723.1| predicted DNA-binding protein [Escherichia coli O127:H6 str.
E2348/69]
gi|218366684|emb|CAR04439.1| conserved hypothetical protein with PD1-like DNA-binding motif
[Escherichia coli S88]
gi|218428611|emb|CAR09540.2| conserved hypothetical protein with PD1-like DNA-binding motif
[Escherichia coli ED1a]
gi|222034618|emb|CAP77360.1| hypothetical protein LF82_454 [Escherichia coli LF82]
gi|226899467|gb|EEH85726.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
gi|281179930|dbj|BAI56260.1| conserved hypothetical protein [Escherichia coli SE15]
gi|294491762|gb|ADE90518.1| conserved hypothetical protein [Escherichia coli IHE3034]
gi|307625504|gb|ADN69808.1| hypothetical protein UM146_01905 [Escherichia coli UM146]
gi|312290370|gb|EFR18250.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|312947456|gb|ADR28283.1| hypothetical protein NRG857_14355 [Escherichia coli O83:H1 str. NRG
857C]
gi|315289465|gb|EFU48860.1| conserved hypothetical protein [Escherichia coli MS 110-3]
gi|323951637|gb|EGB47512.1| hypothetical protein ERKG_02280 [Escherichia coli H252]
gi|323957353|gb|EGB53075.1| hypothetical protein ERLG_01420 [Escherichia coli H263]
gi|333971002|gb|AEG37807.1| hypothetical protein ECNA114_2968 [Escherichia coli NA114]
gi|355350580|gb|EHF99777.1| hypothetical protein i01_04027 [Escherichia coli cloneA_i1]
gi|371605413|gb|EHN94027.1| hypothetical protein ESPG_03573 [Escherichia coli H397]
gi|377842080|gb|EHU07135.1| putative DNA-binding protein [Escherichia coli DEC1A]
gi|377842401|gb|EHU07455.1| putative DNA-binding protein [Escherichia coli DEC1C]
gi|377845233|gb|EHU10256.1| putative DNA-binding protein [Escherichia coli DEC1B]
gi|377855519|gb|EHU20390.1| putative DNA-binding protein [Escherichia coli DEC1D]
gi|377859023|gb|EHU23861.1| putative DNA-binding protein [Escherichia coli DEC1E]
gi|377862610|gb|EHU27422.1| putative DNA-binding protein [Escherichia coli DEC2A]
gi|377872548|gb|EHU37194.1| putative DNA-binding protein [Escherichia coli DEC2B]
gi|377875785|gb|EHU40394.1| putative DNA-binding protein [Escherichia coli DEC2C]
gi|377878446|gb|EHU43033.1| putative DNA-binding protein [Escherichia coli DEC2D]
gi|377888197|gb|EHU52669.1| putative DNA-binding protein [Escherichia coli DEC2E]
gi|388382521|gb|EIL44376.1| hypothetical protein ECKD1_23024 [Escherichia coli KD1]
gi|388418422|gb|EIL78230.1| hypothetical protein ECHM605_10716 [Escherichia coli HM605]
gi|430875109|gb|ELB98652.1| hypothetical protein WCC_03211 [Escherichia coli KTE4]
gi|430883856|gb|ELC06827.1| hypothetical protein WCE_03126 [Escherichia coli KTE5]
gi|430943194|gb|ELC63320.1| hypothetical protein A137_03670 [Escherichia coli KTE178]
gi|430965260|gb|ELC82701.1| hypothetical protein A13O_03195 [Escherichia coli KTE189]
gi|430972316|gb|ELC89314.1| hypothetical protein A13S_03528 [Escherichia coli KTE191]
gi|431027141|gb|ELD40206.1| hypothetical protein A177_03481 [Escherichia coli KTE216]
gi|431089882|gb|ELD95667.1| hypothetical protein A1S7_03765 [Escherichia coli KTE49]
gi|431105685|gb|ELE10019.1| hypothetical protein A1SI_03809 [Escherichia coli KTE55]
gi|431118590|gb|ELE21609.1| hypothetical protein A1SO_03602 [Escherichia coli KTE58]
gi|431128967|gb|ELE31143.1| hypothetical protein A1SW_03838 [Escherichia coli KTE62]
gi|431232333|gb|ELF28001.1| hypothetical protein A31I_03188 [Escherichia coli KTE162]
gi|431301086|gb|ELF90633.1| hypothetical protein WEA_02778 [Escherichia coli KTE22]
gi|431325105|gb|ELG12493.1| hypothetical protein A1SQ_03524 [Escherichia coli KTE59]
gi|431335854|gb|ELG22983.1| hypothetical protein A1U3_02984 [Escherichia coli KTE65]
gi|431347182|gb|ELG34075.1| hypothetical protein A1W3_03342 [Escherichia coli KTE84]
gi|431366080|gb|ELG52578.1| hypothetical protein A1Y5_03907 [Escherichia coli KTE118]
gi|431378325|gb|ELG63316.1| hypothetical protein A1YA_00467 [Escherichia coli KTE123]
gi|431432102|gb|ELH13875.1| hypothetical protein A31K_00297 [Escherichia coli KTE165]
gi|431439517|gb|ELH20851.1| hypothetical protein A133_03461 [Escherichia coli KTE173]
gi|431442604|gb|ELH23693.1| hypothetical protein A135_03805 [Escherichia coli KTE175]
gi|431490388|gb|ELH70005.1| hypothetical protein A15W_03408 [Escherichia coli KTE211]
gi|431512129|gb|ELH90257.1| hypothetical protein A17S_03966 [Escherichia coli KTE227]
gi|431522081|gb|ELH99316.1| hypothetical protein A17W_01768 [Escherichia coli KTE229]
gi|431528874|gb|ELI05579.1| hypothetical protein WI5_02994 [Escherichia coli KTE104]
gi|431533383|gb|ELI09883.1| hypothetical protein WI9_02920 [Escherichia coli KTE106]
gi|431614179|gb|ELI83338.1| hypothetical protein WK3_02896 [Escherichia coli KTE139]
gi|431625501|gb|ELI94081.1| hypothetical protein WK7_03013 [Escherichia coli KTE148]
gi|431672395|gb|ELJ38666.1| hypothetical protein WKS_02934 [Escherichia coli KTE176]
gi|431685241|gb|ELJ50816.1| hypothetical protein WKW_03102 [Escherichia coli KTE179]
gi|431686146|gb|ELJ51712.1| hypothetical protein WKY_03122 [Escherichia coli KTE180]
gi|431719406|gb|ELJ83465.1| hypothetical protein WGW_03173 [Escherichia coli KTE94]
gi|432349313|gb|ELL43742.1| hypothetical protein B185_002184 [Escherichia coli J96]
Length = 143
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|260803918|ref|XP_002596836.1| hypothetical protein BRAFLDRAFT_129099 [Branchiostoma floridae]
gi|229282096|gb|EEN52848.1| hypothetical protein BRAFLDRAFT_129099 [Branchiostoma floridae]
Length = 148
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 56/110 (50%), Gaps = 14/110 (12%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGG----TVTYEGRFEILSLS 216
GE++ S + F + +A +++ G++S+ LR A + + + ++EI+SL
Sbjct: 15 GEEIKSALQKFVEEKRLKAPFVMTCVGSVSSAKLRLANATAEKPNEVIELDQKYEIVSLV 74
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
G+ + + L +SL+ DG V+GG V G LT T ++V+G
Sbjct: 75 GTL---------NNSCHLHISLADKDGAVIGGHVMGNLTVFTTAEIVIGE 115
>gi|119390481|pdb|2NMU|A Chain A, Crystal Structure Of The Hypothetical Protein From
Salmonella Typhimurium Lt2. Northeast Structural
Genomics Consortium Target Str127
Length = 156
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 57/107 (53%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN RA I G++++V LR A T + G FE++SL+G+
Sbjct: 28 GQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 85
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G LGG T T +++V+G
Sbjct: 86 --ELTGEH-----LHLAVSDPYGVXLGGHXXPGCTVRTTLELVIGEL 125
>gi|348518377|ref|XP_003446708.1| PREDICTED: bifunctional protein glmU-like [Oreochromis niloticus]
Length = 151
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 14/124 (11%)
Query: 148 AGVGFTPHVITVKAGEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQA-ATSGGT-- 203
AG H I V+ G+++ + +F + RA I++ G+++ TLR A AT+ T
Sbjct: 6 AGSALRVHAIRVRPGQELLGTLQAFVEEKRLRAPFIVTCVGSLTKATLRLANATATKTNE 65
Query: 204 -VTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQV 262
V G FEI+SL G+ + + +SLS +G+ +GG V G L T +V
Sbjct: 66 VVHLSGHFEIVSLVGTL---------NPDAHVHISLSDFEGKTVGGHVLGDLEVFTTAEV 116
Query: 263 VVGS 266
V+G
Sbjct: 117 VIGE 120
>gi|289808355|ref|ZP_06538984.1| hypothetical protein Salmonellaentericaenterica_29647 [Salmonella
enterica subsp. enterica serovar Typhi str. AG3]
Length = 120
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|432946154|ref|XP_004083794.1| PREDICTED: bifunctional protein GlmU-like [Oryzias latipes]
Length = 148
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 61/125 (48%), Gaps = 8/125 (6%)
Query: 145 LGSAGVGFTPHVITVKA--GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSG 201
+ SAG G V V+ G+++ + +F + +A I++ G+++ TLR A S
Sbjct: 1 MNSAGAGSNLQVYAVRFCPGQEILGSLQAFVEERRLQAPFIMTCVGSVTKATLRLANASA 60
Query: 202 GTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQ 261
E++ L+G + + G +R L +SLS +G+ +GG V G L T +
Sbjct: 61 TNTN-----EVIHLTGHYEIVSLVGTLNRDAHLHISLSDAEGKTIGGHVLGDLEVFTTAE 115
Query: 262 VVVGS 266
VV+G
Sbjct: 116 VVIGE 120
>gi|168819900|ref|ZP_02831900.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
gi|409246769|ref|YP_006887473.1| Bifunctional protein glmU Includes: UDP-N-acetylglucosamine
pyrophosphorylase; N-acetylglucosamine-1-phosphate
uridyltransferase; Includes: RecName:
Full=Glucosamine-1-phosphate N-acetyltransferase
[Salmonella enterica subsp. enterica serovar Weltevreden
str. 2007-60-3289-1]
gi|205343292|gb|EDZ30056.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
gi|320087503|emb|CBY97268.1| Bifunctional protein glmU Includes: UDP-N-acetylglucosamine
pyrophosphorylase; N-acetylglucosamine-1-phosphate
uridyltransferase; Includes: RecName:
Full=Glucosamine-1-phosphate N-acetyltransferase
[Salmonella enterica subsp. enterica serovar Weltevreden
str. 2007-60-3289-1]
Length = 141
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|213612949|ref|ZP_03370775.1| hypothetical protein SentesTyp_10804 [Salmonella enterica subsp.
enterica serovar Typhi str. E98-2068]
Length = 122
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|16761853|ref|NP_457470.1| hypothetical protein STY3229 [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|29143340|ref|NP_806682.1| hypothetical protein t2990 [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|56415017|ref|YP_152092.1| hypothetical protein SPA2942 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|167553236|ref|ZP_02346986.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
gi|168242866|ref|ZP_02667798.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
gi|168264491|ref|ZP_02686464.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
gi|168463750|ref|ZP_02697667.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|194444998|ref|YP_002042325.1| hypothetical protein SNSL254_A3311 [Salmonella enterica subsp.
enterica serovar Newport str. SL254]
gi|194448879|ref|YP_002047058.1| hypothetical protein SeHA_C3308 [Salmonella enterica subsp.
enterica serovar Heidelberg str. SL476]
gi|197249997|ref|YP_002147985.1| hypothetical protein SeAg_B3233 [Salmonella enterica subsp.
enterica serovar Agona str. SL483]
gi|197363946|ref|YP_002143583.1| hypothetical protein SSPA2741 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
gi|198246107|ref|YP_002217049.1| hypothetical protein SeD_A3413 [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|200388456|ref|ZP_03215068.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
gi|205353995|ref|YP_002227796.1| hypothetical protein SG2966 [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|207858334|ref|YP_002244985.1| hypothetical protein SEN2914 [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|213162198|ref|ZP_03347908.1| hypothetical protein Salmoneentericaenterica_20263 [Salmonella
enterica subsp. enterica serovar Typhi str. E00-7866]
gi|213421005|ref|ZP_03354071.1| hypothetical protein Salmonentericaenterica_25957 [Salmonella
enterica subsp. enterica serovar Typhi str. E01-6750]
gi|213427980|ref|ZP_03360730.1| hypothetical protein SentesTyphi_21820 [Salmonella enterica subsp.
enterica serovar Typhi str. E02-1180]
gi|213650824|ref|ZP_03380877.1| hypothetical protein SentesTy_28511 [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
gi|213850197|ref|ZP_03381095.1| hypothetical protein SentesT_00833 [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
gi|238909871|ref|ZP_04653708.1| hypothetical protein SentesTe_01875 [Salmonella enterica subsp.
enterica serovar Tennessee str. CDC07-0191]
gi|289829559|ref|ZP_06547143.1| hypothetical protein Salmonellentericaenterica_23458 [Salmonella
enterica subsp. enterica serovar Typhi str. E98-3139]
gi|375120551|ref|ZP_09765718.1| hypothetical protein SD3246_3301 [Salmonella enterica subsp.
enterica serovar Dublin str. SD3246]
gi|375124858|ref|ZP_09770022.1| hypothetical protein SG9_3001 [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|378956688|ref|YP_005214175.1| hypothetical protein SPUL_3073 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|378961163|ref|YP_005218649.1| hypothetical protein STBHUCCB_31540 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
gi|386592775|ref|YP_006089175.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str. B182]
gi|417367854|ref|ZP_12139602.1| putative regulator [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
gi|418760846|ref|ZP_13316998.1| hypothetical protein SEEN185_09770 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|418766168|ref|ZP_13322247.1| hypothetical protein SEEN199_07998 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|418771494|ref|ZP_13327501.1| hypothetical protein SEEN539_17452 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|418773737|ref|ZP_13329710.1| hypothetical protein SEEN953_05756 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|418778456|ref|ZP_13334366.1| hypothetical protein SEEN188_20647 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|418783366|ref|ZP_13339213.1| hypothetical protein SEEN559_07224 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|418788785|ref|ZP_13344578.1| hypothetical protein SEEN447_01292 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19447]
gi|418795258|ref|ZP_13350967.1| hypothetical protein SEEN449_20239 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19449]
gi|418797374|ref|ZP_13353060.1| hypothetical protein SEEN567_03604 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19567]
gi|418801300|ref|ZP_13356937.1| hypothetical protein SEEN202_18306 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
gi|418806276|ref|ZP_13361848.1| hypothetical protein SEEN550_06222 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21550]
gi|418810435|ref|ZP_13365975.1| hypothetical protein SEEN513_02176 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22513]
gi|418818051|ref|ZP_13373530.1| hypothetical protein SEEN538_16315 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|418823120|ref|ZP_13378529.1| hypothetical protein SEEN425_18966 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|418826763|ref|ZP_13381953.1| hypothetical protein SEEN462_25150 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22462]
gi|418831015|ref|ZP_13385973.1| hypothetical protein SEEN486_12749 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N18486]
gi|418837252|ref|ZP_13392127.1| hypothetical protein SEEN543_06451 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N1543]
gi|418842515|ref|ZP_13397325.1| hypothetical protein SEEN554_12134 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21554]
gi|418847981|ref|ZP_13402721.1| hypothetical protein SEEN978_07580 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 37978]
gi|418856144|ref|ZP_13410792.1| hypothetical protein SEEN593_15215 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
gi|419731324|ref|ZP_14258237.1| hypothetical protein SEEH1579_07445 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|419735779|ref|ZP_14262652.1| hypothetical protein SEEH1563_13711 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|419739548|ref|ZP_14266293.1| hypothetical protein SEEH1573_05093 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|419741943|ref|ZP_14268621.1| hypothetical protein SEEH1566_20517 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|419748327|ref|ZP_14274825.1| hypothetical protein SEEH1565_00320 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|419786993|ref|ZP_14312708.1| hypothetical protein SEENLE01_09311 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|419793387|ref|ZP_14319010.1| hypothetical protein SEENLE15_01500 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|421360657|ref|ZP_15810933.1| hypothetical protein SEEE3139_21520 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|421363431|ref|ZP_15813673.1| hypothetical protein SEEE0166_12439 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|421369755|ref|ZP_15819930.1| hypothetical protein SEEE0631_21289 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|421374198|ref|ZP_15824329.1| hypothetical protein SEEE0424_20933 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|421378864|ref|ZP_15828943.1| hypothetical protein SEEE3076_21681 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|421383466|ref|ZP_15833504.1| hypothetical protein SEEE4917_21961 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|421384887|ref|ZP_15834910.1| hypothetical protein SEEE6622_06319 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|421389470|ref|ZP_15839453.1| hypothetical protein SEEE6670_06636 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|421396756|ref|ZP_15846681.1| hypothetical protein SEEE6426_20678 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|421399535|ref|ZP_15849430.1| hypothetical protein SEEE6437_12385 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|421405975|ref|ZP_15855800.1| hypothetical protein SEEE7246_22078 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|421408497|ref|ZP_15858296.1| hypothetical protein SEEE7250_12053 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|421414872|ref|ZP_15864608.1| hypothetical protein SEEE1427_21380 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|421417525|ref|ZP_15867235.1| hypothetical protein SEEE2659_12031 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|421420864|ref|ZP_15870540.1| hypothetical protein SEEE1757_06085 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|421428509|ref|ZP_15878120.1| hypothetical protein SEEE5101_21902 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|421430952|ref|ZP_15880538.1| hypothetical protein SEEE8B1_11501 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|421435618|ref|ZP_15885154.1| hypothetical protein SEEE5518_11701 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|421440040|ref|ZP_15889520.1| hypothetical protein SEEE1618_11183 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|421443901|ref|ZP_15893340.1| hypothetical protein SEEE3079_07636 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|421449371|ref|ZP_15898755.1| hypothetical protein SEEE6482_12614 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|421572949|ref|ZP_16018594.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str.
CFSAN00322]
gi|421576928|ref|ZP_16022518.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str.
CFSAN00325]
gi|421579426|ref|ZP_16024989.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str.
CFSAN00326]
gi|421583278|ref|ZP_16028802.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str.
CFSAN00328]
gi|421885558|ref|ZP_16316749.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Senftenberg str. SS209]
gi|436605939|ref|ZP_20513456.1| hypothetical protein SEE22704_08801 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|436785513|ref|ZP_20521325.1| hypothetical protein SEE30663_24765 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|436799730|ref|ZP_20524016.1| hypothetical protein SEECHS44_12089 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|436807417|ref|ZP_20527460.1| hypothetical protein SEEE1882_06541 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|436818308|ref|ZP_20534941.1| hypothetical protein SEEE1884_21644 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|436832531|ref|ZP_20536821.1| hypothetical protein SEEE1594_08200 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|436853122|ref|ZP_20543147.1| hypothetical protein SEEE1566_17396 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|436861090|ref|ZP_20548274.1| hypothetical protein SEEE1580_20764 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|436867681|ref|ZP_20552835.1| hypothetical protein SEEE1543_21249 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|436873026|ref|ZP_20555908.1| hypothetical protein SEEE1441_14200 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|436880303|ref|ZP_20560062.1| hypothetical protein SEEE1810_12560 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|436891651|ref|ZP_20566351.1| hypothetical protein SEEE1558_21566 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|436899442|ref|ZP_20570853.1| hypothetical protein SEEE1018_21448 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|436902953|ref|ZP_20573417.1| hypothetical protein SEEE1010_11755 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|436914963|ref|ZP_20579810.1| hypothetical protein SEEE1729_21533 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|436919662|ref|ZP_20582443.1| hypothetical protein SEEE0895_11953 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|436928954|ref|ZP_20588160.1| hypothetical protein SEEE0899_17936 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|436938433|ref|ZP_20593220.1| hypothetical protein SEEE1457_20859 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|436946006|ref|ZP_20597834.1| hypothetical protein SEEE1747_21523 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|436955469|ref|ZP_20602344.1| hypothetical protein SEEE0968_21519 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|436966201|ref|ZP_20606870.1| hypothetical protein SEEE1444_21539 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|436969407|ref|ZP_20608404.1| hypothetical protein SEEE1445_06358 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|436980049|ref|ZP_20613194.1| hypothetical protein SEEE1559_08043 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|436993542|ref|ZP_20618335.1| hypothetical protein SEEE1565_11212 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|437004986|ref|ZP_20622216.1| hypothetical protein SEEE1808_08207 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|437022731|ref|ZP_20628680.1| hypothetical protein SEEE1811_18088 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|437027537|ref|ZP_20630426.1| hypothetical protein SEEE0956_03980 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|437042954|ref|ZP_20636467.1| hypothetical protein SEEE1455_11747 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|437050628|ref|ZP_20640773.1| hypothetical protein SEEE1575_10893 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|437061860|ref|ZP_20647226.1| hypothetical protein SEEE1725_21041 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|437066776|ref|ZP_20649838.1| hypothetical protein SEEE1745_11336 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|437073998|ref|ZP_20653440.1| hypothetical protein SEEE1791_06646 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|437083083|ref|ZP_20658826.1| hypothetical protein SEEE1795_11287 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|437097824|ref|ZP_20665279.1| hypothetical protein SEEE6709_21433 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|437110609|ref|ZP_20667955.1| hypothetical protein SEEE9058_11973 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|437125166|ref|ZP_20673828.1| hypothetical protein SEEE0816_19064 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|437129567|ref|ZP_20676043.1| hypothetical protein SEEE0819_07287 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|437141721|ref|ZP_20683405.1| hypothetical protein SEEE3072_21828 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|437146196|ref|ZP_20685985.1| hypothetical protein SEEE3089_11888 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|437153382|ref|ZP_20690488.1| hypothetical protein SEEE9163_11834 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|437159814|ref|ZP_20694212.1| hypothetical protein SEEE151_07897 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|437169276|ref|ZP_20699669.1| hypothetical protein SEEEN202_12911 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|437175803|ref|ZP_20702979.1| hypothetical protein SEEE3991_06974 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|437184528|ref|ZP_20708393.1| hypothetical protein SEEE3618_11799 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|437230846|ref|ZP_20713404.1| hypothetical protein SEEE1831_14693 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|437264772|ref|ZP_20720048.1| hypothetical protein SEEE2490_21876 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|437269369|ref|ZP_20722612.1| hypothetical protein SEEEL909_12269 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|437277580|ref|ZP_20726939.1| hypothetical protein SEEEL913_11244 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|437296969|ref|ZP_20732770.1| hypothetical protein SEEE4941_18207 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|437315904|ref|ZP_20737592.1| hypothetical protein SEEE7015_20054 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|437327737|ref|ZP_20740679.1| hypothetical protein SEEE7927_12613 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|437341805|ref|ZP_20744928.1| hypothetical protein SEEECHS4_11343 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|437374540|ref|ZP_20749693.1| hypothetical protein SEEE2558_14716 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22558]
gi|437417561|ref|ZP_20753980.1| hypothetical protein SEEE2217_11799 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|437445804|ref|ZP_20758526.1| hypothetical protein SEEE4018_11985 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|437463408|ref|ZP_20763090.1| hypothetical protein SEEE6211_12130 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|437481028|ref|ZP_20768733.1| hypothetical protein SEEE4441_18001 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|437492521|ref|ZP_20771752.1| hypothetical protein SEEE4647_10501 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|437509479|ref|ZP_20776618.1| hypothetical protein SEEE9845_12727 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|437532974|ref|ZP_20781077.1| hypothetical protein SEEE9317_12364 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|437567131|ref|ZP_20787402.1| hypothetical protein SEEE0116_21558 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|437580528|ref|ZP_20791931.1| hypothetical protein SEEE1117_21442 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|437587958|ref|ZP_20793679.1| hypothetical protein SEEE1392_07527 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|437605046|ref|ZP_20799225.1| hypothetical protein SEEE0268_12837 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|437619385|ref|ZP_20803537.1| hypothetical protein SEEE0316_11751 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|437650036|ref|ZP_20809670.1| hypothetical protein SEEE0436_20284 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|437665412|ref|ZP_20814563.1| hypothetical protein SEEE1319_21362 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|437667642|ref|ZP_20815044.1| hypothetical protein SEEE4481_00554 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|437699967|ref|ZP_20823554.1| hypothetical protein SEEE6297_20856 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|437703543|ref|ZP_20824586.1| hypothetical protein SEEE4220_03079 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|437729747|ref|ZP_20830879.1| hypothetical protein SEEE1616_11972 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|437739392|ref|ZP_20833139.1| hypothetical protein SEEE2651_00030 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|437808509|ref|ZP_20840214.1| hypothetical protein SEEE3944_11893 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|437859705|ref|ZP_20847864.1| hypothetical protein SEEE5621_04484 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|438043087|ref|ZP_20855824.1| hypothetical protein SEEE5646_18236 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
gi|438091088|ref|ZP_20860818.1| hypothetical protein SEEE2625_16636 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|438101746|ref|ZP_20864573.1| hypothetical protein SEEE1976_12621 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|438116316|ref|ZP_20870835.1| hypothetical protein SEEE3407_21697 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|438148765|ref|ZP_20876429.1| hypothetical protein SEEP9120_24664 [Salmonella enterica subsp.
enterica serovar Pullorum str. ATCC 9120]
gi|440764020|ref|ZP_20943054.1| hypothetical protein F434_13678 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|440770047|ref|ZP_20949001.1| hypothetical protein F514_20402 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|440772748|ref|ZP_20951651.1| hypothetical protein F515_10158 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|445135462|ref|ZP_21383214.1| hypothetical protein SEEG9184_019543 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
gi|445145351|ref|ZP_21387313.1| hypothetical protein SEEDSL_011492 [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|445151226|ref|ZP_21390176.1| hypothetical protein SEEDHWS_015445 [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
gi|445171099|ref|ZP_21396010.1| hypothetical protein SEE8A_013659 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|445197370|ref|ZP_21400766.1| hypothetical protein SE20037_14325 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|445235282|ref|ZP_21406859.1| hypothetical protein SEE10_013172 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|445241591|ref|ZP_21407709.1| hypothetical protein SEE436_008935 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
gi|445335046|ref|ZP_21415364.1| hypothetical protein SEE18569_012593 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|445343760|ref|ZP_21417223.1| hypothetical protein SEE13_009218 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|445357967|ref|ZP_21422392.1| hypothetical protein SEE23_003283 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|25512798|pir||AF0875 conserved hypothetical protein STY3229 [imported] - Salmonella
enterica subsp. enterica serovar Typhi (strain CT18)
gi|16504155|emb|CAD02902.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi]
gi|29138974|gb|AAO70542.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|56129274|gb|AAV78780.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|194403661|gb|ACF63883.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL254]
gi|194407183|gb|ACF67402.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL476]
gi|195633696|gb|EDX52110.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|197095423|emb|CAR60982.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
gi|197213700|gb|ACH51097.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Agona str. SL483]
gi|197940623|gb|ACH77956.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|199605554|gb|EDZ04099.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
gi|205273776|emb|CAR38771.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|205322305|gb|EDZ10144.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
gi|205338187|gb|EDZ24951.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
gi|205347035|gb|EDZ33666.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
gi|206710137|emb|CAR34492.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|326624818|gb|EGE31163.1| hypothetical protein SD3246_3301 [Salmonella enterica subsp.
enterica serovar Dublin str. SD3246]
gi|326629108|gb|EGE35451.1| hypothetical protein SG9_3001 [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|353588068|gb|EHC47208.1| putative regulator [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
gi|357207299|gb|AET55345.1| hypothetical protein SPUL_3073 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|374355035|gb|AEZ46796.1| hypothetical protein STBHUCCB_31540 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
gi|379984826|emb|CCF89022.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Senftenberg str. SS209]
gi|381291505|gb|EIC32742.1| hypothetical protein SEEH1579_07445 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|381294103|gb|EIC35243.1| hypothetical protein SEEH1563_13711 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|381298127|gb|EIC39208.1| hypothetical protein SEEH1573_05093 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|381314752|gb|EIC55519.1| hypothetical protein SEEH1565_00320 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|381315310|gb|EIC56073.1| hypothetical protein SEEH1566_20517 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|383799816|gb|AFH46898.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str. B182]
gi|392617366|gb|EIW99791.1| hypothetical protein SEENLE15_01500 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|392620936|gb|EIX03302.1| hypothetical protein SEENLE01_09311 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|392734022|gb|EIZ91213.1| hypothetical protein SEEN539_17452 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|392738886|gb|EIZ96026.1| hypothetical protein SEEN199_07998 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|392741567|gb|EIZ98663.1| hypothetical protein SEEN185_09770 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|392752777|gb|EJA09717.1| hypothetical protein SEEN953_05756 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|392755665|gb|EJA12574.1| hypothetical protein SEEN188_20647 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|392757214|gb|EJA14104.1| hypothetical protein SEEN559_07224 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|392759400|gb|EJA16253.1| hypothetical protein SEEN449_20239 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19449]
gi|392762451|gb|EJA19266.1| hypothetical protein SEEN447_01292 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19447]
gi|392768813|gb|EJA25559.1| hypothetical protein SEEN567_03604 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19567]
gi|392781021|gb|EJA37672.1| hypothetical protein SEEN202_18306 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
gi|392781383|gb|EJA38024.1| hypothetical protein SEEN513_02176 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22513]
gi|392782893|gb|EJA39523.1| hypothetical protein SEEN550_06222 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21550]
gi|392786014|gb|EJA42571.1| hypothetical protein SEEN425_18966 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|392786463|gb|EJA43019.1| hypothetical protein SEEN538_16315 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|392799328|gb|EJA55587.1| hypothetical protein SEEN543_06451 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N1543]
gi|392800211|gb|EJA56449.1| hypothetical protein SEEN486_12749 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N18486]
gi|392804326|gb|EJA60489.1| hypothetical protein SEEN462_25150 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22462]
gi|392807086|gb|EJA63170.1| hypothetical protein SEEN554_12134 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21554]
gi|392820494|gb|EJA76344.1| hypothetical protein SEEN593_15215 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
gi|392824040|gb|EJA79831.1| hypothetical protein SEEN978_07580 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 37978]
gi|395981224|gb|EJH90446.1| hypothetical protein SEEE3139_21520 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|395981878|gb|EJH91099.1| hypothetical protein SEEE0631_21289 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|395987892|gb|EJH97054.1| hypothetical protein SEEE0166_12439 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|395994322|gb|EJI03398.1| hypothetical protein SEEE0424_20933 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|395995199|gb|EJI04264.1| hypothetical protein SEEE3076_21681 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|395995700|gb|EJI04764.1| hypothetical protein SEEE4917_21961 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|396009210|gb|EJI18143.1| hypothetical protein SEEE6426_20678 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|396017029|gb|EJI25895.1| hypothetical protein SEEE6670_06636 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|396018519|gb|EJI27381.1| hypothetical protein SEEE6622_06319 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|396022203|gb|EJI31017.1| hypothetical protein SEEE7246_22078 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|396027629|gb|EJI36392.1| hypothetical protein SEEE6437_12385 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|396027912|gb|EJI36674.1| hypothetical protein SEEE7250_12053 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|396034907|gb|EJI43588.1| hypothetical protein SEEE1427_21380 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|396042360|gb|EJI50982.1| hypothetical protein SEEE2659_12031 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|396043909|gb|EJI52507.1| hypothetical protein SEEE1757_06085 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|396048544|gb|EJI57093.1| hypothetical protein SEEE5101_21902 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|396054778|gb|EJI63270.1| hypothetical protein SEEE8B1_11501 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|396056030|gb|EJI64506.1| hypothetical protein SEEE5518_11701 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|396068175|gb|EJI76523.1| hypothetical protein SEEE1618_11183 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|396069532|gb|EJI77870.1| hypothetical protein SEEE3079_07636 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|396070668|gb|EJI78996.1| hypothetical protein SEEE6482_12614 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|402515025|gb|EJW22440.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str.
CFSAN00322]
gi|402516812|gb|EJW24220.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str.
CFSAN00325]
gi|402521637|gb|EJW28971.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str.
CFSAN00326]
gi|402532204|gb|EJW39401.1| putative regulator of STY3230-like transporter operon [Salmonella
enterica subsp. enterica serovar Heidelberg str.
CFSAN00328]
gi|434938183|gb|ELL45198.1| hypothetical protein SEEP9120_24664 [Salmonella enterica subsp.
enterica serovar Pullorum str. ATCC 9120]
gi|434958281|gb|ELL51850.1| hypothetical protein SEE30663_24765 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|434959760|gb|ELL53206.1| hypothetical protein SEECHS44_12089 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|434968373|gb|ELL61125.1| hypothetical protein SEEE1882_06541 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|434970852|gb|ELL63413.1| hypothetical protein SEEE1884_21644 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|434971476|gb|ELL63985.1| hypothetical protein SEE22704_08801 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|434981130|gb|ELL73017.1| hypothetical protein SEEE1594_08200 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|434984467|gb|ELL76207.1| hypothetical protein SEEE1566_17396 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|434985534|gb|ELL77221.1| hypothetical protein SEEE1580_20764 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|434992833|gb|ELL84272.1| hypothetical protein SEEE1543_21249 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|434999883|gb|ELL91057.1| hypothetical protein SEEE1441_14200 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|435005147|gb|ELL96069.1| hypothetical protein SEEE1810_12560 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|435005780|gb|ELL96700.1| hypothetical protein SEEE1558_21566 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|435012577|gb|ELM03252.1| hypothetical protein SEEE1018_21448 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|435019383|gb|ELM09827.1| hypothetical protein SEEE1010_11755 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|435023045|gb|ELM13341.1| hypothetical protein SEEE1729_21533 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|435029497|gb|ELM19555.1| hypothetical protein SEEE0895_11953 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|435033644|gb|ELM23536.1| hypothetical protein SEEE0899_17936 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|435033957|gb|ELM23847.1| hypothetical protein SEEE1457_20859 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|435035578|gb|ELM25423.1| hypothetical protein SEEE1747_21523 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|435045845|gb|ELM35471.1| hypothetical protein SEEE0968_21519 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|435046611|gb|ELM36226.1| hypothetical protein SEEE1444_21539 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|435058725|gb|ELM48032.1| hypothetical protein SEEE1445_06358 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|435065219|gb|ELM54325.1| hypothetical protein SEEE1565_11212 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|435068605|gb|ELM57633.1| hypothetical protein SEEE1559_08043 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|435072277|gb|ELM61206.1| hypothetical protein SEEE1808_08207 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|435076668|gb|ELM65451.1| hypothetical protein SEEE1811_18088 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|435083604|gb|ELM72205.1| hypothetical protein SEEE1455_11747 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|435085596|gb|ELM74149.1| hypothetical protein SEEE0956_03980 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|435088344|gb|ELM76801.1| hypothetical protein SEEE1725_21041 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|435093332|gb|ELM81672.1| hypothetical protein SEEE1575_10893 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|435097582|gb|ELM85841.1| hypothetical protein SEEE1745_11336 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|435106468|gb|ELM94485.1| hypothetical protein SEEE6709_21433 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|435107799|gb|ELM95782.1| hypothetical protein SEEE1791_06646 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|435108656|gb|ELM96621.1| hypothetical protein SEEE1795_11287 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|435118511|gb|ELN06163.1| hypothetical protein SEEE0816_19064 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|435118859|gb|ELN06510.1| hypothetical protein SEEE9058_11973 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|435126787|gb|ELN14181.1| hypothetical protein SEEE0819_07287 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|435127889|gb|ELN15249.1| hypothetical protein SEEE3072_21828 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|435136441|gb|ELN23531.1| hypothetical protein SEEE3089_11888 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|435141133|gb|ELN28075.1| hypothetical protein SEEE9163_11834 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|435148593|gb|ELN35309.1| hypothetical protein SEEE151_07897 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|435149004|gb|ELN35718.1| hypothetical protein SEEEN202_12911 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|435156474|gb|ELN42964.1| hypothetical protein SEEE3991_06974 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|435159779|gb|ELN46097.1| hypothetical protein SEEE2490_21876 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|435161139|gb|ELN47381.1| hypothetical protein SEEE3618_11799 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|435172316|gb|ELN57859.1| hypothetical protein SEEEL909_12269 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|435172976|gb|ELN58501.1| hypothetical protein SEEEL913_11244 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|435179396|gb|ELN64546.1| hypothetical protein SEEE4941_18207 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|435180380|gb|ELN65488.1| hypothetical protein SEEE7015_20054 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|435191918|gb|ELN76474.1| hypothetical protein SEEE7927_12613 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|435193471|gb|ELN77950.1| hypothetical protein SEEECHS4_11343 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|435197616|gb|ELN81898.1| hypothetical protein SEEE1831_14693 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|435202196|gb|ELN86050.1| hypothetical protein SEEE2217_11799 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|435205311|gb|ELN88912.1| hypothetical protein SEEE2558_14716 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22558]
gi|435210193|gb|ELN93464.1| hypothetical protein SEEE4018_11985 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|435218204|gb|ELO00611.1| hypothetical protein SEEE4441_18001 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|435218685|gb|ELO01086.1| hypothetical protein SEEE6211_12130 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|435228813|gb|ELO10236.1| hypothetical protein SEEE4647_10501 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|435232762|gb|ELO13851.1| hypothetical protein SEEE9845_12727 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|435234871|gb|ELO15724.1| hypothetical protein SEEE0116_21558 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|435240779|gb|ELO21169.1| hypothetical protein SEEE1117_21442 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|435242523|gb|ELO22828.1| hypothetical protein SEEE9317_12364 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|435256989|gb|ELO36283.1| hypothetical protein SEEE0268_12837 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|435258421|gb|ELO37685.1| hypothetical protein SEEE1392_07527 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|435258665|gb|ELO37925.1| hypothetical protein SEEE0316_11751 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|435264999|gb|ELO43884.1| hypothetical protein SEEE1319_21362 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|435268077|gb|ELO46698.1| hypothetical protein SEEE0436_20284 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|435274028|gb|ELO52152.1| hypothetical protein SEEE6297_20856 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|435283596|gb|ELO61135.1| hypothetical protein SEEE4481_00554 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|435289859|gb|ELO66809.1| hypothetical protein SEEE1616_11972 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|435293633|gb|ELO70325.1| hypothetical protein SEEE4220_03079 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|435300177|gb|ELO76272.1| hypothetical protein SEEE3944_11893 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|435313886|gb|ELO87409.1| hypothetical protein SEEE2651_00030 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|435316136|gb|ELO89333.1| hypothetical protein SEEE2625_16636 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|435321599|gb|ELO94013.1| hypothetical protein SEEE5646_18236 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
gi|435324429|gb|ELO96362.1| hypothetical protein SEEE1976_12621 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|435327831|gb|ELO99482.1| hypothetical protein SEEE3407_21697 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|435336637|gb|ELP06491.1| hypothetical protein SEEE5621_04484 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|436412617|gb|ELP10556.1| hypothetical protein F514_20402 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|436417730|gb|ELP15618.1| hypothetical protein F434_13678 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|436417905|gb|ELP15792.1| hypothetical protein F515_10158 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|444845663|gb|ELX70851.1| hypothetical protein SEEG9184_019543 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
gi|444846124|gb|ELX71305.1| hypothetical protein SEEDSL_011492 [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|444856126|gb|ELX81164.1| hypothetical protein SEEDHWS_015445 [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
gi|444860033|gb|ELX84963.1| hypothetical protein SEE10_013172 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|444861769|gb|ELX86642.1| hypothetical protein SEE8A_013659 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|444863858|gb|ELX88673.1| hypothetical protein SE20037_14325 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|444874627|gb|ELX98862.1| hypothetical protein SEE18569_012593 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|444880920|gb|ELY04982.1| hypothetical protein SEE13_009218 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|444886421|gb|ELY10178.1| hypothetical protein SEE23_003283 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|444890969|gb|ELY14258.1| hypothetical protein SEE436_008935 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
Length = 141
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|227888476|ref|ZP_04006281.1| DNA-binding protein [Escherichia coli 83972]
gi|386640411|ref|YP_006107209.1| putative DNA-binding protein containing a PD1-like DNA-binding
motif [Escherichia coli ABU 83972]
gi|432413049|ref|ZP_19655708.1| hypothetical protein WG9_03546 [Escherichia coli KTE39]
gi|432496941|ref|ZP_19738736.1| hypothetical protein A173_04118 [Escherichia coli KTE214]
gi|227834745|gb|EEJ45211.1| DNA-binding protein [Escherichia coli 83972]
gi|307554903|gb|ADN47678.1| putative DNA-binding protein containing a PD1-like DNA-binding
motif [Escherichia coli ABU 83972]
gi|430934224|gb|ELC54597.1| hypothetical protein WG9_03546 [Escherichia coli KTE39]
gi|431022634|gb|ELD35895.1| hypothetical protein A173_04118 [Escherichia coli KTE214]
Length = 142
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLYAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|416426450|ref|ZP_11692945.1| hypothetical protein SEEM315_07060 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|416429023|ref|ZP_11694236.1| hypothetical protein SEEM971_19784 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|416439076|ref|ZP_11699953.1| hypothetical protein SEEM973_19905 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|416446090|ref|ZP_11704845.1| hypothetical protein SEEM974_21390 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|416451482|ref|ZP_11708232.1| hypothetical protein SEEM201_12480 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|416459938|ref|ZP_11714383.1| hypothetical protein SEEM202_13588 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|416471942|ref|ZP_11719473.1| hypothetical protein SEEM954_11717 [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|416474271|ref|ZP_11720122.1| hypothetical protein SEEM054_10552 [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|416492955|ref|ZP_11727742.1| hypothetical protein SEEM675_04541 [Salmonella enterica subsp.
enterica serovar Montevideo str. OH_2009072675]
gi|416500936|ref|ZP_11731798.1| hypothetical protein SEEM965_22191 [Salmonella enterica subsp.
enterica serovar Montevideo str. CASC_09SCPH15965]
gi|416546811|ref|ZP_11754205.1| hypothetical protein SEEM19N_18306 [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|416577742|ref|ZP_11770028.1| hypothetical protein SEEM801_04606 [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|416583980|ref|ZP_11773720.1| hypothetical protein SEEM507_09971 [Salmonella enterica subsp.
enterica serovar Montevideo str. MD_MDA09249507]
gi|416591684|ref|ZP_11778628.1| hypothetical protein SEEM877_00810 [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|416598268|ref|ZP_11782655.1| hypothetical protein SEEM867_02002 [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|416606784|ref|ZP_11788025.1| hypothetical protein SEEM180_20494 [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|416610619|ref|ZP_11790226.1| hypothetical protein SEEM600_15891 [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|416620269|ref|ZP_11795627.1| hypothetical protein SEEM581_15712 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|416634758|ref|ZP_11802738.1| hypothetical protein SEEM501_20788 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|416641842|ref|ZP_11805661.1| hypothetical protein SEEM460_18305 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|416647146|ref|ZP_11808145.1| hypothetical protein SEEM020_011804 [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|416657039|ref|ZP_11813495.1| hypothetical protein SEEM6152_10213 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|416670223|ref|ZP_11819937.1| hypothetical protein SEEM0077_03189 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|416675075|ref|ZP_11821398.1| hypothetical protein SEEM0047_09445 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB102109-0047]
gi|416696751|ref|ZP_11828003.1| hypothetical protein SEEM0055_02380 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|416706037|ref|ZP_11831296.1| hypothetical protein SEEM0052_20019 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|416712284|ref|ZP_11835995.1| hypothetical protein SEEM3312_05973 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|416718480|ref|ZP_11840588.1| hypothetical protein SEEM5258_05850 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|416723165|ref|ZP_11843930.1| hypothetical protein SEEM1156_01357 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|416733152|ref|ZP_11850243.1| hypothetical protein SEEM9199_21415 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|416737593|ref|ZP_11852746.1| hypothetical protein SEEM8282_09816 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|416748603|ref|ZP_11858860.1| hypothetical protein SEEM8283_12061 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|416754707|ref|ZP_11861499.1| hypothetical protein SEEM8284_01976 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008284]
gi|416761637|ref|ZP_11865688.1| hypothetical protein SEEM8285_21552 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|416771234|ref|ZP_11872499.1| hypothetical protein SEEM8287_10852 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|418481855|ref|ZP_13050878.1| hypothetical protein SEEM906_07811 [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|418491243|ref|ZP_13057769.1| hypothetical protein SEEM5278_17706 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|418495839|ref|ZP_13062277.1| hypothetical protein SEEM5318_17256 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|418498655|ref|ZP_13065069.1| hypothetical protein SEEM5320_08575 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|418505573|ref|ZP_13071919.1| hypothetical protein SEEM5321_19264 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|418509923|ref|ZP_13076214.1| hypothetical protein SEEM5327_15644 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|418524615|ref|ZP_13090600.1| hypothetical protein SEEM8286_16956 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
gi|322613470|gb|EFY10411.1| hypothetical protein SEEM315_07060 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|322621062|gb|EFY17920.1| hypothetical protein SEEM971_19784 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|322624126|gb|EFY20960.1| hypothetical protein SEEM973_19905 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|322628135|gb|EFY24924.1| hypothetical protein SEEM974_21390 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|322633254|gb|EFY29996.1| hypothetical protein SEEM201_12480 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|322636168|gb|EFY32876.1| hypothetical protein SEEM202_13588 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|322639506|gb|EFY36194.1| hypothetical protein SEEM954_11717 [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|322647561|gb|EFY44050.1| hypothetical protein SEEM054_10552 [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|322648745|gb|EFY45192.1| hypothetical protein SEEM675_04541 [Salmonella enterica subsp.
enterica serovar Montevideo str. OH_2009072675]
gi|322653800|gb|EFY50126.1| hypothetical protein SEEM965_22191 [Salmonella enterica subsp.
enterica serovar Montevideo str. CASC_09SCPH15965]
gi|322657906|gb|EFY54174.1| hypothetical protein SEEM19N_18306 [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|322664009|gb|EFY60208.1| hypothetical protein SEEM801_04606 [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|322668980|gb|EFY65131.1| hypothetical protein SEEM507_09971 [Salmonella enterica subsp.
enterica serovar Montevideo str. MD_MDA09249507]
gi|322673026|gb|EFY69133.1| hypothetical protein SEEM877_00810 [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|322677983|gb|EFY74046.1| hypothetical protein SEEM867_02002 [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|322681159|gb|EFY77192.1| hypothetical protein SEEM180_20494 [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|322687911|gb|EFY83878.1| hypothetical protein SEEM600_15891 [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|323194893|gb|EFZ80080.1| hypothetical protein SEEM581_15712 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|323196644|gb|EFZ81792.1| hypothetical protein SEEM501_20788 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|323202656|gb|EFZ87696.1| hypothetical protein SEEM460_18305 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|323212591|gb|EFZ97408.1| hypothetical protein SEEM6152_10213 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|323214926|gb|EFZ99674.1| hypothetical protein SEEM0077_03189 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|323222656|gb|EGA07021.1| hypothetical protein SEEM0047_09445 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB102109-0047]
gi|323225064|gb|EGA09316.1| hypothetical protein SEEM0055_02380 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|323230586|gb|EGA14704.1| hypothetical protein SEEM0052_20019 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|323235063|gb|EGA19149.1| hypothetical protein SEEM3312_05973 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|323239102|gb|EGA23152.1| hypothetical protein SEEM5258_05850 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|323244540|gb|EGA28546.1| hypothetical protein SEEM1156_01357 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|323247155|gb|EGA31121.1| hypothetical protein SEEM9199_21415 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|323253362|gb|EGA37191.1| hypothetical protein SEEM8282_09816 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|323256331|gb|EGA40067.1| hypothetical protein SEEM8283_12061 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|323262493|gb|EGA46049.1| hypothetical protein SEEM8284_01976 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008284]
gi|323267411|gb|EGA50895.1| hypothetical protein SEEM8285_21552 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|323269185|gb|EGA52640.1| hypothetical protein SEEM8287_10852 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|366058355|gb|EHN22644.1| hypothetical protein SEEM5318_17256 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|366062942|gb|EHN27164.1| hypothetical protein SEEM5278_17706 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|366064588|gb|EHN28785.1| hypothetical protein SEEM906_07811 [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|366067880|gb|EHN32028.1| hypothetical protein SEEM5321_19264 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|366073407|gb|EHN37480.1| hypothetical protein SEEM5320_08575 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|366077523|gb|EHN41537.1| hypothetical protein SEEM5327_15644 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|366830589|gb|EHN57459.1| hypothetical protein SEEM020_011804 [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|372207474|gb|EHP20973.1| hypothetical protein SEEM8286_16956 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
Length = 141
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLADVALRYAG-QDATTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|168236139|ref|ZP_02661197.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
gi|194736305|ref|YP_002116019.1| hypothetical protein SeSA_A3244 [Salmonella enterica subsp.
enterica serovar Schwarzengrund str. CVM19633]
gi|204928148|ref|ZP_03219348.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
gi|375002824|ref|ZP_09727164.1| hypothetical protein SEENIN0B_03185 [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
gi|416504089|ref|ZP_11733036.1| hypothetical protein SEEM031_11220 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|416515628|ref|ZP_11738755.1| hypothetical protein SEEM710_02731 [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|416557856|ref|ZP_11759836.1| hypothetical protein SEEM42N_00420 [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|418512410|ref|ZP_13078653.1| hypothetical protein SEEPO729_05791 [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
gi|437821381|ref|ZP_20843330.1| hypothetical protein SEEERB17_014318 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
gi|452123112|ref|YP_007473360.1| hypothetical protein CFSAN001992_18210 [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
gi|194711807|gb|ACF91028.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. CVM19633]
gi|197290875|gb|EDY30229.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
gi|204322470|gb|EDZ07667.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
gi|353077512|gb|EHB43272.1| hypothetical protein SEENIN0B_03185 [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
gi|363558465|gb|EHL42656.1| hypothetical protein SEEM031_11220 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|363563659|gb|EHL47726.1| hypothetical protein SEEM710_02731 [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|363578066|gb|EHL61883.1| hypothetical protein SEEM42N_00420 [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|366083917|gb|EHN47833.1| hypothetical protein SEEPO729_05791 [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
gi|435306854|gb|ELO82083.1| hypothetical protein SEEERB17_014318 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
gi|451912116|gb|AGF83922.1| hypothetical protein CFSAN001992_18210 [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
Length = 141
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLADVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|170680119|ref|YP_001745085.1| hypothetical protein EcSMS35_3065 [Escherichia coli SMS-3-5]
gi|218706436|ref|YP_002413955.1| hypothetical protein ECUMN_3273 [Escherichia coli UMN026]
gi|293406431|ref|ZP_06650357.1| conserved hypothetical protein [Escherichia coli FVEC1412]
gi|298382167|ref|ZP_06991764.1| hypothetical protein ECFG_01912 [Escherichia coli FVEC1302]
gi|300896171|ref|ZP_07114720.1| conserved hypothetical protein [Escherichia coli MS 198-1]
gi|331664504|ref|ZP_08365410.1| conserved hypothetical protein [Escherichia coli TA143]
gi|417309393|ref|ZP_12096231.1| putative DNA-binding protein [Escherichia coli PCN033]
gi|419934731|ref|ZP_14451833.1| hypothetical protein EC5761_13342 [Escherichia coli 576-1]
gi|422331947|ref|ZP_16412962.1| hypothetical protein HMPREF0986_01456 [Escherichia coli 4_1_47FAA]
gi|432354850|ref|ZP_19598119.1| hypothetical protein WCA_03842 [Escherichia coli KTE2]
gi|432403202|ref|ZP_19645950.1| hypothetical protein WEK_03407 [Escherichia coli KTE26]
gi|432427472|ref|ZP_19669963.1| hypothetical protein A139_02873 [Escherichia coli KTE181]
gi|432461934|ref|ZP_19704076.1| hypothetical protein A15I_02809 [Escherichia coli KTE204]
gi|432477169|ref|ZP_19719161.1| hypothetical protein A15Q_03370 [Escherichia coli KTE208]
gi|432519072|ref|ZP_19756254.1| hypothetical protein A17U_02048 [Escherichia coli KTE228]
gi|432539200|ref|ZP_19776097.1| hypothetical protein A195_02831 [Escherichia coli KTE235]
gi|432632700|ref|ZP_19868622.1| hypothetical protein A1UW_03087 [Escherichia coli KTE80]
gi|432642410|ref|ZP_19878238.1| hypothetical protein A1W1_03287 [Escherichia coli KTE83]
gi|432667402|ref|ZP_19902979.1| hypothetical protein A1Y3_04019 [Escherichia coli KTE116]
gi|432771856|ref|ZP_20006176.1| hypothetical protein A1S9_04654 [Escherichia coli KTE50]
gi|432775989|ref|ZP_20010254.1| hypothetical protein A1SG_04081 [Escherichia coli KTE54]
gi|432914243|ref|ZP_20119783.1| hypothetical protein A13Q_03416 [Escherichia coli KTE190]
gi|432963277|ref|ZP_20152696.1| hypothetical protein A15E_03634 [Escherichia coli KTE202]
gi|433020023|ref|ZP_20208195.1| hypothetical protein WI7_03023 [Escherichia coli KTE105]
gi|433054581|ref|ZP_20241749.1| hypothetical protein WIK_03387 [Escherichia coli KTE122]
gi|433064344|ref|ZP_20251257.1| hypothetical protein WIO_03170 [Escherichia coli KTE125]
gi|433069229|ref|ZP_20256007.1| hypothetical protein WIQ_03113 [Escherichia coli KTE128]
gi|433160006|ref|ZP_20344836.1| hypothetical protein WKU_03088 [Escherichia coli KTE177]
gi|433179768|ref|ZP_20364158.1| hypothetical protein WGM_03414 [Escherichia coli KTE82]
gi|170517837|gb|ACB16015.1| conserved hypothetical protein [Escherichia coli SMS-3-5]
gi|218433533|emb|CAR14436.1| conserved hypothetical protein with PD1-like DNA-binding motif
[Escherichia coli UMN026]
gi|291426437|gb|EFE99469.1| conserved hypothetical protein [Escherichia coli FVEC1412]
gi|298277307|gb|EFI18823.1| hypothetical protein ECFG_01912 [Escherichia coli FVEC1302]
gi|300359905|gb|EFJ75775.1| conserved hypothetical protein [Escherichia coli MS 198-1]
gi|331058435|gb|EGI30416.1| conserved hypothetical protein [Escherichia coli TA143]
gi|338769054|gb|EGP23836.1| putative DNA-binding protein [Escherichia coli PCN033]
gi|373247162|gb|EHP66609.1| hypothetical protein HMPREF0986_01456 [Escherichia coli 4_1_47FAA]
gi|388406958|gb|EIL67335.1| hypothetical protein EC5761_13342 [Escherichia coli 576-1]
gi|430873758|gb|ELB97324.1| hypothetical protein WCA_03842 [Escherichia coli KTE2]
gi|430924361|gb|ELC45082.1| hypothetical protein WEK_03407 [Escherichia coli KTE26]
gi|430953998|gb|ELC72885.1| hypothetical protein A139_02873 [Escherichia coli KTE181]
gi|430987907|gb|ELD04430.1| hypothetical protein A15I_02809 [Escherichia coli KTE204]
gi|431003298|gb|ELD18784.1| hypothetical protein A15Q_03370 [Escherichia coli KTE208]
gi|431049469|gb|ELD59431.1| hypothetical protein A17U_02048 [Escherichia coli KTE228]
gi|431067986|gb|ELD76495.1| hypothetical protein A195_02831 [Escherichia coli KTE235]
gi|431168783|gb|ELE69021.1| hypothetical protein A1UW_03087 [Escherichia coli KTE80]
gi|431179942|gb|ELE79833.1| hypothetical protein A1W1_03287 [Escherichia coli KTE83]
gi|431199542|gb|ELE98294.1| hypothetical protein A1Y3_04019 [Escherichia coli KTE116]
gi|431313269|gb|ELG01244.1| hypothetical protein A1S9_04654 [Escherichia coli KTE50]
gi|431316740|gb|ELG04540.1| hypothetical protein A1SG_04081 [Escherichia coli KTE54]
gi|431437774|gb|ELH19282.1| hypothetical protein A13Q_03416 [Escherichia coli KTE190]
gi|431471852|gb|ELH51744.1| hypothetical protein A15E_03634 [Escherichia coli KTE202]
gi|431529047|gb|ELI05751.1| hypothetical protein WI7_03023 [Escherichia coli KTE105]
gi|431568289|gb|ELI41277.1| hypothetical protein WIK_03387 [Escherichia coli KTE122]
gi|431579660|gb|ELI52240.1| hypothetical protein WIO_03170 [Escherichia coli KTE125]
gi|431581289|gb|ELI53742.1| hypothetical protein WIQ_03113 [Escherichia coli KTE128]
gi|431675941|gb|ELJ42067.1| hypothetical protein WKU_03088 [Escherichia coli KTE177]
gi|431699258|gb|ELJ64265.1| hypothetical protein WGM_03414 [Escherichia coli KTE82]
Length = 143
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYGGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|168234313|ref|ZP_02659371.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CDC 191]
gi|194470425|ref|ZP_03076409.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|194456789|gb|EDX45628.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|205331732|gb|EDZ18496.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CDC 191]
Length = 141
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|418846797|ref|ZP_13401562.1| hypothetical protein SEEN443_22121 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|418857548|ref|ZP_13412175.1| hypothetical protein SEEN470_14961 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|418862623|ref|ZP_13417162.1| hypothetical protein SEEN536_07149 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
gi|418869712|ref|ZP_13424145.1| hypothetical protein SEEN176_07637 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
gi|392809268|gb|EJA65305.1| hypothetical protein SEEN443_22121 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|392834020|gb|EJA89630.1| hypothetical protein SEEN536_07149 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
gi|392835022|gb|EJA90622.1| hypothetical protein SEEN470_14961 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|392836175|gb|EJA91763.1| hypothetical protein SEEN176_07637 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
Length = 141
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGVMLGGHMMPGCTVRTTLELVIGEL 118
>gi|366159908|ref|ZP_09459770.1| hypothetical protein ETW09_13280 [Escherichia sp. TW09308]
Length = 143
Score = 45.8 bits (107), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 61/114 (53%), Gaps = 12/114 (10%)
Query: 163 EDVSSKIMSFS-QNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLL 221
++V S++ +F+ Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 23 QEVFSQLRTFARQQQLHAAWIAGCTGSLTDVALRYAGQEN-TTHLRGKFEVIALNGTL-- 79
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E +G+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 -EQTGEH-----LHLCISDPHGAMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|62181583|ref|YP_218000.1| hypothetical protein SC3013 [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
gi|224584864|ref|YP_002638662.1| hypothetical protein SPC_3134 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|375115918|ref|ZP_09761088.1| Putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SCSA50]
gi|62129216|gb|AAX66919.1| putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
gi|224469391|gb|ACN47221.1| hypothetical protein SPC_3134 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|322716064|gb|EFZ07635.1| Putative DNA-binding protein [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SCSA50]
Length = 141
Score = 45.8 bits (107), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|416527203|ref|ZP_11743041.1| hypothetical protein SEEM010_04570 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|416533865|ref|ZP_11746683.1| hypothetical protein SEEM030_02675 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB30]
gi|416549598|ref|ZP_11755441.1| hypothetical protein SEEM29N_06432 [Salmonella enterica subsp.
enterica serovar Montevideo str. 29N]
gi|416568551|ref|ZP_11764903.1| hypothetical protein SEEM41H_21788 [Salmonella enterica subsp.
enterica serovar Montevideo str. 4441 H]
gi|417469716|ref|ZP_12166018.1| putative regulator [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|353626865|gb|EHC75313.1| putative regulator [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|363556858|gb|EHL41071.1| hypothetical protein SEEM010_04570 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|363567489|gb|EHL51487.1| hypothetical protein SEEM030_02675 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB30]
gi|363569547|gb|EHL53497.1| hypothetical protein SEEM29N_06432 [Salmonella enterica subsp.
enterica serovar Montevideo str. 29N]
gi|363577896|gb|EHL61715.1| hypothetical protein SEEM41H_21788 [Salmonella enterica subsp.
enterica serovar Montevideo str. 4441 H]
Length = 141
Score = 45.8 bits (107), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F QN A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQNQLHAAWIAGCTGSLADVALRYAG-QDATTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|284035642|ref|YP_003385572.1| hypothetical protein Slin_0710 [Spirosoma linguale DSM 74]
gi|283814935|gb|ADB36773.1| protein of unknown function DUF296 [Spirosoma linguale DSM 74]
Length = 169
Score = 45.8 bits (107), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 59/113 (52%), Gaps = 12/113 (10%)
Query: 157 ITVKAGEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSL 215
+ +K G+D+ + + Q A IL+ G++++VTLR A + ++G FEI+SL
Sbjct: 41 VRLKPGQDIKKALEAIVRQERIGAGAILTCVGSLTDVTLRLANQENAS-EWKGHFEIVSL 99
Query: 216 SGSFLLSESSGQRSRTGG-LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+ S TG L +S+S GR LGG + T V++V+G+F
Sbjct: 100 VGTL---------STTGSHLHLSVSDSTGRTLGGHLLDGCRVYTTVELVIGTF 143
>gi|300936001|ref|ZP_07150949.1| conserved hypothetical protein [Escherichia coli MS 21-1]
gi|301027769|ref|ZP_07191075.1| conserved hypothetical protein [Escherichia coli MS 196-1]
gi|423703718|ref|ZP_17678143.1| hypothetical protein ESSG_03119 [Escherichia coli H730]
gi|432565165|ref|ZP_19801738.1| hypothetical protein A1SA_03812 [Escherichia coli KTE51]
gi|432681532|ref|ZP_19916897.1| hypothetical protein A1YW_03286 [Escherichia coli KTE143]
gi|433049327|ref|ZP_20236667.1| hypothetical protein WII_03264 [Escherichia coli KTE120]
gi|299879103|gb|EFI87314.1| conserved hypothetical protein [Escherichia coli MS 196-1]
gi|300458793|gb|EFK22286.1| conserved hypothetical protein [Escherichia coli MS 21-1]
gi|385707752|gb|EIG44779.1| hypothetical protein ESSG_03119 [Escherichia coli H730]
gi|431091560|gb|ELD97277.1| hypothetical protein A1SA_03812 [Escherichia coli KTE51]
gi|431218757|gb|ELF16190.1| hypothetical protein A1YW_03286 [Escherichia coli KTE143]
gi|431563173|gb|ELI36406.1| hypothetical protein WII_03264 [Escherichia coli KTE120]
Length = 143
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V L A T+ G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALHYAGQENTTLL-SGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|260803920|ref|XP_002596837.1| hypothetical protein BRAFLDRAFT_99737 [Branchiostoma floridae]
gi|229282097|gb|EEN52849.1| hypothetical protein BRAFLDRAFT_99737 [Branchiostoma floridae]
Length = 148
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 55/114 (48%), Gaps = 18/114 (15%)
Query: 162 GEDVSSKIMSFSQN-GPRAVCILSANGAISNVTLRQAATSGG--------TVTYEGRFEI 212
G ++ S + F Q G +A +++ G++S LR A G + + R+EI
Sbjct: 15 GVEIQSALQKFVQEKGLKAPFVMTCVGSVSAAKLRLAKAIGDKPGAGKHEIIELDERYEI 74
Query: 213 LSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
+SL G+ + G L VSL+ DG ++GG V G LT T ++V+G
Sbjct: 75 VSLVGTL----NDGTH-----LHVSLADKDGAIVGGHVMGNLTVFTTAEIVIGE 119
>gi|432373492|ref|ZP_19616527.1| hypothetical protein WCO_02538 [Escherichia coli KTE11]
gi|430894533|gb|ELC16821.1| hypothetical protein WCO_02538 [Escherichia coli KTE11]
Length = 143
Score = 45.8 bits (107), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 61/114 (53%), Gaps = 12/114 (10%)
Query: 163 EDVSSKIMSFS-QNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLL 221
++V S++ +F+ Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 23 QEVFSQLRAFARQQQLHAAWIAGCTGSLTDVALRYAGQEN-TTHLRGKFEVIALNGTL-- 79
Query: 222 SESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E +G+ L + +S P G +LGG + T T +++V+GS LA R+
Sbjct: 80 -EQTGEH-----LHLCISDPHGAMLGGHMMPGCTVRTTLELVIGSLEELAFSRQ 127
>gi|357482405|ref|XP_003611488.1| hypothetical protein MTR_5g014460 [Medicago truncatula]
gi|355512823|gb|AES94446.1| hypothetical protein MTR_5g014460 [Medicago truncatula]
Length = 111
Score = 45.8 bits (107), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%)
Query: 234 LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKSSHRME 282
VSL PD R+L G VA AA+ V+V+VGSF DG+K ++ + E
Sbjct: 4 FKVSLVNPDSRLLVGVVADKFIAASLVKVIVGSFTLDGKKNGLNNLKYE 52
>gi|387915350|gb|AFK11284.1| bifunctional protein glmU-like protein [Callorhinchus milii]
Length = 145
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 58/110 (52%), Gaps = 14/110 (12%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQA----ATSGGTVTYEGRFEILSLS 216
GE++ + ++ F Q+ +A I++ G+++ TLR A + + +G +EI+SL
Sbjct: 17 GEEILTSLIKFVQDKKLKAAFIITCVGSVTKATLRLANAIATNTNQIIELKGNYEIVSLV 76
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
G+ L+E + L ++L+ +G +GG V G L T ++V+G
Sbjct: 77 GT--LNEDAH-------LHINLADMEGHTVGGHVLGNLEVFTTAEIVIGE 117
>gi|417285975|ref|ZP_12073266.1| PF03479 domain protein [Escherichia coli TW07793]
gi|425301755|ref|ZP_18691640.1| bifunctional protein glmU [Escherichia coli 07798]
gi|386251216|gb|EII97383.1| PF03479 domain protein [Escherichia coli TW07793]
gi|408211837|gb|EKI36378.1| bifunctional protein glmU [Escherichia coli 07798]
Length = 143
Score = 45.4 bits (106), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 60/115 (52%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ F+Q A I G++++V LR A T G+FE+++L+G+
Sbjct: 22 GQEVLSQLRIFAQQQQLHAAWIAGCTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|432398850|ref|ZP_19641626.1| hypothetical protein WEI_03791 [Escherichia coli KTE25]
gi|432724370|ref|ZP_19959285.1| hypothetical protein WE1_03419 [Escherichia coli KTE17]
gi|432728951|ref|ZP_19963826.1| hypothetical protein WE3_03421 [Escherichia coli KTE18]
gi|432742640|ref|ZP_19977356.1| hypothetical protein WEE_03352 [Escherichia coli KTE23]
gi|432992003|ref|ZP_20180663.1| hypothetical protein A179_03798 [Escherichia coli KTE217]
gi|433112134|ref|ZP_20297991.1| hypothetical protein WK9_03012 [Escherichia coli KTE150]
gi|430914038|gb|ELC35148.1| hypothetical protein WEI_03791 [Escherichia coli KTE25]
gi|431264259|gb|ELF55986.1| hypothetical protein WE1_03419 [Escherichia coli KTE17]
gi|431271547|gb|ELF62666.1| hypothetical protein WE3_03421 [Escherichia coli KTE18]
gi|431282480|gb|ELF73364.1| hypothetical protein WEE_03352 [Escherichia coli KTE23]
gi|431492977|gb|ELH72574.1| hypothetical protein A179_03798 [Escherichia coli KTE217]
gi|431626724|gb|ELI95268.1| hypothetical protein WK9_03012 [Escherichia coli KTE150]
Length = 143
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F+Q A I G++++V LR A + G+FE+++L+G+
Sbjct: 22 GQEVLSQLRAFAQQQQLHAAWIAGCTGSLTDVALRYAGQENPALL-SGKFEVIALNGTL- 79
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E SG+ L + +S P G +LGG + T T +++V+G LA R+
Sbjct: 80 --EQSGEH-----LHLCVSDPHGTMLGGHMMPGCTVRTTLELVIGCLEELAFSRQ 127
>gi|423110277|ref|ZP_17097972.1| hypothetical protein HMPREF9687_03523 [Klebsiella oxytoca 10-5243]
gi|423116210|ref|ZP_17103901.1| hypothetical protein HMPREF9689_03958 [Klebsiella oxytoca 10-5245]
gi|376379031|gb|EHS91787.1| hypothetical protein HMPREF9689_03958 [Klebsiella oxytoca 10-5245]
gi|376380262|gb|EHS93010.1| hypothetical protein HMPREF9687_03523 [Klebsiella oxytoca 10-5243]
Length = 141
Score = 44.7 bits (104), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
GE+V S++ +F Q + +A I G++SN LR A T+ G +E++SL+G+
Sbjct: 21 GEEVFSRLRAFLQLHHIQAAWIAGCTGSLSNAALRFAGQDETTLL-NGTYEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E G+ L +++S P G +LGG + T T +++V+G LA R+
Sbjct: 79 --ELKGEH-----LHLAISDPQGAMLGGHMMPGCTVRTTLELVIGELTSLAFSRQ 126
>gi|386741636|ref|YP_006214815.1| putative DNA-binding protein [Providencia stuartii MRSN 2154]
gi|384478329|gb|AFH92124.1| putative DNA-binding protein [Providencia stuartii MRSN 2154]
Length = 152
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 57/112 (50%), Gaps = 10/112 (8%)
Query: 157 ITVKAGEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSL 215
+ ++ GEDV + F QN +A I G+++ V LR A T + GR+EI+SL
Sbjct: 16 LRLRPGEDVIPTLRHFIQQNHLKAAFIAGCVGSLTRVNLRFAGKET-TDQFVGRYEIVSL 74
Query: 216 SGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+ +S G+ L +++S +G V GG + T T +++V+G
Sbjct: 75 IGTL---DSEGEH-----LHLAISDENGHVQGGHMMLDCTVRTTLELVIGEL 118
>gi|222641439|gb|EEE69571.1| hypothetical protein OsJ_29091 [Oryza sativa Japonica Group]
Length = 1254
Score = 44.7 bits (104), Expect = 0.066, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 24/111 (21%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQA--ATSGGTVTYEGRFEI 212
H++ + GEDV+ + F++ RQ+ G + G EI
Sbjct: 119 HMMEIADGEDVAEAVADFARR-------------------RQSWPGEPGSVIELSGPLEI 159
Query: 213 LSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVV 263
LSLSG+F+ S + GL L+G G+V+GG+V G L A V ++
Sbjct: 160 LSLSGAFMPPPSLANAT---GLKALLAGGQGQVIGGNVVGALRARGHVTIL 207
>gi|183599387|ref|ZP_02960880.1| hypothetical protein PROSTU_02856 [Providencia stuartii ATCC 25827]
gi|188021625|gb|EDU59665.1| hypothetical protein PROSTU_02856 [Providencia stuartii ATCC 25827]
Length = 152
Score = 44.7 bits (104), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 57/112 (50%), Gaps = 10/112 (8%)
Query: 157 ITVKAGEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSL 215
+ ++ GEDV + F QN +A I G+++ V LR A T + GR+EI+SL
Sbjct: 16 LRLRPGEDVIPTLRHFIQQNHLKAAFIAGCVGSLTRVNLRFAGKET-TDQFVGRYEIVSL 74
Query: 216 SGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+ +S G+ L +++S +G V GG + T T +++V+G
Sbjct: 75 IGTL---DSEGEH-----LHLAISDENGHVQGGHMMLDCTVRTTLELVIGEL 118
>gi|161506376|ref|YP_001573488.1| hypothetical protein SARI_04573 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160867723|gb|ABX24346.1| hypothetical protein SARI_04573 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 141
Score = 44.7 bits (104), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 58/107 (54%), Gaps = 10/107 (9%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F Q+ A I G++++V LR A T + G FE++SL+G+
Sbjct: 21 GQEVFSQLHAFVQQHQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
E +G+ L +++S P G +LGG + T T +++V+G
Sbjct: 79 --ELTGEH-----LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 118
>gi|332297216|ref|YP_004439138.1| hypothetical protein Trebr_0564 [Treponema brennaborense DSM 12168]
gi|332180319|gb|AEE16007.1| protein of unknown function DUF296 [Treponema brennaborense DSM
12168]
Length = 134
Score = 44.3 bits (103), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 65/130 (50%), Gaps = 17/130 (13%)
Query: 153 TPHVITVKAGEDVSSKIMSFS-QNGPRAVCILSANGAISNVTLRQAATSGGTV-TYEGRF 210
T HV ++ G+D+ +KI ++ ++ A C+LS G + +R A SG TV T +
Sbjct: 3 TVHVFRLRRGDDLLTKITEYARKHHIEAGCVLSCAGCVLRAHIRDA--SGLTVRTVDEPM 60
Query: 211 EILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG----- 265
EI+SL+G+ S R+ L VS S D +GG + T T ++V+
Sbjct: 61 EIVSLTGTV-----SAARTH---LHVSFSKEDLSTVGGHLVEGCTVNTTAEIVLQHLEGI 112
Query: 266 SFLADGRKES 275
SF A+ KE+
Sbjct: 113 SFAAEFDKET 122
>gi|423104799|ref|ZP_17092501.1| hypothetical protein HMPREF9686_03405 [Klebsiella oxytoca 10-5242]
gi|376382762|gb|EHS95495.1| hypothetical protein HMPREF9686_03405 [Klebsiella oxytoca 10-5242]
Length = 141
Score = 43.9 bits (102), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 62/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F Q +A I G++S+ LR A T+ G +E++SL+G+
Sbjct: 21 GDEVFSRLRAFIQEQQIQAAWIAGCTGSLSHAALRFAGQDETTLL-TGTYEVISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG--SFLADGRK 273
E G+ L +++S P G +LGG + T T +++V+G +FLA R+
Sbjct: 79 --EWQGEH-----LHLAISDPQGAMLGGHMMPGCTVRTTLELVIGELTFLAFSRQ 126
>gi|358339341|dbj|GAA47424.1| hypothetical protein CLF_100341 [Clonorchis sinensis]
Length = 619
Score = 43.9 bits (102), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 65/130 (50%), Gaps = 16/130 (12%)
Query: 141 QLEALGSAGVGFTPHVITVKAGEDV----SSKIMSFSQNGPRAVCILSANGAISNVTLRQ 196
+ ++ SAG GF H++ + G++V S ++S G I++ G+++ +R
Sbjct: 437 DVSSVHSAGEGFGVHILRLAPGQEVRSCLSHYVLSHHLTG---AFIITCCGSLTKAHIRL 493
Query: 197 AATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTA 256
A + EG FEI+S+ G+ S G L ++L+ +G+VLGG + G
Sbjct: 494 ANLQESEL--EGPFEIVSMVGTL---ASDGH----PHLHIALADSNGQVLGGHLLGSCQV 544
Query: 257 ATPVQVVVGS 266
T ++V+G+
Sbjct: 545 NTTAEIVLGA 554
>gi|395228483|ref|ZP_10406806.1| UDP-N-acetylglucosamine diphosphorylase [Citrobacter sp. A1]
gi|421845285|ref|ZP_16278440.1| hypothetical protein D186_09608 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|424731896|ref|ZP_18160477.1| udp-n-acetylglucosamine diphosphorylase [Citrobacter sp. L17]
gi|394718132|gb|EJF23776.1| UDP-N-acetylglucosamine diphosphorylase [Citrobacter sp. A1]
gi|411773606|gb|EKS57151.1| hypothetical protein D186_09608 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|422893524|gb|EKU33371.1| udp-n-acetylglucosamine diphosphorylase [Citrobacter sp. L17]
gi|455642823|gb|EMF21974.1| hypothetical protein H262_15582 [Citrobacter freundii GTC 09479]
Length = 141
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 65/122 (53%), Gaps = 12/122 (9%)
Query: 155 HVITVKAGEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEIL 213
H + + G++V S + +F Q+ +A I G+++N+ LR A T+ G +EI+
Sbjct: 14 HALRLLPGQEVFSALHAFIQQHQLQAAWIAGCTGSLTNIALRFAGREETTL-LTGTWEII 72
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADG 271
SL+G+ E +G+ L ++++ P G +LGG + T T +++V+G LA
Sbjct: 73 SLNGTL---ELTGEH-----LHLAVADPHGAMLGGHMMPGCTVRTTLELVIGELTSLAFS 124
Query: 272 RK 273
R+
Sbjct: 125 RQ 126
>gi|377577203|ref|ZP_09806186.1| hypothetical protein EH105704_02_03800 [Escherichia hermannii NBRC
105704]
gi|377541731|dbj|GAB51351.1| hypothetical protein EH105704_02_03800 [Escherichia hermannii NBRC
105704]
Length = 148
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 63/115 (54%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F Q+ +A I G++S+V LR A T+ G +E++S+SG+
Sbjct: 24 GDEVLSQLRAFIQHHQLQAAWIAGCTGSLSDVALRYAGQEETTLLC-GTYEVISMSGTL- 81
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E +G+ L +S+S P G +LGG + T T +++++G LA R+
Sbjct: 82 --ELTGEH-----LHLSISDPHGAMLGGHMMPGSTVRTTLEIIIGELTELAFSRQ 129
>gi|357481873|ref|XP_003611222.1| DNA binding protein [Medicago truncatula]
gi|355512557|gb|AES94180.1| DNA binding protein [Medicago truncatula]
Length = 124
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 2/63 (3%)
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
S G TPH+ITV EDV+ K+M+F A+ IL A+G IS + + SG
Sbjct: 64 SVGTNLTPHIITVNPREDVAMKVMTFCPQ--EAIRILYASGVISRAIVNRPQASGTLYNL 121
Query: 207 EGR 209
R
Sbjct: 122 HMR 124
>gi|303281476|ref|XP_003060030.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458685|gb|EEH55982.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 139
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 61/117 (52%), Gaps = 13/117 (11%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATS----GGTVTYEGR 209
H + GED+ + +++ + RA +L+ G++S VTLR A ++ V+ + R
Sbjct: 4 HAFRLTPGEDLKKALCAYAASRKLRASFVLTCVGSLSAVTLRLANSARDGKNEVVSLDER 63
Query: 210 FEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
FEI+SL+G+ LS + L VS++ +G V+GG + T ++V+G
Sbjct: 64 FEIVSLTGT--LSANGAH------LHVSIADFEGNVVGGHLMDGCVVFTTAEIVLGE 112
>gi|420368941|ref|ZP_14869672.1| putative DNA-binding protein [Shigella flexneri 1235-66]
gi|391321712|gb|EIQ78429.1| putative DNA-binding protein [Shigella flexneri 1235-66]
Length = 141
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 65/122 (53%), Gaps = 12/122 (9%)
Query: 155 HVITVKAGEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEIL 213
H + + G++V S + +F Q+ +A I G+++N+ LR A T+ G +EI+
Sbjct: 14 HALRLLPGQEVFSALHAFIQQHQLQAAWIAGCTGSLTNIALRFAGREETTL-LTGTWEII 72
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADG 271
SL+G+ E +G+ L ++++ P G +LGG + T T +++V+G LA
Sbjct: 73 SLNGTL---ELTGEH-----LHLAVADPRGAMLGGHMMPGCTVRTTLELVIGELTSLAFS 124
Query: 272 RK 273
R+
Sbjct: 125 RQ 126
>gi|125597060|gb|EAZ36840.1| hypothetical protein OsJ_21183 [Oryza sativa Japonica Group]
Length = 293
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 55/88 (62%), Gaps = 6/88 (6%)
Query: 183 LSANGAISNVTLRQ--AATSGGTV-TYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLS 239
LS GA++NV LRQ A+ G V T G+FEILSL+G+ L + GL+V LS
Sbjct: 112 LSGGGAVANVALRQPGASPPGSLVATMRGQFEILSLTGTVLPPPAP---PSASGLTVFLS 168
Query: 240 GPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G G+V+GGSVAG L AA PV ++ SF
Sbjct: 169 GGQGQVVGGSVAGQLIAAGPVFLMAASF 196
>gi|428309754|ref|YP_007120731.1| DNA-binding protein with PD1-like DNA-binding motif [Microcoleus
sp. PCC 7113]
gi|428251366|gb|AFZ17325.1| putative DNA-binding protein with PD1-like DNA-binding motif
[Microcoleus sp. PCC 7113]
Length = 136
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 57/113 (50%), Gaps = 14/113 (12%)
Query: 157 ITVKAGEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSL 215
I +K ED+ ++ F QN +A IL+ G++ TLR A+ + V ++ +FEI+SL
Sbjct: 6 IRLKPDEDLRQSLIHFVQQNNIQAGFILTGVGSLKQATLRFASQNYSQV-FKQQFEIVSL 64
Query: 216 SGSFLLSESSGQRSRTGGLSV--SLSGPDGRVLGGSVAGLLTAATPVQVVVGS 266
G+ T G+ + SLS G+ LGG + T ++V+G+
Sbjct: 65 VGTL----------STHGIHIHISLSNRQGKTLGGHLLEGCIIYTTAEIVIGT 107
>gi|326435717|gb|EGD81287.1| hypothetical protein PTSG_11324 [Salpingoeca sp. ATCC 50818]
Length = 178
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 57/121 (47%), Gaps = 11/121 (9%)
Query: 153 TPHVITVKAGEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGT------VT 205
T +V+ V+ G+++ + F++ RA + + G++S +R A+ + T
Sbjct: 17 TSYVLRVQPGQEIVGALTWFAKRARMRAGFVQTCVGSVSEAVIRMASATADTDQANHIKR 76
Query: 206 YEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG 265
EG EI+SL G+ + E + L V LS DG +GG V L T ++V+G
Sbjct: 77 VEGCHEIVSLVGTLAVDEDLSYKQH---LHVCLSDKDGNTIGGHVIS-LKVFTTAEIVLG 132
Query: 266 S 266
Sbjct: 133 E 133
>gi|218185063|gb|EEC67490.1| hypothetical protein OsI_34755 [Oryza sativa Indica Group]
Length = 114
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 2/70 (2%)
Query: 260 VQVVVGSFLADGRK-ESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGGPGSP 318
QVVV SF+A+G+K + + ++E + PP++A PA SPPS GT S SS GSP
Sbjct: 5 YQVVVASFIAEGKKSKPVETRKVEPMSAPPQMA-TYVPAPVASPPSEGTSSGSSDDSGSP 63
Query: 319 LNHSTGACNN 328
+NHS N+
Sbjct: 64 INHSGMPYNH 73
>gi|125604350|gb|EAZ43675.1| hypothetical protein OsJ_28300 [Oryza sativa Japonica Group]
Length = 239
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 155 HVITVKAGEDVSSKIMSFSQNGPRAVCILSAN-GAISNVTLRQAATSGGTVTYEGRFEIL 213
HV+ V G DV+ I F++ S+ G +++V L Q A V GRFEIL
Sbjct: 65 HVMEVAGGADVAESIAHFARAAEARRLACSSGAGTVTDVALGQPAAPSAVVALRGRFEIL 124
Query: 214 SLSGSFL 220
SL+G+FL
Sbjct: 125 SLTGTFL 131
>gi|283835308|ref|ZP_06355049.1| UDP-N-acetylglucosamine diphosphorylase [Citrobacter youngae ATCC
29220]
gi|291068466|gb|EFE06575.1| UDP-N-acetylglucosamine diphosphorylase [Citrobacter youngae ATCC
29220]
Length = 141
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S + F Q+ +A I G+++NV LR A T+ G +EI+SL+G+
Sbjct: 21 GQEVFSALRDFVQQHQLQAAWIAGCTGSLTNVALRYAGRDETTL-LTGTWEIISLNGTL- 78
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E +G+ L ++++ P G +LGG + T T +++V+G LA R+
Sbjct: 79 --ELTGEH-----LHLAVADPHGAMLGGHMMPGCTVRTTLELVIGELASLAFSRQ 126
>gi|42523205|ref|NP_968585.1| DNA-binding protein [Bdellovibrio bacteriovorus HD100]
gi|39575410|emb|CAE79578.1| putative DNA-binding protein [Bdellovibrio bacteriovorus HD100]
Length = 140
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 59/115 (51%), Gaps = 14/115 (12%)
Query: 153 TPHVITVKAGEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGG--TVTYEGR 209
T + ++ G+D+ +++ + Q A C++SA G++ LR SGG V ++G
Sbjct: 11 TSYCFRLRPGQDLKKELLFYCQKYHLHAACVVSAVGSVDKAHLRM---SGGKDVVEFQGP 67
Query: 210 FEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVV 264
FEI+SLSG+ L L +++S +G+V+GG + T ++V+
Sbjct: 68 FEIVSLSGT--LGPDGAH------LHMAISNYEGQVIGGHLMDGSVIHTTAEIVL 114
>gi|357440691|ref|XP_003590623.1| AT-hook DNA-binding protein [Medicago truncatula]
gi|355479671|gb|AES60874.1| AT-hook DNA-binding protein [Medicago truncatula]
Length = 192
Score = 42.0 bits (97), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 64/158 (40%), Gaps = 36/158 (22%)
Query: 151 GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRF 210
G HV+ + DVS + +++ R +CIL+ NG + TL + G VT R
Sbjct: 55 GLCSHVLDITTEVDVSIVLFDYARRRGRLICILNGNGVVDKTTLCKPI--GRIVTVHRRS 112
Query: 211 EILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLAD 270
ILS+S R+L GSV L A+ V+++V SF +
Sbjct: 113 NILSIS---------------------------RIL-GSVIPSLVASYSVKLMVVSFANN 144
Query: 271 GRKE------SKSSHRMESLPVPPKLAPGGQPAGQCSP 302
+E S S + L ++A G + QCS
Sbjct: 145 ASEELYLAAYSFGSAQTICLDAARQVAEGKRWHDQCSK 182
>gi|242280979|ref|YP_002993108.1| hypothetical protein Desal_3523 [Desulfovibrio salexigens DSM 2638]
gi|242123873|gb|ACS81569.1| protein of unknown function DUF296 [Desulfovibrio salexigens DSM
2638]
Length = 134
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 12/116 (10%)
Query: 154 PHVITVKAGEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEI 212
P I K G+D+ ++ +Q A C+L+ G+++N LR A G + G FEI
Sbjct: 3 PLAIRFKPGQDILLELERIAQEREIEAACVLTCVGSLTNAVLR-FANQGESTELNGHFEI 61
Query: 213 LSLSGSFLLSESSGQRSRTGG-LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
+SL+G SR G +++S +G+ +G + T ++V+ F
Sbjct: 62 VSLTGVL---------SRHGSHFHIAISDGEGKTIGAHLMEGSRVYTTAEIVLALF 108
>gi|375257329|ref|YP_005016499.1| hypothetical protein KOX_02595 [Klebsiella oxytoca KCTC 1686]
gi|365906807|gb|AEX02260.1| hypothetical protein KOX_02595 [Klebsiella oxytoca KCTC 1686]
Length = 136
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 59/115 (51%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
GE+V S++ +F Q +A I G++S+ LR A T G +E++SL+G+
Sbjct: 16 GEEVFSRLRAFIQEQQIQAAWIAGCTGSLSDAALRFAGQDETTFL-TGTYEVISLNGTL- 73
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E G+ L +++S P G +LGG + T T +++V+G LA R+
Sbjct: 74 --EWQGEH-----LHLAISDPQGAMLGGHMMPGCTVRTTLELVIGELTSLAFSRQ 121
>gi|426371143|ref|XP_004052513.1| PREDICTED: condensin-2 complex subunit D3 [Gorilla gorilla gorilla]
Length = 1498
Score = 41.6 bits (96), Expect = 0.57, Method: Composition-based stats.
Identities = 33/103 (32%), Positives = 48/103 (46%), Gaps = 6/103 (5%)
Query: 214 SLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK 273
+L+ S +S+ R+ G ++VS P+ G + LL A P+ + + L +K
Sbjct: 1305 NLAMSPAVSQPCTPRASAGHVAVSSPTPET----GPLQRLLPKARPMSLSTIAILNSVKK 1360
Query: 274 --ESKSSHRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGG 314
ESKS HR SL V P G P CS S +L + S G
Sbjct: 1361 AVESKSRHRSRSLGVLPFTLNSGSPEKTCSQVSSYSLEQESNG 1403
>gi|436837495|ref|YP_007322711.1| hypothetical protein FAES_4118 [Fibrella aestuarina BUZ 2]
gi|384068908|emb|CCH02118.1| hypothetical protein FAES_4118 [Fibrella aestuarina BUZ 2]
Length = 169
Score = 41.6 bits (96), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 58/110 (52%), Gaps = 10/110 (9%)
Query: 157 ITVKAGEDVSSKIMSF-SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSL 215
I ++ G+D+ +++ +Q+ A +L+ G++++V+LR A G T Y G FEI+SL
Sbjct: 42 IRLRPGQDLKTELDKLVAQHRIEAGLVLTCVGSLTDVSLRLANQEGATA-YHGHFEIVSL 100
Query: 216 SGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG 265
G+ ++G L +++S GR GG + T ++V+G
Sbjct: 101 VGTL---STNGSH-----LHLAVSDSTGRTTGGHLMAGNIIYTTAEIVLG 142
>gi|332264020|ref|XP_003281046.1| PREDICTED: condensin-2 complex subunit D3 [Nomascus leucogenys]
Length = 1498
Score = 41.2 bits (95), Expect = 0.69, Method: Composition-based stats.
Identities = 32/96 (33%), Positives = 43/96 (44%), Gaps = 6/96 (6%)
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK--ESKSS 278
+S+ R+ G + VS P+ G + LL A PV + + L +K ESKS
Sbjct: 1312 VSQPCTPRASAGHVPVSSPTPET----GPLQRLLPKARPVSLSTIAILNSVKKAVESKSR 1367
Query: 279 HRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGG 314
HR SL V P G P CS S +L + S G
Sbjct: 1368 HRSRSLGVLPFTLNSGSPEKTCSQVSSYSLEQESNG 1403
>gi|125584179|gb|EAZ25110.1| hypothetical protein OsJ_08906 [Oryza sativa Japonica Group]
Length = 239
Score = 41.2 bits (95), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 4/82 (4%)
Query: 187 GAISNVTLRQAATSG-GTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRV 245
G ++NV LRQ + G + GRFEILSL+G+FL + + GL+V L+G G+V
Sbjct: 92 GTVANVALRQPSAPGRPSSPSTGRFEILSLTGNFLPGPAPPGST---GLTVYLAGGQGQV 148
Query: 246 LGGSVAGLLTAATPVQVVVGSF 267
+GGSV G L AA PV V+ +F
Sbjct: 149 VGGSVVGSLIAAGPVMVIASTF 170
>gi|402895861|ref|XP_003911031.1| PREDICTED: condensin-2 complex subunit D3 [Papio anubis]
Length = 1498
Score = 41.2 bits (95), Expect = 0.72, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 43/96 (44%), Gaps = 6/96 (6%)
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK--ESKSS 278
+S+ R+ G + VS P+ G + LL A P+ + + L +K ESKS
Sbjct: 1312 VSQPCTHRASAGHVPVSSPAPET----GPLQRLLPKARPMSLSTIAILNSVKKAVESKSR 1367
Query: 279 HRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGG 314
HR SL V P G P CS S +L + S G
Sbjct: 1368 HRSRSLGVLPFTLNSGSPEKTCSQVSSYSLEQESNG 1403
>gi|417328611|ref|ZP_12113694.1| putative regulator [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
gi|417343644|ref|ZP_12124173.1| putative regulator [Salmonella enterica subsp. enterica serovar
Baildon str. R6-199]
gi|417375766|ref|ZP_12145134.1| putative regulator [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
gi|417513390|ref|ZP_12177452.1| putative regulator [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
gi|417520449|ref|ZP_12182359.1| putative regulator of transporter operon [Salmonella enterica
subsp. enterica serovar Uganda str. R8-3404]
gi|417541649|ref|ZP_12193320.1| putative regulator [Salmonella enterica subsp. enterica serovar
Wandsworth str. A4-580]
gi|353567373|gb|EHC32592.1| putative regulator [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
gi|353595236|gb|EHC52535.1| putative regulator [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
gi|353636939|gb|EHC82881.1| putative regulator [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
gi|353643946|gb|EHC88021.1| putative regulator of transporter operon [Salmonella enterica
subsp. enterica serovar Uganda str. R8-3404]
gi|353660318|gb|EHC99975.1| putative regulator [Salmonella enterica subsp. enterica serovar
Wandsworth str. A4-580]
gi|357955160|gb|EHJ81072.1| putative regulator [Salmonella enterica subsp. enterica serovar
Baildon str. R6-199]
Length = 110
Score = 41.2 bits (95), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 9/96 (9%)
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
QN A I G++++V LR A T + G FE++SL+G+ E +G+
Sbjct: 1 MQQNQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL---ELTGEH--- 53
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L +++S P G +LGG + T T +++V+G
Sbjct: 54 --LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 87
>gi|417427576|ref|ZP_12160751.1| putative regulator [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
gi|353616447|gb|EHC67714.1| putative regulator [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
Length = 110
Score = 41.2 bits (95), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 9/96 (9%)
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
QN A I G++++V LR A T + G FE++SL+G+ E +G+
Sbjct: 1 MQQNQLHAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTL---ELTGEH--- 53
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L +++S P G +LGG + T T +++V+G
Sbjct: 54 --LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 87
>gi|86610066|ref|YP_478828.1| bifunctional N-acetylglucosamine-1-phosphate
uridyltransferase/glucosamine-1-phosphate
acetyltransferase [Synechococcus sp. JA-2-3B'a(2-13)]
gi|109892125|sp|Q2JII9.1|GLMU_SYNJB RecName: Full=Bifunctional protein GlmU; Includes: RecName:
Full=UDP-N-acetylglucosamine pyrophosphorylase; AltName:
Full=N-acetylglucosamine-1-phosphate uridyltransferase;
Includes: RecName: Full=Glucosamine-1-phosphate
N-acetyltransferase
gi|86558608|gb|ABD03565.1| UDP-N-acetylglucosamine pyrophosphorylase [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 632
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 61/130 (46%), Gaps = 16/130 (12%)
Query: 98 SSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPP-------GSGSGKKHQLEALGSAGV 150
S VT A G T P PL D + +R RP G S + + + + +
Sbjct: 416 SDVTIAAGST-----IPARYPLPDDCLVIARSRPVVKPGWRLGIRSSRPQEPQPMPPGSL 470
Query: 151 GFTPHVITVKAGEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGR 209
P + + G+D+ ++ ++ P +A +LSA G++S TLR A +G + E R
Sbjct: 471 KIYP--LRLFPGQDLKQELERLARQQPLQAGFVLSAVGSLSQATLRLADQTGDHLLSE-R 527
Query: 210 FEILSLSGSF 219
EIL+LSGS
Sbjct: 528 LEILALSGSL 537
>gi|417352145|ref|ZP_12129438.1| putative regulator [Salmonella enterica subsp. enterica serovar
Gaminara str. A4-567]
gi|417385490|ref|ZP_12150536.1| putative regulator [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
gi|417393381|ref|ZP_12155904.1| putative regulator [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
gi|417481002|ref|ZP_12171898.1| putative regulator [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
gi|417533650|ref|ZP_12187630.1| putative regulator [Salmonella enterica subsp. enterica serovar
Urbana str. R8-2977]
gi|353567383|gb|EHC32600.1| putative regulator [Salmonella enterica subsp. enterica serovar
Gaminara str. A4-567]
gi|353605665|gb|EHC60111.1| putative regulator [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
gi|353608917|gb|EHC62367.1| putative regulator [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
gi|353635925|gb|EHC82104.1| putative regulator [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
gi|353660236|gb|EHC99910.1| putative regulator [Salmonella enterica subsp. enterica serovar
Urbana str. R8-2977]
Length = 110
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 9/96 (9%)
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
QN A I G++++V LR A T + G FE++SL+G+ E +G+
Sbjct: 1 MQQNQLHAAWIAGCTGSLADVALRYAGQEA-TTSLTGTFEVISLNGTL---ELTGEH--- 53
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L +++S P G +LGG + T T +++V+G
Sbjct: 54 --LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 87
>gi|426403685|ref|YP_007022656.1| DNA-binding protein [Bdellovibrio bacteriovorus str. Tiberius]
gi|425860353|gb|AFY01389.1| putative DNA-binding protein [Bdellovibrio bacteriovorus str.
Tiberius]
Length = 124
Score = 40.8 bits (94), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 50/92 (54%), Gaps = 14/92 (15%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGG--TVTYEGRFEILSLSGS 218
G+D+ +++ + Q +A C++SA G++ LR SGG V ++G FEI+SLSG+
Sbjct: 4 GQDLKKELLFYCQKYHLQAACVVSAVGSVDKAHLR---MSGGKDVVEFQGPFEIVSLSGT 60
Query: 219 FLLSESSGQRSRTGGLSVSLSGPDGRVLGGSV 250
L +S+S +G+V+GG +
Sbjct: 61 L--------GPDGAHLHMSISNFEGQVIGGHL 84
>gi|301620226|ref|XP_002939482.1| PREDICTED: bifunctional protein glmU-like [Xenopus (Silurana)
tropicalis]
Length = 160
Score = 40.8 bits (94), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 56/111 (50%), Gaps = 14/111 (12%)
Query: 162 GEDVSSKIMSFSQN-GPRAVCILSANGAISNVTLRQA---ATSGGTVTY-EGRFEILSLS 216
GE++ + + F Q ++ +L+ G+++ TLR A A + + Y + + EI+SL
Sbjct: 29 GEEILTSLFKFVQEKNLKSPFVLTCVGSVTKATLRLANSDALNTNEIIYLKEKLEIVSLV 88
Query: 217 GSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
G+ L+E + L +SL DG+ +GG G L T ++V+G
Sbjct: 89 GT--LNEGAH-------LHISLGDKDGKTIGGHAIGDLEVFTTAEIVIGEL 130
>gi|90410315|ref|ZP_01218331.1| hypothetical DNA binding protein [Photobacterium profundum 3TCK]
gi|90328556|gb|EAS44840.1| hypothetical DNA binding protein [Photobacterium profundum 3TCK]
Length = 135
Score = 40.8 bits (94), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 52/98 (53%), Gaps = 10/98 (10%)
Query: 154 PHVITVKAGEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEI 212
PH + G+D+ + ++++ + N +A +LS G ++ +R A S ++T +G EI
Sbjct: 4 PHAFRLTQGDDLKASVLAYVKANNIKAGSLLSCAGCLTTARIRLADESK-SLTLDGPLEI 62
Query: 213 LSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSV 250
L+LSG+ + L +S++ +GRV GG +
Sbjct: 63 LTLSGTL--------TADHVHLHISVADKEGRVFGGHL 92
>gi|54308966|ref|YP_129986.1| DNA-binding protein [Photobacterium profundum SS9]
gi|46913396|emb|CAG20184.1| hypothetical DNA binding protein [Photobacterium profundum SS9]
Length = 131
Score = 40.8 bits (94), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 59/116 (50%), Gaps = 10/116 (8%)
Query: 154 PHVITVKAGEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEI 212
PH + G+D+ + ++++ + N +A +LS G ++ +R A S ++T +G EI
Sbjct: 4 PHAFRLTQGDDLKASVLAYVKANSIKAGSLLSCAGCLTTARIRLADESK-SLTLDGPLEI 62
Query: 213 LSLSGSFLLSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFL 268
L+LSG+ + L +S++ +GRV GG + + ++ + SF+
Sbjct: 63 LTLSGTL--------TADHVHLHISVADKEGRVFGGHLMDGSDVSYTAEICLLSFI 110
>gi|186472420|ref|YP_001859762.1| hypothetical protein Bphy_3577 [Burkholderia phymatum STM815]
gi|184194752|gb|ACC72716.1| protein of unknown function DUF296 [Burkholderia phymatum STM815]
Length = 135
Score = 40.8 bits (94), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 12/113 (10%)
Query: 155 HVITVKAGEDVSSKI-MSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEIL 213
H + ++ G+D+ + + G AV ++ G++S LR A T EIL
Sbjct: 4 HPLRLQPGQDLRDALEHTMHPVGATAVFVVQGIGSLSVARLRFAGVEHPT-ELRADLEIL 62
Query: 214 SLSGSFLLSESSGQRSRTGG-LSVSLSGPDGRVLGGSVAGLLTAATPVQVVVG 265
+L+G+ +R G L +S+SGPDGRV GG VA T V++++
Sbjct: 63 TLAGTV---------ARNGAHLHMSVSGPDGRVFGGHVAHGCIVRTTVEILLA 106
>gi|147818545|emb|CAN65179.1| hypothetical protein VITISV_021779 [Vitis vinifera]
Length = 229
Score = 40.8 bits (94), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%)
Query: 178 RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
R + +LS +G V+LRQ G +T GR EI SLSGSFL
Sbjct: 91 RGIGVLSGSGLEMKVSLRQPXPIGAFLTLHGRLEIFSLSGSFL 133
>gi|397498239|ref|XP_003819892.1| PREDICTED: condensin-2 complex subunit D3 [Pan paniscus]
Length = 1498
Score = 40.8 bits (94), Expect = 0.99, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 44/96 (45%), Gaps = 6/96 (6%)
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK--ESKSS 278
+S+ R+ G ++VS P+ G + LL A P+ + + L +K ESKS
Sbjct: 1312 VSQPCTPRASAGHVAVSSPTPET----GPLQRLLPKARPMSLSTIAILNSVKKAVESKSR 1367
Query: 279 HRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGG 314
HR SL V P G P CS S +L + S G
Sbjct: 1368 HRSRSLGVLPFTLNSGSPEKTCSQVSSYSLEQESNG 1403
>gi|332838236|ref|XP_003313467.1| PREDICTED: LOW QUALITY PROTEIN: condensin-2 complex subunit D3 [Pan
troglodytes]
Length = 1498
Score = 40.8 bits (94), Expect = 0.99, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 44/96 (45%), Gaps = 6/96 (6%)
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK--ESKSS 278
+S+ R+ G ++VS P+ G + LL A P+ + + L +K ESKS
Sbjct: 1312 VSQPCTPRASAGHVAVSSPTPET----GPLQRLLPKARPMSLSTIAILNSVKKAVESKSR 1367
Query: 279 HRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGG 314
HR SL V P G P CS S +L + S G
Sbjct: 1368 HRSRSLGVLPFTLNSGSPEKTCSQVSSYSLEQESNG 1403
>gi|410212230|gb|JAA03334.1| non-SMC condensin II complex, subunit D3 [Pan troglodytes]
gi|410255978|gb|JAA15956.1| non-SMC condensin II complex, subunit D3 [Pan troglodytes]
gi|410305454|gb|JAA31327.1| non-SMC condensin II complex, subunit D3 [Pan troglodytes]
gi|410336057|gb|JAA36975.1| non-SMC condensin II complex, subunit D3 [Pan troglodytes]
Length = 1498
Score = 40.4 bits (93), Expect = 1.1, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 44/96 (45%), Gaps = 6/96 (6%)
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK--ESKSS 278
+S+ R+ G ++VS P+ G + LL A P+ + + L +K ESKS
Sbjct: 1312 VSQPCTPRASAGHVAVSSPTPET----GPLQRLLPKARPMSLSTIAILNSVKKAVESKSR 1367
Query: 279 HRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGG 314
HR SL V P G P CS S +L + S G
Sbjct: 1368 HRSRSLGVLPFTLNSGSPEKTCSQVSSYSLEQESNG 1403
>gi|45356151|ref|NP_056076.1| condensin-2 complex subunit D3 [Homo sapiens]
gi|82654946|sp|P42695.2|CNDD3_HUMAN RecName: Full=Condensin-2 complex subunit D3; AltName: Full=Non-SMC
condensin II complex subunit D3; Short=hCAP-D3
gi|68534207|gb|AAH98398.1| Non-SMC condensin II complex, subunit D3 [Homo sapiens]
gi|119588228|gb|EAW67824.1| KIAA0056 protein, isoform CRA_d [Homo sapiens]
Length = 1498
Score = 40.4 bits (93), Expect = 1.1, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 44/96 (45%), Gaps = 6/96 (6%)
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRK--ESKSS 278
+S+ R+ G ++VS P+ G + LL A P+ + + L +K ESKS
Sbjct: 1312 VSQPCTPRASAGHVAVSSPTPET----GPLQRLLPKARPMSLSTIAILNSVKKAVESKSR 1367
Query: 279 HRMESLPVPPKLAPGGQPAGQCSPPSRGTLSESSGG 314
HR SL V P G P CS S +L + S G
Sbjct: 1368 HRSRSLGVLPFTLNSGSPEKTCSQVSSYSLEQESNG 1403
>gi|423141570|ref|ZP_17129208.1| hypothetical protein SEHO0A_03127 [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
gi|379050742|gb|EHY68634.1| hypothetical protein SEHO0A_03127 [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
Length = 110
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 9/96 (9%)
Query: 172 FSQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRT 231
QN A I G++++V LR A T + G FE++SL+G+ E +G+
Sbjct: 1 MQQNQLHAAWIAGCTGSLTDVALRYAGQEE-TTSLTGTFEVISLNGTL---ELTGEH--- 53
Query: 232 GGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L +++S P G +LGG + T T +++V+G
Sbjct: 54 --LHLAVSDPYGAMLGGHMMPGCTVRTTLELVIGEL 87
>gi|402845774|ref|ZP_10894107.1| PF03479 domain protein [Klebsiella sp. OBRC7]
gi|402270225|gb|EJU19493.1| PF03479 domain protein [Klebsiella sp. OBRC7]
Length = 136
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 60/115 (52%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F Q +A I G++S+ LR A T+ G +E++SL+G+
Sbjct: 16 GDEVFSRLRAFIQEQQIQAAWIAGCTGSLSHAALRFAGQDETTLL-TGTYEVISLNGTL- 73
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E G+ L +++S P G +LGG + T T +++V+G LA R+
Sbjct: 74 --EWQGEH-----LHLAISDPQGAMLGGHMMPGCTVRTTLELVIGELTSLAFSRQ 121
>gi|56751709|ref|YP_172410.1| hypothetical protein syc1700_d [Synechococcus elongatus PCC 6301]
gi|56686668|dbj|BAD79890.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 84
Score = 40.4 bits (93), Expect = 1.3, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 42/69 (60%), Gaps = 6/69 (8%)
Query: 156 VITVKAGEDVSSKIMSFSQ-NGPRAVCILSANGAISNVTLRQAATSGGTVTYEGR--FEI 212
V+ ++ G+D+ + ++Q + P A C+LSA G++ V LR A GG ++ + EI
Sbjct: 5 VLRLRPGQDLKQALWDWTQEHQPSAACLLSAVGSLDAVCLRLA---GGDRQFQRQEPHEI 61
Query: 213 LSLSGSFLL 221
LSLSG+F L
Sbjct: 62 LSLSGTFCL 70
>gi|291326196|ref|ZP_06123547.2| UDP-N-acetylglucosamine diphosphorylase [Providencia rettgeri DSM
1131]
gi|291315351|gb|EFE55804.1| UDP-N-acetylglucosamine diphosphorylase [Providencia rettgeri DSM
1131]
Length = 147
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 53/95 (55%), Gaps = 9/95 (9%)
Query: 173 SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTG 232
Q+ +A I S+ G++++V LR A T G+FEI+SL G+ +++G+
Sbjct: 39 EQHSLQAAFIASSVGSLTHVALRFAGQEN-TFHTTGKFEIVSLIGTL---DANGEH---- 90
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L +S+S +G+VLGG + T T +++++G
Sbjct: 91 -LHLSVSDEEGKVLGGHMMPGCTVRTTLELIIGEL 124
>gi|319948493|ref|ZP_08022627.1| phosphatidylinositol alpha-mannosyltransferase [Dietzia cinnamea
P4]
gi|319437860|gb|EFV92846.1| phosphatidylinositol alpha-mannosyltransferase [Dietzia cinnamea
P4]
Length = 371
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 63/154 (40%), Gaps = 36/154 (23%)
Query: 87 GTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQLEALG 146
G +LA+ P + T T TGS + G L+P ++K RGR S ++ Q+EALG
Sbjct: 102 GMFALAMCSGPITATFHTSTTGSMILDAADGVLAP-LLEKIRGRIAVSTLARRWQMEALG 160
Query: 147 SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVTY 206
S V + G DV GA S+V T TVT+
Sbjct: 161 SDAV-------EIPNGVDV---------------------GAFSDVDTSGEPTDVPTVTF 192
Query: 207 EGRFE-------ILSLSGSFLLSESSGQRSRTGG 233
GR++ +L+ + +L E G R R G
Sbjct: 193 LGRYDEPRKGMGVLAAALPAVLREVPGLRLRIMG 226
>gi|413918077|gb|AFW58009.1| hypothetical protein ZEAMMB73_047292 [Zea mays]
Length = 293
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 25/37 (67%)
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGS 218
++ +NG IS TLRQ+ATSGG VTYE + + S G
Sbjct: 211 VVRSNGTISKATLRQSATSGGMVTYEVQIVVGSFDGD 247
>gi|397659929|ref|YP_006500631.1| regulator of ECF transporter operon [Klebsiella oxytoca E718]
gi|394348029|gb|AFN34150.1| putative regulator of ECF transporter operon [Klebsiella oxytoca
E718]
Length = 136
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 59/115 (51%), Gaps = 12/115 (10%)
Query: 162 GEDVSSKIMSFSQNGP-RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFL 220
G++V S++ +F Q +A I G++S+ LR A T G +E++SL+G+
Sbjct: 16 GDEVFSRLRAFIQEQQIQAAWIAGCTGSLSHAALRFAGQDETTFL-TGTYEVISLNGTL- 73
Query: 221 LSESSGQRSRTGGLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF--LADGRK 273
E G+ L +++S P G +LGG + T T +++V+G LA R+
Sbjct: 74 --EWQGEH-----LHLAISDPQGAMLGGHMMPGCTVRTTLELVIGELTSLAFSRQ 121
>gi|260429912|ref|ZP_05783887.1| bifunctional protein GlmU [Citreicella sp. SE45]
gi|260418835|gb|EEX12090.1| bifunctional protein GlmU [Citreicella sp. SE45]
Length = 167
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 63/126 (50%), Gaps = 12/126 (9%)
Query: 155 HV-ITVKAGEDVSSKIMSFSQNGPRAVC-ILSANGAISNVTLRQAATSGGTVTYEGRFEI 212
HV + +K GE+V + F + A C I+S+ G++++ +R A T EG FEI
Sbjct: 15 HVALRLKPGEEVLGTLQQFVVDHGIAACAIVSSVGSLTHAAIRYA-NQNDTTRLEGHFEI 73
Query: 213 LSLSGSFLLSE-----SSGQRSRTGGLSVSLSGPD--GRVLGGSVAGLLTAATPVQVVVG 265
SL G+ + + + S +GG V LS D GR++GG + T +++V+
Sbjct: 74 CSLIGTLECAAPGEALNGAEASASGGAHVHLSISDGAGRMIGGHMMRGCRVYTTLEIVL- 132
Query: 266 SFLADG 271
+ DG
Sbjct: 133 -LVLDG 137
>gi|422008882|ref|ZP_16355866.1| hypothetical protein OOC_12446 [Providencia rettgeri Dmel1]
gi|414095355|gb|EKT57018.1| hypothetical protein OOC_12446 [Providencia rettgeri Dmel1]
Length = 141
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 52/95 (54%), Gaps = 9/95 (9%)
Query: 173 SQNGPRAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTG 232
Q+G +A I S+ G++++V LR A T G+FEI+SL G+ ++ G+
Sbjct: 33 EQHGLQAAFIASSVGSLTDVALRFAGQEE-TFHTTGKFEIVSLIGTL---DAKGEH---- 84
Query: 233 GLSVSLSGPDGRVLGGSVAGLLTAATPVQVVVGSF 267
L +++S G+VLGG + T T +++++G
Sbjct: 85 -LHLAVSDEQGQVLGGHMMPGCTVRTTLELIIGEL 118
>gi|30687824|ref|NP_173650.3| methyl-CPG-binding domain 8 [Arabidopsis thaliana]
gi|75174757|sp|Q9LME6.1|MBD8_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 8;
Short=AtMBD8; Short=MBD08; AltName:
Full=Methyl-CpG-binding protein MBD8
gi|9392683|gb|AAF87260.1|AC068562_7 Contains a Methyl-CpG binding domain PF|01429 and two DNA binding
domains with preference for A/T rich regions PF|02178.
ESTs gb|AI998776, gb|N95984 come from this gene
[Arabidopsis thaliana]
gi|26452716|dbj|BAC43440.1| unknown protein [Arabidopsis thaliana]
gi|332192108|gb|AEE30229.1| methyl-CPG-binding domain 8 [Arabidopsis thaliana]
Length = 524
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 45/98 (45%), Gaps = 7/98 (7%)
Query: 62 NVMNMGSGSEPMKRKRGRPRKY-GPDGTMSLALVPSPSSVTTATGGTGSGL---SSPG-- 115
NV+ G+ +KRKRGRPRK P + + +S T S L S G
Sbjct: 169 NVLIQGTSGNKIKRKRGRPRKIRNPSEENEVLDLTGEASTYVFVDKTSSNLGMVSRVGSS 228
Query: 116 GGPLSPDSIKKSRGRPPGSGSGKKHQLEALGSAGVGFT 153
G L +S+K+ RGRPP + + LE SA V +
Sbjct: 229 GISLDSNSVKRKRGRPPKNKE-EIMNLEKRDSAIVNIS 265
>gi|191171879|ref|ZP_03033425.1| conserved hypothetical protein [Escherichia coli F11]
gi|300995642|ref|ZP_07181170.1| hypothetical protein HMPREF9553_04638 [Escherichia coli MS 200-1]
gi|422376940|ref|ZP_16457186.1| hypothetical protein HMPREF9533_04220 [Escherichia coli MS 60-1]
gi|432714643|ref|ZP_19949673.1| hypothetical protein WCI_03021 [Escherichia coli KTE8]
gi|190907914|gb|EDV67507.1| conserved hypothetical protein [Escherichia coli F11]
gi|300304750|gb|EFJ59270.1| hypothetical protein HMPREF9553_04638 [Escherichia coli MS 200-1]
gi|324011725|gb|EGB80944.1| hypothetical protein HMPREF9533_04220 [Escherichia coli MS 60-1]
gi|431254449|gb|ELF47719.1| hypothetical protein WCI_03021 [Escherichia coli KTE8]
Length = 143
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 54/107 (50%), Gaps = 12/107 (11%)
Query: 185 ANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGPDGR 244
G++++V LR A T G+FE+++L+G+ E SG+ L + +S P G
Sbjct: 47 CTGSLTDVALRYAGQEN-TALLSGKFEVIALNGTL---EQSGEH-----LHLCVSDPHGT 97
Query: 245 VLGGSVAGLLTAATPVQVVVGSF--LADGRKESKSSHRMESLPVPPK 289
+LGG + T T +++V+G LA R+ S + L + P+
Sbjct: 98 MLGGHMMPGCTVRTTLELVIGCLEELAFSRQSCALS-GYDELHISPR 143
>gi|392966472|ref|ZP_10331891.1| protein of unknown function DUF296 [Fibrisoma limi BUZ 3]
gi|387845536|emb|CCH53937.1| protein of unknown function DUF296 [Fibrisoma limi BUZ 3]
Length = 166
Score = 38.9 bits (89), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 46/84 (54%), Gaps = 9/84 (10%)
Query: 182 ILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESSGQRSRTGGLSVSLSGP 241
+++ G++++VTLR A G +V + G FEI+SL G+ LS + L +S+S
Sbjct: 64 VVTCVGSLTDVTLRLANQEGSSV-WHGHFEIVSLVGT--LSTNGSH------LHLSVSDS 114
Query: 242 DGRVLGGSVAGLLTAATPVQVVVG 265
GR LGG + T ++V+G
Sbjct: 115 TGRTLGGHLLDGCRIYTTAELVIG 138
>gi|357459563|ref|XP_003600062.1| hypothetical protein MTR_3g051280 [Medicago truncatula]
gi|355489110|gb|AES70313.1| hypothetical protein MTR_3g051280 [Medicago truncatula]
Length = 429
Score = 38.1 bits (87), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 26/46 (56%)
Query: 57 QAQGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTT 102
+ Q N GSG P+KRKRGRPRKY ++ +P+PS T
Sbjct: 2 EQQNQNNTPDGSGDVPLKRKRGRPRKYPRPDSVESPYMPTPSKKQT 47
>gi|441166046|ref|ZP_20968661.1| carbohydrate kinase, partial [Streptomyces rimosus subsp. rimosus
ATCC 10970]
gi|440615966|gb|ELQ79127.1| carbohydrate kinase, partial [Streptomyces rimosus subsp. rimosus
ATCC 10970]
Length = 343
Score = 37.7 bits (86), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 68/160 (42%), Gaps = 18/160 (11%)
Query: 53 GAIPQA-QGLNVMNMGSGSEPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGG-TGSG 110
GA+P A Q L + + +EP R P + L+ P+ TT GG +G+G
Sbjct: 102 GAVPHAAQPLTKLRWLARNEPANAAR-VAEVLQPHDWLVWQLLGRPARRTTDRGGASGTG 160
Query: 111 LSSPGGGPLSPDSIKKSRGRPPGSGSGKKHQL---EALGSAG-VGFTPHVITVKAGE-DV 165
S G PD ++ + G HQ+ E LG +G GFTP + + AG +
Sbjct: 161 YWSAAAGAYRPDLVELALG----------HQVRLPEVLGPSGTAGFTPEGLLISAGTGET 210
Query: 166 SSKIMSFSQNGPRAVCILSANGAISNVTLRQAATSGGTVT 205
+ AV L A+G++ + A S GT+T
Sbjct: 211 MAAAFGLGVGAGDAVVSLGASGSVFGIHHEALADSNGTIT 250
>gi|9502163|gb|AAF88014.1| contains similarity to Antirrhinum majus SAP1 protein (GB:AJ132349)
[Arabidopsis thaliana]
Length = 369
Score = 37.7 bits (86), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 30/42 (71%)
Query: 236 VSLSGPDGRVLGGSVAGLLTAATPVQVVVGSFLADGRKESKS 277
V LS DG++ GG V GLL AA PVQVV+G+F + +K+ ++
Sbjct: 41 VCLSNSDGQIFGGGVGGLLKAAGPVQVVLGTFQLEKKKDGRN 82
>gi|409203748|ref|ZP_11231951.1| hypothetical protein PflaJ_20598 [Pseudoalteromonas flavipulchra
JG1]
Length = 123
Score = 37.7 bits (86), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 5/107 (4%)
Query: 71 EPMKRKRGRPRKYGPDGTMSLALVPSPSSVTTATGGTGSGLSSPGGGPLSPDSIKKSRGR 130
+K +G G +G SL++ +++ T++ G S PG G + I K GR
Sbjct: 5 RSIKIAKGVRVNIGKNGITSLSVGGKGATINTSSKGIRLTSSIPGTGISHSEVIYKHDGR 64
Query: 131 PPG--SGSGKKHQLEALGSAGVGFTPHV---ITVKAGEDVSSKIMSF 172
S SG ++ + G+ F PHV T++ G VSS+ +SF
Sbjct: 65 NDSFRSTSGTSREVSLMLGIGIFFLPHVFAWFTLRKGHSVSSRFISF 111
>gi|297742132|emb|CBI33919.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 37.4 bits (85), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 19/65 (29%), Positives = 34/65 (52%)
Query: 125 KKSRGRPPGSGSGKKHQLEALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILS 184
++ RGRPPGS + K + + H++ V G DV + ++++ R +CILS
Sbjct: 130 RRPRGRPPGSKNRPKPPVIITRESANTLRAHILEVGNGCDVFDCVATYARRRQRGICILS 189
Query: 185 ANGAI 189
+G+
Sbjct: 190 LSGSF 194
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.308 0.128 0.368
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,201,699,688
Number of Sequences: 23463169
Number of extensions: 313953638
Number of successful extensions: 1073006
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 627
Number of HSP's successfully gapped in prelim test: 2135
Number of HSP's that attempted gapping in prelim test: 1056415
Number of HSP's gapped (non-prelim): 14499
length of query: 342
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 199
effective length of database: 9,003,962,200
effective search space: 1791788477800
effective search space used: 1791788477800
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 77 (34.3 bits)