BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026010
(245 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359490779|ref|XP_002269798.2| PREDICTED: uncharacterized protein LOC100252112 [Vitis vinifera]
gi|302144012|emb|CBI23117.3| unnamed protein product [Vitis vinifera]
Length = 397
Score = 263 bits (672), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 143/232 (61%), Positives = 176/232 (75%), Gaps = 18/232 (7%)
Query: 2 DEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEKDVINNVKRKGNND-----HEDA--- 53
++DA L+ +++ + + SKEKSPKK +K +D I++ K+K N+D H+D
Sbjct: 144 EKDAVLSENGQESLLNEASKEKSPKKRVKV----EDHISSKKKKANSDMKGKGHDDGTEL 199
Query: 54 -EEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 112
EE+ ++K + VS S LPLV+SEK+ R+KALVECEGES+DLSGDMGAVGRI++ T
Sbjct: 200 LEEEASDKHTEPQVS-SGLPLVLSEKVHRSKALVECEGESIDLSGDMGAVGRIVILDTPS 258
Query: 113 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE 172
NHEMFLDLKGTIYKTT+VPSRTFCIVSFG SEAK+EA+MNDFIQLKPQSNVYEAETMVE
Sbjct: 259 RNHEMFLDLKGTIYKTTIVPSRTFCIVSFGQSEAKVEAVMNDFIQLKPQSNVYEAETMVE 318
Query: 173 GTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRK 220
G L+GF FDS+DE+DK+ KA Q DQN+G EEQTN KTK K SG RK
Sbjct: 319 GILEGFSFDSDDEADKMPKASTRQTDQNEGDEEQTNGKTKGKAEKASGVVRK 370
>gi|224121130|ref|XP_002318505.1| predicted protein [Populus trichocarpa]
gi|222859178|gb|EEE96725.1| predicted protein [Populus trichocarpa]
Length = 187
Score = 243 bits (620), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 118/159 (74%), Positives = 135/159 (84%), Gaps = 5/159 (3%)
Query: 68 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 127
TS LPLV+SEK+QR+KALVECEGES+DLSGDMGAVGR+++P T GN EM+LDLKGTIY+
Sbjct: 1 TSRLPLVLSEKVQRSKALVECEGESIDLSGDMGAVGRVVIPDTPSGNSEMYLDLKGTIYR 60
Query: 128 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE-GTLDGFLFDSEDES 186
TT+VPSRTFC+VSFG SEAKIEAIMNDFIQLK QSNVYEAETMVE GTL+GF FDSEDE+
Sbjct: 61 TTIVPSRTFCVVSFGQSEAKIEAIMNDFIQLKTQSNVYEAETMVEVGTLEGFSFDSEDET 120
Query: 187 DKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRKE 221
DK+ KA Q DQN+G+EE N KTK K SG RK+
Sbjct: 121 DKITKATALQTDQNEGVEEPANGKTKRKPVKSSGVARKK 159
>gi|356507532|ref|XP_003522518.1| PREDICTED: uncharacterized protein LOC100795660 [Glycine max]
Length = 380
Score = 224 bits (571), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 115/194 (59%), Positives = 147/194 (75%), Gaps = 10/194 (5%)
Query: 31 SPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNV---STSTLPLVMSEKLQRTKALVE 87
+P K K++ + K KG D + EE+ EK + NV S++ LPL++SEK+QRTKAL+E
Sbjct: 151 TPVKGKEIKASAKGKGIGDLKVEEEETCEKPAEPNVRMVSSARLPLMLSEKVQRTKALIE 210
Query: 88 CEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK------GTIYKTTLVPSRTFCIVSF 141
C+G+S+DLSGDMGAVGRI++ + G+ EM LDLK GTIYKT++VP RTFC+VSF
Sbjct: 211 CQGDSIDLSGDMGAVGRIIISDSPSGDQEMCLDLKDFAVISGTIYKTSIVPCRTFCVVSF 270
Query: 142 GHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 201
G SEAK+EAIMNDFIQLKP SNVYEAETMVEGTLDGF FDS++ES K+ K+ +Q DQN+
Sbjct: 271 GQSEAKVEAIMNDFIQLKPHSNVYEAETMVEGTLDGFFFDSDEESGKIQKST-NQTDQNE 329
Query: 202 GIEEQTNEKTKVKS 215
EEQ N +K K+
Sbjct: 330 NAEEQANGNSKGKA 343
>gi|357464307|ref|XP_003602435.1| DNA-binding protein BIN4 [Medicago truncatula]
gi|355491483|gb|AES72686.1| DNA-binding protein BIN4 [Medicago truncatula]
Length = 416
Score = 224 bits (570), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 111/183 (60%), Positives = 140/183 (76%), Gaps = 1/183 (0%)
Query: 33 KKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGES 92
K +K +N + G+ D E +E+ EK NVS+S LPL++SEK+ RTKALVEC+G+S
Sbjct: 198 KGKKTKVNAKGKGGDGDVEVKDEETFEKPVEPNVSSSILPLMLSEKVHRTKALVECQGDS 257
Query: 93 VDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIM 152
+DLSGDMGAVGR+++ + G+ EM LDLKGTIYKT+++PSRTFC+VSFG +EAKIEAIM
Sbjct: 258 IDLSGDMGAVGRVIISDSPSGDQEMCLDLKGTIYKTSIIPSRTFCVVSFGQTEAKIEAIM 317
Query: 153 NDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTK 212
NDFIQL P S+VYEAETMVEGTLDGF FDS++E K K H DQN+ EEQT K+K
Sbjct: 318 NDFIQLNPPSDVYEAETMVEGTLDGFSFDSDEEDGKRQKTT-HPSDQNEHAEEQTPGKSK 376
Query: 213 VKS 215
K+
Sbjct: 377 RKA 379
>gi|218190056|gb|EEC72483.1| hypothetical protein OsI_05847 [Oryza sativa Indica Group]
Length = 342
Score = 211 bits (537), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 113/227 (49%), Positives = 151/227 (66%), Gaps = 12/227 (5%)
Query: 18 KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVS 67
K K +PKK L P + K D D AEED+ +K + +VS
Sbjct: 100 KPPKRSTPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEDMQDKLTEHSVS 159
Query: 68 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 127
LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++ + GN E+ LDLKGTIYK
Sbjct: 160 -QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYK 218
Query: 128 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESD 187
+T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E D
Sbjct: 219 STIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGD 278
Query: 188 KLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 233
KL + Q DQN+ +Q KTK K+ P K + + ATK
Sbjct: 279 KLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325
>gi|45736025|dbj|BAD13052.1| unknown protein [Oryza sativa Japonica Group]
gi|45736067|dbj|BAD13093.1| unknown protein [Oryza sativa Japonica Group]
Length = 320
Score = 209 bits (532), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 102/182 (56%), Positives = 139/182 (76%), Gaps = 2/182 (1%)
Query: 53 AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 112
AEE++ +K + +VS LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++ +
Sbjct: 123 AEEEMQDKLTEHSVS-QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPN 181
Query: 113 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE 172
GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+E
Sbjct: 182 GNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMME 241
Query: 173 GTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATA 231
GTLDGF FDS++E DKL + Q DQN+ +Q KTK K+ P K + + A
Sbjct: 242 GTLDGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKA 301
Query: 232 TK 233
TK
Sbjct: 302 TK 303
>gi|115444213|ref|NP_001045886.1| Os02g0147700 [Oryza sativa Japonica Group]
gi|113535417|dbj|BAF07800.1| Os02g0147700 [Oryza sativa Japonica Group]
gi|215765268|dbj|BAG86965.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222622166|gb|EEE56298.1| hypothetical protein OsJ_05371 [Oryza sativa Japonica Group]
Length = 342
Score = 209 bits (531), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 112/227 (49%), Positives = 151/227 (66%), Gaps = 12/227 (5%)
Query: 18 KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVS 67
K K +PKK L P + K D D AEE++ +K + +VS
Sbjct: 100 KPPKRSTPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEEMQDKLTEHSVS 159
Query: 68 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 127
LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++ + GN E+ LDLKGTIYK
Sbjct: 160 -QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYK 218
Query: 128 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESD 187
+T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E D
Sbjct: 219 STIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGD 278
Query: 188 KLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 233
KL + Q DQN+ +Q KTK K+ P K + + ATK
Sbjct: 279 KLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325
>gi|218189675|gb|EEC72102.1| hypothetical protein OsI_05068 [Oryza sativa Indica Group]
Length = 473
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 110/220 (50%), Positives = 154/220 (70%), Gaps = 14/220 (6%)
Query: 9 GREEKTSVK----KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED---------AEE 55
G++EK +++ K +K +PKK +K P + G + +D AE
Sbjct: 219 GKDEKPTMESNQDKPAKRSTPKKLVKFPSSSNASKVTGPKAGPDQIDDTLEHREEGVAEV 278
Query: 56 DIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNH 115
D+ +K + +VS LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++ + GN
Sbjct: 279 DMQDKLTEHSVS-QRLPLIIPDKVQRSKALIECDGDSIDLSGDVGAVGRIVISNSPNGNQ 337
Query: 116 EMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTL 175
++ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IMNDFIQL+PQSN++EAETMVEGTL
Sbjct: 338 DLLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMNDFIQLEPQSNLFEAETMVEGTL 397
Query: 176 DGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
DGF FDS++E DKL + Q DQN+ +Q KTK K+
Sbjct: 398 DGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKA 437
>gi|297812657|ref|XP_002874212.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297320049|gb|EFH50471.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 492
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 162/265 (61%), Gaps = 61/265 (23%)
Query: 2 DEDAALTGREEKTSVKKVSKEKSPK----KGLKSPKK----------EKDVIN------- 40
DED L EE+ +VKKV KEKSPK G K+PK+ E VIN
Sbjct: 186 DEDFVLDATEEEPAVKKVGKEKSPKTKSKSGRKTPKEGNGAQKIFKSEDKVINTTIAEEV 245
Query: 41 ---------------------NVKRKGNNDHEDAEEDIAEKRSM------LNVSTSTLPL 73
+ ++ ++ +D + IAE+ +M + S+S LPL
Sbjct: 246 TMEKSPETKSKSSRKTPKEENSAQKIFKSEDKDTDTTIAEEGTMDQKIKPSSGSSSRLPL 305
Query: 74 VMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG---------- 123
V+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKG
Sbjct: 306 VLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGINTFSFHVSG 362
Query: 124 TIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSE 183
TIYK+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+
Sbjct: 363 TIYKSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLTPQSNVYEAETMVEGTLEGFSFESD 422
Query: 184 DESDKLAKAMPHQIDQNDGIEEQTN 208
DE++K AK DQ+ G EE+TN
Sbjct: 423 DETNKNAKTASKPADQSGGTEEETN 447
>gi|195625200|gb|ACG34430.1| hypothetical protein [Zea mays]
gi|413935593|gb|AFW70144.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
Length = 364
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 101/165 (61%), Positives = 130/165 (78%), Gaps = 2/165 (1%)
Query: 52 DAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTA 111
DAE+D+ +K + +VS LPL+ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V
Sbjct: 165 DAEDDVQDKLTGNSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGP 223
Query: 112 EGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMV 171
++ LDLKGTIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMV
Sbjct: 224 TAKQDLLLDLKGTIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMV 283
Query: 172 EGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 215
EGTLDGF FDS++E DKL + +Q D N+ +E Q KTK K+
Sbjct: 284 EGTLDGFTFDSDEEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 328
>gi|238481371|ref|NP_001154738.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005958|gb|AED93341.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 446
Score = 207 bits (527), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 67 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 263 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 319
Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 320 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 379
Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
+K AK DQ+ G EE+TN
Sbjct: 380 NKNAKTAVKPADQSVGTEEETN 401
>gi|240256338|ref|NP_197851.5| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005954|gb|AED93337.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 451
Score = 207 bits (527), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 67 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 268 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 324
Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 325 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 384
Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
+K AK DQ+ G EE+TN
Sbjct: 385 NKNAKTAVKPADQSVGTEEETN 406
>gi|238481367|ref|NP_001154736.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|238481369|ref|NP_001154737.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005956|gb|AED93339.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005957|gb|AED93340.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 450
Score = 207 bits (527), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 67 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 267 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 323
Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 324 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 383
Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
+K AK DQ+ G EE+TN
Sbjct: 384 NKNAKTAVKPADQSVGTEEETN 405
>gi|238481373|ref|NP_001154739.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|257050999|sp|Q9FLU1.2|BIN4_ARATH RecName: Full=DNA-binding protein BIN4; AltName: Full=Protein
BRASSINOSTEROID INSENSITIVE 4; AltName: Full=Protein
MIDGET
gi|332005959|gb|AED93342.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 454
Score = 207 bits (527), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 67 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 271 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 327
Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 328 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 387
Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
+K AK DQ+ G EE+TN
Sbjct: 388 NKNAKTAVKPADQSVGTEEETN 409
>gi|449459184|ref|XP_004147326.1| PREDICTED: uncharacterized protein LOC101208657 [Cucumis sativus]
Length = 391
Score = 207 bits (527), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 120/240 (50%), Positives = 162/240 (67%), Gaps = 26/240 (10%)
Query: 2 DEDAALTGREEKTSVKKVSKEKSPKKGLKS---PKKEKDVINNVKRKGNNDHEDAEE--- 55
DE+A E K+ +KVS E SPKK +KS +++++N+ KG + E +E
Sbjct: 128 DENAGRRFTEGKSKSRKVSNEMSPKKKVKSEVCTSAKENIMNSGTNKGGSTMEGSEGHVR 187
Query: 56 ----DIAEKRSMLN-----VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRIL 106
+I EK ++ + VS+S LPLV+S+K R KALVECEG S+DLSGDMGAVGR++
Sbjct: 188 NGDVEILEKDALDDCIGPPVSSSRLPLVLSDKAHRLKALVECEGTSIDLSGDMGAVGRVV 247
Query: 107 VPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYE 166
V ++ +E+ LDLKGT+Y+ +VPSRTFCIVSFG SEAKIE+IMNDFIQLK S V E
Sbjct: 248 VSDSSSAKNELCLDLKGTLYRAVIVPSRTFCIVSFGQSEAKIESIMNDFIQLKALSKVDE 307
Query: 167 AETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE----------EQTNEKTKVKSG 216
AETMVEGTLDGF FDSED+++K+ KA DQN+ +E E+++ + +VK+G
Sbjct: 308 AETMVEGTLDGFSFDSEDDAEKITKAAS-PADQNEPVEGLNTKSKNKAEKSSGRKRVKTG 366
>gi|238481365|ref|NP_001154735.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005955|gb|AED93338.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 432
Score = 207 bits (526), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 67 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 249 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 305
Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 306 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 365
Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
+K AK DQ+ G EE+TN
Sbjct: 366 NKNAKTAVKPADQSVGTEEETN 387
>gi|357138685|ref|XP_003570920.1| PREDICTED: uncharacterized protein LOC100821932 [Brachypodium
distachyon]
Length = 358
Score = 193 bits (491), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 86/137 (62%), Positives = 115/137 (83%), Gaps = 1/137 (0%)
Query: 53 AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 112
AEE + +K ++S LPL++++K+QR+KALVEC+G+S+DLSGD+GAVGRI++
Sbjct: 161 AEEQMQDKLPGNSIS-QRLPLMITDKVQRSKALVECDGDSIDLSGDIGAVGRIVISNGPT 219
Query: 113 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE 172
GN ++ LDLKGT+YK+T+VPSRTFC+VS G +EAKIEAIMNDFIQL+P SN++EAETM+E
Sbjct: 220 GNQDLLLDLKGTVYKSTIVPSRTFCVVSVGQTEAKIEAIMNDFIQLEPHSNLFEAETMME 279
Query: 173 GTLDGFLFDSEDESDKL 189
GTLDGF FDS+ E D+L
Sbjct: 280 GTLDGFTFDSDGEGDRL 296
>gi|255544304|ref|XP_002513214.1| conserved hypothetical protein [Ricinus communis]
gi|223547712|gb|EEF49205.1| conserved hypothetical protein [Ricinus communis]
Length = 155
Score = 192 bits (489), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 88/117 (75%), Positives = 102/117 (87%)
Query: 99 MGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 158
MGAVGR+++P + GNHEM+LDLKGTIY+T +VPSRTFC+VSFG SEAKIEAIMNDFIQL
Sbjct: 1 MGAVGRVVIPDGSSGNHEMYLDLKGTIYRTKIVPSRTFCVVSFGQSEAKIEAIMNDFIQL 60
Query: 159 KPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
K QSNVYEAETMVEGTL+GF FDSEDE++K+ K + HQ DQNDG EEQ N KTK K+
Sbjct: 61 KAQSNVYEAETMVEGTLEGFSFDSEDEAEKMPKPVSHQTDQNDGNEEQMNGKTKRKA 117
>gi|413935594|gb|AFW70145.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
Length = 354
Score = 180 bits (456), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 92/165 (55%), Positives = 121/165 (73%), Gaps = 12/165 (7%)
Query: 52 DAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTA 111
DAE+D+ +K + +VS LPL+ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V
Sbjct: 165 DAEDDVQDKLTGNSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGP 223
Query: 112 EGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMV 171
++ LDLKGTIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMV
Sbjct: 224 TAKQDLLLDLKGTIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMV 283
Query: 172 EGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 215
E ++E DKL + +Q D N+ +E Q KTK K+
Sbjct: 284 E----------DEEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 318
>gi|297598587|ref|NP_001045881.2| Os02g0147100 [Oryza sativa Japonica Group]
gi|255670601|dbj|BAF07795.2| Os02g0147100, partial [Oryza sativa Japonica Group]
Length = 126
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 79/118 (66%), Positives = 102/118 (86%)
Query: 84 ALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH 143
AL+EC+G+S+DLSGD+GAVGRI++ + GN E+ LDLKGTIYK+T+VPSRTFC+VS G
Sbjct: 1 ALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQ 60
Query: 144 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 201
+EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E DKL + Q DQN+
Sbjct: 61 TEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNN 118
>gi|147811095|emb|CAN76881.1| hypothetical protein VITISV_037245 [Vitis vinifera]
Length = 693
Score = 173 bits (438), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 109/238 (45%), Positives = 143/238 (60%), Gaps = 61/238 (25%)
Query: 2 DEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEKDVINNVKRKGNND-----HEDA--- 53
++DA L+ +++ + + SKEKSPKK +K +D I++ K+K N+D H+D
Sbjct: 174 EKDAVLSENGQESLLNEASKEKSPKKRVKV----EDHISSKKKKANSDMKGKGHDDGTEL 229
Query: 54 -EEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA---------------LVECEGESVDLSG 97
EE+ ++K + VS+S LPLV+SEK+ R+K L+ +GES+DLSG
Sbjct: 230 LEEEASDKHTEPQVSSSGLPLVLSEKVHRSKGFRFSIEKAASLEIRPLLSVKGESIDLSG 289
Query: 98 DMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQ 157
DMGAVGRI VSFG SEAK+EA+MNDFIQ
Sbjct: 290 DMGAVGRI---------------------------------VSFGQSEAKVEAVMNDFIQ 316
Query: 158 LKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
LKPQSNVYEAETMVEG L+GF FDS+DE+DK+ KA Q DQN+G EEQTN KTK K+
Sbjct: 317 LKPQSNVYEAETMVEGILEGFSFDSDDEADKMPKASTRQTDQNEGDEEQTNGKTKGKA 374
>gi|410129774|dbj|BAM64852.1| hypothetical protein [Beta vulgaris]
Length = 467
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 99/198 (50%), Positives = 131/198 (66%), Gaps = 25/198 (12%)
Query: 12 EKTSVKKVSKEKSPKKGLKSP--KKEKDVINNVKRKGNNDHEDAEEDI-AEKRSMLNVST 68
E+++ + K KSPKK +K EK K +G ++ D E++ EK+S VS+
Sbjct: 272 EESAPRSAVKRKSPKKRVKQEDDATEKRTNEPAKIEGEVENMDVAEEVKPEKQSEPAVSS 331
Query: 69 STLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 128
ST+PLV+ EK+QRTKALVECEGES+DLSGDMGAVGR+++ T EMFLDLK ++
Sbjct: 332 STMPLVLPEKVQRTKALVECEGESIDLSGDMGAVGRVIISETPSKESEMFLDLKVSLE-- 389
Query: 129 TLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDK 188
+IEAIMNDFIQLKPQSNVY AETMVEGTL+GF F+S+DE+DK
Sbjct: 390 ------------------QIEAIMNDFIQLKPQSNVYNAETMVEGTLEGFTFESDDEADK 431
Query: 189 LAKAMPHQIDQNDGIEEQ 206
+ K + + DQ++G EE+
Sbjct: 432 MTKDV--KGDQSEGAEEK 447
>gi|45736024|dbj|BAD13051.1| unknown protein [Oryza sativa Japonica Group]
gi|45736066|dbj|BAD13092.1| unknown protein [Oryza sativa Japonica Group]
Length = 193
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 75/119 (63%), Positives = 96/119 (80%)
Query: 97 GDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFI 156
GD+GAVGRI++ + GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IM+DFI
Sbjct: 39 GDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMDDFI 98
Query: 157 QLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
QL+PQSN++EAETM+EGTLDGF FDS++E DKL + Q DQN+ +Q KTK K+
Sbjct: 99 QLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKA 157
>gi|10177863|dbj|BAB11215.1| unnamed protein product [Arabidopsis thaliana]
Length = 531
Score = 160 bits (404), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 90/159 (56%), Positives = 107/159 (67%), Gaps = 20/159 (12%)
Query: 67 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLK
Sbjct: 331 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKEHFA 387
Query: 127 KTTLVPSRTF------C-----------IVSFGHSEAKIEAIMNDFIQLKPQSNVYEAET 169
S TF C V+ G +EAKIEAIMNDFIQL PQSNVYEAET
Sbjct: 388 LYVFSSSCTFSDCKGTCGTIIHTSDLVFQVNVGQTEAKIEAIMNDFIQLIPQSNVYEAET 447
Query: 170 MVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTN 208
MVEGTL+GF F+S+DES+K AK DQ+ G EE+TN
Sbjct: 448 MVEGTLEGFTFESDDESNKNAKTAVKPADQSVGTEEETN 486
>gi|168037576|ref|XP_001771279.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677368|gb|EDQ63839.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 379
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 66/125 (52%), Positives = 93/125 (74%), Gaps = 3/125 (2%)
Query: 66 VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTI 125
++ S+LP+V EK+ RTK L+ECEG+++DLSGDMGAVGR V + E+ LDLKG +
Sbjct: 179 IAASSLPIVFGEKVNRTKVLLECEGDALDLSGDMGAVGRFTV---NRPDDELLLDLKGVV 235
Query: 126 YKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDE 185
YKTT+VPS T+ +V+ G EAK+E+IM DF+QL+ ++ E ETMVEGTL F F+S++E
Sbjct: 236 YKTTIVPSNTYFVVNVGQMEAKVESIMTDFMQLRADTSWNENETMVEGTLKDFAFESDEE 295
Query: 186 SDKLA 190
++ A
Sbjct: 296 GERPA 300
>gi|168010889|ref|XP_001758136.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690592|gb|EDQ76958.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 373
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 64/122 (52%), Positives = 93/122 (76%), Gaps = 3/122 (2%)
Query: 67 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
+ S+LP+V EK+ +TK L+ECEG+++DLSGDMGAVGR V ++E+ LDLKG IY
Sbjct: 172 AASSLPIVFGEKVNKTKVLLECEGDALDLSGDMGAVGRFTV---NRRDNELLLDLKGVIY 228
Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
KTT+VPS TF +V+ G ++AK+E+IM+DF+QL+ + E ET+VEGTL F F+S++E
Sbjct: 229 KTTIVPSNTFFLVNVGQTDAKVESIMSDFVQLRADTIGKENETVVEGTLQDFTFESDEEG 288
Query: 187 DK 188
++
Sbjct: 289 ER 290
>gi|45736061|dbj|BAD13087.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 189
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 60/106 (56%), Positives = 85/106 (80%), Gaps = 4/106 (3%)
Query: 53 AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 112
AEED+ +K + +VS LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++ +
Sbjct: 87 AEEDMQDKLTEHSVS-QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPN 145
Query: 113 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 158
GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAK +++D I L
Sbjct: 146 GNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAK---VVDDNINL 188
>gi|224132538|ref|XP_002321341.1| predicted protein [Populus trichocarpa]
gi|222868337|gb|EEF05468.1| predicted protein [Populus trichocarpa]
Length = 159
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 75/178 (42%), Positives = 91/178 (51%), Gaps = 54/178 (30%)
Query: 47 NNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRIL 106
+ D E EE +EK +VS S LPLV+SEK+QR+KALVECEGES+DLSG+MGAVGR +
Sbjct: 20 HGDVEITEEVTSEKLINAHVSISRLPLVLSEKVQRSKALVECEGESIDLSGEMGAVGRPV 79
Query: 107 VPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYE 166
+P T G+ EM LDLKGT
Sbjct: 80 IPDTPSGDSEMHLDLKGT------------------------------------------ 97
Query: 167 AETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRK 220
L+GF F SEDE+DK+ KA HQ DQN +EE +N K K SG RK
Sbjct: 98 --------LEGFSFFSEDETDKITKATAHQTDQNMDVEEPSNGMIKRKAEKSSGVARK 147
>gi|384247484|gb|EIE20971.1| hypothetical protein COCSUDRAFT_83530 [Coccomyxa subellipsoidea
C-169]
Length = 378
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 68/105 (64%), Gaps = 5/105 (4%)
Query: 60 KRSMLNVSTSTLPLVMSEKLQRTKALVECEG---ESVDLSGDMGAVGRILVPGTAEGNHE 116
K ++L +P++M EKL TK +VE EG SVDL+GD G VGR LV G + N E
Sbjct: 185 KIAVLAPVAGEIPVMMPEKLPVTKFIVELEGGNDSSVDLTGDAGVVGRWLVEGD-DSNPE 243
Query: 117 MFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKP 160
+ +DLKG +Y T VP + T C+V+ +EA++EA+ + +IQL+P
Sbjct: 244 LRMDLKGVVYAATTVPCATTLCVVNLSATEARVEALGHQYIQLRP 288
>gi|303275097|ref|XP_003056848.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
CCMP1545]
gi|226461200|gb|EEH58493.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
CCMP1545]
Length = 392
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 72/119 (60%), Gaps = 8/119 (6%)
Query: 71 LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 128
+PL+M + R K ECEG ESVDL GD+G VGR+L +A+ + +DLKG IY
Sbjct: 195 IPLLMPATVNRNKIFFECEGAGESVDLEGDVGVVGRLLSDPSAKYTSGVKVDLKGVIYDA 254
Query: 129 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFL-FDSEDE 185
++P+ + +++ +EAK+E I ND++QL+ V GTL+G+L DS+D+
Sbjct: 255 KILPTPASIVVLAVNQTEAKVECIANDYVQLRRDVTANN----VGGTLEGYLGVDSDDD 309
>gi|307107138|gb|EFN55382.1| hypothetical protein CHLNCDRAFT_52595 [Chlorella variabilis]
Length = 402
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 73/125 (58%), Gaps = 8/125 (6%)
Query: 58 AEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGES------VDLSGDMGAVGRILVPGTA 111
A + + TS +P+V+ EKL + K LVE E DLSGD GA+GR+++ GT
Sbjct: 168 AAGLGLPHAPTSQMPVVLPEKLPQMKMLVELESNPDDAHHVTDLSGDSGAIGRLVIAGTK 227
Query: 112 EGNHEMFLDLKGTIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETM 170
E EM +DLKG +Y T+VP + +++ G +EAK+E + ++F+QL+ + +T
Sbjct: 228 EAP-EMQIDLKGVLYNATIVPCPVSIAVINMGQTEAKVECLFSEFVQLREDTRFSFVDTE 286
Query: 171 VEGTL 175
V G +
Sbjct: 287 VLGGM 291
>gi|255084389|ref|XP_002508769.1| predicted protein [Micromonas sp. RCC299]
gi|226524046|gb|ACO70027.1| predicted protein [Micromonas sp. RCC299]
Length = 376
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 72/121 (59%), Gaps = 11/121 (9%)
Query: 71 LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 128
+PL M + R K E EG E+VDL GD+G VGR+L + + + +D+KG IY
Sbjct: 178 VPLFMPSAVNRNKVFFELEGTGEAVDLEGDVGVVGRLLSNASEKHGSGLQMDMKGVIYNA 237
Query: 129 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLK--PQSNVYEAETMVEGTLDGFL-FDSED 184
++P+ + I++ +EAK+E+I +DF+QL+ P +N + TLDG+L DS+D
Sbjct: 238 RILPTPASVVILAVNANEAKVESITHDFVQLRRDPVAN-----GGLGATLDGYLGVDSDD 292
Query: 185 E 185
E
Sbjct: 293 E 293
>gi|449532507|ref|XP_004173222.1| PREDICTED: DNA-binding protein BIN4-like, partial [Cucumis sativus]
Length = 198
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 36/57 (63%), Positives = 46/57 (80%)
Query: 66 VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK 122
VS+S LPLV+S+K R KALVECEG S+DLSGDMGAVGR++V ++ +E+ LDLK
Sbjct: 142 VSSSRLPLVLSDKAHRLKALVECEGTSIDLSGDMGAVGRVVVSDSSSAKNELCLDLK 198
>gi|320166916|gb|EFW43815.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 236
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 68/110 (61%), Gaps = 3/110 (2%)
Query: 71 LPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTL 130
+PLV+ + T L+E ++LSGD G+VGR++V + + LDLKG I++ T+
Sbjct: 53 IPLVLGKP--ATAFLMETTEPQLNLSGDTGSVGRLVV-NAQDAAAPLVLDLKGLIFQGTV 109
Query: 131 VPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLF 180
VPS TF +V+ G +EA+I+A +N+ I + ++ T+VEG++D F
Sbjct: 110 VPSNTFMLVTIGPNEARIDATVNNVIHTTCLGDALDSMTVVEGSIDADAF 159
>gi|428174918|gb|EKX43811.1| hypothetical protein GUITHDRAFT_140250 [Guillardia theta CCMP2712]
Length = 309
Score = 70.5 bits (171), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 52/82 (63%), Gaps = 5/82 (6%)
Query: 85 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHS 144
+V+CE +++D GD G +G I G M LDLKG +Y+ +++P T ++ G +
Sbjct: 164 VVQCEDQNLDFEGDSGVIGTI-----KTGERSMELDLKGQLYEGSILPCCTMMVMDMGAT 218
Query: 145 EAKIEAIMNDFIQLKPQSNVYE 166
E KIE++MNDF+++ P +N ++
Sbjct: 219 EVKIESVMNDFVKVHPIANPHD 240
>gi|449526337|ref|XP_004170170.1| PREDICTED: DNA-binding protein BIN4-like [Cucumis sativus]
Length = 71
Score = 67.0 bits (162), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/53 (66%), Positives = 41/53 (77%), Gaps = 1/53 (1%)
Query: 152 MNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE 204
MNDFIQLK S V EAETMVEGTLDGF FDSED+++K+ KA DQN+ +E
Sbjct: 1 MNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDDAEKITKA-ASPADQNEPVE 52
>gi|302857504|ref|XP_002959890.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
nagariensis]
gi|300254022|gb|EFJ39045.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
nagariensis]
Length = 340
Score = 67.0 bits (162), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 35/129 (27%)
Query: 66 VSTSTLPLVMSEKLQRTKALVEC-----EGE----SVDLSGDMGAVGRILVPGTAEGNH- 115
V LP+++ EKL R K L+E GE + DLSGD GAVGR++V N
Sbjct: 38 VEAGKLPIMLPEKLNRNKVLLELPPAAMSGEHYHGAADLSGDSGAVGRLVVGKAPAANSK 97
Query: 116 ------------------------EMFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAKIEA 150
++ +DLKG +Y ++P + T +++ G SEAK+E+
Sbjct: 98 GTAAAAAAAAAGGSEGGGSGGDSVQLQVDLKGLLYNAWVLPLAGTAMVLNIGPSEAKVES 157
Query: 151 IMNDFIQLK 159
+ NDFI+L+
Sbjct: 158 LFNDFIRLR 166
>gi|299471419|emb|CBN79372.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 240
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 68/120 (56%), Gaps = 7/120 (5%)
Query: 71 LPLVMSEKL---QRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 127
+PL++ + + LV E+++ +GD GA+GRI V G +G F DLKG +
Sbjct: 59 VPLIVPPAVCKDAKASLLVHLRDEALNFAGDTGAIGRISVGGKKKGRGLTF-DLKGQQFS 117
Query: 128 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAET--MVEGTL-DGFLFDSED 184
++PS T +V G +EAK+E + ++ +L + NV++A +++G + D + F+ ED
Sbjct: 118 GQIIPSATVMVVGIGKTEAKVEGLFSEVCRLDYKGNVFDAMGGEVLKGEMDDSYKFEEED 177
>gi|412989095|emb|CCO15686.1| hypothetical protein VOLCADRAFT_101408 [Bathycoccus prasinos]
Length = 422
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 24/117 (20%)
Query: 66 VSTSTLPLVMSE-KLQRTK---ALVECEG--ESVDLSGDMGAVGRILV-----PGTAEG- 113
V S +PL+++ LQ+T L EC+G E+VDLSGD+G VGR+LV P G
Sbjct: 237 VPKSKMPLLINHGALQKTANSTILFECKGSGEAVDLSGDVGVVGRLLVERESQPTGGGGS 296
Query: 114 ---------NHEMFLDLKGTIY--KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLK 159
+ + +DLKG +Y +T L+ S T IV+ G +AKIE ++++QL+
Sbjct: 297 KASTSKQQPHASVKVDLKGVMYLARTHLL-SSTVAIVNVGAEDAKIETFFDEYVQLR 352
>gi|308804497|ref|XP_003079561.1| unnamed protein product [Ostreococcus tauri]
gi|116058016|emb|CAL54219.1| unnamed protein product [Ostreococcus tauri]
Length = 229
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 70/124 (56%), Gaps = 11/124 (8%)
Query: 67 STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 123
S++ + LV S L ++K L E EG E+VDL GD GAVGR L AE + +D+KG
Sbjct: 63 SSTRVDLVASAHLGGKSKVLFEIEGNGEAVDLDGDTGAVGRWL----AESARSLKVDMKG 118
Query: 124 TIYKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDS 182
+Y +VP + T +V+ AKI+++ +F+QL+ + V A T G G LF+
Sbjct: 119 LMYNARVVPLAGTVVVVAVNGDVAKIDSVHREFVQLR-EDPVANAGTENYGV--GSLFEM 175
Query: 183 EDES 186
DE+
Sbjct: 176 TDEA 179
>gi|356516190|ref|XP_003526779.1| PREDICTED: uncharacterized protein LOC100812582 [Glycine max]
Length = 82
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 36/46 (78%), Gaps = 1/46 (2%)
Query: 170 MVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
MVEGTLDGF FDS++E+ K+ K+ +Q DQN+ +EEQ N K+K K+
Sbjct: 1 MVEGTLDGFFFDSDEEAGKMQKST-NQTDQNENVEEQANGKSKGKT 45
>gi|159483423|ref|XP_001699760.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281702|gb|EDP07456.1| predicted protein [Chlamydomonas reinhardtii]
Length = 308
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 64/138 (46%), Gaps = 41/138 (29%)
Query: 69 STLPLVMSEKLQRTKALVECEGE---------------SVDLSGDMGAVGRILVPGTAEG 113
+LPL++ +K+ R K L E + DLSGD GAVGR++V A
Sbjct: 140 GSLPLMLPDKINRNKVLFELPPPPASSSNADGYGHGHGATDLSGDAGAVGRLIVLAAAPT 199
Query: 114 NHE-------------------------MFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAK 147
+ + +DLKG IY +P + T +++ G +EAK
Sbjct: 200 GGKGAAAGGGEPGPSSQGGGGPGGAEVCLQVDLKGVIYNAWALPLAGTAMVLNVGPTEAK 259
Query: 148 IEAIMNDFIQLKPQSNVY 165
+E++ NDF++L+ ++V+
Sbjct: 260 VESLFNDFVRLREAADVH 277
>gi|145347277|ref|XP_001418100.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578328|gb|ABO96393.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 999
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 58/97 (59%), Gaps = 8/97 (8%)
Query: 67 STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 123
S++ + LV + L ++K L E EG E+VDL GD GAVGR L AE + + +D+KG
Sbjct: 33 SSTAVELVATANLGGKSKVLFEIEGAGEAVDLDGDTGAVGRWL----AESSRALKVDMKG 88
Query: 124 TIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLK 159
+Y +V S T +V+ AKIE++ +F+QL+
Sbjct: 89 VMYNARVVSSAGTVVVVAVNADVAKIESVHREFVQLR 125
>gi|255544302|ref|XP_002513213.1| conserved hypothetical protein [Ricinus communis]
gi|223547711|gb|EEF49204.1| conserved hypothetical protein [Ricinus communis]
Length = 137
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 52/77 (67%), Gaps = 5/77 (6%)
Query: 15 SVKKVSKEKSPKKGLK----SPKKEKDVINNVKRKGNNDH-EDAEEDIAEKRSMLNVSTS 69
SV+K K KSPKK LK KK+++ ++V + G+ND E A+ED++EK + VSTS
Sbjct: 61 SVEKALKGKSPKKLLKVEGRKLKKKENKNDDVPKTGDNDDVEVADEDVSEKHTGPQVSTS 120
Query: 70 TLPLVMSEKLQRTKALV 86
+PL++S+K+ RTK +
Sbjct: 121 MIPLLLSDKVSRTKMFL 137
>gi|397569025|gb|EJK46493.1| hypothetical protein THAOC_34834 [Thalassiosira oceanica]
Length = 315
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 85 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH- 143
LV+ + +DL G GAVGR+ + LDLKG Y+ T+ P T +V+
Sbjct: 175 LVQMNDDKLDLHGQSGAVGRLEA-----DERGVILDLKGYQYRGTIRPGPTAMVVALTRE 229
Query: 144 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSED 184
+ K+E+I ++F+ L + + +++ + +GF+ ED
Sbjct: 230 GKFKVESITDEFVTLDAK-GAHVMDSLGDNMREGFMVREED 269
>gi|422294628|gb|EKU21928.1| hypothetical protein NGA_2017520, partial [Nannochloropsis gaditana
CCMP526]
Length = 238
Score = 45.8 bits (107), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 4/74 (5%)
Query: 94 DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 153
DL+GD GAVGR+ V + + + LDLKG Y L T +VS G EAK+E +++
Sbjct: 40 DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVEDVVD 95
Query: 154 DFIQLKPQSNVYEA 167
L + EA
Sbjct: 96 TLCVLTHHHDELEA 109
>gi|219116542|ref|XP_002179066.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409833|gb|EEC49764.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 289
Score = 44.7 bits (104), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 14/105 (13%)
Query: 85 LVECEGE---SVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSF 141
LV+ + E S+D +G +GA+GR+ AEG + LD KG Y+ + P T ++S
Sbjct: 150 LVQIDPEDSASLDFTGAVGAIGRLET--VAEG---VVLDFKGCQYRGRIYPGSTAVVLST 204
Query: 142 GHSEA-----KIEAIMNDFIQLKPQSNVYE-AETMVEGTLDGFLF 180
+A K+E++ ++F +L + N E +V G + G+ +
Sbjct: 205 NGKKADTQLLKVESMTDEFCRLTKEGNAMEKLNAIVRGDMVGYQY 249
>gi|387199325|gb|AFJ68895.1| hypothetical protein NGATSA_2017510 [Nannochloropsis gaditana
CCMP526]
Length = 226
Score = 44.7 bits (104), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 4/74 (5%)
Query: 94 DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 153
DL+GD GAVGR+ V + + + LDLKG Y L T +VS G EAK++ +++
Sbjct: 28 DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVKDVVD 83
Query: 154 DFIQLKPQSNVYEA 167
L + EA
Sbjct: 84 TLCVLTHHHDELEA 97
>gi|449018377|dbj|BAM81779.1| hypothetical protein CYME_CMP364C [Cyanidioschyzon merolae strain
10D]
Length = 263
Score = 43.9 bits (102), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 51/100 (51%), Gaps = 10/100 (10%)
Query: 84 ALVECEGESVDLSGDMGAVGRILVPGTAE-GNHEMFLDLKGTIYKTTLVPSRTFCI---- 138
AL+ E + DL+GD G +G + A+ G + LDLKG +Y+T RTF +
Sbjct: 110 ALLVFENDEFDLAGDTGTIGCVQWSHNADSGAVRLCLDLKGVVYET-----RTFSLCGSA 164
Query: 139 VSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGF 178
++ + ++EA+++D P + E ++ G ++ +
Sbjct: 165 LAVDPTSQRVEAVLHDITYSIPTNVADERAQLIRGKIEPY 204
>gi|293394374|ref|ZP_06638674.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
gi|291423352|gb|EFE96581.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
Length = 392
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 65/138 (47%), Gaps = 14/138 (10%)
Query: 30 KSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA-LVEC 88
+ P+++ +N +G+ + +DI KR+ + TS P + E L+ + L +
Sbjct: 21 RDPEQQSGFVNAPVYRGSTVVYKSCDDIRHKRARFSYGTSGTPTI--ENLENAWSTLCDA 78
Query: 89 EGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFC---IVSFGHSE 145
G + SG +GA+ L T G+H + D ++Y+ P+R FC +V +G S
Sbjct: 79 AGTVISPSG-LGALALALFSVTRAGDHILVTD---SVYR----PTRKFCNTLLVKYGVSV 130
Query: 146 AKIEAIMNDFIQLKPQSN 163
+ ++ I+ Q+N
Sbjct: 131 TYYDPLIGGDIERLLQNN 148
>gi|307109224|gb|EFN57462.1| hypothetical protein CHLNCDRAFT_142962 [Chlorella variabilis]
Length = 642
Score = 38.5 bits (88), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 85 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVP-SRTFCIV 139
L E G ++L GD G +GR+ V + + LDL G +Y+ LVP + T +V
Sbjct: 441 LCEVPGHLLNLEGDSGTIGRVAVERGSGKEAALQLDLMGIVYQAELVPLAATAAVV 496
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.308 0.126 0.344
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,560,197,107
Number of Sequences: 23463169
Number of extensions: 143378815
Number of successful extensions: 542441
Number of sequences better than 100.0: 159
Number of HSP's better than 100.0 without gapping: 59
Number of HSP's successfully gapped in prelim test: 100
Number of HSP's that attempted gapping in prelim test: 541765
Number of HSP's gapped (non-prelim): 628
length of query: 245
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 106
effective length of database: 9,097,814,876
effective search space: 964368376856
effective search space used: 964368376856
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 75 (33.5 bits)