BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019746
(336 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359490779|ref|XP_002269798.2| PREDICTED: uncharacterized protein LOC100252112 [Vitis vinifera]
gi|302144012|emb|CBI23117.3| unnamed protein product [Vitis vinifera]
Length = 397
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 194/377 (51%), Positives = 235/377 (62%), Gaps = 73/377 (19%)
Query: 1 MSGSREESPDWLRSFQAPTHSVLTLSSDSDS-------------DSSHN----------- 36
M SREESPDWL FQAPT S LTLSSDSDS DS N
Sbjct: 1 MGSSREESPDWLHCFQAPTRSALTLSSDSDSPPEDSPLREDKDADSPFNETSKAKSPRKR 60
Query: 37 -------------------GNGDGGKNIAELTFEKHMEPFAPKSSILASSSDSESCQNSS 77
G+ G+ A + EP AP S+ SSDSESC ++S
Sbjct: 61 LKAEDPKSTKKQKVNNHKKKEGNEGRKKASVEKPSDGEPNAPNHSVWKLSSDSESCPDNS 120
Query: 78 LLREE----------KAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEK 127
+RE+ K QF +KE +DA L+ +++ + + SKEKSPKK +K +
Sbjct: 121 SMREDYIHHEESSVHKTTQFPDKE--KDAVLSENGQESLLNEASKEKSPKKRVKV----E 174
Query: 128 DVINNVKRKGNND-----HEDA----EEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVE 178
D I++ K+K N+D H+D EE+ ++K + VS S LPLV+SEK+ R+KALVE
Sbjct: 175 DHISSKKKKANSDMKGKGHDDGTELLEEEASDKHTEPQVS-SGLPLVLSEKVHRSKALVE 233
Query: 179 CEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAK 238
CEGES+DLSGDMGAVGRI++ T NHEMFLDLKGTIYKTT+VPSRTFCIVSFG SEAK
Sbjct: 234 CEGESIDLSGDMGAVGRIVILDTPSRNHEMFLDLKGTIYKTTIVPSRTFCIVSFGQSEAK 293
Query: 239 IEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQT 298
+EA+MNDFIQLKPQSNVYEAETMVEG L+GF FDS+DE+DK+ KA Q DQN+G EEQT
Sbjct: 294 VEAVMNDFIQLKPQSNVYEAETMVEGILEGFSFDSDDEADKMPKASTRQTDQNEGDEEQT 353
Query: 299 NEKTKVK----SGPCRK 311
N KTK K SG RK
Sbjct: 354 NGKTKGKAEKASGVVRK 370
>gi|449459184|ref|XP_004147326.1| PREDICTED: uncharacterized protein LOC101208657 [Cucumis sativus]
Length = 391
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 158/367 (43%), Positives = 219/367 (59%), Gaps = 61/367 (16%)
Query: 1 MSGSREESPDWLRSFQAPTHSVLTLSS--------------DSDSDSSHNGNGD------ 40
MS SRE+SPDW+RSFQAPT L+ +S D SSH D
Sbjct: 1 MSSSREQSPDWMRSFQAPTGVALSSNSGSSKNGSSSMDNAIDQRDPSSHKTTQDLDGDQI 60
Query: 41 ----GGKNIA-ELTFEKHMEPFAPKSSILASSSDSESCQNSSLLREEKAF---------- 85
G N+A E+ ++H K S+ S DSESC +++ ++E+ ++
Sbjct: 61 QGDCGNHNLAKEVKLDRHTGHENSKHSVWMLSLDSESCSDNNFIKEDYSYHEELAELATS 120
Query: 86 QFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKGLKS---PKKEKDVINNVKRKGNNDHE 142
+ Q + DE+A E K+ +KVS E SPKK +KS +++++N+ KG + E
Sbjct: 121 EIQGRRKDENAGRRFTEGKSKSRKVSNEMSPKKKVKSEVCTSAKENIMNSGTNKGGSTME 180
Query: 143 DAEE-------DIAEKRSMLN-----VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDM 190
+E +I EK ++ + VS+S LPLV+S+K R KALVECEG S+DLSGDM
Sbjct: 181 GSEGHVRNGDVEILEKDALDDCIGPPVSSSRLPLVLSDKAHRLKALVECEGTSIDLSGDM 240
Query: 191 GAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLK 250
GAVGR++V ++ +E+ LDLKGT+Y+ +VPSRTFCIVSFG SEAKIE+IMNDFIQLK
Sbjct: 241 GAVGRVVVSDSSSAKNELCLDLKGTLYRAVIVPSRTFCIVSFGQSEAKIESIMNDFIQLK 300
Query: 251 PQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE----------EQTNE 300
S V EAETMVEGTLDGF FDSED+++K+ KA DQN+ +E E+++
Sbjct: 301 ALSKVDEAETMVEGTLDGFSFDSEDDAEKITKA-ASPADQNEPVEGLNTKSKNKAEKSSG 359
Query: 301 KTKVKSG 307
+ +VK+G
Sbjct: 360 RKRVKTG 366
>gi|224121130|ref|XP_002318505.1| predicted protein [Populus trichocarpa]
gi|222859178|gb|EEE96725.1| predicted protein [Populus trichocarpa]
Length = 187
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 118/159 (74%), Positives = 135/159 (84%), Gaps = 5/159 (3%)
Query: 159 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 218
TS LPLV+SEK+QR+KALVECEGES+DLSGDMGAVGR+++P T GN EM+LDLKGTIY+
Sbjct: 1 TSRLPLVLSEKVQRSKALVECEGESIDLSGDMGAVGRVVIPDTPSGNSEMYLDLKGTIYR 60
Query: 219 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE-GTLDGFLFDSEDES 277
TT+VPSRTFC+VSFG SEAKIEAIMNDFIQLK QSNVYEAETMVE GTL+GF FDSEDE+
Sbjct: 61 TTIVPSRTFCVVSFGQSEAKIEAIMNDFIQLKTQSNVYEAETMVEVGTLEGFSFDSEDET 120
Query: 278 DKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRKE 312
DK+ KA Q DQN+G+EE N KTK K SG RK+
Sbjct: 121 DKITKATALQTDQNEGVEEPANGKTKRKPVKSSGVARKK 159
>gi|218190056|gb|EEC72483.1| hypothetical protein OsI_05847 [Oryza sativa Indica Group]
Length = 342
Score = 233 bits (595), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 195/341 (57%), Gaps = 36/341 (10%)
Query: 3 GSREESPDWLRSFQAPTHSVLTLSSDSDSDSSHN--------GNGDGGKNIAELTFEKHM 54
G EE PDWLR+FQ PT S + LSS SD DS N G G+N A
Sbjct: 2 GEEEEDPDWLRAFQPPTTSTVMLSSGSD-DSPENSPTRTTPSGEEQKGENKASSDHAGDG 60
Query: 55 EPFAPKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEK 114
+ ++ L + + + +E AF EK E ++ K K
Sbjct: 61 D-----AAALNKGKKATPTRRKTPTSQEDAFDKDEKPTME----------SNQDKPPKRS 105
Query: 115 SPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVSTSTLPL 164
+PKK L P + K D D AEED+ +K + +VS LPL
Sbjct: 106 TPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEDMQDKLTEHSVS-QRLPL 164
Query: 165 VMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPS 224
++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++ + GN E+ LDLKGTIYK+T+VPS
Sbjct: 165 IIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPS 224
Query: 225 RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAM 284
RTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E DKL +
Sbjct: 225 RTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPH 284
Query: 285 PHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 324
Q DQN+ +Q KTK K+ P K + + ATK
Sbjct: 285 ASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325
>gi|115444213|ref|NP_001045886.1| Os02g0147700 [Oryza sativa Japonica Group]
gi|113535417|dbj|BAF07800.1| Os02g0147700 [Oryza sativa Japonica Group]
gi|215765268|dbj|BAG86965.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222622166|gb|EEE56298.1| hypothetical protein OsJ_05371 [Oryza sativa Japonica Group]
Length = 342
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 195/341 (57%), Gaps = 36/341 (10%)
Query: 3 GSREESPDWLRSFQAPTHSVLTLSSDSDSDSSHN--------GNGDGGKNIAELTFEKHM 54
G EE PDWLR+FQ PT S + LSS SD DS N G G+N A
Sbjct: 2 GEEEEDPDWLRAFQPPTTSTVMLSSGSD-DSPENSPTRTTPSGEEQKGENKASSDHAGDG 60
Query: 55 EPFAPKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEK 114
+ ++ L + + + +E AF EK E ++ K K
Sbjct: 61 D-----AAALNKGKKATPTRRKTPTSQEDAFDKDEKPTME----------SNQDKPPKRS 105
Query: 115 SPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVSTSTLPL 164
+PKK L P + K D D AEE++ +K + +VS LPL
Sbjct: 106 TPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEEMQDKLTEHSVS-QRLPL 164
Query: 165 VMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPS 224
++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++ + GN E+ LDLKGTIYK+T+VPS
Sbjct: 165 IIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPS 224
Query: 225 RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAM 284
RTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E DKL +
Sbjct: 225 RTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPH 284
Query: 285 PHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 324
Q DQN+ +Q KTK K+ P K + + ATK
Sbjct: 285 ASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325
>gi|195625200|gb|ACG34430.1| hypothetical protein [Zea mays]
gi|413935593|gb|AFW70144.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
Length = 364
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 141/333 (42%), Positives = 198/333 (59%), Gaps = 35/333 (10%)
Query: 3 GSREESPDWLRSFQAPTHSVLTLSSDSDSDSSHNGNGDGGKNIAELTFEKHMEPFAPKSS 62
G E+ PDWLR+FQAPT + + LSS SD+ S + + EKH P +
Sbjct: 2 GDEEDDPDWLRAFQAPTVAPVMLSSGSDT-SPEASPTRTSTSRKQNKGEKHASPDHARDR 60
Query: 63 ILASSSD---SESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKG 119
AS + S + + +L+ +++ EK+ + LT +++ ++ S + SP
Sbjct: 61 DGASQNKNKISAATRRKNLVSKKEGSTMDEKQANTPRRLTPKKDMVTLSSGS-DASPGNS 119
Query: 120 LKSPKK---EKDVINNVKRK----------------------GNNDHEDAEEDIAEKRSM 154
L + E+ ++ KRK GN D AE+D+ +K +
Sbjct: 120 LSRAHEDNHEEGSLSTAKRKNAQQTKTKKTKDAGTKTGADQAGNGD---AEDDVQDKLTG 176
Query: 155 LNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 214
+VS LPL+ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V ++ LDLKG
Sbjct: 177 NSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGPTAKQDLLLDLKG 235
Query: 215 TIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSE 274
TIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMVEGTLDGF FDS+
Sbjct: 236 TIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMVEGTLDGFTFDSD 295
Query: 275 DESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 306
+E DKL + +Q D N+ +E Q KTK K+
Sbjct: 296 EEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 328
>gi|147811095|emb|CAN76881.1| hypothetical protein VITISV_037245 [Vitis vinifera]
Length = 693
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 160/383 (41%), Positives = 202/383 (52%), Gaps = 116/383 (30%)
Query: 1 MSGSREESPDWLRSFQAPTHSVLTLSSDSDS-------------DSSHN----------- 36
M SREESPDWL FQAPT S LTLSSDSDS DS N
Sbjct: 31 MGSSREESPDWLHCFQAPTRSALTLSSDSDSPPEDSPLREDKDADSPFNETSKAKSPRKR 90
Query: 37 -------------------GNGDGGKNIAELTFEKHMEPFAPKSSILASSSDSESCQNSS 77
G+ G+ A + EP AP S+ SSDSESC ++S
Sbjct: 91 LKAEXPKSTKKQKVNNHKKKEGNEGRKKASVEKPSDGEPNAPNHSVWXLSSDSESCPDNS 150
Query: 78 LLREE----------KAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEK 127
+RE+ K QF +KE +DA L+ +++ + + SKEKSPKK +K +
Sbjct: 151 SMREDYIHHEESSVHKTTQFPDKE--KDAVLSENGQESLLNEASKEKSPKKRVKV----E 204
Query: 128 DVINNVKRKGNND-----HEDA----EEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA--- 175
D I++ K+K N+D H+D EE+ ++K + VS+S LPLV+SEK+ R+K
Sbjct: 205 DHISSKKKKANSDMKGKGHDDGTELLEEEASDKHTEPQVSSSGLPLVLSEKVHRSKGFRF 264
Query: 176 ------------LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVP 223
L+ +GES+DLSGDMGAVGRI
Sbjct: 265 SIEKAASLEIRPLLSVKGESIDLSGDMGAVGRI--------------------------- 297
Query: 224 SRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKA 283
VSFG SEAK+EA+MNDFIQLKPQSNVYEAETMVEG L+GF FDS+DE+DK+ KA
Sbjct: 298 ------VSFGQSEAKVEAVMNDFIQLKPQSNVYEAETMVEGILEGFSFDSDDEADKMPKA 351
Query: 284 MPHQIDQNDGIEEQTNEKTKVKS 306
Q DQN+G EEQTN KTK K+
Sbjct: 352 STRQTDQNEGDEEQTNGKTKGKA 374
>gi|356507532|ref|XP_003522518.1| PREDICTED: uncharacterized protein LOC100795660 [Glycine max]
Length = 380
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 115/194 (59%), Positives = 147/194 (75%), Gaps = 10/194 (5%)
Query: 122 SPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNV---STSTLPLVMSEKLQRTKALVE 178
+P K K++ + K KG D + EE+ EK + NV S++ LPL++SEK+QRTKAL+E
Sbjct: 151 TPVKGKEIKASAKGKGIGDLKVEEEETCEKPAEPNVRMVSSARLPLMLSEKVQRTKALIE 210
Query: 179 CEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK------GTIYKTTLVPSRTFCIVSF 232
C+G+S+DLSGDMGAVGRI++ + G+ EM LDLK GTIYKT++VP RTFC+VSF
Sbjct: 211 CQGDSIDLSGDMGAVGRIIISDSPSGDQEMCLDLKDFAVISGTIYKTSIVPCRTFCVVSF 270
Query: 233 GHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 292
G SEAK+EAIMNDFIQLKP SNVYEAETMVEGTLDGF FDS++ES K+ K+ +Q DQN+
Sbjct: 271 GQSEAKVEAIMNDFIQLKPHSNVYEAETMVEGTLDGFFFDSDEESGKIQKST-NQTDQNE 329
Query: 293 GIEEQTNEKTKVKS 306
EEQ N +K K+
Sbjct: 330 NAEEQANGNSKGKA 343
>gi|357464307|ref|XP_003602435.1| DNA-binding protein BIN4 [Medicago truncatula]
gi|355491483|gb|AES72686.1| DNA-binding protein BIN4 [Medicago truncatula]
Length = 416
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 111/183 (60%), Positives = 140/183 (76%), Gaps = 1/183 (0%)
Query: 124 KKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGES 183
K +K +N + G+ D E +E+ EK NVS+S LPL++SEK+ RTKALVEC+G+S
Sbjct: 198 KGKKTKVNAKGKGGDGDVEVKDEETFEKPVEPNVSSSILPLMLSEKVHRTKALVECQGDS 257
Query: 184 VDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIM 243
+DLSGDMGAVGR+++ + G+ EM LDLKGTIYKT+++PSRTFC+VSFG +EAKIEAIM
Sbjct: 258 IDLSGDMGAVGRVIISDSPSGDQEMCLDLKGTIYKTSIIPSRTFCVVSFGQTEAKIEAIM 317
Query: 244 NDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTK 303
NDFIQL P S+VYEAETMVEGTLDGF FDS++E K K H DQN+ EEQT K+K
Sbjct: 318 NDFIQLNPPSDVYEAETMVEGTLDGFSFDSDEEDGKRQKTT-HPSDQNEHAEEQTPGKSK 376
Query: 304 VKS 306
K+
Sbjct: 377 RKA 379
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 20/31 (64%), Positives = 25/31 (80%), Gaps = 1/31 (3%)
Query: 1 MSGSREESPDWLRSFQAPT-HSVLTLSSDSD 30
MS S + SPDWLRS+Q PT +S +TLSSDS+
Sbjct: 1 MSSSGDSSPDWLRSYQVPTPNSPVTLSSDSE 31
>gi|218189675|gb|EEC72102.1| hypothetical protein OsI_05068 [Oryza sativa Indica Group]
Length = 473
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 188/311 (60%), Gaps = 26/311 (8%)
Query: 9 PDWLRSFQAPTHSVLTLSSDSDSDSSHNGNGDGGKNIAELTFEKHMEPFAPKSSILASSS 68
PDWLR+FQ PT S + LSSDS DS N + E E ASS
Sbjct: 140 PDWLRAFQPPTTSTVMLSSDS-YDSPENSPTRTTPSGEEQKGENK-----------ASSD 187
Query: 69 DSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVK----KVSKEKSPKKGLKSPK 124
+S + +KA + K G++EK +++ K +K +PKK +K P
Sbjct: 188 HVGDGDVASPNKGKKATPTRRKTPTSQEDAFGKDEKPTMESNQDKPAKRSTPKKLVKFPS 247
Query: 125 KEKDVINNVKRKGNNDHED---------AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA 175
+ G + +D AE D+ +K + +VS LPL++ +K+QR+KA
Sbjct: 248 SSNASKVTGPKAGPDQIDDTLEHREEGVAEVDMQDKLTEHSVS-QRLPLIIPDKVQRSKA 306
Query: 176 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHS 235
L+EC+G+S+DLSGD+GAVGRI++ + GN ++ LDLKGTIYK+T+VPSRTFC+VS G +
Sbjct: 307 LIECDGDSIDLSGDVGAVGRIVISNSPNGNQDLLLDLKGTIYKSTIVPSRTFCVVSVGQT 366
Query: 236 EAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE 295
EAKIE+IMNDFIQL+PQSN++EAETMVEGTLDGF FDS++E DKL + Q DQN+
Sbjct: 367 EAKIESIMNDFIQLEPQSNLFEAETMVEGTLDGFTFDSDEEGDKLPEPHASQNDQNNEDG 426
Query: 296 EQTNEKTKVKS 306
+Q KTK K+
Sbjct: 427 DQPKAKTKRKA 437
>gi|357138685|ref|XP_003570920.1| PREDICTED: uncharacterized protein LOC100821932 [Brachypodium
distachyon]
Length = 358
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 178/296 (60%), Gaps = 16/296 (5%)
Query: 1 MSGSREESPDWLRSFQAPTHSVLTLSSDSDS---------DSSHNGNGDGGKNIAELTFE 51
M+G E PDWL +FQAP+ + + LSS SDS D G G + A T +
Sbjct: 1 MAGQEEGDPDWLTAFQAPSLAPVMLSSGSDSSRENSPTRADQEEKEAGKGNSDTAGRTDD 60
Query: 52 --KHMEPFAPKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDED-AALTGREEKTSVK 108
K +P A + LAS + S + E++ + + ++ AL+ S
Sbjct: 61 ATKGKQPTATRRKALASKKGALGKHEGSSMEEQEDIPPKRRTPKKNLVALSDSSAGNSPS 120
Query: 109 KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSM----LNVSTSTLPL 164
+ S+ + LK + + + ++ E EE +AE++ N + LPL
Sbjct: 121 RASEANEEEDSLKDKNAKVVGTESCPDQRDDTLEQLEEGVAEEQMQDKLPGNSISQRLPL 180
Query: 165 VMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPS 224
++++K+QR+KALVEC+G+S+DLSGD+GAVGRI++ GN ++ LDLKGT+YK+T+VPS
Sbjct: 181 MITDKVQRSKALVECDGDSIDLSGDIGAVGRIVISNGPTGNQDLLLDLKGTVYKSTIVPS 240
Query: 225 RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKL 280
RTFC+VS G +EAKIEAIMNDFIQL+P SN++EAETM+EGTLDGF FDS+ E D+L
Sbjct: 241 RTFCVVSVGQTEAKIEAIMNDFIQLEPHSNLFEAETMMEGTLDGFTFDSDGEGDRL 296
>gi|297812657|ref|XP_002874212.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297320049|gb|EFH50471.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 492
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 127/267 (47%), Positives = 163/267 (61%), Gaps = 61/267 (22%)
Query: 91 EMDEDAALTGREEKTSVKKVSKEKSPK----KGLKSPKK----------EKDVIN----- 131
E DED L EE+ +VKKV KEKSPK G K+PK+ E VIN
Sbjct: 184 EKDEDFVLDATEEEPAVKKVGKEKSPKTKSKSGRKTPKEGNGAQKIFKSEDKVINTTIAE 243
Query: 132 -----------------------NVKRKGNNDHEDAEEDIAEKRSM------LNVSTSTL 162
+ ++ ++ +D + IAE+ +M + S+S L
Sbjct: 244 EVTMEKSPETKSKSSRKTPKEENSAQKIFKSEDKDTDTTIAEEGTMDQKIKPSSGSSSRL 303
Query: 163 PLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK--------- 213
PLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLK
Sbjct: 304 PLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGINTFSFHV 360
Query: 214 -GTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFD 272
GTIYK+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+
Sbjct: 361 SGTIYKSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLTPQSNVYEAETMVEGTLEGFSFE 420
Query: 273 SEDESDKLAKAMPHQIDQNDGIEEQTN 299
S+DE++K AK DQ+ G EE+TN
Sbjct: 421 SDDETNKNAKTASKPADQSGGTEEETN 447
>gi|45736025|dbj|BAD13052.1| unknown protein [Oryza sativa Japonica Group]
gi|45736067|dbj|BAD13093.1| unknown protein [Oryza sativa Japonica Group]
Length = 320
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 112/227 (49%), Positives = 151/227 (66%), Gaps = 12/227 (5%)
Query: 109 KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVS 158
K K +PKK L P + K D D AEE++ +K + +VS
Sbjct: 78 KPPKRSTPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEEMQDKLTEHSVS 137
Query: 159 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 218
LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++ + GN E+ LDLKGTIYK
Sbjct: 138 -QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYK 196
Query: 219 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESD 278
+T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E D
Sbjct: 197 STIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGD 256
Query: 279 KLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 324
KL + Q DQN+ +Q KTK K+ P K + + ATK
Sbjct: 257 KLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 303
>gi|238481371|ref|NP_001154738.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005958|gb|AED93341.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 446
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 263 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 319
Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 320 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 379
Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
+K AK DQ+ G EE+TN
Sbjct: 380 NKNAKTAVKPADQSVGTEEETN 401
>gi|238481367|ref|NP_001154736.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|238481369|ref|NP_001154737.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005956|gb|AED93339.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005957|gb|AED93340.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 450
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 267 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 323
Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 324 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 383
Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
+K AK DQ+ G EE+TN
Sbjct: 384 NKNAKTAVKPADQSVGTEEETN 405
>gi|240256338|ref|NP_197851.5| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005954|gb|AED93337.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 451
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 268 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 324
Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 325 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 384
Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
+K AK DQ+ G EE+TN
Sbjct: 385 NKNAKTAVKPADQSVGTEEETN 406
>gi|238481365|ref|NP_001154735.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|332005955|gb|AED93338.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 432
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 249 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 305
Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 306 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 365
Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
+K AK DQ+ G EE+TN
Sbjct: 366 NKNAKTAVKPADQSVGTEEETN 387
>gi|238481373|ref|NP_001154739.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
gi|257050999|sp|Q9FLU1.2|BIN4_ARATH RecName: Full=DNA-binding protein BIN4; AltName: Full=Protein
BRASSINOSTEROID INSENSITIVE 4; AltName: Full=Protein
MIDGET
gi|332005959|gb|AED93342.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
Length = 454
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)
Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLKGTIY
Sbjct: 271 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 327
Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 328 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 387
Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
+K AK DQ+ G EE+TN
Sbjct: 388 NKNAKTAVKPADQSVGTEEETN 409
>gi|413935594|gb|AFW70145.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
Length = 354
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 132/333 (39%), Positives = 189/333 (56%), Gaps = 45/333 (13%)
Query: 3 GSREESPDWLRSFQAPTHSVLTLSSDSDSDSSHNGNGDGGKNIAELTFEKHMEPFAPKSS 62
G E+ PDWLR+FQAPT + + LSS SD+ S + + EKH P +
Sbjct: 2 GDEEDDPDWLRAFQAPTVAPVMLSSGSDT-SPEASPTRTSTSRKQNKGEKHASPDHARDR 60
Query: 63 ILASSSD---SESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKG 119
AS + S + + +L+ +++ EK+ + LT +++ ++ S + SP
Sbjct: 61 DGASQNKNKISAATRRKNLVSKKEGSTMDEKQANTPRRLTPKKDMVTLSSGS-DASPGNS 119
Query: 120 LKSPKK---EKDVINNVKRK----------------------GNNDHEDAEEDIAEKRSM 154
L + E+ ++ KRK GN D AE+D+ +K +
Sbjct: 120 LSRAHEDNHEEGSLSTAKRKNAQQTKTKKTKDAGTKTGADQAGNGD---AEDDVQDKLTG 176
Query: 155 LNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 214
+VS LPL+ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V ++ LDLKG
Sbjct: 177 NSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGPTAKQDLLLDLKG 235
Query: 215 TIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSE 274
TIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMVE +
Sbjct: 236 TIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMVE----------D 285
Query: 275 DESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 306
+E DKL + +Q D N+ +E Q KTK K+
Sbjct: 286 EEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 318
>gi|255544304|ref|XP_002513214.1| conserved hypothetical protein [Ricinus communis]
gi|223547712|gb|EEF49205.1| conserved hypothetical protein [Ricinus communis]
Length = 155
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/117 (75%), Positives = 102/117 (87%)
Query: 190 MGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 249
MGAVGR+++P + GNHEM+LDLKGTIY+T +VPSRTFC+VSFG SEAKIEAIMNDFIQL
Sbjct: 1 MGAVGRVVIPDGSSGNHEMYLDLKGTIYRTKIVPSRTFCVVSFGQSEAKIEAIMNDFIQL 60
Query: 250 KPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 306
K QSNVYEAETMVEGTL+GF FDSEDE++K+ K + HQ DQNDG EEQ N KTK K+
Sbjct: 61 KAQSNVYEAETMVEGTLEGFSFDSEDEAEKMPKPVSHQTDQNDGNEEQMNGKTKRKA 117
>gi|297598587|ref|NP_001045881.2| Os02g0147100 [Oryza sativa Japonica Group]
gi|255670601|dbj|BAF07795.2| Os02g0147100, partial [Oryza sativa Japonica Group]
Length = 126
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 79/118 (66%), Positives = 102/118 (86%)
Query: 175 ALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH 234
AL+EC+G+S+DLSGD+GAVGRI++ + GN E+ LDLKGTIYK+T+VPSRTFC+VS G
Sbjct: 1 ALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQ 60
Query: 235 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 292
+EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E DKL + Q DQN+
Sbjct: 61 TEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNN 118
>gi|410129774|dbj|BAM64852.1| hypothetical protein [Beta vulgaris]
Length = 467
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 112/243 (46%), Positives = 154/243 (63%), Gaps = 40/243 (16%)
Query: 58 APKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPK 117
AP SI SSDSE+ ++++ + +EK +E++A +++VK+ KSPK
Sbjct: 242 APNHSICELSSDSETKSDATIRK------IEEKIDNEESA-----PRSAVKR----KSPK 286
Query: 118 KGLKSP--KKEKDVINNVKRKGNNDHEDAEEDI-AEKRSMLNVSTSTLPLVMSEKLQRTK 174
K +K EK K +G ++ D E++ EK+S VS+ST+PLV+ EK+QRTK
Sbjct: 287 KRVKQEDDATEKRTNEPAKIEGEVENMDVAEEVKPEKQSEPAVSSSTMPLVLPEKVQRTK 346
Query: 175 ALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH 234
ALVECEGES+DLSGDMGAVGR+++ T EMFLDLK ++
Sbjct: 347 ALVECEGESIDLSGDMGAVGRVIISETPSKESEMFLDLKVSLE----------------- 389
Query: 235 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGI 294
+IEAIMNDFIQLKPQSNVY AETMVEGTL+GF F+S+DE+DK+ K + + DQ++G
Sbjct: 390 ---QIEAIMNDFIQLKPQSNVYNAETMVEGTLEGFTFESDDEADKMTKDV--KGDQSEGA 444
Query: 295 EEQ 297
EE+
Sbjct: 445 EEK 447
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/28 (71%), Positives = 23/28 (82%)
Query: 1 MSGSREESPDWLRSFQAPTHSVLTLSSD 28
M SRE SPDWLR+FQAP+ S+ TLSSD
Sbjct: 115 MGDSREASPDWLRTFQAPSQSITTLSSD 142
>gi|45736024|dbj|BAD13051.1| unknown protein [Oryza sativa Japonica Group]
gi|45736066|dbj|BAD13092.1| unknown protein [Oryza sativa Japonica Group]
Length = 193
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 80/138 (57%), Positives = 103/138 (74%), Gaps = 1/138 (0%)
Query: 188 GDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFI 247
GD+GAVGRI++ + GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IM+DFI
Sbjct: 39 GDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMDDFI 98
Query: 248 QLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS- 306
QL+PQSN++EAETM+EGTLDGF FDS++E DKL + Q DQN+ +Q KTK K+
Sbjct: 99 QLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKAE 158
Query: 307 GPCRKERYECWRTATATK 324
P K + + ATK
Sbjct: 159 KPAGKGQKKAKVAGKATK 176
>gi|10177863|dbj|BAB11215.1| unnamed protein product [Arabidopsis thaliana]
Length = 531
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 91/163 (55%), Positives = 108/163 (66%), Gaps = 28/163 (17%)
Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK---- 213
S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V T +M+LDLK
Sbjct: 331 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKEHFA 387
Query: 214 -----------------GTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVY 256
GTI T S V+ G +EAKIEAIMNDFIQL PQSNVY
Sbjct: 388 LYVFSSSCTFSDCKGTCGTIIHT----SDLVFQVNVGQTEAKIEAIMNDFIQLIPQSNVY 443
Query: 257 EAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTN 299
EAETMVEGTL+GF F+S+DES+K AK DQ+ G EE+TN
Sbjct: 444 EAETMVEGTLEGFTFESDDESNKNAKTAVKPADQSVGTEEETN 486
>gi|168037576|ref|XP_001771279.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677368|gb|EDQ63839.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 379
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 66/125 (52%), Positives = 93/125 (74%), Gaps = 3/125 (2%)
Query: 157 VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTI 216
++ S+LP+V EK+ RTK L+ECEG+++DLSGDMGAVGR V + E+ LDLKG +
Sbjct: 179 IAASSLPIVFGEKVNRTKVLLECEGDALDLSGDMGAVGRFTV---NRPDDELLLDLKGVV 235
Query: 217 YKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDE 276
YKTT+VPS T+ +V+ G EAK+E+IM DF+QL+ ++ E ETMVEGTL F F+S++E
Sbjct: 236 YKTTIVPSNTYFVVNVGQMEAKVESIMTDFMQLRADTSWNENETMVEGTLKDFAFESDEE 295
Query: 277 SDKLA 281
++ A
Sbjct: 296 GERPA 300
>gi|168010889|ref|XP_001758136.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690592|gb|EDQ76958.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 373
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 64/122 (52%), Positives = 93/122 (76%), Gaps = 3/122 (2%)
Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
+ S+LP+V EK+ +TK L+ECEG+++DLSGDMGAVGR V ++E+ LDLKG IY
Sbjct: 172 AASSLPIVFGEKVNKTKVLLECEGDALDLSGDMGAVGRFTV---NRRDNELLLDLKGVIY 228
Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
KTT+VPS TF +V+ G ++AK+E+IM+DF+QL+ + E ET+VEGTL F F+S++E
Sbjct: 229 KTTIVPSNTFFLVNVGQTDAKVESIMSDFVQLRADTIGKENETVVEGTLQDFTFESDEEG 288
Query: 278 DK 279
++
Sbjct: 289 ER 290
>gi|45736061|dbj|BAD13087.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 189
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 60/106 (56%), Positives = 85/106 (80%), Gaps = 4/106 (3%)
Query: 144 AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 203
AEED+ +K + +VS LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++ +
Sbjct: 87 AEEDMQDKLTEHSVS-QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPN 145
Query: 204 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 249
GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAK +++D I L
Sbjct: 146 GNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAK---VVDDNINL 188
>gi|224132538|ref|XP_002321341.1| predicted protein [Populus trichocarpa]
gi|222868337|gb|EEF05468.1| predicted protein [Populus trichocarpa]
Length = 159
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 79/196 (40%), Positives = 98/196 (50%), Gaps = 61/196 (31%)
Query: 127 KDVINNVKRKG-------NNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVEC 179
K ++ VK +G + D E EE +EK +VS S LPLV+SEK+QR+KALVEC
Sbjct: 2 KAIMCKVKVQGEEGTCGNHGDVEITEEVTSEKLINAHVSISRLPLVLSEKVQRSKALVEC 61
Query: 180 EGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKI 239
EGES+DLSG+MGAVGR ++P T G+ EM LDLKGT
Sbjct: 62 EGESIDLSGEMGAVGRPVIPDTPSGDSEMHLDLKGT------------------------ 97
Query: 240 EAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTN 299
L+GF F SEDE+DK+ KA HQ DQN +EE +N
Sbjct: 98 --------------------------LEGFSFFSEDETDKITKATAHQTDQNMDVEEPSN 131
Query: 300 EKTKVK----SGPCRK 311
K K SG RK
Sbjct: 132 GMIKRKAEKSSGVARK 147
>gi|384247484|gb|EIE20971.1| hypothetical protein COCSUDRAFT_83530 [Coccomyxa subellipsoidea
C-169]
Length = 378
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/276 (30%), Positives = 130/276 (47%), Gaps = 36/276 (13%)
Query: 9 PDWLRSFQAPTHSVLTLSSDSD--------SDSSHNGNGDGGKNIAEL--TFEKHMEPFA 58
PDWL +P + L L SDSD D + G G G + AE K A
Sbjct: 16 PDWLSENNSPVKA-LELLSDSDEVLEVASSDDGAEEGAGPGPGSAAEQKEAPRKRGRAAA 74
Query: 59 PKSS-----ILASSSDSES--------CQNSSLLREEKAFQFQEKEMDEDAALTGREEKT 105
P++ +L SS +S+S Q + R++ Q+ + G +K
Sbjct: 75 PQAKRNLNRVLLSSENSDSDDVEGSGKGQQPARKRQKTMVATQQSQRFSSQDSAGPSQKA 134
Query: 106 SVKKVSKEKSPKKGLKSPKKEKDVINN-VKRKG-----NNDHEDAEEDIAEKRSMLNVST 159
+K + K L+ ++E + +N V +G + + K ++L
Sbjct: 135 PSRKAAGSGG-KAALRRIQEEDEQPDNDVGGEGTQAIVGSQAAKGPKSPVSKIAVLAPVA 193
Query: 160 STLPLVMSEKLQRTKALVECEG---ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTI 216
+P++M EKL TK +VE EG SVDL+GD G VGR LV G + N E+ +DLKG +
Sbjct: 194 GEIPVMMPEKLPVTKFIVELEGGNDSSVDLTGDAGVVGRWLVEGD-DSNPELRMDLKGVV 252
Query: 217 YKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKP 251
Y T VP + T C+V+ +EA++EA+ + +IQL+P
Sbjct: 253 YAATTVPCATTLCVVNLSATEARVEALGHQYIQLRP 288
>gi|303275097|ref|XP_003056848.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
CCMP1545]
gi|226461200|gb|EEH58493.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
CCMP1545]
Length = 392
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 72/119 (60%), Gaps = 8/119 (6%)
Query: 162 LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 219
+PL+M + R K ECEG ESVDL GD+G VGR+L +A+ + +DLKG IY
Sbjct: 195 IPLLMPATVNRNKIFFECEGAGESVDLEGDVGVVGRLLSDPSAKYTSGVKVDLKGVIYDA 254
Query: 220 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFL-FDSEDE 276
++P+ + +++ +EAK+E I ND++QL+ V GTL+G+L DS+D+
Sbjct: 255 KILPTPASIVVLAVNQTEAKVECIANDYVQLRRDVTANN----VGGTLEGYLGVDSDDD 309
>gi|449532507|ref|XP_004173222.1| PREDICTED: DNA-binding protein BIN4-like, partial [Cucumis sativus]
Length = 198
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 95/174 (54%), Gaps = 25/174 (14%)
Query: 41 GGKNIA-ELTFEKHMEPFAPKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALT 99
G N+A E+ ++H K S+ S DSESC +++ ++E+ ++ + E+ + +
Sbjct: 49 GNHNLAKEVKLDRHTGHENSKHSVWMLSLDSESCSDNNFIKEDYSYHEELAEL-ATSEIQ 107
Query: 100 GREEKTSVKKVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVST 159
GR + E + +G +V+ N D E E+D + VS+
Sbjct: 108 GRRKD--------ENAGSEG------------HVR---NGDVEILEKDALDDCIGPPVSS 144
Query: 160 STLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK 213
S LPLV+S+K R KALVECEG S+DLSGDMGAVGR++V ++ +E+ LDLK
Sbjct: 145 SRLPLVLSDKAHRLKALVECEGTSIDLSGDMGAVGRVVVSDSSSAKNELCLDLK 198
>gi|307107138|gb|EFN55382.1| hypothetical protein CHLNCDRAFT_52595 [Chlorella variabilis]
Length = 402
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 73/125 (58%), Gaps = 8/125 (6%)
Query: 149 AEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGES------VDLSGDMGAVGRILVPGTA 202
A + + TS +P+V+ EKL + K LVE E DLSGD GA+GR+++ GT
Sbjct: 168 AAGLGLPHAPTSQMPVVLPEKLPQMKMLVELESNPDDAHHVTDLSGDSGAIGRLVIAGTK 227
Query: 203 EGNHEMFLDLKGTIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETM 261
E EM +DLKG +Y T+VP + +++ G +EAK+E + ++F+QL+ + +T
Sbjct: 228 EAP-EMQIDLKGVLYNATIVPCPVSIAVINMGQTEAKVECLFSEFVQLREDTRFSFVDTE 286
Query: 262 VEGTL 266
V G +
Sbjct: 287 VLGGM 291
>gi|255084389|ref|XP_002508769.1| predicted protein [Micromonas sp. RCC299]
gi|226524046|gb|ACO70027.1| predicted protein [Micromonas sp. RCC299]
Length = 376
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 72/121 (59%), Gaps = 11/121 (9%)
Query: 162 LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 219
+PL M + R K E EG E+VDL GD+G VGR+L + + + +D+KG IY
Sbjct: 178 VPLFMPSAVNRNKVFFELEGTGEAVDLEGDVGVVGRLLSNASEKHGSGLQMDMKGVIYNA 237
Query: 220 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLK--PQSNVYEAETMVEGTLDGFL-FDSED 275
++P+ + I++ +EAK+E+I +DF+QL+ P +N + TLDG+L DS+D
Sbjct: 238 RILPTPASVVILAVNANEAKVESITHDFVQLRRDPVAN-----GGLGATLDGYLGVDSDD 292
Query: 276 E 276
E
Sbjct: 293 E 293
>gi|320166916|gb|EFW43815.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 236
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 68/110 (61%), Gaps = 3/110 (2%)
Query: 162 LPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTL 221
+PLV+ + T L+E ++LSGD G+VGR++V + + LDLKG I++ T+
Sbjct: 53 IPLVLGKP--ATAFLMETTEPQLNLSGDTGSVGRLVV-NAQDAAAPLVLDLKGLIFQGTV 109
Query: 222 VPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLF 271
VPS TF +V+ G +EA+I+A +N+ I + ++ T+VEG++D F
Sbjct: 110 VPSNTFMLVTIGPNEARIDATVNNVIHTTCLGDALDSMTVVEGSIDADAF 159
>gi|255544302|ref|XP_002513213.1| conserved hypothetical protein [Ricinus communis]
gi|223547711|gb|EEF49204.1| conserved hypothetical protein [Ricinus communis]
Length = 137
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 78/125 (62%), Gaps = 9/125 (7%)
Query: 58 APKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPK 117
A K SI A SSDSESC + S R+ K+ Q +E ++DA L SV+K K KSPK
Sbjct: 17 AAKRSIWALSSDSESCPDDSSQRKTKSNQPKESRKEKDAILVD----NSVEKALKGKSPK 72
Query: 118 KGLK----SPKKEKDVINNVKRKGNNDH-EDAEEDIAEKRSMLNVSTSTLPLVMSEKLQR 172
K LK KK+++ ++V + G+ND E A+ED++EK + VSTS +PL++S+K+ R
Sbjct: 73 KLLKVEGRKLKKKENKNDDVPKTGDNDDVEVADEDVSEKHTGPQVSTSMIPLLLSDKVSR 132
Query: 173 TKALV 177
TK +
Sbjct: 133 TKMFL 137
>gi|428174918|gb|EKX43811.1| hypothetical protein GUITHDRAFT_140250 [Guillardia theta CCMP2712]
Length = 309
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 52/82 (63%), Gaps = 5/82 (6%)
Query: 176 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHS 235
+V+CE +++D GD G +G I G M LDLKG +Y+ +++P T ++ G +
Sbjct: 164 VVQCEDQNLDFEGDSGVIGTI-----KTGERSMELDLKGQLYEGSILPCCTMMVMDMGAT 218
Query: 236 EAKIEAIMNDFIQLKPQSNVYE 257
E KIE++MNDF+++ P +N ++
Sbjct: 219 EVKIESVMNDFVKVHPIANPHD 240
>gi|302857504|ref|XP_002959890.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
nagariensis]
gi|300254022|gb|EFJ39045.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
nagariensis]
Length = 340
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 35/129 (27%)
Query: 157 VSTSTLPLVMSEKLQRTKALVEC-----EGE----SVDLSGDMGAVGRILVPGTAEGNH- 206
V LP+++ EKL R K L+E GE + DLSGD GAVGR++V N
Sbjct: 38 VEAGKLPIMLPEKLNRNKVLLELPPAAMSGEHYHGAADLSGDSGAVGRLVVGKAPAANSK 97
Query: 207 ------------------------EMFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAKIEA 241
++ +DLKG +Y ++P + T +++ G SEAK+E+
Sbjct: 98 GTAAAAAAAAAGGSEGGGSGGDSVQLQVDLKGLLYNAWVLPLAGTAMVLNIGPSEAKVES 157
Query: 242 IMNDFIQLK 250
+ NDFI+L+
Sbjct: 158 LFNDFIRLR 166
>gi|449526337|ref|XP_004170170.1| PREDICTED: DNA-binding protein BIN4-like [Cucumis sativus]
Length = 71
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/53 (66%), Positives = 41/53 (77%), Gaps = 1/53 (1%)
Query: 243 MNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE 295
MNDFIQLK S V EAETMVEGTLDGF FDSED+++K+ KA DQN+ +E
Sbjct: 1 MNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDDAEKITKA-ASPADQNEPVE 52
>gi|299471419|emb|CBN79372.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 240
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 68/120 (56%), Gaps = 7/120 (5%)
Query: 162 LPLVMSEKL---QRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 218
+PL++ + + LV E+++ +GD GA+GRI V G +G F DLKG +
Sbjct: 59 VPLIVPPAVCKDAKASLLVHLRDEALNFAGDTGAIGRISVGGKKKGRGLTF-DLKGQQFS 117
Query: 219 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAET--MVEGTL-DGFLFDSED 275
++PS T +V G +EAK+E + ++ +L + NV++A +++G + D + F+ ED
Sbjct: 118 GQIIPSATVMVVGIGKTEAKVEGLFSEVCRLDYKGNVFDAMGGEVLKGEMDDSYKFEEED 177
>gi|412989095|emb|CCO15686.1| hypothetical protein VOLCADRAFT_101408 [Bathycoccus prasinos]
Length = 422
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 24/117 (20%)
Query: 157 VSTSTLPLVMSE-KLQRTK---ALVECEG--ESVDLSGDMGAVGRILV-----PGTAEGN 205
V S +PL+++ LQ+T L EC+G E+VDLSGD+G VGR+LV P G+
Sbjct: 237 VPKSKMPLLINHGALQKTANSTILFECKGSGEAVDLSGDVGVVGRLLVERESQPTGGGGS 296
Query: 206 H----------EMFLDLKGTIY--KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLK 250
+ +DLKG +Y +T L+ S T IV+ G +AKIE ++++QL+
Sbjct: 297 KASTSKQQPHASVKVDLKGVMYLARTHLL-SSTVAIVNVGAEDAKIETFFDEYVQLR 352
>gi|308804497|ref|XP_003079561.1| unnamed protein product [Ostreococcus tauri]
gi|116058016|emb|CAL54219.1| unnamed protein product [Ostreococcus tauri]
Length = 229
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 70/124 (56%), Gaps = 11/124 (8%)
Query: 158 STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 214
S++ + LV S L ++K L E EG E+VDL GD GAVGR L AE + +D+KG
Sbjct: 63 SSTRVDLVASAHLGGKSKVLFEIEGNGEAVDLDGDTGAVGRWL----AESARSLKVDMKG 118
Query: 215 TIYKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDS 273
+Y +VP + T +V+ AKI+++ +F+QL+ + V A T G G LF+
Sbjct: 119 LMYNARVVPLAGTVVVVAVNGDVAKIDSVHREFVQLR-EDPVANAGTENYGV--GSLFEM 175
Query: 274 EDES 277
DE+
Sbjct: 176 TDEA 179
>gi|159483423|ref|XP_001699760.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281702|gb|EDP07456.1| predicted protein [Chlamydomonas reinhardtii]
Length = 308
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 64/138 (46%), Gaps = 41/138 (29%)
Query: 160 STLPLVMSEKLQRTKALVECEGE---------------SVDLSGDMGAVGRILVPGTAEG 204
+LPL++ +K+ R K L E + DLSGD GAVGR++V A
Sbjct: 140 GSLPLMLPDKINRNKVLFELPPPPASSSNADGYGHGHGATDLSGDAGAVGRLIVLAAAPT 199
Query: 205 NHE-------------------------MFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAK 238
+ + +DLKG IY +P + T +++ G +EAK
Sbjct: 200 GGKGAAAGGGEPGPSSQGGGGPGGAEVCLQVDLKGVIYNAWALPLAGTAMVLNVGPTEAK 259
Query: 239 IEAIMNDFIQLKPQSNVY 256
+E++ NDF++L+ ++V+
Sbjct: 260 VESLFNDFVRLREAADVH 277
>gi|356516190|ref|XP_003526779.1| PREDICTED: uncharacterized protein LOC100812582 [Glycine max]
Length = 82
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 36/46 (78%), Gaps = 1/46 (2%)
Query: 261 MVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 306
MVEGTLDGF FDS++E+ K+ K+ +Q DQN+ +EEQ N K+K K+
Sbjct: 1 MVEGTLDGFFFDSDEEAGKMQKST-NQTDQNENVEEQANGKSKGKT 45
>gi|255544300|ref|XP_002513212.1| conserved hypothetical protein [Ricinus communis]
gi|223547710|gb|EEF49203.1| conserved hypothetical protein [Ricinus communis]
Length = 119
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/31 (77%), Positives = 25/31 (80%)
Query: 1 MSGSREESPDWLRSFQAPTHSVLTLSSDSDS 31
MS SRE SPDWL FQAP HS +TLSSDSDS
Sbjct: 1 MSSSRENSPDWLHCFQAPIHSTVTLSSDSDS 31
>gi|145347277|ref|XP_001418100.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578328|gb|ABO96393.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 999
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 58/97 (59%), Gaps = 8/97 (8%)
Query: 158 STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 214
S++ + LV + L ++K L E EG E+VDL GD GAVGR L AE + + +D+KG
Sbjct: 33 SSTAVELVATANLGGKSKVLFEIEGAGEAVDLDGDTGAVGRWL----AESSRALKVDMKG 88
Query: 215 TIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLK 250
+Y +V S T +V+ AKIE++ +F+QL+
Sbjct: 89 VMYNARVVSSAGTVVVVAVNADVAKIESVHREFVQLR 125
>gi|397569025|gb|EJK46493.1| hypothetical protein THAOC_34834 [Thalassiosira oceanica]
Length = 315
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 54/102 (52%), Gaps = 9/102 (8%)
Query: 176 LVECEGESVDLSGDMGAVGRILVPGTAEGNHE-MFLDLKGTIYKTTLVPSRTFCIVSFGH 234
LV+ + +DL G GAVGR+ E + + LDLKG Y+ T+ P T +V+
Sbjct: 175 LVQMNDDKLDLHGQSGAVGRL------EADERGVILDLKGYQYRGTIRPGPTAMVVALTR 228
Query: 235 -SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSED 275
+ K+E+I ++F+ L + + +++ + +GF+ ED
Sbjct: 229 EGKFKVESITDEFVTLDAK-GAHVMDSLGDNMREGFMVREED 269
>gi|422294628|gb|EKU21928.1| hypothetical protein NGA_2017520, partial [Nannochloropsis gaditana
CCMP526]
Length = 238
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 4/74 (5%)
Query: 185 DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 244
DL+GD GAVGR+ V + + + LDLKG Y L T +VS G EAK+E +++
Sbjct: 40 DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVEDVVD 95
Query: 245 DFIQLKPQSNVYEA 258
L + EA
Sbjct: 96 TLCVLTHHHDELEA 109
>gi|219116542|ref|XP_002179066.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409833|gb|EEC49764.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 289
Score = 45.4 bits (106), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 14/105 (13%)
Query: 176 LVECEGE---SVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSF 232
LV+ + E S+D +G +GA+GR+ AEG + LD KG Y+ + P T ++S
Sbjct: 150 LVQIDPEDSASLDFTGAVGAIGRLET--VAEG---VVLDFKGCQYRGRIYPGSTAVVLST 204
Query: 233 GHSEA-----KIEAIMNDFIQLKPQSNVYE-AETMVEGTLDGFLF 271
+A K+E++ ++F +L + N E +V G + G+ +
Sbjct: 205 NGKKADTQLLKVESMTDEFCRLTKEGNAMEKLNAIVRGDMVGYQY 249
>gi|387199325|gb|AFJ68895.1| hypothetical protein NGATSA_2017510 [Nannochloropsis gaditana
CCMP526]
Length = 226
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 4/74 (5%)
Query: 185 DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 244
DL+GD GAVGR+ V + + + LDLKG Y L T +VS G EAK++ +++
Sbjct: 28 DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVKDVVD 83
Query: 245 DFIQLKPQSNVYEA 258
L + EA
Sbjct: 84 TLCVLTHHHDELEA 97
>gi|449018377|dbj|BAM81779.1| hypothetical protein CYME_CMP364C [Cyanidioschyzon merolae strain
10D]
Length = 263
Score = 44.7 bits (104), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 51/100 (51%), Gaps = 10/100 (10%)
Query: 175 ALVECEGESVDLSGDMGAVGRILVPGTAE-GNHEMFLDLKGTIYKTTLVPSRTFCI---- 229
AL+ E + DL+GD G +G + A+ G + LDLKG +Y+T RTF +
Sbjct: 110 ALLVFENDEFDLAGDTGTIGCVQWSHNADSGAVRLCLDLKGVVYET-----RTFSLCGSA 164
Query: 230 VSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGF 269
++ + ++EA+++D P + E ++ G ++ +
Sbjct: 165 LAVDPTSQRVEAVLHDITYSIPTNVADERAQLIRGKIEPY 204
>gi|413935595|gb|AFW70146.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
Length = 186
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 3 GSREESPDWLRSFQAPTHSVLTLSSDSDS 31
G E+ PDWLR+FQAPT + + LSS SD+
Sbjct: 2 GDEEDDPDWLRAFQAPTVAPVMLSSGSDT 30
>gi|413935596|gb|AFW70147.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
Length = 198
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 3 GSREESPDWLRSFQAPTHSVLTLSSDSDS 31
G E+ PDWLR+FQAPT + + LSS SD+
Sbjct: 2 GDEEDDPDWLRAFQAPTVAPVMLSSGSDT 30
>gi|293394374|ref|ZP_06638674.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
gi|291423352|gb|EFE96581.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
Length = 392
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 65/138 (47%), Gaps = 14/138 (10%)
Query: 121 KSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA-LVEC 179
+ P+++ +N +G+ + +DI KR+ + TS P + E L+ + L +
Sbjct: 21 RDPEQQSGFVNAPVYRGSTVVYKSCDDIRHKRARFSYGTSGTPTI--ENLENAWSTLCDA 78
Query: 180 EGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFC---IVSFGHSE 236
G + SG +GA+ L T G+H + D ++Y+ P+R FC +V +G S
Sbjct: 79 AGTVISPSG-LGALALALFSVTRAGDHILVTD---SVYR----PTRKFCNTLLVKYGVSV 130
Query: 237 AKIEAIMNDFIQLKPQSN 254
+ ++ I+ Q+N
Sbjct: 131 TYYDPLIGGDIERLLQNN 148
>gi|358336321|dbj|GAA54864.1| midasin [Clonorchis sinensis]
Length = 5090
Score = 38.9 bits (89), Expect = 3.9, Method: Composition-based stats.
Identities = 37/144 (25%), Positives = 59/144 (40%), Gaps = 24/144 (16%)
Query: 25 LSSDSDSDSSHNGNGDGGKNIAELTFEKHM----EPFAPKSSILASSSDSESCQNSSL-- 78
LSS ++ N GDG ++ F++ E P S+I + +D+ Q+ +
Sbjct: 4249 LSSADETQDLENRMGDGASGASDEVFKEMWASDNEEEEPASTIPPAENDTGGVQDQTNGK 4308
Query: 79 -----LREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEKDVINNV 133
R+ Q Q+K +D D + E+ ++ + PKKG PK KD N
Sbjct: 4309 QDEKGTRDSSDVQRQKKSLDTDQPAQEKNEEANLGEAESSPPPKKG---PKSAKDFQNKT 4365
Query: 134 KRKG-------NNDHE---DAEED 147
G NN+ E DAE +
Sbjct: 4366 TVAGDEGQMLPNNEQELTPDAESE 4389
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.306 0.124 0.340
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,002,068,949
Number of Sequences: 23463169
Number of extensions: 207109386
Number of successful extensions: 918536
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 204
Number of HSP's successfully gapped in prelim test: 1645
Number of HSP's that attempted gapping in prelim test: 900332
Number of HSP's gapped (non-prelim): 15018
length of query: 336
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 193
effective length of database: 9,003,962,200
effective search space: 1737764704600
effective search space used: 1737764704600
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 77 (34.3 bits)