BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 044000
         (420 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449448711|ref|XP_004142109.1| PREDICTED: mediator of RNA polymerase II transcription subunit
           4-like [Cucumis sativus]
 gi|449520010|ref|XP_004167027.1| PREDICTED: mediator of RNA polymerase II transcription subunit
           4-like [Cucumis sativus]
          Length = 404

 Score =  535 bits (1379), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 296/420 (70%), Positives = 340/420 (80%), Gaps = 18/420 (4%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
           MLQHQIVQSPARLGLTNPNSPS+QNP    +P K P S   HH   +   ++  ++    
Sbjct: 1   MLQHQIVQSPARLGLTNPNSPSIQNP----TPPKLPPSQQSHHQNQSNLSSSTPSSTLLP 56

Query: 61  CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
                         RAQALLLQMASLASKLFEVSPNRSLW++AFRGSLPTFL+SQ QS P
Sbjct: 57  LLPPLP--------RAQALLLQMASLASKLFEVSPNRSLWLTAFRGSLPTFLTSQTQSSP 108

Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
              +DS+PS+ KEI++ F  LQTQLFEAVAELQEILDLQD+KQKIA EI+AKD+A+ AFA
Sbjct: 109 STLLDSTPSSSKEILSLFNVLQTQLFEAVAELQEILDLQDSKQKIAYEIRAKDAALRAFA 168

Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHR 240
           NK+KEAE+VLD+L DDY+DY+RP+R K+EG   N+     CSTTV S+L LSDILSYAHR
Sbjct: 169 NKLKEAEQVLDVLFDDYADYQRPRRDKTEGDAENE---SLCSTTVASQLRLSDILSYAHR 225

Query: 241 ISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAI 300
           ISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLY FA+LDVGLPK VE+ EKTIE +
Sbjct: 226 ISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYNFANLDVGLPKAVESTEKTIEPM 285

Query: 301 IEPP-PLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPL 359
           IEPP P++P DANPL  + GLLPPNITVPSGWKPGMPVELP D P+ PPPGWKPGD V L
Sbjct: 286 IEPPAPVRP-DANPLAAIQGLLPPNITVPSGWKPGMPVELPGDLPM-PPPGWKPGDPVRL 343

Query: 360 PPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDILDQDDDSSDYSSEEGSSEDD 419
           PP+D+LP+ R++E QLRP    GLHKPPEPIQVRHV+LDILDQDDDSSDY+S+EGSSEDD
Sbjct: 344 PPMDSLPVTRVDEPQLRPNALPGLHKPPEPIQVRHVQLDILDQDDDSSDYTSDEGSSEDD 403


>gi|357518055|ref|XP_003629316.1| hypothetical protein MTR_8g075790 [Medicago truncatula]
 gi|355523338|gb|AET03792.1| hypothetical protein MTR_8g075790 [Medicago truncatula]
          Length = 402

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 284/406 (69%), Positives = 326/406 (80%), Gaps = 31/406 (7%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHH--LRSATPPTAPITTGT 58
           MLQHQIVQSPARLGLTNPNSPS+ NPNP     K P S T+HH    SATP         
Sbjct: 1   MLQHQIVQSPARLGLTNPNSPSILNPNPQ----KLPPSQTNHHQDRHSATP--------- 47

Query: 59  TACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQS 118
               S+ L+SLLPPLPRAQALL QMASLASKLFEVSPNRS+W++AFRGSLPTFLSSQ Q 
Sbjct: 48  ----SAALISLLPPLPRAQALLSQMASLASKLFEVSPNRSVWVTAFRGSLPTFLSSQTQP 103

Query: 119 QPPPPIDSS-PSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAIL 177
                ++SS PST KEII+ FT+LQTQ+FEAV+ELQEI+DLQDAK+KI  EI++KDSA+L
Sbjct: 104 HSFSSLESSLPSTTKEIISLFTTLQTQIFEAVSELQEIIDLQDAKKKIDGEIRSKDSALL 163

Query: 178 AFANKIKEAERVLDILVDDYSDYRRP-KRVKSEGGDGNDDDSGSCSTTVESRLNLSDILS 236
           AFANK+K+AER LDILVDDYSDYRR  KR+KS  GDG++DDS + ST V S+L LSDILS
Sbjct: 164 AFANKLKDAERELDILVDDYSDYRRSIKRLKS--GDGSEDDSLTTST-VSSQLKLSDILS 220

Query: 237 YAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKT 296
           YAHRISYTTFAPPEFGAG  PLRGA+PPAPQ+EQMRASQLY FADLD+GLPK VETKEKT
Sbjct: 221 YAHRISYTTFAPPEFGAGTAPLRGAMPPAPQDEQMRASQLYNFADLDIGLPKAVETKEKT 280

Query: 297 IEAIIEPPPLQPVDANPLKDLA---GLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKP 353
           +EAIIEPPP   VD NPL +L+   G+LPPN  +P GWKPGMPV+LP D P+ PPPGWKP
Sbjct: 281 VEAIIEPPP--SVDTNPLGNLSAIQGMLPPNFAIPPGWKPGMPVQLPIDMPIKPPPGWKP 338

Query: 354 GDAVPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
           GD V LPP+D+LP+PR EEQ+L P  PQ   + PE IQV+HV LD+
Sbjct: 339 GDPVALPPIDSLPVPRFEEQKLPPHIPQP--RQPEIIQVQHVNLDL 382


>gi|225439765|ref|XP_002273310.1| PREDICTED: uncharacterized protein LOC100249946 [Vitis vinifera]
          Length = 414

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 306/423 (72%), Positives = 341/423 (80%), Gaps = 19/423 (4%)

Query: 2   LQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTAC 61
           +QHQIVQSPARLGL NPNSPSLQN           T+         +      +  +T  
Sbjct: 5   VQHQIVQSPARLGLPNPNSPSLQNS----------TAPKLSSQIPQSQQPHQQSNLSTTT 54

Query: 62  TSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPP 121
           TSSTLLSLLPPL RAQ+LLLQMASLAS+LFEVSPNRSLW+++FRG+LPTFLSSQ+    P
Sbjct: 55  TSSTLLSLLPPLSRAQSLLLQMASLASRLFEVSPNRSLWLTSFRGTLPTFLSSQSIPPTP 114

Query: 122 PPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFAN 181
            P++ SPS+ KEI++ FTSLQTQLFEAVAELQEILDLQDAKQKI REI++KDSAILAFAN
Sbjct: 115 TPLEHSPSSTKEILSLFTSLQTQLFEAVAELQEILDLQDAKQKITREIRSKDSAILAFAN 174

Query: 182 KIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRI 241
           K+KEAE VLDILVDDYSDYRRPKR  S   D + +DS SC+TTV S+LNLSDILSYAHRI
Sbjct: 175 KLKEAEAVLDILVDDYSDYRRPKR--SNTAD-DSEDSSSCTTTVASQLNLSDILSYAHRI 231

Query: 242 SYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAII 301
           SYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY F+DLDVGLPK V  KEKTIE  I
Sbjct: 232 SYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFSDLDVGLPKAVGIKEKTIETFI 291

Query: 302 EPPPLQPVDANPLKDL---AGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVP 358
           EPPP QP D+NPL +L    GLLPPNITVPSGWKPGMPVELP   P+VPPPGWKPGD VP
Sbjct: 292 EPPPAQPADSNPLANLPSIQGLLPPNITVPSGWKPGMPVELPS-LPIVPPPGWKPGDPVP 350

Query: 359 LPPLDALPMP-RIEEQQLRPVPPQGLHKPPEPIQVRHVELDI-LDQDDDSSDYSSEEGSS 416
           LPPL+++P+P R EEQ  RP+   GL K PEPI V HV+LDI LDQDDDSSDYSSE GSS
Sbjct: 351 LPPLESIPIPSRAEEQLPRPLAAHGLQKAPEPIHVPHVQLDINLDQDDDSSDYSSEMGSS 410

Query: 417 EDD 419
           ED+
Sbjct: 411 EDE 413


>gi|147807720|emb|CAN66553.1| hypothetical protein VITISV_018166 [Vitis vinifera]
          Length = 1448

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 300/417 (71%), Positives = 335/417 (80%), Gaps = 19/417 (4%)

Query: 8    QSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTACTSSTLL 67
            +SPARLGL NPNSPSLQN           T+         +      +  +T  TSSTLL
Sbjct: 1045 KSPARLGLPNPNSPSLQNS----------TAPKLSSQIPQSQQPHQQSNLSTTTTSSTLL 1094

Query: 68   SLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSS 127
            SLLPPL RAQ+LLLQMASLAS+LFEVSPNRSLW+++FRG+LPTFLSSQ+    P P++ S
Sbjct: 1095 SLLPPLSRAQSLLLQMASLASRLFEVSPNRSLWLTSFRGTLPTFLSSQSIPPTPTPLEHS 1154

Query: 128  PSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAE 187
            PS+ KEI++ FTSLQTQLFEAVAELQEILDLQDAKQKI REI++KDSAILAFANK+KEAE
Sbjct: 1155 PSSTKEILSLFTSLQTQLFEAVAELQEILDLQDAKQKITREIRSKDSAILAFANKLKEAE 1214

Query: 188  RVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFA 247
             VLDILVDDYSDYRRPKR  S   D + +DS SC+TTV S+LNLSDILSYAHRISYTTFA
Sbjct: 1215 AVLDILVDDYSDYRRPKR--SNTAD-DSEDSSSCTTTVASQLNLSDILSYAHRISYTTFA 1271

Query: 248  PPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQ 307
            PPEFGAGQ PLRGALPPAPQEEQMRASQLY F+DLDVGLPK V  KEKTIE  IEPPP Q
Sbjct: 1272 PPEFGAGQAPLRGALPPAPQEEQMRASQLYNFSDLDVGLPKAVGIKEKTIETFIEPPPAQ 1331

Query: 308  PVDANPLKDL---AGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPLPPLDA 364
            P D+NPL +L    GLLPPNITVPSGWKPGMPVELP   P+VPPPGWKPGD VPLPPL++
Sbjct: 1332 PADSNPLANLPSIQGLLPPNITVPSGWKPGMPVELP-SLPIVPPPGWKPGDPVPLPPLES 1390

Query: 365  LPMP-RIEEQQLRPVPPQGLHKPPEPIQVRHVELDI-LDQDDDSSDYSSEEGSSEDD 419
            +P+P R EEQ  RP+   GL K PEPI V HV+LDI LDQDDDSSDYSSE GSSED+
Sbjct: 1391 IPIPSRAEEQLPRPLAAHGLQKAPEPIHVPHVQLDINLDQDDDSSDYSSEMGSSEDE 1447


>gi|224138570|ref|XP_002322847.1| predicted protein [Populus trichocarpa]
 gi|222867477|gb|EEF04608.1| predicted protein [Populus trichocarpa]
          Length = 397

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 266/401 (66%), Positives = 308/401 (76%), Gaps = 27/401 (6%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSL--QNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGT 58
           M QHQIVQSPARLGLTNP SPSL  Q PNP P     P    HHH  + T  T   +   
Sbjct: 1   MHQHQIVQSPARLGLTNPTSPSLPVQTPNPPPPKFPNPQPPQHHHPSTTTTTTTTSSALL 60

Query: 59  TACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQS 118
           +               RAQ+LLLQMASLA+KLFE+SPNR  W+SA+RG+LPTFL SQ+QS
Sbjct: 61  SLLPPLP---------RAQSLLLQMASLATKLFELSPNRPHWLSAYRGTLPTFLPSQSQS 111

Query: 119 QPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILA 178
              P    +P++ KEII+ FTSLQTQLFE+VAELQEILDLQD+KQ+I++EIK+KDSA+L 
Sbjct: 112 LDEP----TPTSTKEIISLFTSLQTQLFESVAELQEILDLQDSKQRISQEIKSKDSALLN 167

Query: 179 FANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYA 238
           FANK+KEAERVLDILVDDY+DYRR + +        ++++ + +TTV S+L LSDILSYA
Sbjct: 168 FANKLKEAERVLDILVDDYADYRRRRPIPP----NEEEENATNTTTVASQLKLSDILSYA 223

Query: 239 HRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIE 298
           HRISYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY FA LDVGLPK VETKEKTIE
Sbjct: 224 HRISYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFAHLDVGLPKDVETKEKTIE 283

Query: 299 AIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVP 358
           AIIEP      +ANPL++LAGL PPNI+VP+GWKPGMPV LP D P VPPPGWKPGD V 
Sbjct: 284 AIIEP----QHEANPLENLAGLFPPNISVPAGWKPGMPVVLPTDLP-VPPPGWKPGDPV- 337

Query: 359 LPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
              LD++P+PR  EQ+L P+ PQGLHKPPE IQVRHVELDI
Sbjct: 338 --SLDSIPLPRAVEQKLPPIAPQGLHKPPETIQVRHVELDI 376


>gi|357500889|ref|XP_003620733.1| hypothetical protein MTR_6g089780 [Medicago truncatula]
 gi|355495748|gb|AES76951.1| hypothetical protein MTR_6g089780 [Medicago truncatula]
          Length = 579

 Score =  452 bits (1164), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 280/403 (69%), Positives = 316/403 (78%), Gaps = 26/403 (6%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
           MLQHQ+VQSPARLGLTNPNSP + NP P   P    T+  HH  R              A
Sbjct: 1   MLQHQVVQSPARLGLTNPNSPLIPNPTPPKLPPLQTTN--HHQDRH------------LA 46

Query: 61  CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
             S  LLSLLPPLPRAQALL QMASL++KLFEVSPN+SLW+SAFRGSLPTFLSSQ Q + 
Sbjct: 47  TPSPALLSLLPPLPRAQALLQQMASLSTKLFEVSPNKSLWVSAFRGSLPTFLSSQGQPRS 106

Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
              +DSSPST KEI++ FT+LQTQ+FEAV+ELQE+LD +DAKQKI +EI +KDSA+LAFA
Sbjct: 107 SASLDSSPSTTKEILSLFTNLQTQMFEAVSELQEVLDQKDAKQKIDQEICSKDSALLAFA 166

Query: 181 NKIKEAERVLDILVDDYSDYR-RPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAH 239
           NK+K+AERVLDILVDDYSDYR + KR+K   GDG++D S + S TV S+L LSDILSYAH
Sbjct: 167 NKLKDAERVLDILVDDYSDYRSKTKRLKL--GDGSEDVSLTTS-TVSSQLKLSDILSYAH 223

Query: 240 RISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEA 299
           RISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLY FADLD+GLPK  ET EKTIEA
Sbjct: 224 RISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYNFADLDIGLPKSAETTEKTIEA 283

Query: 300 IIEPPPLQPVDANPLKDL---AGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDA 356
           IIEPPPLQPVDAN L +L    GLLPPN TVP+GWKPGMPV+LP D P+ PPPGWKPGD 
Sbjct: 284 IIEPPPLQPVDANSLANLPGIQGLLPPNFTVPAGWKPGMPVQLPIDIPIKPPPGWKPGDP 343

Query: 357 VPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
           V    LD+L +PRIEEQQL P  PQ   K PE IQV  V LD+
Sbjct: 344 V---ALDSLSIPRIEEQQLHPHVPQP--KLPEIIQVAPVNLDL 381


>gi|357501019|ref|XP_003620798.1| hypothetical protein MTR_6g090580 [Medicago truncatula]
 gi|355495813|gb|AES77016.1| hypothetical protein MTR_6g090580 [Medicago truncatula]
          Length = 538

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 280/403 (69%), Positives = 316/403 (78%), Gaps = 26/403 (6%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
           MLQHQ+VQSPARLGLTNPNSP + NP P   P    T+  HH  R              A
Sbjct: 1   MLQHQVVQSPARLGLTNPNSPLIPNPTPPKLPPLQTTN--HHQDRH------------LA 46

Query: 61  CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
             S  LLSLLPPLPRAQALL QMASL++KLFEVSPN+SLW+SAFRGSLPTFLSSQ Q + 
Sbjct: 47  TPSPALLSLLPPLPRAQALLQQMASLSTKLFEVSPNKSLWVSAFRGSLPTFLSSQGQPRS 106

Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
              +DSSPST KEI++ FT+LQTQ+FEAV+ELQE+LD +DAKQKI +EI +KDSA+LAFA
Sbjct: 107 SASLDSSPSTTKEILSLFTNLQTQMFEAVSELQEVLDQKDAKQKIDQEICSKDSALLAFA 166

Query: 181 NKIKEAERVLDILVDDYSDYR-RPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAH 239
           NK+K+AERVLDILVDDYSDYR + KR+K   GDG++D S + S TV S+L LSDILSYAH
Sbjct: 167 NKLKDAERVLDILVDDYSDYRSKTKRLKL--GDGSEDVSLTTS-TVSSQLKLSDILSYAH 223

Query: 240 RISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEA 299
           RISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLY FADLD+GLPK  ET EKTIEA
Sbjct: 224 RISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYNFADLDIGLPKSAETTEKTIEA 283

Query: 300 IIEPPPLQPVDANPLKDL---AGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDA 356
           IIEPPPLQPVDAN L +L    GLLPPN TVP+GWKPGMPV+LP D P+ PPPGWKPGD 
Sbjct: 284 IIEPPPLQPVDANSLANLPGIQGLLPPNFTVPAGWKPGMPVQLPIDIPIKPPPGWKPGDP 343

Query: 357 VPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
           V    LD+L +PRIEEQQL P  PQ   K PE IQV  V LD+
Sbjct: 344 V---ALDSLSIPRIEEQQLHPHVPQP--KLPEIIQVAPVNLDL 381


>gi|356519021|ref|XP_003528173.1| PREDICTED: uncharacterized protein LOC100796958 [Glycine max]
          Length = 388

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 271/404 (67%), Positives = 308/404 (76%), Gaps = 40/404 (9%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHH--HLRSATPPTAPITTGT 58
           MLQHQIVQSPA LGL NPNSPS+ NP    +P K P + THH     S+TP         
Sbjct: 1   MLQHQIVQSPAMLGLANPNSPSIPNP----TPPKLPPTQTHHPQDRNSSTP--------- 47

Query: 59  TACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQS 118
               SS LLSLLPP  RAQALL QMASLASKLFEVSPNRSLW++AFRGSLPTFLSS + +
Sbjct: 48  ----SSALLSLLPPHLRAQALLAQMASLASKLFEVSPNRSLWVTAFRGSLPTFLSSHSST 103

Query: 119 QPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILA 178
               P+D+SPS  KEII+ FT LQTQ+FEA+AELQEILDLQDAKQKI  EI ++DS +LA
Sbjct: 104 ----PLDTSPSDGKEIISLFTVLQTQIFEAIAELQEILDLQDAKQKIDCEIHSQDSTLLA 159

Query: 179 FANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYA 238
           FANK+KEAE  LDILVDDYSDY R KR KS      DDDS + S+T+ S+L LSD+LSYA
Sbjct: 160 FANKLKEAESCLDILVDDYSDYHRSKRSKS-----GDDDSMT-SSTISSQLKLSDMLSYA 213

Query: 239 HRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIE 298
           HRISYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY FADLDVGLPK VETKEK +E
Sbjct: 214 HRISYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFADLDVGLPKEVETKEKIVE 273

Query: 299 AIIEPPPLQPVDANPLKDLA---GLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGD 355
           AI+EPPP   VD N + +L+   G+LPP   +P GWKPGMPV+LP D PL PP GWKPGD
Sbjct: 274 AIVEPPP--QVDTNTVPNLSAFQGMLPP---MPPGWKPGMPVQLPIDLPLPPP-GWKPGD 327

Query: 356 AVPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
            VPLPP+D+L +PR  EQQ++P  PQ   K PE IQV+ V LD+
Sbjct: 328 PVPLPPMDSLQIPRFAEQQMQPHIPQP--KQPEVIQVQPVNLDL 369


>gi|356524055|ref|XP_003530648.1| PREDICTED: uncharacterized protein LOC100812071 [Glycine max]
          Length = 388

 Score =  440 bits (1132), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 258/400 (64%), Positives = 292/400 (73%), Gaps = 32/400 (8%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
           MLQHQIVQSPARLGL NPNSPS+ N                HH +     T      +  
Sbjct: 1   MLQHQIVQSPARLGLANPNSPSIPN-----PTPPKLPPTQTHHPQDRNSSTPSSALLSLL 55

Query: 61  CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
                         RAQALL QMASLASKLF+VSPNRSLW++AFRGSLPTFLSS + +  
Sbjct: 56  PPLP----------RAQALLAQMASLASKLFDVSPNRSLWVTAFRGSLPTFLSSHSST-- 103

Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
             P+D+SPS  KEII+ FT LQTQ+FEAVAELQEILDLQDAKQKI REI+++DS +LAFA
Sbjct: 104 --PLDTSPSNAKEIISLFTVLQTQIFEAVAELQEILDLQDAKQKIDREIRSQDSTLLAFA 161

Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHR 240
           NK+KEAE  LDILVDDYSDYRR KR KS      DDDS + S+TV S+L LSDILSYAHR
Sbjct: 162 NKLKEAESCLDILVDDYSDYRRSKRSKS-----GDDDSMT-SSTVSSQLKLSDILSYAHR 215

Query: 241 ISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAI 300
           ISYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY FADLDVGLPK VETKEK +EAI
Sbjct: 216 ISYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFADLDVGLPKEVETKEKIVEAI 275

Query: 301 IEPPPLQPVDANP-LKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPL 359
           +EPPP    +A P L    G+LPP   +P GWKPGMPV+LP D   +PPPGWKPGD VPL
Sbjct: 276 VEPPPQVDTNAVPNLSAFQGMLPP---MPPGWKPGMPVQLPID-LPLPPPGWKPGDPVPL 331

Query: 360 PPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
           PP+D+L +PR  EQQ++P  PQ   K PE IQV+ V LD+
Sbjct: 332 PPMDSLQIPRFAEQQMQPHIPQP--KQPEVIQVQPVNLDL 369


>gi|255568436|ref|XP_002525192.1| conserved hypothetical protein [Ricinus communis]
 gi|223535489|gb|EEF37158.1| conserved hypothetical protein [Ricinus communis]
          Length = 401

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 261/401 (65%), Positives = 303/401 (75%), Gaps = 27/401 (6%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
           MLQHQIVQSPARLGLTNPNSPSLQNP    +P KFP +++      ++   + +      
Sbjct: 1   MLQHQIVQSPARLGLTNPNSPSLQNP----TPPKFPAASSQQQQHPSSALLSLLPPLR-- 54

Query: 61  CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
                         RAQ+LLLQMASLASK FEVSPNR+LW+SAFRGSLPTFLSSQ QS  
Sbjct: 55  --------------RAQSLLLQMASLASKFFEVSPNRALWVSAFRGSLPTFLSSQTQS-- 98

Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
             P++S+P + K+I++ FTSLQTQLFEAVAELQEILDLQDAKQ++AR+IK+ DSA+L FA
Sbjct: 99  ISPLESTPFSSKDILSLFTSLQTQLFEAVAELQEILDLQDAKQRLARDIKSHDSALLNFA 158

Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKS---EGGDGNDDDSGSCSTTVESRLNLSDILSY 237
           NK+KEAE +LDILVDDYSDYR+PK  KS   E  D ++  + + +TTV S+LNLSDILSY
Sbjct: 159 NKLKEAETLLDILVDDYSDYRKPKIAKSSLTEDDDVSNTTTSATTTTVASQLNLSDILSY 218

Query: 238 AHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTI 297
           AHRISYTTFAPPEFGAG  PLRGA PPAPQEEQMR SQLY FADLDVGLPK    +    
Sbjct: 219 AHRISYTTFAPPEFGAGTAPLRGAFPPAPQEEQMRVSQLYAFADLDVGLPKTETRERIIE 278

Query: 298 EAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAV 357
             I  PPP Q  D NPL +LAGL PP++TVPSGWKPGMPV LP D P  PP   KPGD +
Sbjct: 279 PIIEPPPPPQSADTNPLGNLAGLFPPSLTVPSGWKPGMPVVLPTDLPPPPPGW-KPGDPL 337

Query: 358 PLPPLDALPMPRIEEQQLRP-VPPQGLHKPPEPIQVRHVEL 397
           PLPPLD+LP+PR  EQQL+P V PQGLHKPPE IQVRHV++
Sbjct: 338 PLPPLDSLPLPRTNEQQLQPAVAPQGLHKPPETIQVRHVQI 378


>gi|356550939|ref|XP_003543839.1| PREDICTED: uncharacterized protein LOC100812941 [Glycine max]
          Length = 384

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 254/400 (63%), Positives = 297/400 (74%), Gaps = 36/400 (9%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
           MLQHQIV SPARLGL NPNSPS+       +PT      TH   R+++ P++ + +    
Sbjct: 1   MLQHQIVLSPARLGLANPNSPSIP------NPTPPKLPPTHTQDRNSSTPSSALLSLLPP 54

Query: 61  CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
                         RAQALL+Q+ASLASKLFEVSPNRSLW++AFRGSLPTFLSS + +  
Sbjct: 55  LP------------RAQALLVQLASLASKLFEVSPNRSLWVTAFRGSLPTFLSSHSST-- 100

Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
             P+ +S S  KEII+ FT LQTQ+FEAVAELQEILDLQDAKQKI REI ++DS +L+FA
Sbjct: 101 --PLCAS-SNAKEIISLFTVLQTQIFEAVAELQEILDLQDAKQKIDREIHSQDSTLLSFA 157

Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHR 240
           NK+K+AE  LDILVDDYSDYRR KR+K       DDDS + S+TV S+L LSDILSYAHR
Sbjct: 158 NKLKDAESCLDILVDDYSDYRRSKRLKF-----GDDDSMT-SSTVSSQLKLSDILSYAHR 211

Query: 241 ISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAI 300
           ISYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY FADLDVGLPK VETKEK +EAI
Sbjct: 212 ISYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFADLDVGLPKEVETKEKIVEAI 271

Query: 301 IEPPPLQPVDANP-LKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPL 359
           +EPPP    +A P L    G+LPP   +P GWKPGMPV+LP D   +PPPGWKPGD VPL
Sbjct: 272 VEPPPQVDTNAVPNLSAFQGMLPP---MPPGWKPGMPVQLPID-LPLPPPGWKPGDPVPL 327

Query: 360 PPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
           PP+D+L +PR  EQQ++P  PQ   K PE IQV+ V LD+
Sbjct: 328 PPMDSLQIPRFAEQQIQPHIPQP--KQPEVIQVQPVNLDL 365


>gi|15242555|ref|NP_195905.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
           thaliana]
 gi|75181168|sp|Q9LZ00.1|MED4_ARATH RecName: Full=Mediator of RNA polymerase II transcription subunit 4
 gi|7413560|emb|CAB86039.1| putative protein [Arabidopsis thaliana]
 gi|332003144|gb|AED90527.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
           thaliana]
          Length = 426

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 272/413 (65%), Positives = 320/413 (77%), Gaps = 17/413 (4%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNP--NPSPTKFPTSATHHHLRSATP--------P 50
           MLQHQIVQSPARLGLT P SPS+QNP P  +  PT   +S + H      P         
Sbjct: 1   MLQHQIVQSPARLGLTGPGSPSVQNPTPTRHGHPTSSSSSQSQHQQIQQQPNLLPSSTVA 60

Query: 51  TAPITTGTTACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPT 110
            A   + + A +SS LLSLLPPLPRAQALL QMA L SKLF+VSPNR++W+SAFRGSLP+
Sbjct: 61  AASSASASAAVSSSALLSLLPPLPRAQALLQQMAVLTSKLFDVSPNRAIWLSAFRGSLPS 120

Query: 111 FLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIK 170
           FLSS +   PPP  + SPS+ KEI++ F SLQTQLFEAV ELQEILDLQDAKQK+AREIK
Sbjct: 121 FLSSHSLPPPPPLENPSPSSTKEILSQFNSLQTQLFEAVTELQEILDLQDAKQKVAREIK 180

Query: 171 AKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVK-SEGGDGNDDDSGSCSTTVESRL 229
           +KDS++LAFANK+K+AERVLD+LVDDYSDYR+PKR K  E  + ND++S S STTV S+L
Sbjct: 181 SKDSSLLAFANKLKDAERVLDMLVDDYSDYRKPKRSKIEEDDEDNDNESSSSSTTVSSQL 240

Query: 230 NLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV 289
            L DIL+YAH+ISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLYTFADLD+GLPK 
Sbjct: 241 KLKDILAYAHKISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYTFADLDIGLPKT 300

Query: 290 VETKEKTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPP 349
           VE  EK +EA+IEPPP  P +A  +  +  LLPPNI VPSGWKPGMPVEL      +PPP
Sbjct: 301 VENMEKKVEALIEPPP--PPEAMDISAIHNLLPPNIAVPSGWKPGMPVEL-PRDLPLPPP 357

Query: 350 GWKPGDAVPLPPLDALPMPRIEE-QQLRPVPPQGLHKPPEPIQVRHVELDILD 401
           GWKPGD V LPPL+++  PR E+ Q +R  P QGLH+PP+ IQVR V+LDIL+
Sbjct: 358 GWKPGDPVVLPPLESIAAPRAEDHQHMR--PSQGLHRPPDVIQVRAVQLDILE 408


>gi|30794029|gb|AAP40461.1| unknown protein [Arabidopsis thaliana]
          Length = 426

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 270/413 (65%), Positives = 317/413 (76%), Gaps = 17/413 (4%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPN----------PSPTKFPTSATHHHLRSATPP 50
           MLQHQIVQSPARLGLT P SPS+QNP P                       + L S+T  
Sbjct: 1   MLQHQIVQSPARLGLTGPGSPSVQNPTPTRHGHTTSSSSSQSQHQQIQQQPNLLPSSTVA 60

Query: 51  TAPITTGTTACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPT 110
            A   + + A +SS LLSLLPPLPRAQALL QMA L SKLF+VSPNR++W+SAFRGSLP+
Sbjct: 61  AASSASASAAVSSSALLSLLPPLPRAQALLQQMAVLTSKLFDVSPNRAIWLSAFRGSLPS 120

Query: 111 FLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIK 170
           FLSS +   PPP  + SPS+ KEI++ F SLQTQLFEAV ELQEILDLQDAKQK+AREIK
Sbjct: 121 FLSSHSLPPPPPLENPSPSSTKEILSQFNSLQTQLFEAVTELQEILDLQDAKQKVAREIK 180

Query: 171 AKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVK-SEGGDGNDDDSGSCSTTVESRL 229
           +KDS++LAFANK+K+AERVLD+LVDDYSDYR+PKR K  E  + ND++S S STTV S+L
Sbjct: 181 SKDSSLLAFANKLKDAERVLDMLVDDYSDYRKPKRSKIEEDDEDNDNESSSSSTTVSSQL 240

Query: 230 NLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV 289
            L DIL+YAH+ISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLYTFADLD+GLPK 
Sbjct: 241 KLKDILAYAHKISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYTFADLDIGLPKT 300

Query: 290 VETKEKTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPP 349
           VE  EK +EA+IEPPP  P +A  +  +  LLPPNI VPSGWKPGMPVEL      +PPP
Sbjct: 301 VENMEKKVEALIEPPP--PPEAMDISAIHNLLPPNIAVPSGWKPGMPVEL-PRDLPLPPP 357

Query: 350 GWKPGDAVPLPPLDALPMPRIEE-QQLRPVPPQGLHKPPEPIQVRHVELDILD 401
           GWKPGD V LPPL+++  PR E+ Q +R  P QGLH+PP+ IQVR V+LDIL+
Sbjct: 358 GWKPGDPVVLPPLESIAAPRAEDHQHMR--PSQGLHRPPDVIQVRAVQLDILE 408


>gi|297806219|ref|XP_002870993.1| hypothetical protein ARALYDRAFT_487063 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316830|gb|EFH47252.1| hypothetical protein ARALYDRAFT_487063 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 428

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 273/414 (65%), Positives = 324/414 (78%), Gaps = 17/414 (4%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNP--NPSPTKFPTSATHHHLRSATP--------P 50
           MLQHQIVQSPARLGLT P SPS+QNP P  +  PT   +S + H      P         
Sbjct: 1   MLQHQIVQSPARLGLTGPGSPSVQNPTPTRHGHPTSSSSSQSQHQQIQQQPNLLPSSTVA 60

Query: 51  TAPITTGTTACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPT 110
            A   +  +A +SS LLSLLPPLPRAQALL QMA L SKLF+VSPNR+LW+SAFRGSLP+
Sbjct: 61  AASSASAASAISSSALLSLLPPLPRAQALLQQMAVLTSKLFDVSPNRALWLSAFRGSLPS 120

Query: 111 FLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIK 170
           FLSS +   PPP  + +PS+ KEI++ F SLQTQLFEAV ELQEILDLQDAKQK+AREIK
Sbjct: 121 FLSSHSLPPPPPLENPNPSSTKEILSQFNSLQTQLFEAVTELQEILDLQDAKQKVAREIK 180

Query: 171 AKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVK--SEGGDGNDDDSGSCSTTVESR 228
           ++DS++LAFANK+K+AERVLD+LV+DYSDYR+PKR K  SE  + ND++S S STTV S+
Sbjct: 181 SRDSSLLAFANKLKDAERVLDMLVEDYSDYRKPKRSKIVSEEEEDNDNESSSSSTTVSSQ 240

Query: 229 LNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPK 288
           L L DIL+YAH+ISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLY FADLD+GLPK
Sbjct: 241 LKLKDILAYAHKISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYAFADLDIGLPK 300

Query: 289 VVETKEKTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPP 348
            VE  EK +EA+IE PP  P +A  +  +  LLPPNI VPSGWKPGMPVELP+D   +PP
Sbjct: 301 TVENMEKKVEALIE-PPPPPPEAMDISAIHNLLPPNIAVPSGWKPGMPVELPRD-LPLPP 358

Query: 349 PGWKPGDAVPLPPLDALPMPRIEEQQ-LRPVPPQGLHKPPEPIQVRHVELDILD 401
           PGWKPGD V LPPL+++  PR E+QQ +R  PPQGLH+PP+ IQVR V+LDIL+
Sbjct: 359 PGWKPGDPVVLPPLESIAAPRAEDQQHMR--PPQGLHRPPDVIQVRAVQLDILE 410


>gi|116789048|gb|ABK25099.1| unknown [Picea sitchensis]
          Length = 393

 Score =  308 bits (790), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 189/389 (48%), Positives = 249/389 (64%), Gaps = 33/389 (8%)

Query: 23  LQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTACTSSTLLSLLPPLPRAQALLLQ 82
           +QNP   PSP +   S  H    +AT P+ P ++   A  S+ + +L PP  RAQ L+ +
Sbjct: 2   MQNPPIFPSPARLGLSVAHSP--AATVPSVPSSSQKIAERSTPIFNL-PPYRRAQLLIHE 58

Query: 83  MASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQ 142
           MA L S+LFEVS NR+ W++A+RGS+PTFL    Q    PP  ++ +T KEI++   S Q
Sbjct: 59  MAGLTSRLFEVSNNRAAWLTAYRGSIPTFL----QGSCIPPEAAAATTTKEILSILESHQ 114

Query: 143 TQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRR 202
             L EAV ELQEI  ++  KQ +A+EI +KDSA+  FA K+ +A++VL+  ++DY  YR 
Sbjct: 115 NLLEEAVTELQEIQKIEQDKQAVAQEISSKDSAMRVFARKLHDAQQVLEQTLEDYEGYRI 174

Query: 203 PKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGAL 262
            KR KS      D             L L+DIL+YAHRISYTTFA P+FGAG+ PLRGA+
Sbjct: 175 GKRSKSLNSSPGD-------AVGLGHLELNDILTYAHRISYTTFATPDFGAGKAPLRGAI 227

Query: 263 PPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQPVDANPLKDLA-GLL 321
           PPAPQEEQMRASQLY +ADLDVG+PK   T+  ++ AI+EP P + V  NP +++  G+L
Sbjct: 228 PPAPQEEQMRASQLYQYADLDVGIPKQAVTEPLSV-AIVEPAPDR-VVVNPKQEVVPGML 285

Query: 322 PPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPLPPL----------DALPMPRIE 371
           P    VP GWKPGMP+ELP     V PPGWKPGD VPLPP           DA+ +P+ E
Sbjct: 286 PGMPPVPPGWKPGMPIELP-----VMPPGWKPGDPVPLPPAMEVPAGWKPGDAVVLPKPE 340

Query: 372 EQ-QLRPVPPQGLHKPPEPIQVRHVELDI 399
           E  Q  P+ P  L +P EPIQV++V+LDI
Sbjct: 341 EAVQAPPLRPPALARPAEPIQVQYVQLDI 369


>gi|357159717|ref|XP_003578536.1| PREDICTED: uncharacterized protein LOC100835981 [Brachypodium
           distachyon]
          Length = 416

 Score =  300 bits (768), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 216/431 (50%), Positives = 273/431 (63%), Gaps = 30/431 (6%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
           M+Q  +  SPARLG+T  +SPSL    PNP+P    +S       S+           T 
Sbjct: 1   MMQSHM-PSPARLGMTA-SSPSLP---PNPAPVNPTSSPPQASSSSSAAAPGAGVPTATL 55

Query: 61  CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
            TS +LL LLPPLPRAQ+LL  M+SLA+ LFE+SPNR+ WIS++RGSLPTFL S + S  
Sbjct: 56  TTSPSLLPLLPPLPRAQSLLHIMSSLAANLFELSPNRTAWISSYRGSLPTFLPSSSTSGM 115

Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
           PPP+ +  S  KE ++  TS+QTQLFE V ELQE LDLQDA+ ++ RE +AKD+A+LAFA
Sbjct: 116 PPPLPTPVSNTKEAMSLLTSVQTQLFETVTELQETLDLQDARARLVREARAKDAALLAFA 175

Query: 181 NKIKEAERVLDILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNLSDIL 235
            K++EA  VLD LVDDY+DYRR PKR +  G    D+      G    ++ S+L L DIL
Sbjct: 176 KKLREAHHVLDRLVDDYADYRRDPKRPR--GAPVVDEPEPVSHGDFGASLHSKLKLDDIL 233

Query: 236 SYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV-VETKE 294
           +YAHRISYTTFAPPE GAG  PLRGALPPAPQ+ ++R SQLY FADLDVG+PK  +ETK+
Sbjct: 234 TYAHRISYTTFAPPEHGAGM-PLRGALPPAPQDNELRMSQLYQFADLDVGVPKKPLETKD 292

Query: 295 KTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPG 354
               A+                   +LP  ITVP GW  G+P +LP D PL PPPGWKPG
Sbjct: 293 GA--AVETEATPLFEPPQEEPPRPSMLP--ITVPPGWPKGLPKDLPVDIPL-PPPGWKPG 347

Query: 355 DAVPLPPLDALPMPRIEEQQLRPVPPQGLHKPP------EPIQVRHVELDILDQDDDSSD 408
           D + L  +    M  ++ ++ RP  P  +   P      EPIQV  VELDI +    S +
Sbjct: 348 DPIELGGI----MASVKAEEPRPSMPMPVGVQPIVPRAQEPIQVAAVELDI-NISSSSDE 402

Query: 409 YSSEEGSSEDD 419
           YSS+ GSSE+D
Sbjct: 403 YSSDVGSSEED 413


>gi|326509687|dbj|BAJ87059.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326515180|dbj|BAK03503.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 415

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 213/415 (51%), Positives = 270/415 (65%), Gaps = 36/415 (8%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPS-PTKFPTSATHHHLRSATPPTAPITTGTT 59
           M+QH  + SPARLG+T  +SPSL  PN +P+ PT  P  A      +A           T
Sbjct: 1   MMQH--MPSPARLGVTA-SSPSLP-PNQSPANPTSSPPQANPPSASAAAAAAGAGAAVPT 56

Query: 60  ACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQ 119
             TS  LL LLPPLPRAQ+LL  ++SLAS LFE+SPNR+ WIS++RGSLP FL+S + S 
Sbjct: 57  LTTSPALLPLLPPLPRAQSLLHLISSLASNLFELSPNRAAWISSYRGSLPNFLASSSASA 116

Query: 120 PPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAF 179
            P P+ +  ST KE ++  TSLQTQLFE VAELQE LDLQDA+ K+ARE +AKD+++LAF
Sbjct: 117 APTPLPAPVSTTKEAMSMLTSLQTQLFETVAELQETLDLQDARAKLAREARAKDASLLAF 176

Query: 180 ANKIKEAERVLDILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNLSDI 234
           A K++EA  VLD LVDDY+DYRR PKR +  G    D+      G    ++ S+L L DI
Sbjct: 177 AKKLREAHHVLDRLVDDYADYRRDPKRPR--GAAAVDEPEPVSQGDFGASLHSKLKLDDI 234

Query: 235 LSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV-VETK 293
           L+YAHRISYTTFAPPE GAG  PLRGALPPAPQ+ ++R SQLY FADLDVG+PK  +ETK
Sbjct: 235 LTYAHRISYTTFAPPEHGAGL-PLRGALPPAPQDNELRMSQLYQFADLDVGVPKKPLETK 293

Query: 294 E---KTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPG 350
           E     +E++      +     P      +LP  ITVP GW+ G+P  LP  FP+ PPPG
Sbjct: 294 EGLTAEMESMPLFELPEEEPPRP-----SMLP--ITVPPGWQKGLPALLPA-FPM-PPPG 344

Query: 351 WKPGDAVPLPPLDALPMPRIEEQQLRP-VP-PQGLH----KPPEPIQVRHVELDI 399
           WKPGD + L  +    +P ++  + RP VP P G+     +  EPIQV  V L+I
Sbjct: 345 WKPGDPIELGGI----IPAVKPDEQRPSVPIPVGVQPMVPRAQEPIQVAAVNLEI 395


>gi|115480431|ref|NP_001063809.1| Os09g0540500 [Oryza sativa Japonica Group]
 gi|52076063|dbj|BAD46576.1| unknown protein [Oryza sativa Japonica Group]
 gi|113632042|dbj|BAF25723.1| Os09g0540500 [Oryza sativa Japonica Group]
 gi|125606481|gb|EAZ45517.1| hypothetical protein OsJ_30176 [Oryza sativa Japonica Group]
 gi|215767917|dbj|BAH00146.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 414

 Score =  280 bits (716), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 187/355 (52%), Positives = 228/355 (64%), Gaps = 22/355 (6%)

Query: 75  RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEI 134
           RAQALL  +++LAS LFE+SP+R+ WISA+RGSLPTFL S + + PPP      ST KE 
Sbjct: 70  RAQALLQLISALASNLFELSPSRAAWISAYRGSLPTFLPSPSSAPPPPLPAPISST-KEA 128

Query: 135 IACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILV 194
           ++   +LQTQLFEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA  VLD LV
Sbjct: 129 LSLLNTLQTQLFEAVAELQETLDLQDARARLAREARAKDASILAFAKKLREAHHVLDRLV 188

Query: 195 DDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNLSDILSYAHRISYTTFAPP 249
           DDY+DYRR PKR +  G    DD      G    ++ S+LNL D+L+YAHRISYTTFAPP
Sbjct: 189 DDYADYRRDPKRPR--GAAAADDPEPVSDGDFGASLHSKLNLDDVLTYAHRISYTTFAPP 246

Query: 250 EFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQPV 309
           E GAG  PLRGALPPAPQE +MR SQLY FADLDVG+PK  E KE+T       P  QP 
Sbjct: 247 EHGAGL-PLRGALPPAPQENEMRMSQLYQFADLDVGVPKSQEAKERTAAEGDATPLFQP- 304

Query: 310 DANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPLPPLDALP--- 366
             +P ++ A +LP  +  P GW+ G         P     GWKPGD + LPP   L    
Sbjct: 305 --SPTQE-AAVLPITVPHPHGWRNGALPLEIPLPPP----GWKPGDPITLPPDGILAGVK 357

Query: 367 --MPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDILDQDDDSSDYSSEEGSSEDD 419
              PR   QQ+  V P  + K  EPIQVRHV+LDI +        S    S EDD
Sbjct: 358 GEEPRASVQQMPVVVPAMVPKAQEPIQVRHVDLDINNSSSSDEYSSDVGSSEEDD 412


>gi|125564538|gb|EAZ09918.1| hypothetical protein OsI_32214 [Oryza sativa Indica Group]
          Length = 414

 Score =  277 bits (708), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 215/421 (51%), Positives = 262/421 (62%), Gaps = 26/421 (6%)

Query: 9   SPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTACTSSTLLS 68
           SPARLGLT  +SPSL    PNP+     +S    +  +A    A      T  TS +LL 
Sbjct: 8   SPARLGLTA-SSPSLP---PNPAAVNPTSSPPQGNPSAAAAVGAAAAAAPTLTTSPSLLP 63

Query: 69  LLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSP 128
           LLPPLPRAQALL  +++LAS LFE+SP+R+ WISA+RGSLPTFL S + + PPP      
Sbjct: 64  LLPPLPRAQALLQLISALASNLFELSPSRAAWISAYRGSLPTFLPSPSSAPPPPLPAPIS 123

Query: 129 STIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAER 188
           ST KE ++   +LQTQLFEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA  
Sbjct: 124 ST-KEALSLLNTLQTQLFEAVAELQETLDLQDARARLAREARAKDASILAFAKKLREAHH 182

Query: 189 VLDILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNLSDILSYAHRISY 243
           VLD LVDDY+DYRR PKR +  G    DD      G    ++ S+LNL D+L+YAHRISY
Sbjct: 183 VLDRLVDDYADYRRDPKRPR--GAAAADDPEPVSDGDFGASLHSKLNLDDVLTYAHRISY 240

Query: 244 TTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEP 303
           TTFAPPE GAG  PLRGALPPAPQE +MR SQLY FADLDVG+PK  E KE+T       
Sbjct: 241 TTFAPPEHGAGL-PLRGALPPAPQENEMRMSQLYQFADLDVGVPKSQEAKERTAAEGDAT 299

Query: 304 PPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPLPPLD 363
           P  QP   +P ++ A +LP  +  P GW+ G         P     GWKPGD + LPP  
Sbjct: 300 PLFQP---SPTQE-AAVLPITVPHPHGWRNGALPLEIPLPPP----GWKPGDPITLPPDG 351

Query: 364 ALP-----MPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDILDQDDDSSDYSSEEGSSED 418
            L       PR    Q+  V P  + K  EPIQVRHV+LDI +        S    S ED
Sbjct: 352 ILAGVKGEEPRASVPQMPVVVPAMVPKAQEPIQVRHVDLDINNSSSSDEYSSDVGSSEED 411

Query: 419 D 419
           D
Sbjct: 412 D 412


>gi|226498388|ref|NP_001144436.1| uncharacterized protein LOC100277397 [Zea mays]
 gi|195642186|gb|ACG40561.1| hypothetical protein [Zea mays]
 gi|238013174|gb|ACR37622.1| unknown [Zea mays]
 gi|414590084|tpg|DAA40655.1| TPA: hypothetical protein ZEAMMB73_937469 [Zea mays]
 gi|414590085|tpg|DAA40656.1| TPA: hypothetical protein ZEAMMB73_937469 [Zea mays]
          Length = 402

 Score =  275 bits (702), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 178/360 (49%), Positives = 227/360 (63%), Gaps = 43/360 (11%)

Query: 75  RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEI 134
           RAQ+LL  ++SLA+ LFE+SPNR+ WI ++RGSLPTFL S + S PP P  +S S+ K+ 
Sbjct: 68  RAQSLLQLISSLATNLFELSPNRAAWILSYRGSLPTFLPSASSSAPPLP--TSISSTKDA 125

Query: 135 IACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILV 194
           ++  T+LQTQLFEAVAELQE LDLQDA+ ++ RE +AKD++++ FA K+ EA +VLD LV
Sbjct: 126 LSLLTTLQTQLFEAVAELQETLDLQDARARLVREARAKDASLVTFAKKLHEAHQVLDRLV 185

Query: 195 DDYSDYRR-PKRVKS--EGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEF 251
           DDY+DYRR PKR +      D      G   T++ S L L DIL+YAHRISYTTFAPPE 
Sbjct: 186 DDYADYRRDPKRPRGAAAADDPEPASEGDFGTSLHSSLKLDDILTYAHRISYTTFAPPEH 245

Query: 252 GAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV-VETKEKTIEAIIEPPPLQPVD 310
           GAG  PLR +LPP PQE QMRASQLY FADLDVG+PK  ++ KE     +   P  +P  
Sbjct: 246 GAGL-PLRVSLPPVPQENQMRASQLYLFADLDVGVPKKPLDAKEGIAPEVEAAPLYEPPQ 304

Query: 311 ANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFP--LVPPPGWKPGDAVPLPPLDALPMP 368
             P +                   +P  LP  FP  L PPPGWKPGD + +P  + LP  
Sbjct: 305 EGPSR-------------------LPNTLPPMFPKELKPPPGWKPGDPITIPLDEILPGI 345

Query: 369 RIEE---------QQLRPVPPQGLHKPPEPIQVRHVELDILDQDDDSSDYSSEEGSSEDD 419
           +  E           +RPV P G    PEPIQV+ V+LD   +   S +YSS+ GSSE+D
Sbjct: 346 KGAEPQPPVPQAPVSVRPVAPMG----PEPIQVKPVQLDF--ESSSSDEYSSDVGSSEED 399


>gi|224029783|gb|ACN33967.1| unknown [Zea mays]
 gi|414888302|tpg|DAA64316.1| TPA: hypothetical protein ZEAMMB73_260641 [Zea mays]
          Length = 403

 Score =  275 bits (702), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 170/341 (49%), Positives = 213/341 (62%), Gaps = 42/341 (12%)

Query: 75  RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEI 134
           RAQ+LL  ++SLAS L E+S NR+ WISA+RGSLPTFL S + +   PP+ +  S+ K+ 
Sbjct: 68  RAQSLLQLISSLASNLLELSANRAAWISAYRGSLPTFLPSASSA---PPLATPISSTKDA 124

Query: 135 IACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILV 194
           ++  T+LQTQLFEAVAELQE LDLQDA+ ++ RE + KD+A+LAFA K+ EA +VLD LV
Sbjct: 125 LSLVTTLQTQLFEAVAELQETLDLQDARARLVREARGKDAALLAFAKKLHEAHQVLDRLV 184

Query: 195 DDYSDYRR-PKRVKSEGGDGNDD--DSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEF 251
           DDY+DYRR PKR +      + +    G   T++ S L L DIL+YAHRISYTTFAPPE 
Sbjct: 185 DDYADYRRDPKRPRGAAAADDPEPASEGDFGTSLHSSLKLDDILTYAHRISYTTFAPPEH 244

Query: 252 GAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQPVDA 311
           GAG  PLR +LPP PQE QMRASQLY FADLDVG+PK     +  I A +E  PL     
Sbjct: 245 GAGL-PLRVSLPPVPQENQMRASQLYLFADLDVGVPKKPLDAKDGIAAEVEVAPL----- 298

Query: 312 NPLKDLAGLLPPNITVPSGWKPGMPVELPKDFP--LVPPPGWKPGDAVPLPPLDALPMPR 369
                           P      +P  LP  FP  L PPPGWKPGD + LP  + LP  +
Sbjct: 299 -------------YEAPQEGPSRLPNTLPPMFPKELKPPPGWKPGDPITLPLDEILPGIK 345

Query: 370 IEEQQ-----------LRPVPPQGLHKPPEPIQVRHVELDI 399
            EE++           +RPV P G    PE IQV+ V+LD 
Sbjct: 346 GEEREAQAPVPQAPVSVRPVAPMG----PETIQVKPVQLDF 382


>gi|242050012|ref|XP_002462750.1| hypothetical protein SORBIDRAFT_02g031320 [Sorghum bicolor]
 gi|241926127|gb|EER99271.1| hypothetical protein SORBIDRAFT_02g031320 [Sorghum bicolor]
          Length = 402

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 175/338 (51%), Positives = 218/338 (64%), Gaps = 41/338 (12%)

Query: 75  RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEI 134
           RAQ+LL  ++SLAS LFE+SPNR+ WI A+RGS PTFL S + S PP P  +  S+ K+ 
Sbjct: 68  RAQSLLQLISSLASNLFELSPNRAAWIQAYRGSFPTFLPSASSSAPPLP--TPISSTKDA 125

Query: 135 IACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILV 194
           ++  T+LQTQLFEAVAELQE LDLQDA+ ++ARE +AKD+A++AFA K+ EA +VLD LV
Sbjct: 126 LSLLTTLQTQLFEAVAELQETLDLQDARARLAREARAKDAALVAFAKKLHEAHQVLDRLV 185

Query: 195 DDYSDYRR-PKRVKSEGGDGNDD--DSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEF 251
           DDY+DYRR PKR +      + +    G   T++ S L L DIL+YAHRISYTTFAPPE 
Sbjct: 186 DDYADYRRDPKRPRGAAAADDPEPASEGDFGTSLHSSLKLDDILAYAHRISYTTFAPPEH 245

Query: 252 GAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPL-QPVD 310
           GAG  PLR +LPP PQE QMRASQLY FADLDVG+PK     ++ I A +E  PL +P  
Sbjct: 246 GAGL-PLRVSLPPVPQENQMRASQLYLFADLDVGVPKKPLDAKEGIAAEVEAAPLYEPPQ 304

Query: 311 ANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFP--LVPPPGWKPGDAVPLPPLDALPMP 368
             P +                   +P  LP  FP  L PPPGWKPGD + LPP + LP  
Sbjct: 305 EGPSR-------------------LPNTLPPMFPKELKPPPGWKPGDPITLPPDEILPGI 345

Query: 369 RIEEQQ---------LRPVPPQGLHKPPEPIQVRHVEL 397
           + EE Q         +RPV P G     EPIQV+ V+L
Sbjct: 346 KGEEPQAPVPQPHVPVRPVAPMGA----EPIQVQPVQL 379


>gi|226506328|ref|NP_001145443.1| uncharacterized protein LOC100278821 [Zea mays]
 gi|195656375|gb|ACG47655.1| hypothetical protein [Zea mays]
          Length = 431

 Score =  265 bits (676), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 193/410 (47%), Positives = 245/410 (59%), Gaps = 39/410 (9%)

Query: 1   MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
           M+Q  +  SPARLGLT  +SPSL       +PT  P+   H  L    P +A    GT+A
Sbjct: 29  MMQSHL-PSPARLGLTA-SSPSLPPNPVPLNPTSSPS---HGSL----PWSA--AAGTSA 77

Query: 61  CTSSTLLSLLPPLP----RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQA 116
             + T    L PL     RAQ+LL  ++SLAS   E+S NR+ WISA+RGSLPTFL S +
Sbjct: 78  APTLTTSPSLLPLLPPLPRAQSLLQLISSLASNFLELSANRAAWISAYRGSLPTFLPSAS 137

Query: 117 QSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAI 176
            +   PP+ +  S+ ++ ++  T+LQTQLFEAVAELQE LDLQDA+ ++ RE + KD+A+
Sbjct: 138 SA---PPLATPISSTEDALSLVTTLQTQLFEAVAELQETLDLQDARARLVREARGKDAAL 194

Query: 177 LAFANKIKEAERVLDILVDDYSDYRR-PKRVKS--EGGDGNDDDSGSCSTTVESRLNLSD 233
           LAFA K+ EA +VLD LVDDY+DYRR PKR +      D      G   T++ S L L D
Sbjct: 195 LAFAKKLHEAHQVLDRLVDDYADYRRDPKRPRGAAAADDPEPASEGDFGTSLHSSLKLDD 254

Query: 234 ILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETK 293
           IL+YAHRISYT FAPPE GAG  PLR +LPP PQE QMRASQLY FADLD+G+PK     
Sbjct: 255 ILTYAHRISYTAFAPPEHGAGL-PLRVSLPPVPQENQMRASQLYLFADLDLGVPKKPLDA 313

Query: 294 EKTIEAIIEPPPLQPV-DANPLK---DLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPP 349
           +  I A +E  PL       P +    LA + P  +  P GWKPG P+ LP D  L   P
Sbjct: 314 KDGIAAEVEVAPLYEAPQEGPSRLPNTLAPMFPKELKPPPGWKPGDPITLPLDEIL---P 370

Query: 350 GWKPGDAVPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
           G K  +     P+   P+       +RPV P G    PE IQV+ V+LD 
Sbjct: 371 GIKGEEREAQAPVPQAPV------SVRPVAPMG----PETIQVKPVQLDF 410


>gi|168013070|ref|XP_001759224.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689537|gb|EDQ75908.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 401

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 122/331 (36%), Positives = 174/331 (52%), Gaps = 53/331 (16%)

Query: 38  SATHHHLRSATPPT----------APITTGTTACTSSTLLSLL--PPLPRAQALLLQMAS 85
           SA H  L+S   P            P+ T  TA  +    +L+  PP  RA  LL +MA 
Sbjct: 54  SAGHGELQSMWGPARSQQQQHDQAVPMDTAATARPAVVRSALMNQPPQRRAHMLLQEMAG 113

Query: 86  LASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQTQL 145
           LA++L E++  +S      + S+P FL+        PP   +  T KE+ A   + Q  L
Sbjct: 114 LANRLVELAATKSWQPGHMKVSIPPFLAGAIL----PPEGQAAVTSKEVQALLENHQNLL 169

Query: 146 FEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRRPKR 205
           +EAV+ELQE+ D++  KQ+IA EI  KD  + AFA ++++A +VL+  +++Y+D ++ K 
Sbjct: 170 YEAVSELQELQDIEQEKQQIAEEINTKDVVVRAFAKRVRDAHQVLEDALEEYADLKKGKH 229

Query: 206 VKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPA 265
            KS                    + +++++SYAHRISYTTFAPP++GAG+  LRGALPPA
Sbjct: 230 RKS--------------------VPVAELVSYAHRISYTTFAPPDYGAGETILRGALPPA 269

Query: 266 PQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQPVDANPLKDLAGLLPPNI 325
           PQEE MRASQLY     D+G                     +P  A P+           
Sbjct: 270 PQEENMRASQLYHVQVSDLGFTSAFAPPPAPEALEEA----EPPKAEPI----------- 314

Query: 326 TVPSGWKPGMPVELPKDFPLVP-PPGWKPGD 355
            +P+GW+PGMPV+LP    L P P GWK GD
Sbjct: 315 -MPAGWRPGMPVQLPPMAELPPMPAGWKIGD 344


>gi|62701671|gb|AAX92744.1| hypothetical protein LOC_Os11g05150 [Oryza sativa Japonica Group]
 gi|77548691|gb|ABA91488.1| hydroxyproline-rich glycoprotein family protein, putative [Oryza
           sativa Japonica Group]
          Length = 194

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 55/105 (52%), Positives = 73/105 (69%), Gaps = 7/105 (6%)

Query: 132 KEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLD 191
           +E ++   +LQ Q FEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA  VLD
Sbjct: 86  QEALSLLNTLQMQFFEAVAELQETLDLQDARARLARETRAKDASILAFAKKLREAHHVLD 145

Query: 192 ILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNL 231
            LVDDY+DYRR PKR +  G    DD      G    ++ S+L+L
Sbjct: 146 RLVDDYADYRRDPKRPR--GAAAADDPEPVSDGDFGASLHSKLDL 188


>gi|218185250|gb|EEC67677.1| hypothetical protein OsI_35109 [Oryza sativa Indica Group]
          Length = 157

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 55/108 (50%), Positives = 74/108 (68%), Gaps = 7/108 (6%)

Query: 129 STIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAER 188
           S  K+ ++   +LQ Q FEAVAELQE LDLQDA+ ++AR+ +AKD++ILAFA K++EA  
Sbjct: 46  SYTKKALSLLNTLQMQFFEAVAELQETLDLQDARARLARKTRAKDASILAFAKKLREAHH 105

Query: 189 VLDILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNL 231
           VLD LVDDY+DYRR PKR +  G    DD      G    ++ S+L+L
Sbjct: 106 VLDRLVDDYADYRRDPKRPR--GAAAADDPEPVSDGDFGASLHSKLDL 151


>gi|125576223|gb|EAZ17445.1| hypothetical protein OsJ_32973 [Oryza sativa Japonica Group]
          Length = 174

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 52/93 (55%), Positives = 66/93 (70%), Gaps = 7/93 (7%)

Query: 144 QLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRR- 202
           Q FEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA  VLD LVDDY+DYRR 
Sbjct: 78  QFFEAVAELQETLDLQDARARLARETRAKDASILAFAKKLREAHHVLDRLVDDYADYRRD 137

Query: 203 PKRVKSEGGDGNDD----DSGSCSTTVESRLNL 231
           PKR +  G    DD      G    ++ S+L+L
Sbjct: 138 PKRPR--GAAAADDPEPVSDGDFGASLHSKLDL 168



 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 45/63 (71%), Positives = 54/63 (85%), Gaps = 1/63 (1%)

Query: 144 QLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRR- 202
           Q FEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA  VLD LVDDY+DYRR 
Sbjct: 2   QFFEAVAELQETLDLQDARARLARETRAKDASILAFAKKLREAHHVLDRLVDDYADYRRD 61

Query: 203 PKR 205
           PKR
Sbjct: 62  PKR 64


>gi|301098212|ref|XP_002898199.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262105262|gb|EEY63314.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 352

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 81/277 (29%), Positives = 117/277 (42%), Gaps = 56/277 (20%)

Query: 118 SQPPPPIDSSP-STIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAI 176
           S PP  I ++   T+ E           L  AV +L      Q    +I  EI+ K   +
Sbjct: 31  SLPPSTITATRLRTVNETFNKVLEKDRDLLTAVKKLARHQVAQKELLRIEAEIENKKQKV 90

Query: 177 LAFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILS 236
           + +A +++ ++       +D +D  R  RV  +       ++   S  V   L+  DI++
Sbjct: 91  VQYAAQLRCSQ-------EDIADVLRKHRVTLQ-------NAKQKSKVV---LDPRDIVA 133

Query: 237 YAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDV---GLPKVVETK 293
           YAHRI+ TT AP E+  G  P+ G +PPAPQE  MRA  L      ++   G  K   T 
Sbjct: 134 YAHRIAGTTSAPKEWQPG-FPMFGFMPPAPQEHMMRAGVLSRGIVAEIVTQGPEKYTSTA 192

Query: 294 EKT--------IEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKD--- 342
            +         +E +  P  L   DA   K +          P GWKPG PV+LP D   
Sbjct: 193 ARVSKEGDGDAMEKLNIPIGLGASDAQIRKQM----------PPGWKPGDPVDLPLDALL 242

Query: 343 ----------FPLVPPPGWKPGDAVPLPPLDALPMPR 369
                       +  P  WK GD  PLPP DA+ + R
Sbjct: 243 HYMGRNFFTEHGITLPESWKTGD--PLPP-DAMEILR 276


>gi|348675077|gb|EGZ14895.1| hypothetical protein PHYSODRAFT_508758 [Phytophthora sojae]
          Length = 353

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 75/276 (27%), Positives = 124/276 (44%), Gaps = 59/276 (21%)

Query: 122 PPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAR---EIKAKDSAILA 178
           PP   + + ++ +   F  +  +  + +A ++++   Q A++++ R   EI++K   ++ 
Sbjct: 33  PPSTITATRLRTVNETFNKVLEKDRDLLAAVKKLARHQVAQKELLRVEAEIESKKQKVVQ 92

Query: 179 FANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYA 238
           +A +++ ++       +D ++  R  RV  +       ++   S  V   L+  DI++YA
Sbjct: 93  YAAQLRRSQ-------EDIANVLRKHRVVLQ-------NAKQKSKVV---LDPRDIVAYA 135

Query: 239 HRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDV---GLPKVVETKEK 295
           HRI+ TT AP E+  G  P+ G +PPAPQE  MRA  L      ++   G  K   T  +
Sbjct: 136 HRIAGTTSAPKEWQPG-FPMFGFMPPAPQEHMMRAGVLSRGIVAEIVTQGPEKYTSTAAR 194

Query: 296 T---------IEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKD---- 342
                     IE +  P  L   DA   K +          P GWKPG PV+LP D    
Sbjct: 195 VKQDGDGGEAIEKLNIPIGLGASDAQIRKQM----------PPGWKPGDPVDLPLDALLH 244

Query: 343 ---------FPLVPPPGWKPGDAVPLPPLDALPMPR 369
                      +  P  WK GD  PLPP D + + R
Sbjct: 245 YMGRSFFTEHGITLPESWKTGD--PLPP-DVMEILR 277


>gi|325187457|emb|CCA21995.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 335

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 66/225 (29%), Positives = 101/225 (44%), Gaps = 50/225 (22%)

Query: 164 KIAREIKAKDSAILAFANKIKEAERVLDILVDDYS----DYRRPKRVKSEGGDGNDDDSG 219
           ++  EI+AK+  ++ +A +++ A+  +  + + +       R+PK+ +            
Sbjct: 77  QLQNEIQAKEERVIEYATQLRNAQVAIQNVFEKHQYTLQQARKPKQEQ------------ 124

Query: 220 SCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTF 279
                ++ R+    +++YAHRI+ TT AP  +  G  P+ G  PPAPQE+ MRA      
Sbjct: 125 ---VLLDPRV----VVAYAHRIAGTTSAPEHWQPGL-PMSGFAPPAPQEQIMRA------ 170

Query: 280 ADLDVGLPKVVETKE--KTIEAIIEPPPLQPVDANPLKDLAGLLPPNITV----PSGWKP 333
             L  G+     T+E  KT    +   P   V  N LK   GL   +  +    P GWKP
Sbjct: 171 GVLSRGVIAETVTREGFKTTRIDVSSEPETSV-TNDLKTPIGLGATDAEIRNQMPPGWKP 229

Query: 334 GMPVELPKDFPLVP-------------PPGWKPGDAVPLPPLDAL 365
           G PV+LP D  L               P  WK GDAVP   +D L
Sbjct: 230 GDPVDLPLDALLHYMGRSFFQENGINLPEFWKSGDAVPPEAMDLL 274


>gi|307106146|gb|EFN54393.1| hypothetical protein CHLNCDRAFT_53218 [Chlorella variabilis]
          Length = 326

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 52/133 (39%), Positives = 69/133 (51%), Gaps = 11/133 (8%)

Query: 234 ILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFA-DLDVGLPKVVET 292
           +LSYAHR+SY++ AP  +  GQ PL    PPAPQE +MRASQL+ F+ D +         
Sbjct: 116 LLSYAHRLSYSSSAPLGYQTGQ-PLYFFKPPAPQEAEMRASQLHAFSRDWEARQQAAAAQ 174

Query: 293 KEKTIEAI---IEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLV--- 346
           ++ T  A+     PP  QP +A P    A   P  +  P G    M  +   D   +   
Sbjct: 175 RQITATAVALGAAPPSAQP-EAAPGPGSATAAP--VPAPVGDVAAMAAQHGLDVAQLLAS 231

Query: 347 PPPGWKPGDAVPL 359
            PPGWKPGD +PL
Sbjct: 232 IPPGWKPGDPIPL 244


>gi|412994053|emb|CCO14564.1| predicted protein [Bathycoccus prasinos]
          Length = 264

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 48/143 (33%), Positives = 68/143 (47%), Gaps = 17/143 (11%)

Query: 229 LNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPK 288
           +N+  +L YA R++YTT AP  +  G   L G LPPAPQEE    SQ+    +  +   +
Sbjct: 73  VNVETLLEYARRVAYTTSAPLNYQPGVSQLCGQLPPAPQEEHFLVSQMKKRHEEKLVHEQ 132

Query: 289 VVETK-EKTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPG--MPVELPKDFPL 345
            ++ K ++  E       ++ +   P ++L   L         WKPG   P  +PK    
Sbjct: 133 ALQRKRQREEETTKRQKEIEEMAKLPKEELIRRL-------VQWKPGAAWPAGVPK---- 181

Query: 346 VPPPGWKPGDAVPLPPLDALPMP 368
            PP GWKPGD  PL  L A   P
Sbjct: 182 -PPVGWKPGD--PLAFLTAKAAP 201


>gi|145341318|ref|XP_001415760.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575983|gb|ABO94052.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 235

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/54 (57%), Positives = 38/54 (70%), Gaps = 1/54 (1%)

Query: 226 ESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTF 279
           ES L +S++L YA RISYTTFAP  +  G  PL G +PPAPQEE  +AS+L  F
Sbjct: 45  ESELKVSEVLDYARRISYTTFAPAGYVPG-APLCGIMPPAPQEEHFQASRLAAF 97


>gi|301099090|ref|XP_002898637.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262105062|gb|EEY63114.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 232

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/165 (35%), Positives = 75/165 (45%), Gaps = 38/165 (23%)

Query: 229 LNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDV---G 285
           L+  DI++YAHRI+ TT AP E+  G  P+ G +PPAPQE  MRA  L      ++   G
Sbjct: 6   LDPRDIVAYAHRIAGTTSAPKEWQPGF-PMFGFMPPAPQEHMMRAGVLSRGIVAEIVTQG 64

Query: 286 LPKVVETKEKT--------IEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPV 337
             K   T  +         +E +  P  L   DA   K +          P GWKPG PV
Sbjct: 65  PEKYTSTAARVSKEGDGDAMEKLNIPIGLGASDAQIRKQM----------PPGWKPGDPV 114

Query: 338 ELPKD-------------FPLVPPPGWKPGDAVPLPPLDALPMPR 369
           +LP D               +  P  WK GD  PLPP DA+ + R
Sbjct: 115 DLPLDALLHYMGRSFFTEHGITLPESWKTGD--PLPP-DAMEILR 156


>gi|330844437|ref|XP_003294132.1| hypothetical protein DICPUDRAFT_159085 [Dictyostelium purpureum]
 gi|325075457|gb|EGC29342.1| hypothetical protein DICPUDRAFT_159085 [Dictyostelium purpureum]
          Length = 347

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 48/156 (30%), Positives = 77/156 (49%), Gaps = 5/156 (3%)

Query: 126 SSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKE 185
           ++P ++ +I+     +     + + +L++    Q+    I +EI+ KD  I   AN +K+
Sbjct: 79  ATPQSVIQIMGRLIEVDKTYQKTLKKLEKHQKYQNEIANIQKEIEEKDKLISGLANNLKD 138

Query: 186 AERVLDI-LVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYT 244
            E  L+  L  D +         SE  D N+  +      +   +   +++SYAH+IS T
Sbjct: 139 IESFLENELSQDNAKMNPNISNVSEILDLNEPSTPVTPDIINREVTPEELISYAHKISGT 198

Query: 245 TFAPPEFGAG-QGPLRGAL-PPAPQEEQMRASQLYT 278
           TFAP  FG     PL     PPAPQ+E MRAS L+T
Sbjct: 199 TFAP--FGYQPHLPLISLFKPPAPQDEMMRASVLFT 232


>gi|66801189|ref|XP_629520.1| hypothetical protein DDB_G0292608 [Dictyostelium discoideum AX4]
 gi|74850970|sp|Q54D00.1|MED4_DICDI RecName: Full=Putative mediator of RNA polymerase II transcription
           subunit 4; AltName: Full=Putative mediator complex
           subunit 4
 gi|60462910|gb|EAL61107.1| hypothetical protein DDB_G0292608 [Dictyostelium discoideum AX4]
          Length = 343

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 78/169 (46%), Gaps = 24/169 (14%)

Query: 125 DSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIK 184
           +++P+ + +++     +       + +L++   +Q    +I +EI+ KD  I   A  +K
Sbjct: 62  NATPNMVLQLMNHLIDVDKIYQNTLKKLEKHQKIQKEITEIQKEIEEKDKLISTLALNLK 121

Query: 185 EAERVLDILVDDYSDYRRPKRVKSEGGDGND-------------DDSGSCSTTVESRLNL 231
           + E  L+       +      V    G+ +D             DD+ S    V   ++ 
Sbjct: 122 DIESFLE-------NELSQDNVNINNGNASDGIVNEVLIDLKQPDDAASMDNIVNREVSP 174

Query: 232 SDILSYAHRISYTTFAPPEFGAG-QGPLRGAL-PPAPQEEQMRASQLYT 278
            D++SYAH+IS TT AP  FG     PL     PPAPQ+E MR+S L+T
Sbjct: 175 EDLISYAHKISGTTSAP--FGYQPNAPLASLFKPPAPQDEMMRSSVLFT 221


>gi|328868218|gb|EGG16598.1| putative mediator complex subunit 4 [Dictyostelium fasciculatum]
          Length = 358

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 63/134 (47%), Gaps = 27/134 (20%)

Query: 144 QLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRRP 203
           QL + +   +EI+D+Q       +EI  KD  I+  A  +K  E+ L+  +++Y      
Sbjct: 113 QLEQHLIYQKEIVDIQ-------KEIIEKDKVIMEMATTLKSLEKTLEEDIEEYG----- 160

Query: 204 KRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALP 263
              K   GD             E  +   D++SYAH+IS TT AP  +      +    P
Sbjct: 161 --AKKTKGD-------------EREVYPEDLISYAHKISGTTSAPFGYQPHLALIPLYKP 205

Query: 264 PAPQEEQMRASQLY 277
           PAPQEE MRAS L+
Sbjct: 206 PAPQEEMMRASVLF 219


>gi|428183182|gb|EKX52040.1| hypothetical protein GUITHDRAFT_133791 [Guillardia theta CCMP2712]
          Length = 324

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 66/145 (45%), Gaps = 24/145 (16%)

Query: 134 IIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDIL 193
           ++    S Q +L E VAEL +I++++     +  +I   D+A+  F   +K+A  VL   
Sbjct: 61  LLHLLCSFQVELRENVAELNKIVEVKKKIDDVKSKIDDSDAALKTFGRSLKDAAGVLQEA 120

Query: 194 VDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGA 253
           +                         S +T     + + D++ YAH ISYT+ A  E   
Sbjct: 121 L-----------------------KTSSNTERVVEVEVEDLIKYAHVISYTSSA-QEGWE 156

Query: 254 GQGPLRGALPPAPQEEQMRASQLYT 278
              PL GALPPAP  + M  S+L++
Sbjct: 157 PNTPLVGALPPAPHSDVMARSRLFS 181


>gi|281201187|gb|EFA75401.1| putative mediator complex subunit 4 [Polysphondylium pallidum
           PN500]
          Length = 355

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 69/143 (48%), Gaps = 28/143 (19%)

Query: 143 TQLFEAVAELQEILDLQDAKQK-------IAREIKAKDSAILAFANKIKEAERVLDILVD 195
           + L EA   LQ+ L   +  Q        I +EI+ KD  I   A+ +K+ E +L+   +
Sbjct: 96  SMLIEADKLLQKTLKQLEQHQTYQKEIVAIQKEIEEKDKQITTLASNLKDVEILLE---N 152

Query: 196 DYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQ 255
           + SD  + + +K                  E  ++  +++SYAH+IS TT AP  + +  
Sbjct: 153 EISDASKNEDIK------------------ERTISPEELISYAHKISGTTSAPYGYQSNL 194

Query: 256 GPLRGALPPAPQEEQMRASQLYT 278
             +    PPAPQE+ MR+S L+T
Sbjct: 195 PLMPLYKPPAPQEDMMRSSTLFT 217


>gi|440799017|gb|ELR20078.1| hypothetical protein ACA1_114570 [Acanthamoeba castellanii str.
           Neff]
          Length = 310

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 69/147 (46%), Gaps = 14/147 (9%)

Query: 132 KEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLD 191
           +E++         L  A+ +L+E    Q    ++  E+  KD  +   A ++++A+  L 
Sbjct: 75  EELMERLLRTDQALLRALNQLREHQRQQQRLARLKDEVAKKDGVVEQLALQLQQAQGTLQ 134

Query: 192 ILVDDYS-DYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPE 250
            ++DD +    RP  +   GG    D            +++ ++++YAH++SYTT APP 
Sbjct: 135 RVIDDTTTKLDRP--LVPGGGSARRDG-----------VDVDEVVAYAHKVSYTTSAPPN 181

Query: 251 FGAGQGPLRGALPPAPQEEQMRASQLY 277
           +      +R  LPPAPQE  +     Y
Sbjct: 182 WNPNLPHMRMFLPPAPQEINILQGNFY 208


>gi|255073715|ref|XP_002500532.1| hypothetical protein MICPUN_56974 [Micromonas sp. RCC299]
 gi|226515795|gb|ACO61790.1| hypothetical protein MICPUN_56974 [Micromonas sp. RCC299]
          Length = 227

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 30/41 (73%), Gaps = 1/41 (2%)

Query: 229 LNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEE 269
           ++ +D++ YA RISYTTFAP  +  G  PL G +PPAPQ+E
Sbjct: 41  VSAADVVEYARRISYTTFAPAGYQPG-APLHGIMPPAPQDE 80


>gi|299115493|emb|CBN75657.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 333

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 77/171 (45%), Gaps = 19/171 (11%)

Query: 111 FLSSQAQSQPPPPIDSSP-STIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREI 169
            L + A+ Q    +D++  + I++ +      + +L +A   L+E   L    + + +E+
Sbjct: 21  VLGALARCQSGTKLDANARAAIRKALVLLNGKEAKLTKASEALREHQQLHAEVESLRQEV 80

Query: 170 KAKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRL 229
           +  +  +   A K+ EA+  L          R  KR+          ++G  +   E  +
Sbjct: 81  RYGEQQVQDCARKLTEADAAL------RGPLRDAKRLL---------EAGRKAEAAE--I 123

Query: 230 NLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFA 280
           ++ DI+ YA RIS  T AP  +  G   + G  PPAP+ E MRA  L  FA
Sbjct: 124 DVRDIVDYAQRISGITSAPAYWKPGMAMV-GFAPPAPRPEMMRAGALSAFA 173


>gi|239617708|ref|YP_002941030.1| Chromosome segregation ATPase-like protein [Kosmotoga olearia TBF
           19.5.1]
 gi|239506539|gb|ACR80026.1| Chromosome segregation ATPase-like protein [Kosmotoga olearia TBF
           19.5.1]
          Length = 1455

 Score = 43.9 bits (102), Expect = 0.16,   Method: Composition-based stats.
 Identities = 34/124 (27%), Positives = 64/124 (51%), Gaps = 7/124 (5%)

Query: 124 IDSSPSTIKEIIACF---TSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
           ++S  + ++E+I  F     L  Q +EA  EL+ + ++++A   I +E+K  D AI    
Sbjct: 831 LNSRKADVEELIIKFERLIELNKQAYEADKELRRLKEIENA---IGKELKDNDDAISRCD 887

Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNL-SDILSYAH 239
            ++K+   +  IL  D SD R+ +    E G G +  S S +  ++S   L S++ S++ 
Sbjct: 888 EELKKIHALKAILDKDLSDRRKEREFYREKGKGRNPVSLSLTEAIDSYRRLESELGSHSA 947

Query: 240 RISY 243
            I +
Sbjct: 948 TIKH 951


>gi|320166264|gb|EFW43163.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 371

 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 78/170 (45%), Gaps = 24/170 (14%)

Query: 110 TFLSSQAQS--QPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAR 167
           T  + QAQ   +   P+  S +  + ++A    LQ    +AV +LQ          ++ R
Sbjct: 104 TEATEQAQRLVEMADPVARSTAIAEALMATEARLQ----QAVTKLQVHQSGMARLNELKR 159

Query: 168 EIKAKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVES 227
            +  +D  I +FA ++K A++ L ++VD  S  +     ++E GD            V+ 
Sbjct: 160 VLGEQDELIASFAAELKRADQRLGLIVD-RSKTKLDVMKQAERGD------------VDP 206

Query: 228 RLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLY 277
           R    D++ YAH IS    APP++  G  PL     P P E QMR S LY
Sbjct: 207 R----DVIRYAHYISTVATAPPDWKEGM-PLGIYSGPYPLETQMRDSLLY 251


>gi|395332718|gb|EJF65096.1| hypothetical protein DICSQDRAFT_50584 [Dichomitus squalens LYAD-421
           SS1]
          Length = 1081

 Score = 41.6 bits (96), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 59/117 (50%), Gaps = 11/117 (9%)

Query: 188 RVLDILVDDYSD-YRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTF 246
           R+LD LVD+ ++ Y R  R + +G DG +D   + +T  E+ L L   +S     SYT F
Sbjct: 727 RLLD-LVDELTNWYIRFNRRRLKGEDGEEDTVAALNTLFETLLTLCQTMS-----SYTPF 780

Query: 247 APPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADL-DVGLPKVVETKEKTIEAIIE 302
                  G   LR  +P AP+   +R+     F ++ +    +V+E + K ++ +IE
Sbjct: 781 LTENIYQG---LREFIPEAPKAGDVRSIHFLPFPEVKEEYFDEVIERRVKRMQTVIE 834


>gi|406862031|gb|EKD15083.1| hypothetical protein MBM_06844 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 155

 Score = 39.7 bits (91), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 7/98 (7%)

Query: 118 SQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAIL 177
           SQPP  +D  P T++++ A    L+  L +A   L+++ DL+       R IK ++  + 
Sbjct: 61  SQPPADLDHGPLTVRDLPAATDGLKHNLQKAKVLLKQLPDLE-------RSIKEQEDEMR 113

Query: 178 AFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGND 215
               K+ E + VL  LV+  +  +R +  ++  G  +D
Sbjct: 114 DLEAKVGEQKAVLAQLVEVGASVKREREQRNRNGRADD 151


>gi|410926535|ref|XP_003976734.1| PREDICTED: mediator of RNA polymerase II transcription subunit
           4-like [Takifugu rubripes]
          Length = 253

 Score = 39.3 bits (90), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 41/171 (23%), Positives = 74/171 (43%), Gaps = 30/171 (17%)

Query: 114 SQAQSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKD 173
           S++Q  PPP  D       +++        +  E +   Q+   + +  Q + +E++ +D
Sbjct: 34  SRSQKLPPPGEDV------QVLELLVQRDAEFQELMEVAQQQGKVHEEMQVLEKEVEKRD 87

Query: 174 SAILAFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSD 233
           S I     ++KEAE +L   V     Y+  +++K            S     +  ++  +
Sbjct: 88  SDIQQLQKQLKEAEHILATAV-----YQAKEKLK------------SIDKARKGNISSEE 130

Query: 234 ILSYAHRISYTT--FAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADL 282
           I+ YAHRIS +    AP  +  G  P R    P P + +MR+  L   A+L
Sbjct: 131 IIKYAHRISASNAVCAPLNWVPGD-PRR----PYPTDLEMRSGLLGHMANL 176


>gi|331696553|ref|YP_004332792.1| FAD dependent oxidoreductase [Pseudonocardia dioxanivorans CB1190]
 gi|326951242|gb|AEA24939.1| FAD dependent oxidoreductase [Pseudonocardia dioxanivorans CB1190]
          Length = 393

 Score = 38.1 bits (87), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 42/90 (46%), Gaps = 4/90 (4%)

Query: 281 DLDVGLPKVVETKEKTIEAIIEPPPLQPVDANPLKDLAGLLP-PNITVPSGWKPGMPVEL 339
           D  V  P VV+      + + +   + P+   PL+  A ++P  +  VP+G +  M +E+
Sbjct: 207 DGHVEAPDVVDAAGAWADVVAQAAGVDPIGLVPLRRTAAIVPVADEAVPAGGRRSMVIEV 266

Query: 340 PKDFPLVPPPGW---KPGDAVPLPPLDALP 366
            + F     PG     PGD  P PP DA P
Sbjct: 267 GESFYFKSEPGGLLVSPGDETPSPPCDARP 296


>gi|99082818|ref|YP_614972.1| flagellar basal body rod modification protein [Ruegeria sp. TM1040]
 gi|99039098|gb|ABF65710.1| flagellar basal-body rod modification protein FlgD [Ruegeria sp.
           TM1040]
          Length = 231

 Score = 37.7 bits (86), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 74/160 (46%), Gaps = 27/160 (16%)

Query: 102 SAFRGSLPTFL---SSQAQSQPP-PPIDSSPSTIKEIIACFTSLQTQLF--EAVAELQEI 155
           SA      TFL   ++QA+ Q P  P+DSS  T +  +A F+S++ Q+   E +  +   
Sbjct: 27  SALTSDFETFLRLMTTQAKYQDPLEPMDSSDYTAQ--LAQFSSVEQQVLTNETLTSMLTA 84

Query: 156 LDLQDAKQ---KIAREIKAKDSAILAFANKIK-------EAERVLDILVDDY-SDYRR-- 202
           L L +  Q    + RE++A+ +     ++ +         A++V  I+ DD  ++  R  
Sbjct: 85  LSLSNMSQLTGWVGREVRAQAAGYFDGSSSVSIAPNPAAVADKVEMIIYDDQEAEVGRVD 144

Query: 203 ------PKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILS 236
                 P +   E GDGN    GS S  +ES  N   ILS
Sbjct: 145 LPVSAEPYQWNGEDGDGNTLPEGSYSFVIESSSNGEVILS 184


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.312    0.132    0.389 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,422,129,033
Number of Sequences: 23463169
Number of extensions: 360503379
Number of successful extensions: 1598637
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 87
Number of HSP's successfully gapped in prelim test: 2841
Number of HSP's that attempted gapping in prelim test: 1568437
Number of HSP's gapped (non-prelim): 26288
length of query: 420
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 275
effective length of database: 8,957,035,862
effective search space: 2463184862050
effective search space used: 2463184862050
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 78 (34.7 bits)