BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 017655
         (368 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224100559|ref|XP_002311923.1| predicted protein [Populus trichocarpa]
 gi|222851743|gb|EEE89290.1| predicted protein [Populus trichocarpa]
          Length = 659

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 175/391 (44%), Positives = 222/391 (56%), Gaps = 67/391 (17%)

Query: 11  NFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRKFGFYNDQKPK 69
           N  +NGQ S  KS  +K SGKN   NR +  SQ G QKS F+ M NGKRK  F N+   K
Sbjct: 303 NTARNGQSSTPKSKWEKPSGKNLKNNRNRGRSQSGHQKSDFNCMDNGKRKLEFSNEHGRK 362

Query: 70  GHGNGKAIKFGLANQMNQPQEKKRK-SLALTYTEQEVKQWCEERRKNYPTKANINKKLTE 128
           G+GN +  KFG  +  +Q  E+K K S    YTEQE+KQW E RRK+YPTK +I KK  E
Sbjct: 363 GNGNERVAKFGRTDLTDQATEEKSKPSRTFFYTEQEIKQWRESRRKHYPTKTSIEKKQME 422

Query: 129 KQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS---------------- 172
                 VID+EA  R +QLKEILAKQAELGVEVAEIP  YLLDS                
Sbjct: 423 ------VIDREANFRHKQLKEILAKQAELGVEVAEIPPDYLLDSEKLGVEVAEIPPPQVL 476

Query: 173 ---------------KKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADK 217
                          +KQ   R +N   L K+G+F N +DKRGR+ +K + +K+   A++
Sbjct: 477 SSEKLGVEVAEIPPPQKQEHGREDNRRSLTKKGKFWNKHDKRGRFNKKGRSAKQVGSANE 536

Query: 218 DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVK 277
           +         +KPTLL+KLLS D+KRDK  LLQ FRFM  NSFFKDWPEKP+ FP V+VK
Sbjct: 537 E---------RKPTLLEKLLSTDIKRDKRQLLQVFRFMVANSFFKDWPEKPLKFPSVVVK 587

Query: 278 DGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERMHHNIQGFEDENDDEEEDND 337
           + G   E+VE+KS L+GEE SE +   + E+F                  + DD  E + 
Sbjct: 588 EDGYEDEIVEKKSSLVGEEVSEDRNNTIAENF-----------------GDRDDNIEHDA 630

Query: 338 NGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
             E    ++ GK +  DE  R+  EEGEI+D
Sbjct: 631 QVELGNCFVRGKCDIVDEVDRV--EEGEIVD 659


>gi|147775405|emb|CAN73830.1| hypothetical protein VITISV_043067 [Vitis vinifera]
          Length = 605

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 160/315 (50%), Positives = 199/315 (63%), Gaps = 16/315 (5%)

Query: 11  NFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPK 69
           NF+++   + S  N K  S KNF+ N + K    G +KS  H M NG+ K G  N+ + K
Sbjct: 252 NFVKDAGVNNSNPNWKNSSRKNFMXNPKGKNSHWGSRKSQLHHMQNGRGKAGISNENRGK 311

Query: 70  GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
           G  N  A      N   Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK  EK
Sbjct: 312 GLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEK 371

Query: 130 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLN 187
            +NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV  R EN      
Sbjct: 372 LTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFG 431

Query: 188 KRGRFQNNYDKRGRYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTLLQ 234
           K+ RFQN  +KR R+ RK        D F+K+Q+LA  DS     SN    NKKK TLLQ
Sbjct: 432 KKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGTDSGDTNASNQPPLNKKKQTLLQ 491

Query: 235 KLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLG 294
           KLLS D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV VK+ G  GEVV+ KS    
Sbjct: 492 KLLSTDIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVAVKETGCQGEVVDRKSSPTS 551

Query: 295 EEDSEGKGRRMVEHF 309
           +   +G  +   E F
Sbjct: 552 KGVPQGGRKTXAEEF 566


>gi|224113339|ref|XP_002316461.1| predicted protein [Populus trichocarpa]
 gi|222865501|gb|EEF02632.1| predicted protein [Populus trichocarpa]
          Length = 674

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 184/408 (45%), Positives = 222/408 (54%), Gaps = 81/408 (19%)

Query: 11  NFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRKFGFYNDQKPK 69
           N  +NGQGS   S     SGKNF  N  +E SQ G QKS FH+M NGKRK GF N    K
Sbjct: 298 NPAKNGQGSTPNSKQGIFSGKNFKNNPKREQSQSGHQKSEFHRMDNGKRKLGFSNKHGGK 357

Query: 70  GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
           G GN +A KFG ++  NQ  E+KR  +   YTEQE+KQW E RRK++PTK NI KK TEK
Sbjct: 358 GKGNERAAKFGRSDPSNQVMEQKRTHI---YTEQEIKQWRESRRKHFPTKTNIEKKQTEK 414

Query: 130 QSNSDVIDKEAKMRRE-------------QLKEILAKQAELGVEVAEIPSYY-------- 168
             +S VIDKEA  RR+             QLKEILAKQAELGVEVAEIP  Y        
Sbjct: 415 LIDSGVIDKEANFRRKVLNALPRKLDDKLQLKEILAKQAELGVEVAEIPPEYMLDSEKLG 474

Query: 169 ----------------------------LLDSKKQVRARVENTMPLNKRGRFQNNYDKRG 200
                                       LLDS+KQ   R +N   L K+GRF N +D+RG
Sbjct: 475 VEVAETPLSYLLDSEKLGVEAAEIPPHHLLDSEKQEHGREDNRRSLTKKGRFWNKHDRRG 534

Query: 201 RYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSF 260
           RYKRK +   +  L +++         +KPTLL+KLL AD+KRDK  LLQ FRFM  NSF
Sbjct: 535 RYKRKGRSDMQLGLENEE---------RKPTLLEKLLCADIKRDKHRLLQVFRFMVANSF 585

Query: 261 FKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERMHH 320
           FKDW +KP+ FP V+VK+ G   E  EEK  L+GEE+SE      VE F           
Sbjct: 586 FKDWSDKPLKFPSVVVKEDGCKDEPQEEKPSLVGEEESEVPNNTTVEDF----------- 634

Query: 321 NIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
                  + DD +E +   E     + GK +  DE  R+  EEGEIID
Sbjct: 635 ------GDRDDGDEHDAQVEPGNGSVMGKCDIVDEVYRV--EEGEIID 674


>gi|255554855|ref|XP_002518465.1| conserved hypothetical protein [Ricinus communis]
 gi|223542310|gb|EEF43852.1| conserved hypothetical protein [Ricinus communis]
          Length = 273

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 128/218 (58%), Positives = 166/218 (76%), Gaps = 4/218 (1%)

Query: 94  KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAK 153
           +S AL YTEQE+K W EERRKN+P+KANI K  +E+ +NS  IDKEAK+RRE+LKEILAK
Sbjct: 9   RSPALIYTEQEIKLWREERRKNFPSKANIEKMCSERVTNSGGIDKEAKLRRERLKEILAK 68

Query: 154 QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQK 213
           QAELGVEVAEIPS+YL DS+KQV+ + ++     K+GR ++ +D+RGRY +KD  +++  
Sbjct: 69  QAELGVEVAEIPSHYLSDSEKQVKVKKDSRRSAPKKGRSRHKHDRRGRYNKKDGLTQQNT 128

Query: 214 LADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPL 273
           LA KDSSN SSF++ KPTLLQKLLSA+V++DK HLLQ FRFM +NSFF D PEKP+ FP 
Sbjct: 129 LAKKDSSNGSSFSRTKPTLLQKLLSAEVRKDKHHLLQVFRFMVMNSFFDDGPEKPLKFPS 188

Query: 274 VMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEH 311
           V+VK+     EVV EK+ + G+  SE   +  VE+  H
Sbjct: 189 VVVKE----DEVVAEKTSITGKGISEVSNKTAVENSGH 222


>gi|359487059|ref|XP_002271061.2| PREDICTED: uncharacterized protein LOC100252331 [Vitis vinifera]
          Length = 1031

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 136/236 (57%), Positives = 163/236 (69%), Gaps = 15/236 (6%)

Query: 70  GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
           G  N  A      N   Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK  EK
Sbjct: 738 GLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEK 797

Query: 130 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLN 187
            +NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV  R EN      
Sbjct: 798 LTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFG 857

Query: 188 KRGRFQNNYDKRGRYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTLLQ 234
           K+ RFQN  +KR R+ RK        D F+K+Q+LA KDS     SN    NKKK TLLQ
Sbjct: 858 KKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGKDSGDMNASNQPPLNKKKQTLLQ 917

Query: 235 KLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 290
           KLLS D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV+VK+ G  GEVV+ KS
Sbjct: 918 KLLSTDIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVVVKETGCQGEVVDRKS 973


>gi|449446325|ref|XP_004140922.1| PREDICTED: uncharacterized protein LOC101213190 [Cucumis sativus]
          Length = 599

 Score =  237 bits (604), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 156/352 (44%), Positives = 214/352 (60%), Gaps = 45/352 (12%)

Query: 31  KNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQE 90
           +NF++N  K    GFQK+  H + N K+KFGF   QK KG  N +  KF   N  +Q +E
Sbjct: 279 RNFMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKE 334

Query: 91  KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 150
           +KR SL+L YT+QE++QW E RRKNYP+  NI KKLT KQ+N  ++DKEAK+ R++LKEI
Sbjct: 335 QKR-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEI 393

Query: 151 LAKQAELGVEVAEIPSYYLL-----DSKKQVRARV-------ENTMPLNKRGRFQNNYDK 198
           LAKQAELGVEVAEIP  YL      D++KQ   R        E ++   ++   QN  +K
Sbjct: 394 LAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASI---EKENSQNRLNK 450

Query: 199 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 258
           RGR K+K++  K+ K  +K  SN     K++PTLLQKLL ADV++DKS LLQ  RF  +N
Sbjct: 451 RGRCKKKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMN 509

Query: 259 SFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRE 316
           SFFK+WP KP+ FP V VK+      VV+E SL  G  +  E     +VE+   HD    
Sbjct: 510 SFFKEWPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD---- 565

Query: 317 RMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
                    + +N+++ +D++  E++K    G G      +++ EEEGEIID
Sbjct: 566 --------IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 599


>gi|449494110|ref|XP_004159451.1| PREDICTED: uncharacterized LOC101213190 [Cucumis sativus]
          Length = 552

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 156/352 (44%), Positives = 214/352 (60%), Gaps = 45/352 (12%)

Query: 31  KNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQE 90
           +NF++N  K    GFQK+  H + N K+KFGF   QK KG  N +  KF   N  +Q +E
Sbjct: 232 RNFMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKE 287

Query: 91  KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 150
           +KR SL+L YT+QE++QW E RRKNYP+  NI KKLT KQ+N  ++DKEAK+ R++LKEI
Sbjct: 288 QKR-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEI 346

Query: 151 LAKQAELGVEVAEIPSYYLL-----DSKKQVRARV-------ENTMPLNKRGRFQNNYDK 198
           LAKQAELGVEVAEIP  YL      D++KQ   R        E ++   ++   QN  +K
Sbjct: 347 LAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASI---EKENSQNRLNK 403

Query: 199 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 258
           RGR K+K++  K+ K  +K  SN     K++PTLLQKLL ADV++DKS LLQ  RF  +N
Sbjct: 404 RGRCKKKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMN 462

Query: 259 SFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRE 316
           SFFK+WP KP+ FP V VK+      VV+E SL  G  +  E     +VE+   HD    
Sbjct: 463 SFFKEWPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD---- 518

Query: 317 RMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
                    + +N+++ +D++  E++K    G G      +++ EEEGEIID
Sbjct: 519 --------IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 552


>gi|145334515|ref|NP_001078603.1| uncharacterized protein [Arabidopsis thaliana]
 gi|145358174|ref|NP_197345.2| uncharacterized protein [Arabidopsis thaliana]
 gi|60547897|gb|AAX23912.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
 gi|71905555|gb|AAZ52755.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
 gi|71905557|gb|AAZ52756.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
 gi|332005180|gb|AED92563.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332005181|gb|AED92564.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 470

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 131/252 (51%), Positives = 177/252 (70%), Gaps = 10/252 (3%)

Query: 43  QGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL--ANQMNQPQEKKRKSLALTY 100
           QGFQ+   HQ  NGKRK GF  D +  G GN   +K GL  ++  N  +EKKR S AL Y
Sbjct: 202 QGFQRPQLHQADNGKRKSGFNKDHR--GKGNNNKMKTGLDGSDTGNIAKEKKR-SYALMY 258

Query: 101 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVE 160
           T +EV+QW E RRKNYPTK  + KK+ +K  ++ ++D+EAKMRR+QL+E+LAKQAELGVE
Sbjct: 259 TPREVQQWREARRKNYPTKFLVEKKV-KKNVSASILDEEAKMRRQQLREVLAKQAELGVE 317

Query: 161 VAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQF-SKEQKLADKDS 219
           VAE+PS+YL ++ +QV     N     ++GRFQNN   + R+ RKD+F +K+ +L DK S
Sbjct: 318 VAEVPSHYLSNNDEQVNGDRGNNN--GRKGRFQNNRRNKRRHDRKDKFDNKKPRLEDKKS 375

Query: 220 SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDG 279
           S  SS   +KPTLL+KLLSAD+KRDKS LLQ FRFM +NS  K++PE+P+  PL+ VK+ 
Sbjct: 376 SQDSSITTRKPTLLEKLLSADIKRDKSQLLQVFRFMVMNSLLKEFPEQPLKLPLITVKET 435

Query: 280 GAAGEVVEEKSL 291
           G   + +E+ S+
Sbjct: 436 GCE-DAMEDPSI 446


>gi|297807865|ref|XP_002871816.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317653|gb|EFH48075.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 454

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 119/239 (49%), Positives = 170/239 (71%), Gaps = 6/239 (2%)

Query: 43  QGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL-ANQMNQPQEKKRKSLALTYT 101
           QGFQ+   HQ  NGKRK GF  D +  G GN   +K GL  +  +   ++KR+S AL YT
Sbjct: 203 QGFQRPQLHQADNGKRKSGFNKDHR--GKGNYNKMKNGLDGSDADNIAKEKRRSYALMYT 260

Query: 102 EQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEV 161
            ++V QW E RRKN+PT+ N+ KK+ +K  ++ ++D+EAKMRR+QL+E+LAKQAELG+EV
Sbjct: 261 PKDVNQWREARRKNFPTRLNVEKKV-KKNVSASILDEEAKMRRQQLREVLAKQAELGIEV 319

Query: 162 AEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSN 221
           A++PS+YL ++ ++V    +N     ++ +FQNN  K+ R+ RKD+F K  +L DK+SS 
Sbjct: 320 ADVPSHYLSNTDERVHG--DNGANDGQKRKFQNNRHKQRRHGRKDKFDKTPRLDDKNSSQ 377

Query: 222 TSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGG 280
            S    KKPTLL+KLLSA++KRDK HLLQ FRFM +NSF K++PE+P+  PL+ V++ G
Sbjct: 378 ESPMTTKKPTLLEKLLSANIKRDKIHLLQVFRFMVMNSFLKEFPEQPLKLPLITVEETG 436


>gi|356553962|ref|XP_003545319.1| PREDICTED: uncharacterized protein LOC100786384 [Glycine max]
          Length = 822

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 149/359 (41%), Positives = 215/359 (59%), Gaps = 17/359 (4%)

Query: 15  NGQGSISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHG 72
           N + ++  SN K    KNF KN+     F  GFQKS FH ++NGK+  GF  +   KG  
Sbjct: 476 NIKTNVPNSNWKGSPSKNF-KNKPNRGGFQAGFQKSKFHDVNNGKKGSGFPIEHNGKGPN 534

Query: 73  NGKAIKFGLANQMNQPQEKKR---KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
           +G+   +GL     +P+E K+   +SL++TYT QE++QW E R+KN+P   NI KK +E 
Sbjct: 535 SGRGGHYGL-----KPKEHKQQPERSLSVTYTVQEIQQWREARKKNHPFNNNIQKKHSEH 589

Query: 130 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKR 189
             +   I++E   R  +LKE+LAKQAELGVEVAEIPSYYL +S  Q   + E       +
Sbjct: 590 PKDRKAINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQA-LQSEGKNKFTDK 646

Query: 190 GRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 249
            +FQN ++K+    RK +F+K QK  DKD S + S  K+KPTLLQKLLS+DVKRDKSHL+
Sbjct: 647 RKFQNKFNKKS--DRKGRFAKRQKFDDKDFSESPSLKKRKPTLLQKLLSSDVKRDKSHLI 704

Query: 250 QTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHF 309
           Q FRFM +NSFFK   +KP+ +PLV+VK+ G+  +  EEK L  G++  +G     V+  
Sbjct: 705 QVFRFMVMNSFFKHCLDKPLRYPLVVVKEKGSEVD-GEEKYLHTGKDVLKGGNEETVQKI 763

Query: 310 EHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
               N        +  +D+ +D   DN+  +     +  + ++ +   + +EEEGEII+
Sbjct: 764 VTFNNDNSHDCEDEDSDDDENDSIVDNNLHKDPSSLVKRQCDSGEGIGKFDEEEGEIIE 822


>gi|358347440|ref|XP_003637765.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
 gi|355503700|gb|AES84903.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
          Length = 553

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 152/355 (42%), Positives = 209/355 (58%), Gaps = 49/355 (13%)

Query: 19  SISKSNSKKVSGKNFIKNRTKEFSQG-FQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 77
           + + SN K    KNF   + +  SQG FQKS F+  + GKR   F  + K KG  N +A 
Sbjct: 243 NFANSNWKGSPNKNFKNKQNRGGSQGGFQKSKFNDTNKGKR---FSKEHKGKGPNNERAG 299

Query: 78  KFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVID 137
            FGL    N  QE KR S +LTY+EQE++QW E RRKN+P++  I K+ +E+  +S VID
Sbjct: 300 HFGL----NSKQEPKR-SFSLTYSEQEIQQWREARRKNHPSREKIEKQQSEQSKDSKVID 354

Query: 138 KEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--- 194
           +E   R  +LKE+LAKQAELG+EVAEIPSYYL++   Q     +       + +F+N   
Sbjct: 355 RELLQR--ELKEVLAKQAELGIEVAEIPSYYLMNGTNQGLQSEDKKNSFTDKRKFKNKLM 412

Query: 195 -NYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFR 253
            N D+R R  +K      QK AD+D       NKKKPTLLQKLLSAD+KRDKSHL Q FR
Sbjct: 413 RNPDRRSRNNKK------QKFADRDLLE----NKKKPTLLQKLLSADIKRDKSHLFQVFR 462

Query: 254 FMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDE 313
           F+  NSFFKD+P+KP+ +P V VK+ G+  EV   K  L G       G  ++EH     
Sbjct: 463 FITANSFFKDYPDKPLVYPPVSVKEMGS--EVYGGKKHLQG-------GEAVLEH----G 509

Query: 314 NRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
            +E +   ++G ++ +D+E+E++D           +G+  D     +EEEGEII+
Sbjct: 510 TKEIVQKFVKGSDNGHDNEDEESD-----------EGDNEDGINEFDEEEGEIIE 553


>gi|115440647|ref|NP_001044603.1| Os01g0814000 [Oryza sativa Japonica Group]
 gi|56785048|dbj|BAD82687.1| unknown protein [Oryza sativa Japonica Group]
 gi|113534134|dbj|BAF06517.1| Os01g0814000 [Oryza sativa Japonica Group]
          Length = 533

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/186 (48%), Positives = 121/186 (65%), Gaps = 2/186 (1%)

Query: 93  RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 152
           RK   + Y   EVKQW E R+KNYPT  NINKKL+E + + +  D+EA+MRR++LKE+L 
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKKLSESRPDGEKKDEEAQMRRQELKEVLE 340

Query: 153 KQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-KRGRYKRKDQFSKE 211
           KQ ELG E+ E+P  YL + + Q   R  N     +  RF N  D KR RY R D  SK 
Sbjct: 341 KQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNKRSRYDRNDFQSKR 400

Query: 212 QKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVN 270
            K+ ++   +  +  K ++PTLLQKLLS+DVKRD+  LL TF+FM +N+FF D+P+KP+ 
Sbjct: 401 PKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVLNNFFSDYPDKPLE 460

Query: 271 FPLVMV 276
           FP V V
Sbjct: 461 FPSVKV 466


>gi|242054757|ref|XP_002456524.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
 gi|241928499|gb|EES01644.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
          Length = 568

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 3/188 (1%)

Query: 92  KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 151
           +RK   + Y   EVK+W E R+KNYPT  N++KKL+E  S+++  DK+A++RR++LKE+L
Sbjct: 295 RRKPPRIIYDANEVKRWVEARKKNYPTSVNVHKKLSEINSDNENKDKDAQLRRQELKEVL 354

Query: 152 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQFS 209
           AKQ ELG ++ E+P  YL ++  Q      N     +   F N  N +KR RY+R    S
Sbjct: 355 AKQQELGFDLPELPPGYLSETGDQCIENKNNRKAQCRDSHFGNRSNNNKRPRYERGGFQS 414

Query: 210 KEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKP 268
           K  K+ ++      +  K ++PTLLQKLLS+D+KRD+  LL  F+FM +N+FFKDWP+KP
Sbjct: 415 KRSKVWNRTPCADDAMAKSREPTLLQKLLSSDIKRDRHRLLHVFKFMTLNNFFKDWPDKP 474

Query: 269 VNFPLVMV 276
           + FP V V
Sbjct: 475 LQFPSVKV 482


>gi|223943225|gb|ACN25696.1| unknown [Zea mays]
 gi|414880010|tpg|DAA57141.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
          Length = 562

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 84/191 (43%), Positives = 125/191 (65%), Gaps = 13/191 (6%)

Query: 92  KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 151
           +RK   + Y   EVKQW E R+KNYPT  N+NKKL+E +S+++  DK+A++RR++LKE++
Sbjct: 290 RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQELKEVI 349

Query: 152 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN-------NYDKRGRYKR 204
           AKQ ELG +  E+P  YL +   Q    +EN    N++ +F++       N +KR RY+R
Sbjct: 350 AKQQELGFDPPELPPGYLSEIGDQC---IENKN--NRKTQFRDSHFGNRFNNNKRSRYER 404

Query: 205 KDQFSKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 263
               SK  K+ +    ++      ++PTLLQKLLS+D+KRD+  LL  FRFM +N+FFKD
Sbjct: 405 GGFQSKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKD 464

Query: 264 WPEKPVNFPLV 274
           WP++P+ FP V
Sbjct: 465 WPDRPLQFPSV 475


>gi|357125563|ref|XP_003564462.1| PREDICTED: uncharacterized protein LOC100829078 [Brachypodium
           distachyon]
          Length = 609

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 87/195 (44%), Positives = 120/195 (61%), Gaps = 15/195 (7%)

Query: 93  RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 152
           RK   + Y   EVKQW E R+KNYPT  N+NKKL++ QS+    D+EA+MRR +LKE+LA
Sbjct: 333 RKRPPICYDTNEVKQWVEARKKNYPTSVNVNKKLSQSQSDDQKKDEEAQMRRLELKEVLA 392

Query: 153 KQAELGVEVAEIPSYYLLDSKKQVRARVEN-----------TMPLNKRGRFQNNYDKRGR 201
           KQ ELG E+ E+P  YL +++ Q +    N                 RGR     +KR R
Sbjct: 393 KQKELGFELPELPPGYLSETEGQPKGPQGNDKESKWKTQQRDCRFGNRGR----GNKRQR 448

Query: 202 YKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
           Y R D  SK  +  +    +  +  K++PTLLQKLLS+D+KRD+  LL TF+FMA+N+FF
Sbjct: 449 YDRADFQSKRPREWNNSRHDGGAVAKREPTLLQKLLSSDIKRDRHRLLHTFKFMALNNFF 508

Query: 262 KDWPEKPVNFPLVMV 276
           ++ P+KP+ FP V V
Sbjct: 509 RNLPDKPLEFPSVKV 523


>gi|218189267|gb|EEC71694.1| hypothetical protein OsI_04190 [Oryza sativa Indica Group]
          Length = 500

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 91/199 (45%), Positives = 122/199 (61%), Gaps = 15/199 (7%)

Query: 93  RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 139
           RK   + Y   EVKQW E R+KNYPT  NINK             KL+E + + +  D+E
Sbjct: 237 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 296

Query: 140 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-K 198
           A+MRR++LKE+LAKQ ELG E+ E+P  YL + + Q   R  N     +  RF N  D K
Sbjct: 297 AQMRRQELKEVLAKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNK 356

Query: 199 RGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAI 257
           R RY R D  SK  K+ ++   +  +  K ++PTLLQKLLS+DVKRD+  LL TF+FM +
Sbjct: 357 RSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVL 416

Query: 258 NSFFKDWPEKPVNFPLVMV 276
           N+FF D+P+KP+ FP V V
Sbjct: 417 NNFFSDYPDKPLEFPSVKV 435


>gi|296085358|emb|CBI29090.3| unnamed protein product [Vitis vinifera]
          Length = 467

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 101/214 (47%), Positives = 132/214 (61%), Gaps = 11/214 (5%)

Query: 11  NFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPK 69
           NF+++   + S  N K  S KNF+KN + K    G +KS  H M N + K G  N+ + K
Sbjct: 246 NFVKDAGVNNSNPNWKNSSRKNFMKNPKGKNSHWGSRKSQLHHMQNARGKAGISNENRGK 305

Query: 70  GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
           G  N  A      N   Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK  EK
Sbjct: 306 GLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEK 365

Query: 130 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLN 187
            +NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV  R EN      
Sbjct: 366 LTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFG 425

Query: 188 KRGRFQNNYDKRGRYKRK--------DQFSKEQK 213
           K+ RFQN  +KR R+ RK        D F+K+Q+
Sbjct: 426 KKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQR 459


>gi|293335934|ref|NP_001167960.1| uncharacterized protein LOC100381676 [Zea mays]
 gi|223945137|gb|ACN26652.1| unknown [Zea mays]
 gi|414880008|tpg|DAA57139.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
          Length = 295

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 83/187 (44%), Positives = 119/187 (63%), Gaps = 4/187 (2%)

Query: 92  KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRRE-QLKEI 150
           +RK   + Y   EVKQW E R+KNYPT  N+NKKL+E +S+++  DK+A++RR+ +LKE+
Sbjct: 22  RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQVELKEV 81

Query: 151 LAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQF 208
           +AKQ ELG +  E+P  YL +   Q      N     +   F N  N +KR RY+R    
Sbjct: 82  IAKQQELGFDPPELPPGYLSEIGDQCIENKNNRKTQFRDSHFGNRFNNNKRSRYERGGFQ 141

Query: 209 SKEQKLADKDS-SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEK 267
           SK  K+ +    ++      ++PTLLQKLLS+D+KRD+  LL  FRFM +N+FFKDWP++
Sbjct: 142 SKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDR 201

Query: 268 PVNFPLV 274
           P+ FP V
Sbjct: 202 PLQFPSV 208


>gi|20161477|dbj|BAB90401.1| P0432B10.22 [Oryza sativa Japonica Group]
          Length = 541

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 92/220 (41%), Positives = 124/220 (56%), Gaps = 24/220 (10%)

Query: 93  RKSLALTYTEQEVKQWCEERRKNYPTKANINK----------------------KLTEKQ 130
           RK   + Y   EVKQW E R+KNYPT  NINK                      KL+E +
Sbjct: 267 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKASATLCPHQTIREASSTDHLMQKLSESR 326

Query: 131 SNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRG 190
            + +  D+EA+MRR++LKE+L KQ ELG E+ E+P  YL + + Q   R  N     +  
Sbjct: 327 PDGEKKDEEAQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDC 386

Query: 191 RFQNNYD-KRGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHL 248
           RF N  D KR RY R D  SK  K+ ++   +  +  K ++PTLLQKLLS+DVKRD+  L
Sbjct: 387 RFGNRADNKRSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRL 446

Query: 249 LQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEE 288
           L TF+FM +N+FF D+P+KP+ FP V V        + EE
Sbjct: 447 LHTFKFMVLNNFFSDYPDKPLEFPSVKVNQIELESNIAEE 486


>gi|326528619|dbj|BAJ97331.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 625

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 90/218 (41%), Positives = 128/218 (58%), Gaps = 20/218 (9%)

Query: 93  RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 152
           RK   + Y + EVKQW + R+KNYPT+AN+NKKL + Q +    D+EA+MRR++LKE++A
Sbjct: 335 RKRPPIIYDKNEVKQWVQARKKNYPTRANVNKKLCQNQLDEQKKDEEAQMRRQELKEVIA 394

Query: 153 KQAELGVEVAEIPSYYLLDSKKQVR----------------ARVENTMPLNKRGRFQNNY 196
           KQ ELG+E+ E+P  YL D++ Q R                           RGR     
Sbjct: 395 KQKELGLELPELPPGYLSDNEGQPRDPQKESNWKTRQGGGRFGNRGRGRGQGRGRGSGRD 454

Query: 197 DKRGRY-KRKDQFSKEQKLADKDS---SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTF 252
           +KR RY  R+D  SK  +  + +S      +    ++PTLLQKLL++D+KRD+  LL TF
Sbjct: 455 NKRQRYDSREDSPSKRPREWNNNSRCHDGGAVAKIREPTLLQKLLNSDIKRDRHRLLHTF 514

Query: 253 RFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 290
           +FMA+N+FFKDWP KP+ FP V +       ++ EE S
Sbjct: 515 KFMALNNFFKDWPAKPLEFPSVKLNQIELESDIAEEGS 552


>gi|356562267|ref|XP_003549393.1| PREDICTED: uncharacterized protein LOC100789443 [Glycine max]
          Length = 684

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 104/250 (41%), Positives = 155/250 (62%), Gaps = 21/250 (8%)

Query: 124 KKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQ-VRARVEN 182
           +K +E   +S  I++E  ++RE LKE+LAKQAELGVEVAEIPSYYL +S  Q +++  +N
Sbjct: 451 QKHSECPKDSKAINREV-LQRE-LKEVLAKQAELGVEVAEIPSYYLKNSDNQGLQSEAKN 508

Query: 183 TMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVK 242
               +KR +FQN ++K+    RK +F+++QK ADKD S + S  K+KPTLLQKLLS+DVK
Sbjct: 509 KY-TDKR-KFQNKFNKKS--DRKGRFAEKQKFADKDFSESPSLKKRKPTLLQKLLSSDVK 564

Query: 243 RDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKG 302
           RDKSHLLQ  RFM +NSFF+  P+K + +P V VK+ G+  EV  EK  L   +D   +G
Sbjct: 565 RDKSHLLQVLRFMVMNSFFRHIPDKALRYPSVEVKEKGS--EVSGEKKHLHTGKDVLNRG 622

Query: 303 R----RMVEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVR 358
                + +  F +D   +         +D+ +D   DN+  +     +  + +  +   +
Sbjct: 623 SEETVQKIVIFNNDNGHD--------CDDDENDSIVDNNLHKDPSSLVKRQCDGGEGIKK 674

Query: 359 LEEEEGEIID 368
            +EEEGEI++
Sbjct: 675 SDEEEGEILE 684



 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%), Gaps = 3/64 (4%)

Query: 20  ISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 77
           +  SN K    KN +KN+     F  GFQKS FH ++NGKR  GF  +   KG  +G+A 
Sbjct: 282 VPNSNWKGSPSKN-LKNKPNRGRFQGGFQKSKFHDVNNGKRGSGFPKEHNGKGPNSGRAG 340

Query: 78  KFGL 81
            +GL
Sbjct: 341 HYGL 344


>gi|222619440|gb|EEE55572.1| hypothetical protein OsJ_03849 [Oryza sativa Japonica Group]
          Length = 484

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 49/118 (41%), Positives = 65/118 (55%), Gaps = 13/118 (11%)

Query: 93  RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 139
           RK   + Y   EVKQW E R+KNYPT  NINK             KL+E + + +  D+E
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 340

Query: 140 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD 197
           A+MRR++LKE+L KQ ELG E+ E+P  YL + + Q   R  N     +  RF N  D
Sbjct: 341 AQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRAD 398


>gi|168019512|ref|XP_001762288.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686366|gb|EDQ72755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 614

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 65/90 (72%), Gaps = 6/90 (6%)

Query: 96  LALTYTEQ---EVKQWCEERRKNYPTKANINKK---LTEKQSNSDVIDKEAKMRREQLKE 149
           +AL  +E+   E++ W EER +NYPT++NI +K   L  K +  ++ID++ + R++++KE
Sbjct: 278 VALKASEEDDDEIRAWREERMRNYPTRSNIQRKKEALEGKVARGELIDEDTRKRQQRMKE 337

Query: 150 ILAKQAELGVEVAEIPSYYLLDSKKQVRAR 179
           ILAKQAELGV VAE+P++YL  S K+ R R
Sbjct: 338 ILAKQAELGVPVAEVPAHYLSQSYKEDRGR 367



 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 58/129 (44%), Gaps = 18/129 (13%)

Query: 185 PLNKRGRFQNNYDKRGRYKR--KDQFSKEQKLADKDSSNTSSFN---------KKKPTLL 233
           P +K G+ Q  + +RGR K+  K  F+       K   N +            K+ PTLL
Sbjct: 461 PSDKPGKTQCYFYQRGRCKKGFKCHFAHTGPRGKKGGDNEAGRGGPKGVGVEKKRPPTLL 520

Query: 234 QKLLSADVKRDKSHLLQTFRFMAINSFFKDWP-------EKPVNFPLVMVKDGGAAGEVV 286
            KLL  ++ RDKS++LQ  R+   N+F  +WP       E P            + GEVV
Sbjct: 521 SKLLQGEINRDKSYILQCLRYFVNNNFLLNWPNQLSESIELPEQSDDAHESSDYSEGEVV 580

Query: 287 EEKSLLLGE 295
           E   +LL +
Sbjct: 581 EAPDVLLAD 589


>gi|219363611|ref|NP_001136910.1| uncharacterized protein LOC100217067 [Zea mays]
 gi|194697570|gb|ACF82869.1| unknown [Zea mays]
          Length = 138

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 30/47 (63%), Positives = 40/47 (85%)

Query: 228 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLV 274
           ++PTLLQKLLS+D+KRD+  LL  FRFM +N+FFKDWP++P+ FP V
Sbjct: 5   REPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDRPLQFPSV 51


>gi|255546771|ref|XP_002514444.1| conserved hypothetical protein [Ricinus communis]
 gi|223546440|gb|EEF47940.1| conserved hypothetical protein [Ricinus communis]
          Length = 123

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 62/129 (48%), Gaps = 39/129 (30%)

Query: 146 QLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRK 205
           +LKE LAKQAELGVEVAEIPS YL DS+KQV+                            
Sbjct: 28  RLKETLAKQAELGVEVAEIPSRYLSDSEKQVK---------------------------- 59

Query: 206 DQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVK-RDKSHLLQTFRFMAINSFF--- 261
                   L +KDS+N SSF+  KPTL QKLLSA+ K    S  LQ         FF   
Sbjct: 60  ------DTLTNKDSTNGSSFSNTKPTLRQKLLSAEGKGAFLSECLQASSTFTCEQFFSSN 113

Query: 262 KDWP-EKPV 269
           K W   KPV
Sbjct: 114 KLWNFTKPV 122


>gi|384254158|gb|EIE27632.1| hypothetical protein COCSUDRAFT_55624 [Coccomyxa subellipsoidea
           C-169]
          Length = 539

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 33/50 (66%)

Query: 223 SSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFP 272
           S   +++PTLLQKLL+ D++ + S LLQ  RF+  N+F  D+   P+ FP
Sbjct: 313 SPVQQRQPTLLQKLLAPDIRAEHSRLLQCLRFLVTNNFLLDFGRAPLIFP 362


>gi|328766403|gb|EGF76457.1| hypothetical protein BATDEDRAFT_28378 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 411

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 88/185 (47%), Gaps = 25/185 (13%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKKL---TEKQSNSDVID---KEAKMRREQLKEILAK- 153
           T ++++QW +ER++ +PT++NI +K+    ++Q   ++ID   K+ K R    ++++ K 
Sbjct: 217 TPKDIEQWIQERKRKFPTESNIKQKMEDEKQRQKQGNIIDVNGKKLKGRNADKRQLIRKN 276

Query: 154 ---QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRGRYKRKDQF 208
              Q + G    +  + YL D+         +T+ L+  G+   +       R+    + 
Sbjct: 277 DTVQEKHGTSEQKDSTIYLDDASNAKLEASHDTLALDMTGKLNQHKGASSCCRHGSNCRH 336

Query: 209 SKE-QKLADKDSSNTSSFNKKKPT-----------LLQKLLSADVKRDKSHLLQTFRFMA 256
           S +  KLA+  S+   +     PT           LL  LL  D K+DK+ LLQ  RF+ 
Sbjct: 337 SHDLSKLAESRSA-PKTHTPLNPTAAPIKGTNRRPLLNMLLETDFKKDKNALLQCIRFII 395

Query: 257 INSFF 261
            N FF
Sbjct: 396 ENKFF 400


>gi|308800704|ref|XP_003075133.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
           (ISS) [Ostreococcus tauri]
 gi|116061687|emb|CAL52405.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
           (ISS) [Ostreococcus tauri]
          Length = 401

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 199 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 258
           RGR K   +      + DK +S  SS N +  TLL+KLL  ++KRD+  LLQ FRF+  N
Sbjct: 261 RGRCKAGKRCPFRHDIVDKKTSGVSSGNPQ--TLLKKLLHKEIKRDEERLLQLFRFLVNN 318

Query: 259 SFF 261
            FF
Sbjct: 319 DFF 321


>gi|145343958|ref|XP_001416510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576735|gb|ABO94803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 291

 Score = 46.6 bits (109), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 62/223 (27%), Positives = 100/223 (44%), Gaps = 44/223 (19%)

Query: 103 QEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELG---- 158
           ++V+++ EER K +P+  N+  K    +       +    RRE+L+EILAKQ E+G    
Sbjct: 28  EDVRRYREERAKYWPSDRNVRAKTEAGEEEE----RARAARRERLREILAKQREMGHFEA 83

Query: 159 -VEVAE----IPSYYLLDSKKQVRARVENTMPLNKRGRF--------------------- 192
             E+ E    + +   LD K  V AR  +    +K  RF                     
Sbjct: 84  SQEIGEEEGGVAAPAALDGKG-VEARAADG---SKVCRFWLQGGCRKGSACDFKHESAPN 139

Query: 193 ---QNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 249
              +  +  RGR K   +   + ++ ++ SS        + TLL+KLL  ++KRD+  LL
Sbjct: 140 KDQKCRFFARGRCKAGARCPFKHEVTERKSSAADGGGNPQ-TLLKKLLDKEIKRDEQRLL 198

Query: 250 QTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLL 292
           Q FRF   N FF     K V+ PL M     +  +V++ ++ L
Sbjct: 199 QLFRFFVNNDFFTG--TKGVDEPLWMFPWADSGKKVIDRRAAL 239


>gi|118358218|ref|XP_001012358.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila]
 gi|89294124|gb|EAR92112.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila
           SB210]
          Length = 762

 Score = 45.8 bits (107), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 41/63 (65%)

Query: 204 RKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 263
           ++D++S+E  L+    SN+S+F  K  TLL  L+  ++ R++++LLQ  R++  N+FF +
Sbjct: 574 KRDKYSQELDLSFNYKSNSSNFRYKANTLLTNLVLDEIYRERNYLLQAIRYIVSNNFFDE 633

Query: 264 WPE 266
             E
Sbjct: 634 QAE 636



 Score = 38.9 bits (89), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 104 EVKQWCEERRKNYPTKANINKKLTE 128
           E+++W E+R+KNYPT  NI KK TE
Sbjct: 289 EIRKWIEQRKKNYPTSENIKKKQTE 313


>gi|68073975|ref|XP_678902.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56499510|emb|CAH94055.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 611

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 90/182 (49%), Gaps = 18/182 (9%)

Query: 89  QEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLK 148
            EK  K+  +  +++E+++W EER+KNYPTK    KK+ + ++N++ I+ E K ++ + K
Sbjct: 426 HEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMDIKNNNN-INSETKEKKNKEK 480

Query: 149 EILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRGR 201
           +I +   +L VE     +     Y     +K +   + N +  N       N  Y    +
Sbjct: 481 KITSLIEKLLVEKYSSAIGRNIYYQTQSQRKSIFVPILNKLIENNHKNVYENTYYSISDK 540

Query: 202 YKRKD-QFSKEQKLADK-DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINS 259
           Y +K+ +F+K  K  +  DS N      K   L+ +L+  D+   +  L+Q   F+  N+
Sbjct: 541 YNKKNIKFNKRVKKHNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEFIVNNN 596

Query: 260 FF 261
           FF
Sbjct: 597 FF 598


>gi|126337727|ref|XP_001369929.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Monodelphis domestica]
          Length = 469

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 16/104 (15%)

Query: 159 VEVAEIPSYYLLDSKKQVR-ARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADK 217
           V ++++P     D KKQ R + V  T  L K    +N   KRG Y++K            
Sbjct: 358 VILSDVPQNQSEDIKKQERNSPVAKTKSLRKNWMRKNCRYKRGLYRKKP----------- 406

Query: 218 DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
                S   K++P LL+ LL+ D++ +++ +LQ  R++  N FF
Sbjct: 407 ----VSEPQKRRPHLLEMLLAQDIRHERNVILQCVRYIIQNDFF 446



 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 22/25 (88%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
           T +E+++W EERRKN+PT+ANI KK
Sbjct: 178 TPEEIEKWREERRKNFPTRANIEKK 202


>gi|303286377|ref|XP_003062478.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455995|gb|EEH53297.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 552

 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 26/35 (74%)

Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 263
           +PTLL+KL + +V+  +S LLQ FRF+  N+F +D
Sbjct: 460 QPTLLKKLFAKEVRVSRSRLLQVFRFLVNNNFLRD 494


>gi|255088323|ref|XP_002506084.1| predicted protein [Micromonas sp. RCC299]
 gi|226521355|gb|ACO67342.1| predicted protein [Micromonas sp. RCC299]
          Length = 567

 Score = 42.4 bits (98), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 24/32 (75%)

Query: 231 TLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
           TLL+KLL+ +V+ D+S LLQ FRF+  N F +
Sbjct: 458 TLLRKLLAREVRADRSRLLQVFRFLVNNDFLR 489


>gi|392567649|gb|EIW60824.1| hypothetical protein TRAVEDRAFT_71098 [Trametes versicolor
           FP-101664 SS1]
          Length = 590

 Score = 41.6 bits (96), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 64/121 (52%), Gaps = 15/121 (12%)

Query: 102 EQEVKQWCEERRKNYPTKANINKKL----TEKQSNSDVIDKEAKMRREQLKEILAKQAEL 157
           EQ ++++ EE++ +   + N +  +    TE +S     D+ A  R E+  +I+AK  EL
Sbjct: 211 EQALQRFIEEQKTSALQRFNFHMAVLIWETEARSAKATGDEAA--RSERHADIIAKLVEL 268

Query: 158 GVEVAEIPSY-----YLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQ 212
           G + +EIP+Y      LLD  +++  R+ NT+    R +     ++R R + +D F K+ 
Sbjct: 269 GYQSSEIPAYDHAFQRLLDQPRKLTPRIWNTI----RPKLVEMLEERRRAREQDAFKKKW 324

Query: 213 K 213
           +
Sbjct: 325 R 325


>gi|307107626|gb|EFN55868.1| hypothetical protein CHLNCDRAFT_52119 [Chlorella variabilis]
          Length = 684

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 41/70 (58%), Gaps = 2/70 (2%)

Query: 104 EVKQWCEERRKNYPTKANINKKLTEKQS--NSDVIDKEAKMRREQLKEILAKQAELGVEV 161
           EV+QW +ER++++PT   + +K  E ++      +D    +R+++L  ILA+Q+ +G+  
Sbjct: 388 EVEQWIKERKRSFPTSEAVARKEAEAEARRQRGELDAARTLRQQRLAAILAQQSAMGLSR 447

Query: 162 AEIPSYYLLD 171
           A      LLD
Sbjct: 448 AAGTLDMLLD 457


>gi|82594577|ref|XP_725484.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480507|gb|EAA17049.1| Zinc finger, C2H2 type, putative [Plasmodium yoelii yoelii]
          Length = 601

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 49/187 (26%), Positives = 89/187 (47%), Gaps = 29/187 (15%)

Query: 89  QEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLK 148
            EK  K+  +  +++E+++W EER+KNYPTK    KK+ E ++N+  I+ E K ++   K
Sbjct: 419 HEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMEMKNNN--INSEEKEKKNXQK 472

Query: 149 EILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNK-RGRFQNNYDKRGRY 202
           +I +   +L VE     +     Y     KK +   + N +  N  +  ++N Y     Y
Sbjct: 473 KITSLIEKLLVEKYSSAIGRNIYYQTQSQKKSIFVPLLNKLIENNYKNVYENTY-----Y 527

Query: 203 KRKDQFSKEQKLADK--------DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRF 254
              D+++++    +K        DS N      K   L+ +L+  D+   +  L+Q   F
Sbjct: 528 SISDKYNQKNIKFNKKIKKNNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEF 583

Query: 255 MAINSFF 261
           +  N+FF
Sbjct: 584 IVNNNFF 590


>gi|47218661|emb|CAG04990.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 350

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 25/34 (73%)

Query: 228 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
           ++PTLL+ LL+ D++ +++ LLQ  RF+  N FF
Sbjct: 306 RRPTLLEMLLAPDIRHERNVLLQCVRFVVRNHFF 339



 Score = 37.7 bits (86), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 17/25 (68%), Positives = 19/25 (76%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
           T +EV +W EERRKNYPT  NI KK
Sbjct: 69  TPEEVAKWREERRKNYPTLQNIEKK 93


>gi|158297378|ref|XP_001237955.2| AGAP007875-PA [Anopheles gambiae str. PEST]
 gi|157015166|gb|EAU76388.2| AGAP007875-PA [Anopheles gambiae str. PEST]
          Length = 474

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 24/31 (77%)

Query: 231 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
           T+L+KLL AD++ +++ LLQ  R++ +N FF
Sbjct: 416 TMLEKLLDADIRHERNVLLQCVRYVLVNKFF 446


>gi|17540726|ref|NP_499991.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
 gi|351063723|emb|CCD71947.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
          Length = 213

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 8/120 (6%)

Query: 248 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 306
           L+  F ++    FF +D P+   N  L +V+D G   E+        GEE S G+ RR V
Sbjct: 93  LIGAFIYVLAGEFFAQDSPQTIFNKALALVRDDGRCQEIFGASIAGFGEETSRGR-RRHV 151

Query: 307 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 360
            H +++++   R R+  +++G  DE     E E  D   Q + +Y+  K      HV ++
Sbjct: 152 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 211


>gi|291235450|ref|XP_002737657.1| PREDICTED: nuclear fragile X mental retardation protein interacting
           protein-like [Saccoglossus kowalevskii]
          Length = 639

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 26/35 (74%)

Query: 227 KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
           KKKPTLL+ LL+ D++ +++ LLQ  R++   +FF
Sbjct: 458 KKKPTLLEMLLAGDIRHERNVLLQCVRYVKKKNFF 492


>gi|345325138|ref|XP_001514551.2| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Ornithorhynchus anatinus]
          Length = 446

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 12/52 (23%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 152
           T +E+ +W EERRKN+PT ANI KK            K  ++ REQ  E+L 
Sbjct: 125 TPEEIAKWREERRKNFPTMANIEKK------------KALQLDREQRGEVLT 164


>gi|313219540|emb|CBY30463.1| unnamed protein product [Oikopleura dioica]
 gi|313226347|emb|CBY21491.1| unnamed protein product [Oikopleura dioica]
          Length = 310

 Score = 39.7 bits (91), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 35/49 (71%), Gaps = 1/49 (2%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKE 149
           TE+++++W EERRKN+PTKA I +K   +++ + +  K+   +REQ +E
Sbjct: 96  TEEDIRKWREERRKNFPTKARIAEKEAARKAGTSIETKKFS-KREQERE 143


>gi|270007929|gb|EFA04377.1| hypothetical protein TcasGA2_TC014675 [Tribolium castaneum]
          Length = 327

 Score = 38.9 bits (89), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 39/63 (61%), Gaps = 3/63 (4%)

Query: 203 KRKDQFSKEQKLADKDSSNTSS---FNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINS 259
           +R+    K+QKL +K      S   F K+K TLL+KLL ++++ +++ LLQ  + +  N+
Sbjct: 265 RRRLSSGKKQKLNNKPEPGGFSRMKFVKRKVTLLEKLLDSEIRHERNVLLQCVQHVVTNN 324

Query: 260 FFK 262
           FF+
Sbjct: 325 FFR 327


>gi|156353142|ref|XP_001622934.1| predicted protein [Nematostella vectensis]
 gi|156209570|gb|EDO30834.1| predicted protein [Nematostella vectensis]
          Length = 401

 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 215 ADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
           A++   NT S   + PTLL+ LL+ +++ +++ +LQ  R++  NSFF
Sbjct: 355 ANRTKQNTKS--NRNPTLLEMLLAPEIRHERNAILQCLRYIVRNSFF 399


>gi|156099300|ref|XP_001615652.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148804526|gb|EDL45925.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 589

 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 44/192 (22%), Positives = 81/192 (42%), Gaps = 28/192 (14%)

Query: 103 QEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVA 162
           +E+++W +ER+KNYPT+  I   L   ++      K+     E+L    +  + +G  + 
Sbjct: 418 KEIEKWIQERKKNYPTREKIADSLKNNENGKKKKKKKPNCLIEELL-FESYCSAVGRNI- 475

Query: 163 EIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQF-----SKEQKLADK 217
               Y+    +K +   +   M  N    F N Y+K       +       +K+Q+   K
Sbjct: 476 ----YFKSKWEKSIFIPLLTKMEQNN---FHNIYEKSYYPLFNNSMGKFFPTKQQRRPFK 528

Query: 218 DSSNTSSFN-KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMV 276
           +     S N  +KP LL +L+  ++   +  L++   F+  + FF D             
Sbjct: 529 ERQGIDSLNIHRKPPLLYQLMKKEIYLYEKRLIKCIEFITRSGFFDD------------- 575

Query: 277 KDGGAAGEVVEE 288
            +GGAAG  + E
Sbjct: 576 SEGGAAGTDILE 587


>gi|312375641|gb|EFR22971.1| hypothetical protein AND_13906 [Anopheles darlingi]
          Length = 575

 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 24/31 (77%)

Query: 231 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
           T+L+KLL AD++ +++ LLQ  R++  N+FF
Sbjct: 459 TMLEKLLDADIRHERNVLLQCVRYVVSNNFF 489


>gi|341888654|gb|EGT44589.1| hypothetical protein CAEBREN_21252 [Caenorhabditis brenneri]
          Length = 219

 Score = 38.9 bits (89), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 8/120 (6%)

Query: 248 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 306
           L+  F ++    FF +D P+   N  L +V+D G   ++        GEE S G+ RR V
Sbjct: 99  LIGAFIYVLCGEFFAQDSPQTIFNKALAIVRDDGRCQDIFGATIAGFGEETSRGR-RRHV 157

Query: 307 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 360
            H +++++   R R+  +++G  DE     E E  D   Q + +Y+  K      HV ++
Sbjct: 158 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 217


>gi|256053309|ref|XP_002570140.1| hypothetical protein [Schistosoma mansoni]
          Length = 356

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
           +PTLLQ LL+ D++++++ L+Q  R++   +FF+
Sbjct: 323 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 356


>gi|350646629|emb|CCD58749.1| hypothetical protein Smp_099560.1 [Schistosoma mansoni]
          Length = 336

 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
           +PTLLQ LL+ D++++++ L+Q  R++   +FF+
Sbjct: 303 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 336


>gi|256053307|ref|XP_002570139.1| hypothetical protein [Schistosoma mansoni]
          Length = 371

 Score = 38.5 bits (88), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
           +PTLLQ LL+ D++++++ L+Q  R++   +FF+
Sbjct: 338 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 371


>gi|7305335|ref|NP_038773.1| nuclear fragile X mental retardation-interacting protein 1 [Mus
           musculus]
 gi|81917719|sp|Q9QXX8.1|NUFP1_MOUSE RecName: Full=Nuclear fragile X mental retardation-interacting
           protein 1; AltName: Full=Nuclear FMRP-interacting
           protein 1
 gi|6525073|gb|AAF15316.1| nuclear FMRP interacting protein 1 [Mus musculus]
 gi|26328149|dbj|BAC27815.1| unnamed protein product [Mus musculus]
 gi|33525186|gb|AAH56192.1| Nuclear fragile X mental retardation protein interacting protein 1
           [Mus musculus]
 gi|74220737|dbj|BAE33625.1| unnamed protein product [Mus musculus]
 gi|148703863|gb|EDL35810.1| nuclear fragile X mental retardation protein interacting protein 1
           [Mus musculus]
          Length = 484

 Score = 38.5 bits (88), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 5/51 (9%)

Query: 75  KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKK 125
           K ++F   N M+ P  KK K      T +E+ +W EERRKNYPT ANI +K
Sbjct: 202 KIVQFHWRN-MHAPGMKKIK----LDTPEEIARWREERRKNYPTLANIERK 247


>gi|350646628|emb|CCD58748.1| hypothetical protein Smp_099560.2 [Schistosoma mansoni]
          Length = 351

 Score = 38.5 bits (88), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
           +PTLLQ LL+ D++++++ L+Q  R++   +FF+
Sbjct: 318 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 351


>gi|327283872|ref|XP_003226664.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Anolis carolinensis]
          Length = 451

 Score = 38.5 bits (88), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 8/50 (16%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 150
           T +E+ +W EERRKN+PT ANI KK          + KE + R E LK +
Sbjct: 92  TPEEIAKWREERRKNFPTLANIEKK--------KALQKEKEQRGEVLKTL 133


>gi|67474753|ref|XP_653120.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56470045|gb|EAL47734.1| hypothetical protein EHI_019110 [Entamoeba histolytica HM-1:IMSS]
 gi|449703311|gb|EMD43787.1| Hypothetical protein EHI5A_022200 [Entamoeba histolytica KU27]
          Length = 129

 Score = 38.5 bits (88), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 29/39 (74%)

Query: 100 YTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 138
            TE+++++W EER+K +PTKA   K L+E++  +D I+K
Sbjct: 1   MTEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39


>gi|395527571|ref|XP_003765917.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1 [Sarcophilus harrisii]
          Length = 480

 Score = 38.1 bits (87), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 22/25 (88%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
           T +E+++W EERRKN+PT+ANI KK
Sbjct: 187 TPEEIEKWREERRKNFPTRANIEKK 211


>gi|407040593|gb|EKE40213.1| hypothetical protein ENU1_097960 [Entamoeba nuttalli P19]
          Length = 129

 Score = 38.1 bits (87), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 29/39 (74%)

Query: 100 YTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 138
            TE+++++W EER+K +PTKA   K L+E++  +D I+K
Sbjct: 1   MTEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39


>gi|157108497|ref|XP_001650253.1| hypothetical protein AaeL_AAEL000699 [Aedes aegypti]
 gi|108884013|gb|EAT48238.1| AAEL000699-PA [Aedes aegypti]
          Length = 492

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 23/31 (74%)

Query: 231 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
           TLL+KLL  +++ +++ LLQ  RF+  NSFF
Sbjct: 444 TLLEKLLEPEIRHERNVLLQCVRFVVQNSFF 474


>gi|448516074|ref|XP_003867484.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis Co 90-125]
 gi|380351823|emb|CCG22046.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis]
          Length = 243

 Score = 37.7 bits (86), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 13/25 (52%), Positives = 22/25 (88%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
           TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 69  TEEDIAKWIEERKKNWPTRANVERK 93


>gi|12843948|dbj|BAB26177.1| unnamed protein product [Mus musculus]
          Length = 249

 Score = 37.7 bits (86), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
           T +E+ +W EERRKNYPT ANI +K
Sbjct: 223 TPEEIARWREERRKNYPTLANIERK 247


>gi|354543450|emb|CCE40169.1| hypothetical protein CPAR2_102070 [Candida parapsilosis]
          Length = 253

 Score = 37.7 bits (86), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 13/25 (52%), Positives = 22/25 (88%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
           TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 77  TEEDIAKWIEERKKNWPTRANVERK 101


>gi|167387090|ref|XP_001738022.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165898944|gb|EDR25671.1| hypothetical protein EDI_290620 [Entamoeba dispar SAW760]
          Length = 137

 Score = 37.7 bits (86), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 29/40 (72%)

Query: 99  TYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 138
           + TE+++++W EER+K +PTKA   K L E++  +D I+K
Sbjct: 8   SMTEEQLQKWIEERKKRFPTKAKREKCLFEEKKKTDEIEK 47


>gi|348545180|ref|XP_003460058.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Oreochromis niloticus]
          Length = 544

 Score = 37.7 bits (86), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 19/25 (76%)

Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
           T +E+ +W EERRKNYPT  NI KK
Sbjct: 152 TPEEITKWREERRKNYPTLVNIEKK 176


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.311    0.130    0.363 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,072,966,266
Number of Sequences: 23463169
Number of extensions: 278780249
Number of successful extensions: 1279746
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 202
Number of HSP's successfully gapped in prelim test: 1964
Number of HSP's that attempted gapping in prelim test: 1252546
Number of HSP's gapped (non-prelim): 18829
length of query: 368
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 224
effective length of database: 8,980,499,031
effective search space: 2011631782944
effective search space used: 2011631782944
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 77 (34.3 bits)