BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 008521
         (563 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|147775405|emb|CAN73830.1| hypothetical protein VITISV_043067 [Vitis vinifera]
          Length = 605

 Score =  298 bits (763), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 224/550 (40%), Positives = 282/550 (51%), Gaps = 103/550 (18%)

Query: 35  SVPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMF 94
           ++P   Q G++ PQ   P    NMP      NQ  FM    PL   Q+NHLG+P LG   
Sbjct: 40  ALPNPLQMGMLXPQTSIPFI-PNMP------NQFGFM----PL---QNNHLGMPHLGSAQ 85

Query: 95  PVHGNFG--NMPLNVSQ-----------GQLLGPNLSNLPQQFNQNMGFSNGQMCLQSVN 141
              G+F   N   N+ Q             L G N + LP          N Q CLQ+ N
Sbjct: 86  SYQGSFNPQNCVPNLGQSAXRSNPSQPPANLFGHNSAILP---------GNLQFCLQNXN 136

Query: 142 QLLSMQMSNPCLFG------------------NPQFGLVNQNQQNFVQPIKDVS------ 177
           Q   M M N    G                   PQ      NQQ     +  V       
Sbjct: 137 QNFPMPMPNLNQVGPYNIPQASHHMFQLAQALGPQSSAFIPNQQFVPMCVSGVGENVHQD 196

Query: 178 ---------GNNTLK-----------------SCSSEPRQ-GQNLQPSAFMRSQRNFIQN 210
                    G+N LK                 + S +P+Q   N QPS FM+SQ NF+++
Sbjct: 197 NHKLGPPAMGSNALKQPPIANPQLQGNWPTMATSSVQPQQTWNNQQPSNFMKSQGNFVKD 256

Query: 211 GQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNG 269
              + S  N K  S KNF+ N + K    G +KS  H M NG+ K G  N+ + KG  N 
Sbjct: 257 AGVNNSNPNWKNSSRKNFMXNPKGKNSHWGSRKSQLHHMQNGRGKAGISNENRGKGLSNN 316

Query: 270 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSD 329
            A      N   Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK  EK +NS+
Sbjct: 317 MAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEKLTNSE 376

Query: 330 VIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLNKRGRF 387
           VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV  R EN      K+ RF
Sbjct: 377 VIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFGKKERF 436

Query: 388 QNNYDKRGRYKRK--------DQFSKEQKLADKDSSNTSS-----FNKKKPTLLQKLLSA 434
           QN  +KR R+ RK        D F+K+Q+LA  DS +T++      NKKK TLLQKLLS 
Sbjct: 437 QNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGTDSGDTNASNQPPLNKKKQTLLQKLLST 496

Query: 435 DVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSE 494
           D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV VK+ G  GEVV+ KS    +   +
Sbjct: 497 DIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVAVKETGCQGEVVDRKSSPTSKGVPQ 556

Query: 495 GKGRRMVEHF 504
           G  +   E F
Sbjct: 557 GGRKTXAEEF 566


>gi|449446325|ref|XP_004140922.1| PREDICTED: uncharacterized protein LOC101213190 [Cucumis sativus]
          Length = 599

 Score =  296 bits (758), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 233/587 (39%), Positives = 313/587 (53%), Gaps = 113/587 (19%)

Query: 58  MPATAPLMNQNNFM-NASNPLLIAQSNHLGLPQL--GHM--------FPVHGNFGNMPLN 106
           +P   P M Q   M N  NPL+   +N LG      GHM        FP  G F  MP N
Sbjct: 45  VPTMPPPMFQPGLMMNLQNPLMGLPNNSLGASPFAPGHMGFANSAANFPAQGQFNLMP-N 103

Query: 107 VSQ---------GQLLGPNLSNLPQQFNQNMGFSNGQMCL--QSVNQ-LLSMQMSN---- 150
           V+Q          Q  G N+ NL QQ  QNMG +NGQ CL  Q++NQ ++  QM N    
Sbjct: 104 VNQMNMNSCLPLAQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGQMMNMPQV 163

Query: 151 --------------PCLFGNPQFGL------------VNQNQQNFV-------------- 170
                         P  F NP F              VNQN QNF+              
Sbjct: 164 PSYTSYGGPNQQAVPMPFQNPGFSTAQSFGVNQGMQPVNQNPQNFIPQAMGGAGSNQFPA 223

Query: 171 --QPIKDVSGNNTLK-SCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKN 227
             QP++   GN+T+  + S++P+Q +NLQ  AF  +Q N   +  G+ S S S   + +N
Sbjct: 224 SAQPLQ---GNSTMPINSSTQPQQARNLQSPAFAGTQGNSSISDGGNGSNSISNNSAHRN 280

Query: 228 FIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKK 287
           F++N  K    GFQK+  H + N K+KFGF   QK KG  N +  KF   N  +Q +E+K
Sbjct: 281 FMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKEQK 336

Query: 288 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 347
           R SL+L YT+QE++QW E RRKNYP+  NI KKLT KQ+N  ++DKEAK+ R++LKEILA
Sbjct: 337 R-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEILA 395

Query: 348 KQAELGVEVAEIPSYYLL-----DSKKQVRARV----ENTMPLNKRGRFQNNYDKRGRYK 398
           KQAELGVEVAEIP  YL      D++KQ   R     E      ++   QN  +KRGR K
Sbjct: 396 KQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASIEKENSQNRLNKRGRCK 455

Query: 399 RKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 458
           +K++  K+ K  +K  SN     K++PTLLQKLL ADV++DKS LLQ  RF  +NSFFK+
Sbjct: 456 KKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMNSFFKE 514

Query: 459 WPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRERMHHN 516
           WP KP+ FP V VK+      VV+E SL  G  +  E     +VE+   HD         
Sbjct: 515 WPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD--------- 565

Query: 517 IQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 563
               + +N+++ +D++  E++K    G G      +++ EEEGEIID
Sbjct: 566 ---IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 599


>gi|449494110|ref|XP_004159451.1| PREDICTED: uncharacterized LOC101213190 [Cucumis sativus]
          Length = 552

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 230/581 (39%), Positives = 310/581 (53%), Gaps = 112/581 (19%)

Query: 63  PLMNQNNFMNASNPLLIAQSNHLGLPQL--GHM--------FPVHGNFGNMPLNVSQ--- 109
           P+      MN  NPL+   +N LG      GHM        FP  G F  MP NV+Q   
Sbjct: 4   PMFQPGLMMNMQNPLMGLPNNSLGASPFAPGHMGFANSAANFPAQGQFNLMP-NVNQMNM 62

Query: 110 ------GQLLGPNLSNLPQQFNQNMGFSNGQMCL--QSVNQ-LLSMQMSN---------- 150
                  Q  G N+ NL QQ  QNMG +NGQ CL  Q++NQ ++  QM N          
Sbjct: 63  NSCLPLAQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGQMMNMPQVPSYTSY 122

Query: 151 --------PCLFGNPQFGL------------VNQNQQNFV----------------QPIK 174
                   P  F NP F              VNQN QNF+                QP++
Sbjct: 123 GGPNQQAVPMPFQNPGFSTAQSFGVNQGMQPVNQNPQNFIPQAMGGAGSNQFPASAQPLQ 182

Query: 175 DVSGNNTLK-SCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRT 233
              GN+T+  + S++P+Q +NLQ  AF  +Q N   +  G+ S S S   + +NF++N  
Sbjct: 183 ---GNSTMPINSSTQPQQARNLQSPAFAGTQGNSSISDGGNGSNSISNNSAHRNFMRNSK 239

Query: 234 KEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLAL 293
           K    GFQK+  H + N K+KFGF   QK KG  N +  KF   N  +Q +E+KR SL+L
Sbjct: 240 K----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKEQKR-SLSL 294

Query: 294 TYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELG 353
            YT+QE++QW E RRKNYP+  NI KKLT KQ+N  ++DKEAK+ R++LKEILAKQAELG
Sbjct: 295 VYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEILAKQAELG 354

Query: 354 VEVAEIPSYYLL-----DSKKQVRARV----ENTMPLNKRGRFQNNYDKRGRYKRKDQFS 404
           VEVAEIP  YL      D++KQ   R     E      ++   QN  +KRGR K+K++  
Sbjct: 355 VEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASIEKENSQNRLNKRGRCKKKNRPR 414

Query: 405 KEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPV 464
           K+ K  +K  SN     K++PTLLQKLL ADV++DKS LLQ  RF  +NSFFK+WP KP+
Sbjct: 415 KKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMNSFFKEWPNKPL 473

Query: 465 NFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRERMHHNIQGFED 522
            FP V VK+      VV+E SL  G  +  E     +VE+   HD             + 
Sbjct: 474 KFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD------------IDS 521

Query: 523 ENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 563
           +N+++ +D++  E++K    G G      +++ EEEGEIID
Sbjct: 522 DNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 552


>gi|224100559|ref|XP_002311923.1| predicted protein [Populus trichocarpa]
 gi|222851743|gb|EEE89290.1| predicted protein [Populus trichocarpa]
          Length = 659

 Score =  291 bits (745), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 184/411 (44%), Positives = 233/411 (56%), Gaps = 67/411 (16%)

Query: 186 SSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSH 244
           SS  +Q  N QPS + R Q N  +NGQ S  KS  +K SGKN   NR +  SQ G QKS 
Sbjct: 283 SSASQQSWNSQPSTYNRWQGNTARNGQSSTPKSKWEKPSGKNLKNNRNRGRSQSGHQKSD 342

Query: 245 FHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRK-SLALTYTEQEVKQW 303
           F+ M NGKRK  F N+   KG+GN +  KFG  +  +Q  E+K K S    YTEQE+KQW
Sbjct: 343 FNCMDNGKRKLEFSNEHGRKGNGNERVAKFGRTDLTDQATEEKSKPSRTFFYTEQEIKQW 402

Query: 304 CEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYY 363
            E RRK+YPTK +I KK  E      VID+EA  R +QLKEILAKQAELGVEVAEIP  Y
Sbjct: 403 RESRRKHYPTKTSIEKKQME------VIDREANFRHKQLKEILAKQAELGVEVAEIPPDY 456

Query: 364 LLDS-------------------------------KKQVRARVENTMPLNKRGRFQNNYD 392
           LLDS                               +KQ   R +N   L K+G+F N +D
Sbjct: 457 LLDSEKLGVEVAEIPPPQVLSSEKLGVEVAEIPPPQKQEHGREDNRRSLTKKGKFWNKHD 516

Query: 393 KRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAI 452
           KRGR+ +K + +K+   A+++         +KPTLL+KLLS D+KRDK  LLQ FRFM  
Sbjct: 517 KRGRFNKKGRSAKQVGSANEE---------RKPTLLEKLLSTDIKRDKRQLLQVFRFMVA 567

Query: 453 NSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRER 512
           NSFFKDWPEKP+ FP V+VK+ G   E+VE+KS L+GEE SE +   + E+F        
Sbjct: 568 NSFFKDWPEKPLKFPSVVVKEDGYEDEIVEKKSSLVGEEVSEDRNNTIAENF-------- 619

Query: 513 MHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 563
                     + DD  E +   E    ++ GK +  DE  R+  EEGEI+D
Sbjct: 620 ---------GDRDDNIEHDAQVELGNCFVRGKCDIVDEVDRV--EEGEIVD 659



 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 68/216 (31%), Positives = 92/216 (42%), Gaps = 41/216 (18%)

Query: 1   MQPLFNAFANQTQSNSSTSTLPQMQAFRPNM-LQNSVPMQPQTGIMNPQVPFPLSNSNMP 59
           M+P FN+   +       +     Q F  N  + NS+PMQPQ G++NPQ+P   +NSN  
Sbjct: 1   MKPFFNSRRPKHHKQKGNTAPLNQQVFSTNQSVANSMPMQPQLGLVNPQIPITFNNSNT- 59

Query: 60  ATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNM--------PLNVSQGQ 111
               L++    M    PL+  Q   L  P    +  +      +         +N  QG+
Sbjct: 60  ----LLSNGQAMANMPPLIAQQPGILSGPNDLAILQLQNQVNKLNALKMLMDQVNQLQGE 115

Query: 112 LLGPNLSNLPQQFNQNMGF--------SNGQMCLQ--------------SVNQLLSMQMS 149
           L GP+ SNLPQQ NQNMG          N  M LQ              S +Q++  Q  
Sbjct: 116 LFGPSFSNLPQQLNQNMGLLQNPMQNMMNPVMPLQMPINSQVGSFNVPSSNHQVVGAQSQ 175

Query: 150 NPCLFGNPQFGL---VNQNQQNFVQPIKDVSGNNTL 182
           N   F NP FG    VN NQ N+V P     G+  L
Sbjct: 176 N--FFVNPPFGAEQRVNPNQPNYVMPTTSAYGSKLL 209


>gi|224113339|ref|XP_002316461.1| predicted protein [Populus trichocarpa]
 gi|222865501|gb|EEF02632.1| predicted protein [Populus trichocarpa]
          Length = 674

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 261/718 (36%), Positives = 330/718 (45%), Gaps = 199/718 (27%)

Query: 1   MQPLFNAFANQTQSNSSTSTLPQMQAFRPN-MLQNSVPMQPQTGIMNPQVPF-------P 52
           M+P FN+   +       +     Q F  N  L NS  MQPQ G++NPQ+P        P
Sbjct: 1   MEPFFNSRRPRHHKQKGNTAPYNQQGFSANPSLTNS--MQPQLGLVNPQIPIPFNNSNTP 58

Query: 53  LSNS----NMP----------------ATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGH 92
           L N     NMP                A   L NQ N +NA   +L+ Q N L     G 
Sbjct: 59  LRNGQAMPNMPPLINQQHGLVSGPNDLAILQLQNQVNKLNALK-MLMNQVNQL----QGE 113

Query: 93  MFPVHGNFGNMP--LNVSQGQLLGP--NLSN-----------------LPQQFNQNMG-- 129
           +F     F N+P  +N + G L  P  N+ N                 +P + +Q +G  
Sbjct: 114 LF--GPGFSNLPQQINQNMGLLQNPMQNMMNPVMPMQMPMTSQVGSFNVPSRSHQVVGAQ 171

Query: 130 ----FSNGQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQ----------NFVQPIKD 175
               F N Q+     N ++    +N    G+ Q    NQ  Q          NFV P   
Sbjct: 172 SPNFFVNQQVKQNQPNFVMPTTGAN----GSKQLSFENQQMQGNLSATQKNQNFVMPAVG 227

Query: 176 VSGNNTLKSCSSE----------------------------------------PRQGQNL 195
            +G+N+L   + +                                         +Q QNL
Sbjct: 228 TNGSNSLHVATQQVQGISPASQQSEKFVMPTMAANGPKPLPAATQQVQGNPFTSQQSQNL 287

Query: 196 QPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRK 254
           QPSA+ R Q N  +NGQGS   S     SGKNF  N  +E SQ G QKS FH+M NGKRK
Sbjct: 288 QPSAYNRWQGNPAKNGQGSTPNSKQGIFSGKNFKNNPKREQSQSGHQKSEFHRMDNGKRK 347

Query: 255 FGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTK 314
            GF N    KG GN +A KFG ++  NQ  E+KR  +   YTEQE+KQW E RRK++PTK
Sbjct: 348 LGFSNKHGGKGKGNERAAKFGRSDPSNQVMEQKRTHI---YTEQEIKQWRESRRKHFPTK 404

Query: 315 ANINKKLTEKQSNSDVIDKEAKMRRE-------------QLKEILAKQAELGVEVAEIPS 361
            NI KK TEK  +S VIDKEA  RR+             QLKEILAKQAELGVEVAEIP 
Sbjct: 405 TNIEKKQTEKLIDSGVIDKEANFRRKVLNALPRKLDDKLQLKEILAKQAELGVEVAEIPP 464

Query: 362 YY------------------------------------LLDSKKQVRARVENTMPLNKRG 385
            Y                                    LLDS+KQ   R +N   L K+G
Sbjct: 465 EYMLDSEKLGVEVAETPLSYLLDSEKLGVEAAEIPPHHLLDSEKQEHGREDNRRSLTKKG 524

Query: 386 RFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQ 445
           RF N +D+RGRYKRK +   +  L +++         +KPTLL+KLL AD+KRDK  LLQ
Sbjct: 525 RFWNKHDRRGRYKRKGRSDMQLGLENEE---------RKPTLLEKLLCADIKRDKHRLLQ 575

Query: 446 TFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFE 505
            FRFM  NSFFKDW +KP+ FP V+VK+ G   E  EEK  L+GEE+SE      VE F 
Sbjct: 576 VFRFMVANSFFKDWSDKPLKFPSVVVKEDGCKDEPQEEKPSLVGEEESEVPNNTTVEDF- 634

Query: 506 HDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 563
                            + DD +E +   E     + GK +  DE  R+  EEGEIID
Sbjct: 635 ----------------GDRDDGDEHDAQVEPGNGSVMGKCDIVDEVYRV--EEGEIID 674


>gi|356553962|ref|XP_003545319.1| PREDICTED: uncharacterized protein LOC100786384 [Glycine max]
          Length = 822

 Score =  256 bits (653), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 219/607 (36%), Positives = 301/607 (49%), Gaps = 122/607 (20%)

Query: 59  PATAPLMNQNNFMNASN----PLLIAQSNHLGLPQLGH---------------------- 92
           P   P MN   FMNA+N    PL +    H+GLP  GH                      
Sbjct: 236 PFMQPPMNAAPFMNAANHNHFPLHMP---HMGLP--GHQQGQPHVGGLGPQNSVGNANYN 290

Query: 93  --MFP-----VHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSNGQMC----LQSVN 141
             MFP     V  N     L+  QGQ L  ++ N+ QQ N NM   NGQ C    +Q++N
Sbjct: 291 NPMFPPVQGQVMQNQAQFNLSPLQGQFLAQSILNMLQQPNMNMSMPNGQFCGPYLMQNMN 350

Query: 142 QLLSMQMSNP------------C-LFG------------------NPQFGLVNQNQQNFV 170
             L MQM NP            C +FG                  NPQ G V  NQ   V
Sbjct: 351 PQLPMQMPNPSQGVPYGMHPGSCPMFGFPNQVPQAMVPQNSMFSTNPQLGFVPGNQ---V 407

Query: 171 QPIKD----------VSGN---------------NTLKSCSSEPRQGQNLQPSAFMR--- 202
           +P  D          VS N               NT  S +       N QP AFM+   
Sbjct: 408 RPQIDPNEKNLNPPNVSANSFVSSSPFSSQQLQGNTSGSLNPNLAHTNNSQPPAFMKQVL 467

Query: 203 -SQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYN 259
            ++     N + ++  SN K    KNF KN+     F  GFQKS FH ++NGK+  GF  
Sbjct: 468 KTEEKPNSNIKTNVPNSNWKGSPSKNF-KNKPNRGGFQAGFQKSKFHDVNNGKKGSGFPI 526

Query: 260 DQKPKGHGNGKAIKFGLANQMNQPQEKKR---KSLALTYTEQEVKQWCEERRKNYPTKAN 316
           +   KG  +G+   +GL     +P+E K+   +SL++TYT QE++QW E R+KN+P   N
Sbjct: 527 EHNGKGPNSGRGGHYGL-----KPKEHKQQPERSLSVTYTVQEIQQWREARKKNHPFNNN 581

Query: 317 INKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVE 376
           I KK +E   +   I++E   R  +LKE+LAKQAELGVEVAEIPSYYL +S  Q   + E
Sbjct: 582 IQKKHSEHPKDRKAINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQA-LQSE 638

Query: 377 NTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADV 436
                  + +FQN ++K+    RK +F+K QK  DKD S + S  K+KPTLLQKLLS+DV
Sbjct: 639 GKNKFTDKRKFQNKFNKKS--DRKGRFAKRQKFDDKDFSESPSLKKRKPTLLQKLLSSDV 696

Query: 437 KRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGK 496
           KRDKSHL+Q FRFM +NSFFK   +KP+ +PLV+VK+ G+  +  EEK L  G++  +G 
Sbjct: 697 KRDKSHLIQVFRFMVMNSFFKHCLDKPLRYPLVVVKEKGSEVD-GEEKYLHTGKDVLKGG 755

Query: 497 GRRMVEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEE 556
               V+      N        +  +D+ +D   DN+  +     +  + ++ +   + +E
Sbjct: 756 NEETVQKIVTFNNDNSHDCEDEDSDDDENDSIVDNNLHKDPSSLVKRQCDSGEGIGKFDE 815

Query: 557 EEGEIID 563
           EEGEII+
Sbjct: 816 EEGEIIE 822


>gi|255554855|ref|XP_002518465.1| conserved hypothetical protein [Ricinus communis]
 gi|223542310|gb|EEF43852.1| conserved hypothetical protein [Ricinus communis]
          Length = 273

 Score =  249 bits (635), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 128/218 (58%), Positives = 166/218 (76%), Gaps = 4/218 (1%)

Query: 289 KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAK 348
           +S AL YTEQE+K W EERRKN+P+KANI K  +E+ +NS  IDKEAK+RRE+LKEILAK
Sbjct: 9   RSPALIYTEQEIKLWREERRKNFPSKANIEKMCSERVTNSGGIDKEAKLRRERLKEILAK 68

Query: 349 QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQK 408
           QAELGVEVAEIPS+YL DS+KQV+ + ++     K+GR ++ +D+RGRY +KD  +++  
Sbjct: 69  QAELGVEVAEIPSHYLSDSEKQVKVKKDSRRSAPKKGRSRHKHDRRGRYNKKDGLTQQNT 128

Query: 409 LADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPL 468
           LA KDSSN SSF++ KPTLLQKLLSA+V++DK HLLQ FRFM +NSFF D PEKP+ FP 
Sbjct: 129 LAKKDSSNGSSFSRTKPTLLQKLLSAEVRKDKHHLLQVFRFMVMNSFFDDGPEKPLKFPS 188

Query: 469 VMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEH 506
           V+VK+     EVV EK+ + G+  SE   +  VE+  H
Sbjct: 189 VVVKE----DEVVAEKTSITGKGISEVSNKTAVENSGH 222


>gi|145334515|ref|NP_001078603.1| uncharacterized protein [Arabidopsis thaliana]
 gi|145358174|ref|NP_197345.2| uncharacterized protein [Arabidopsis thaliana]
 gi|60547897|gb|AAX23912.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
 gi|71905555|gb|AAZ52755.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
 gi|71905557|gb|AAZ52756.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
 gi|332005180|gb|AED92563.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332005181|gb|AED92564.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 470

 Score =  245 bits (626), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 185/440 (42%), Positives = 246/440 (55%), Gaps = 58/440 (13%)

Query: 69  NFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNM 128
           N MNA N L +     +  P +GH         NMP+         PN+   PQ FN NM
Sbjct: 43  NPMNA-NQLGMMNPQMMNNPMMGH---------NMPM---------PNMPIHPQFFN-NM 82

Query: 129 GFSNGQMCLQS-VNQLLSMQMSNPCLFGNPQFGLVNQNQQ-----NFVQPIKD---VSGN 179
                Q  + + +NQLL      P L GN QF + N N       NF QP  +    S  
Sbjct: 83  PQQLPQFAMPNHINQLL------PNLLGNLQFAVANSNLMGHSLPNFFQPSLEPHAFSSR 136

Query: 180 NTLKSCSS-------EPRQGQNLQPSAFM--RSQRNFIQNGQGSISKSNSKKVSGKNFIK 230
             L S +S        P Q     P  F   R Q   + N  GS    N  +     F K
Sbjct: 137 PQLNSFNSLPYPPVPNPHQNHQSGPPGFSEPRPQGQSVDNTNGSGPNGNDFR---NKFPK 193

Query: 231 NRT-KEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL--ANQMNQPQEKK 287
           ++  K   QGFQ+   HQ  NGKRK GF  D +  G GN   +K GL  ++  N  +EKK
Sbjct: 194 HQNFKGPGQGFQRPQLHQADNGKRKSGFNKDHR--GKGNNNKMKTGLDGSDTGNIAKEKK 251

Query: 288 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 347
           R S AL YT +EV+QW E RRKNYPTK  + KK+ +K  ++ ++D+EAKMRR+QL+E+LA
Sbjct: 252 R-SYALMYTPREVQQWREARRKNYPTKFLVEKKV-KKNVSASILDEEAKMRRQQLREVLA 309

Query: 348 KQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQF-SKE 406
           KQAELGVEVAE+PS+YL ++ +QV     N     ++GRFQNN   + R+ RKD+F +K+
Sbjct: 310 KQAELGVEVAEVPSHYLSNNDEQVNGDRGNNN--GRKGRFQNNRRNKRRHDRKDKFDNKK 367

Query: 407 QKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNF 466
            +L DK SS  SS   +KPTLL+KLLSAD+KRDKS LLQ FRFM +NS  K++PE+P+  
Sbjct: 368 PRLEDKKSSQDSSITTRKPTLLEKLLSADIKRDKSQLLQVFRFMVMNSLLKEFPEQPLKL 427

Query: 467 PLVMVKDGGAAGEVVEEKSL 486
           PL+ VK+ G   + +E+ S+
Sbjct: 428 PLITVKETGCE-DAMEDPSI 446


>gi|358347440|ref|XP_003637765.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
 gi|355503700|gb|AES84903.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
          Length = 553

 Score =  245 bits (625), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 203/543 (37%), Positives = 273/543 (50%), Gaps = 121/543 (22%)

Query: 93  MFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNM---GFSNGQMC------LQSV--N 141
           MFPVHG  G  P      Q+L  N+ NL QQ N NM      N Q C      +Q++   
Sbjct: 60  MFPVHGQGG--PQITHNAQILAQNILNLLQQPNMNMPNMNVPNAQFCAPGPYPMQNMMNQ 117

Query: 142 QLLSMQMS--------------------------------NPCLFGNPQFGLVNQNQQNF 169
           QL  MQM                                 NP   GNPQFG+V  NQ   
Sbjct: 118 QLPPMQMQRQNPTQQPYGMQPVQQPMFGFPNQVPHAMVPQNPMFNGNPQFGMVPGNQ--- 174

Query: 170 VQPIKDVSGNNTLKSCSSE----------------------PRQGQNLQPSAFMRS--QR 205
           ++P  D++  N + + +                        P    N Q SAF  S  Q 
Sbjct: 175 IRPQIDLNEKNRVPAGNGNANGFVPGGPFPPQQLQGNNGPVPHNANNAQGSAFRNSHPQV 234

Query: 206 NFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQG-FQKSHFHQMHNGKRKFGFYNDQKPK 264
           N   N   + + SN K    KNF   + +  SQG FQKS F+  + GKR   F  + K K
Sbjct: 235 NPNSNMNTNFANSNWKGSPNKNFKNKQNRGGSQGGFQKSKFNDTNKGKR---FSKEHKGK 291

Query: 265 GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 324
           G  N +A  FGL    N  QE KR S +LTY+EQE++QW E RRKN+P++  I K+ +E+
Sbjct: 292 GPNNERAGHFGL----NSKQEPKR-SFSLTYSEQEIQQWREARRKNHPSREKIEKQQSEQ 346

Query: 325 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKR 384
             +S VID+E   R  +LKE+LAKQAELG+EVAEIPSYYL++   Q     +       +
Sbjct: 347 SKDSKVIDRELLQR--ELKEVLAKQAELGIEVAEIPSYYLMNGTNQGLQSEDKKNSFTDK 404

Query: 385 GRFQN----NYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDK 440
            +F+N    N D+R R  +K      QK AD+D       NKKKPTLLQKLLSAD+KRDK
Sbjct: 405 RKFKNKLMRNPDRRSRNNKK------QKFADRDLLE----NKKKPTLLQKLLSADIKRDK 454

Query: 441 SHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRM 500
           SHL Q FRF+  NSFFKD+P+KP+ +P V VK+ G+  EV   K  L G       G  +
Sbjct: 455 SHLFQVFRFITANSFFKDYPDKPLVYPPVSVKEMGS--EVYGGKKHLQG-------GEAV 505

Query: 501 VEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGE 560
           +EH      +E +   ++G ++ +D+E+E++D           +G+  D     +EEEGE
Sbjct: 506 LEH----GTKEIVQKFVKGSDNGHDNEDEESD-----------EGDNEDGINEFDEEEGE 550

Query: 561 IID 563
           II+
Sbjct: 551 IIE 553


>gi|297807865|ref|XP_002871816.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317653|gb|EFH48075.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 454

 Score =  244 bits (624), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 163/416 (39%), Positives = 233/416 (56%), Gaps = 52/416 (12%)

Query: 74  SNPLLIAQSNHLGLPQLGHMFPVHGNF-GNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSN 132
           SNP++   +N + +P +    P+H  F  NMP      Q   PN                
Sbjct: 59  SNPMMGHMNNPIPMPNM----PIHPQFFNNMPQQQQLHQFAMPN---------------- 98

Query: 133 GQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQ-----NFVQPIKDVSGN------NT 181
                  +NQLL      P L GN QF + N N       NF QP  + S        N+
Sbjct: 99  ------HINQLL------PNLLGNLQFAVANSNLMGHSLPNFFQPNLEPSAFTSRPQLNS 146

Query: 182 LKSCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRT-KEFSQGF 240
             S    P    +L+P  F   +     + + + S SN      K F K++  K   QGF
Sbjct: 147 FNSLPYPPVPNHHLRPPGFSEPRPQVGIDDRTNGSGSNGNDFRNK-FTKHQNFKGPGQGF 205

Query: 241 QKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL-ANQMNQPQEKKRKSLALTYTEQE 299
           Q+   HQ  NGKRK GF  D +  G GN   +K GL  +  +   ++KR+S AL YT ++
Sbjct: 206 QRPQLHQADNGKRKSGFNKDHR--GKGNYNKMKNGLDGSDADNIAKEKRRSYALMYTPKD 263

Query: 300 VKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEI 359
           V QW E RRKN+PT+ N+ KK+ +K  ++ ++D+EAKMRR+QL+E+LAKQAELG+EVA++
Sbjct: 264 VNQWREARRKNFPTRLNVEKKV-KKNVSASILDEEAKMRRQQLREVLAKQAELGIEVADV 322

Query: 360 PSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSS 419
           PS+YL ++ ++V    +N     ++ +FQNN  K+ R+ RKD+F K  +L DK+SS  S 
Sbjct: 323 PSHYLSNTDERVHG--DNGANDGQKRKFQNNRHKQRRHGRKDKFDKTPRLDDKNSSQESP 380

Query: 420 FNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGG 475
              KKPTLL+KLLSA++KRDK HLLQ FRFM +NSF K++PE+P+  PL+ V++ G
Sbjct: 381 MTTKKPTLLEKLLSANIKRDKIHLLQVFRFMVMNSFLKEFPEQPLKLPLITVEETG 436


>gi|359487059|ref|XP_002271061.2| PREDICTED: uncharacterized protein LOC100252331 [Vitis vinifera]
          Length = 1031

 Score =  238 bits (608), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 133/223 (59%), Positives = 160/223 (71%), Gaps = 15/223 (6%)

Query: 278 NQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKM 337
           N   Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK  EK +NS+VI+ E K 
Sbjct: 751 NFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEKLTNSEVIEAEVKS 810

Query: 338 RREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLNKRGRFQNNYDKRG 395
           RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV  R EN      K+ RFQN  +KR 
Sbjct: 811 RRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFGKKERFQNRGNKRR 870

Query: 396 RYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTLLQKLLSADVKRDKSH 442
           R+ RK        D F+K+Q+LA KDS     SN    NKKK TLLQKLLS D+KRDK H
Sbjct: 871 RHDRKQWQRHDQEDGFTKKQRLAGKDSGDMNASNQPPLNKKKQTLLQKLLSTDIKRDKRH 930

Query: 443 LLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 485
           LLQ FRFMA+NSFFKDWPEKP+ FPLV+VK+ G  GEVV+ KS
Sbjct: 931 LLQVFRFMAMNSFFKDWPEKPLKFPLVVVKETGCQGEVVDRKS 973


>gi|296085358|emb|CBI29090.3| unnamed protein product [Vitis vinifera]
          Length = 467

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 113/240 (47%), Positives = 149/240 (62%), Gaps = 12/240 (5%)

Query: 181 TLKSCSSEPRQ-GQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQ 238
           T+ + S +P+Q   N QPS FM+SQ NF+++   + S  N K  S KNF+KN + K    
Sbjct: 220 TMATSSVQPQQTWNNQQPSNFMKSQGNFVKDAGVNNSNPNWKNSSRKNFMKNPKGKNSHW 279

Query: 239 GFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQ 298
           G +KS  H M N + K G  N+ + KG  N  A      N   Q + +K++ L L YTEQ
Sbjct: 280 GSRKSQLHHMQNARGKAGISNENRGKGLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQ 339

Query: 299 EVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAE 358
           E++ W EER+KNYP+K N+ KK  EK +NS+VI+ E K RR+QLKEILAKQAELGVEVAE
Sbjct: 340 EIQNWREERKKNYPSKINLEKKSAEKLTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAE 399

Query: 359 IPSYYLLDS-KKQVRARVENT-MPLNKRGRFQNNYDKRGRYKRK--------DQFSKEQK 408
           IP +YL DS K+QV  R EN      K+ RFQN  +KR R+ RK        D F+K+Q+
Sbjct: 400 IPPHYLSDSEKQQVHGREENNKKAFGKKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQR 459


>gi|115440647|ref|NP_001044603.1| Os01g0814000 [Oryza sativa Japonica Group]
 gi|56785048|dbj|BAD82687.1| unknown protein [Oryza sativa Japonica Group]
 gi|113534134|dbj|BAF06517.1| Os01g0814000 [Oryza sativa Japonica Group]
          Length = 533

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 90/186 (48%), Positives = 121/186 (65%), Gaps = 2/186 (1%)

Query: 288 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 347
           RK   + Y   EVKQW E R+KNYPT  NINKKL+E + + +  D+EA+MRR++LKE+L 
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKKLSESRPDGEKKDEEAQMRRQELKEVLE 340

Query: 348 KQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-KRGRYKRKDQFSKE 406
           KQ ELG E+ E+P  YL + + Q   R  N     +  RF N  D KR RY R D  SK 
Sbjct: 341 KQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNKRSRYDRNDFQSKR 400

Query: 407 QKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVN 465
            K+ ++   +  +  K ++PTLLQKLLS+DVKRD+  LL TF+FM +N+FF D+P+KP+ 
Sbjct: 401 PKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVLNNFFSDYPDKPLE 460

Query: 466 FPLVMV 471
           FP V V
Sbjct: 461 FPSVKV 466


>gi|242054757|ref|XP_002456524.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
 gi|241928499|gb|EES01644.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
          Length = 568

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 3/188 (1%)

Query: 287 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 346
           +RK   + Y   EVK+W E R+KNYPT  N++KKL+E  S+++  DK+A++RR++LKE+L
Sbjct: 295 RRKPPRIIYDANEVKRWVEARKKNYPTSVNVHKKLSEINSDNENKDKDAQLRRQELKEVL 354

Query: 347 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQFS 404
           AKQ ELG ++ E+P  YL ++  Q      N     +   F N  N +KR RY+R    S
Sbjct: 355 AKQQELGFDLPELPPGYLSETGDQCIENKNNRKAQCRDSHFGNRSNNNKRPRYERGGFQS 414

Query: 405 KEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKP 463
           K  K+ ++      +  K ++PTLLQKLLS+D+KRD+  LL  F+FM +N+FFKDWP+KP
Sbjct: 415 KRSKVWNRTPCADDAMAKSREPTLLQKLLSSDIKRDRHRLLHVFKFMTLNNFFKDWPDKP 474

Query: 464 VNFPLVMV 471
           + FP V V
Sbjct: 475 LQFPSVKV 482


>gi|357125563|ref|XP_003564462.1| PREDICTED: uncharacterized protein LOC100829078 [Brachypodium
           distachyon]
          Length = 609

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 88/191 (46%), Positives = 123/191 (64%), Gaps = 7/191 (3%)

Query: 288 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 347
           RK   + Y   EVKQW E R+KNYPT  N+NKKL++ QS+    D+EA+MRR +LKE+LA
Sbjct: 333 RKRPPICYDTNEVKQWVEARKKNYPTSVNVNKKLSQSQSDDQKKDEEAQMRRLELKEVLA 392

Query: 348 KQAELGVEVAEIPSYYLLDSKKQVRA----RVENTMPLNKRG-RFQN--NYDKRGRYKRK 400
           KQ ELG E+ E+P  YL +++ Q +       E+     +R  RF N    +KR RY R 
Sbjct: 393 KQKELGFELPELPPGYLSETEGQPKGPQGNDKESKWKTQQRDCRFGNRGRGNKRQRYDRA 452

Query: 401 DQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWP 460
           D  SK  +  +    +  +  K++PTLLQKLLS+D+KRD+  LL TF+FMA+N+FF++ P
Sbjct: 453 DFQSKRPREWNNSRHDGGAVAKREPTLLQKLLSSDIKRDRHRLLHTFKFMALNNFFRNLP 512

Query: 461 EKPVNFPLVMV 471
           +KP+ FP V V
Sbjct: 513 DKPLEFPSVKV 523


>gi|218189267|gb|EEC71694.1| hypothetical protein OsI_04190 [Oryza sativa Indica Group]
          Length = 500

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 91/199 (45%), Positives = 122/199 (61%), Gaps = 15/199 (7%)

Query: 288 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 334
           RK   + Y   EVKQW E R+KNYPT  NINK             KL+E + + +  D+E
Sbjct: 237 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 296

Query: 335 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-K 393
           A+MRR++LKE+LAKQ ELG E+ E+P  YL + + Q   R  N     +  RF N  D K
Sbjct: 297 AQMRRQELKEVLAKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNK 356

Query: 394 RGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAI 452
           R RY R D  SK  K+ ++   +  +  K ++PTLLQKLLS+DVKRD+  LL TF+FM +
Sbjct: 357 RSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVL 416

Query: 453 NSFFKDWPEKPVNFPLVMV 471
           N+FF D+P+KP+ FP V V
Sbjct: 417 NNFFSDYPDKPLEFPSVKV 435


>gi|223943225|gb|ACN25696.1| unknown [Zea mays]
 gi|414880010|tpg|DAA57141.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
          Length = 562

 Score =  162 bits (409), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 83/186 (44%), Positives = 119/186 (63%), Gaps = 3/186 (1%)

Query: 287 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 346
           +RK   + Y   EVKQW E R+KNYPT  N+NKKL+E +S+++  DK+A++RR++LKE++
Sbjct: 290 RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQELKEVI 349

Query: 347 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQFS 404
           AKQ ELG +  E+P  YL +   Q      N     +   F N  N +KR RY+R    S
Sbjct: 350 AKQQELGFDPPELPPGYLSEIGDQCIENKNNRKTQFRDSHFGNRFNNNKRSRYERGGFQS 409

Query: 405 KEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKP 463
           K  K+ +    ++      ++PTLLQKLLS+D+KRD+  LL  FRFM +N+FFKDWP++P
Sbjct: 410 KRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDRP 469

Query: 464 VNFPLV 469
           + FP V
Sbjct: 470 LQFPSV 475


>gi|20161477|dbj|BAB90401.1| P0432B10.22 [Oryza sativa Japonica Group]
          Length = 541

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 92/220 (41%), Positives = 124/220 (56%), Gaps = 24/220 (10%)

Query: 288 RKSLALTYTEQEVKQWCEERRKNYPTKANINK----------------------KLTEKQ 325
           RK   + Y   EVKQW E R+KNYPT  NINK                      KL+E +
Sbjct: 267 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKASATLCPHQTIREASSTDHLMQKLSESR 326

Query: 326 SNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRG 385
            + +  D+EA+MRR++LKE+L KQ ELG E+ E+P  YL + + Q   R  N     +  
Sbjct: 327 PDGEKKDEEAQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDC 386

Query: 386 RFQNNYD-KRGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHL 443
           RF N  D KR RY R D  SK  K+ ++   +  +  K ++PTLLQKLLS+DVKRD+  L
Sbjct: 387 RFGNRADNKRSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRL 446

Query: 444 LQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEE 483
           L TF+FM +N+FF D+P+KP+ FP V V        + EE
Sbjct: 447 LHTFKFMVLNNFFSDYPDKPLEFPSVKVNQIELESNIAEE 486


>gi|356562267|ref|XP_003549393.1| PREDICTED: uncharacterized protein LOC100789443 [Glycine max]
          Length = 684

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 104/250 (41%), Positives = 153/250 (61%), Gaps = 21/250 (8%)

Query: 319 KKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQ-VRARVEN 377
           +K +E   +S  I++E   R  +LKE+LAKQAELGVEVAEIPSYYL +S  Q +++  +N
Sbjct: 451 QKHSECPKDSKAINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQGLQSEAKN 508

Query: 378 TMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVK 437
               +KR +FQN ++K+    RK +F+++QK ADKD S + S  K+KPTLLQKLLS+DVK
Sbjct: 509 KY-TDKR-KFQNKFNKKS--DRKGRFAEKQKFADKDFSESPSLKKRKPTLLQKLLSSDVK 564

Query: 438 RDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKG 497
           RDKSHLLQ  RFM +NSFF+  P+K + +P V VK+ G+  EV  EK  L   +D   +G
Sbjct: 565 RDKSHLLQVLRFMVMNSFFRHIPDKALRYPSVEVKEKGS--EVSGEKKHLHTGKDVLNRG 622

Query: 498 R----RMVEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVR 553
                + +  F +D        N    +D+ +D   DN+  +     +  + +  +   +
Sbjct: 623 SEETVQKIVIFNND--------NGHDCDDDENDSIVDNNLHKDPSSLVKRQCDGGEGIKK 674

Query: 554 LEEEEGEIID 563
            +EEEGEI++
Sbjct: 675 SDEEEGEILE 684



 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 103/335 (30%), Positives = 131/335 (39%), Gaps = 111/335 (33%)

Query: 36  VPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGL--PQLGH- 92
           +P QPQ G     +P        P   P MN   FMNA+N       NH  L  P +GH 
Sbjct: 27  MPPQPQLGAGQNLMP--------PFMQPHMNAAPFMNAAN------HNHFPLHRPHMGHQ 72

Query: 93  -------------------------MFPVHG----NFGNMPLNVSQGQLLGPNLSNLPQQ 123
                                    MFPV G    N   + L+  QGQ L  ++ N+ QQ
Sbjct: 73  QGQPHVVGGLGPQNSVVGNANYNNPMFPVQGQVMQNQAQLNLSPLQGQFLAQSILNMLQQ 132

Query: 124 FNQNMGFS--NGQMC----LQSVNQLLSMQMSNP-------------------------- 151
            N NM  S  NGQ C    +Q++NQ L MQMSNP                          
Sbjct: 133 PNMNMSMSMPNGQFCGPYPMQNMNQQLPMQMSNPPQGVPYGMHPSSRPVFRFPNQVPQAM 192

Query: 152 ----CLFG-NPQFGLV-------------------NQNQQNFVQ----PIKDVSGNNTLK 183
                +F  NPQ G V                   N N   FV     P + + GN T  
Sbjct: 193 VPQNSMFSTNPQLGFVPGNQVRPQIDPNEKILAPPNANANAFVSSSPFPSQQLQGN-TSG 251

Query: 184 SCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKE--FSQGFQ 241
           S +       N QP AFM+ Q     N + ++  SN K    KN +KN+     F  GFQ
Sbjct: 252 SVNPNLAHTNNSQPPAFMK-QETPNSNIKTNVPNSNWKGSPSKN-LKNKPNRGRFQGGFQ 309

Query: 242 KSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL 276
           KS FH ++NGKR  GF  +   KG  +G+A  +GL
Sbjct: 310 KSKFHDVNNGKRGSGFPKEHNGKGPNSGRAGHYGL 344


>gi|326528619|dbj|BAJ97331.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 625

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 90/218 (41%), Positives = 128/218 (58%), Gaps = 20/218 (9%)

Query: 288 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 347
           RK   + Y + EVKQW + R+KNYPT+AN+NKKL + Q +    D+EA+MRR++LKE++A
Sbjct: 335 RKRPPIIYDKNEVKQWVQARKKNYPTRANVNKKLCQNQLDEQKKDEEAQMRRQELKEVIA 394

Query: 348 KQAELGVEVAEIPSYYLLDSKKQVR----------------ARVENTMPLNKRGRFQNNY 391
           KQ ELG+E+ E+P  YL D++ Q R                           RGR     
Sbjct: 395 KQKELGLELPELPPGYLSDNEGQPRDPQKESNWKTRQGGGRFGNRGRGRGQGRGRGSGRD 454

Query: 392 DKRGRY-KRKDQFSKEQKLADKDS---SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTF 447
           +KR RY  R+D  SK  +  + +S      +    ++PTLLQKLL++D+KRD+  LL TF
Sbjct: 455 NKRQRYDSREDSPSKRPREWNNNSRCHDGGAVAKIREPTLLQKLLNSDIKRDRHRLLHTF 514

Query: 448 RFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 485
           +FMA+N+FFKDWP KP+ FP V +       ++ EE S
Sbjct: 515 KFMALNNFFKDWPAKPLEFPSVKLNQIELESDIAEEGS 552


>gi|293335934|ref|NP_001167960.1| uncharacterized protein LOC100381676 [Zea mays]
 gi|223945137|gb|ACN26652.1| unknown [Zea mays]
 gi|414880008|tpg|DAA57139.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
          Length = 295

 Score =  151 bits (382), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 82/187 (43%), Positives = 119/187 (63%), Gaps = 4/187 (2%)

Query: 287 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRRE-QLKEI 345
           +RK   + Y   EVKQW E R+KNYPT  N+NKKL+E +S+++  DK+A++RR+ +LKE+
Sbjct: 22  RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQVELKEV 81

Query: 346 LAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD--KRGRYKRKDQF 403
           +AKQ ELG +  E+P  YL +   Q      N     +   F N ++  KR RY+R    
Sbjct: 82  IAKQQELGFDPPELPPGYLSEIGDQCIENKNNRKTQFRDSHFGNRFNNNKRSRYERGGFQ 141

Query: 404 SKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEK 462
           SK  K+ +    ++      ++PTLLQKLLS+D+KRD+  LL  FRFM +N+FFKDWP++
Sbjct: 142 SKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDR 201

Query: 463 PVNFPLV 469
           P+ FP V
Sbjct: 202 PLQFPSV 208


>gi|222619440|gb|EEE55572.1| hypothetical protein OsJ_03849 [Oryza sativa Japonica Group]
          Length = 484

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 49/118 (41%), Positives = 65/118 (55%), Gaps = 13/118 (11%)

Query: 288 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 334
           RK   + Y   EVKQW E R+KNYPT  NINK             KL+E + + +  D+E
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 340

Query: 335 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD 392
           A+MRR++LKE+L KQ ELG E+ E+P  YL + + Q   R  N     +  RF N  D
Sbjct: 341 AQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRAD 398


>gi|168019512|ref|XP_001762288.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686366|gb|EDQ72755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 614

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 67/95 (70%), Gaps = 6/95 (6%)

Query: 291 LALTYTEQ---EVKQWCEERRKNYPTKANINKK---LTEKQSNSDVIDKEAKMRREQLKE 344
           +AL  +E+   E++ W EER +NYPT++NI +K   L  K +  ++ID++ + R++++KE
Sbjct: 278 VALKASEEDDDEIRAWREERMRNYPTRSNIQRKKEALEGKVARGELIDEDTRKRQQRMKE 337

Query: 345 ILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTM 379
           ILAKQAELGV VAE+P++YL  S K+ R R   ++
Sbjct: 338 ILAKQAELGVPVAEVPAHYLSQSYKEDRGRGRKSL 372



 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 58/129 (44%), Gaps = 18/129 (13%)

Query: 380 PLNKRGRFQNNYDKRGRYKR--KDQFSKEQKLADKDSSNTSSFN---------KKKPTLL 428
           P +K G+ Q  + +RGR K+  K  F+       K   N +            K+ PTLL
Sbjct: 461 PSDKPGKTQCYFYQRGRCKKGFKCHFAHTGPRGKKGGDNEAGRGGPKGVGVEKKRPPTLL 520

Query: 429 QKLLSADVKRDKSHLLQTFRFMAINSFFKDWP-------EKPVNFPLVMVKDGGAAGEVV 481
            KLL  ++ RDKS++LQ  R+   N+F  +WP       E P            + GEVV
Sbjct: 521 SKLLQGEINRDKSYILQCLRYFVNNNFLLNWPNQLSESIELPEQSDDAHESSDYSEGEVV 580

Query: 482 EEKSLLLGE 490
           E   +LL +
Sbjct: 581 EAPDVLLAD 589


>gi|219363611|ref|NP_001136910.1| uncharacterized protein LOC100217067 [Zea mays]
 gi|194697570|gb|ACF82869.1| unknown [Zea mays]
          Length = 138

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 30/47 (63%), Positives = 40/47 (85%)

Query: 423 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLV 469
           ++PTLLQKLLS+D+KRD+  LL  FRFM +N+FFKDWP++P+ FP V
Sbjct: 5   REPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDRPLQFPSV 51


>gi|255546771|ref|XP_002514444.1| conserved hypothetical protein [Ricinus communis]
 gi|223546440|gb|EEF47940.1| conserved hypothetical protein [Ricinus communis]
          Length = 123

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 62/129 (48%), Gaps = 39/129 (30%)

Query: 341 QLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRK 400
           +LKE LAKQAELGVEVAEIPS YL DS+KQV+                            
Sbjct: 28  RLKETLAKQAELGVEVAEIPSRYLSDSEKQVK---------------------------- 59

Query: 401 DQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRD-KSHLLQTFRFMAINSFF--- 456
                   L +KDS+N SSF+  KPTL QKLLSA+ K    S  LQ         FF   
Sbjct: 60  ------DTLTNKDSTNGSSFSNTKPTLRQKLLSAEGKGAFLSECLQASSTFTCEQFFSSN 113

Query: 457 KDWP-EKPV 464
           K W   KPV
Sbjct: 114 KLWNFTKPV 122


>gi|328766403|gb|EGF76457.1| hypothetical protein BATDEDRAFT_28378 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 411

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 88/185 (47%), Gaps = 25/185 (13%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTE---KQSNSDVID---KEAKMRREQLKEILAK- 348
           T ++++QW +ER++ +PT++NI +K+ +   +Q   ++ID   K+ K R    ++++ K 
Sbjct: 217 TPKDIEQWIQERKRKFPTESNIKQKMEDEKQRQKQGNIIDVNGKKLKGRNADKRQLIRKN 276

Query: 349 ---QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRG--RYKRKDQF 403
              Q + G    +  + YL D+         +T+ L+  G+   +       R+    + 
Sbjct: 277 DTVQEKHGTSEQKDSTIYLDDASNAKLEASHDTLALDMTGKLNQHKGASSCCRHGSNCRH 336

Query: 404 SKE-QKLADKDSSNTSSFNKKKPT-----------LLQKLLSADVKRDKSHLLQTFRFMA 451
           S +  KLA+  S+   +     PT           LL  LL  D K+DK+ LLQ  RF+ 
Sbjct: 337 SHDLSKLAESRSA-PKTHTPLNPTAAPIKGTNRRPLLNMLLETDFKKDKNALLQCIRFII 395

Query: 452 INSFF 456
            N FF
Sbjct: 396 ENKFF 400


>gi|384254158|gb|EIE27632.1| hypothetical protein COCSUDRAFT_55624 [Coccomyxa subellipsoidea
           C-169]
          Length = 539

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 33/50 (66%)

Query: 418 SSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFP 467
           S   +++PTLLQKLL+ D++ + S LLQ  RF+  N+F  D+   P+ FP
Sbjct: 313 SPVQQRQPTLLQKLLAPDIRAEHSRLLQCLRFLVTNNFLLDFGRAPLIFP 362


>gi|308800704|ref|XP_003075133.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
           (ISS) [Ostreococcus tauri]
 gi|116061687|emb|CAL52405.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
           (ISS) [Ostreococcus tauri]
          Length = 401

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 52/204 (25%), Positives = 84/204 (41%), Gaps = 60/204 (29%)

Query: 289 KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAK 348
           K + +    ++V+++ EER K++P+ +++                       Q KEILAK
Sbjct: 144 KPVDVASLPEDVRRYREERAKHWPSDSHV-----------------------QAKEILAK 180

Query: 349 QAELG--------VEVAEIPSYYLLDSKKQVRARVENTMPLN---KRGRF---------- 387
           Q E+G         E  +      + +K++ R   +   P     K  RF          
Sbjct: 181 QREMGHFEASQEIGEEGDGERTATMTTKREQRVGEQEKKPSGDSVKVCRFWLQGGCRKGD 240

Query: 388 --------------QNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLS 433
                         +  +  RGR K   +      + DK +S  SS N +  TLL+KLL 
Sbjct: 241 ACDFKHEAGPNSDQRCRFFARGRCKAGKRCPFRHDIVDKKTSGVSSGNPQ--TLLKKLLH 298

Query: 434 ADVKRDKSHLLQTFRFMAINSFFK 457
            ++KRD+  LLQ FRF+  N FF+
Sbjct: 299 KEIKRDEERLLQLFRFLVNNDFFE 322


>gi|145343958|ref|XP_001416510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576735|gb|ABO94803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 291

 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 62/223 (27%), Positives = 100/223 (44%), Gaps = 44/223 (19%)

Query: 298 QEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELG---- 353
           ++V+++ EER K +P+  N+  K    +       +    RRE+L+EILAKQ E+G    
Sbjct: 28  EDVRRYREERAKYWPSDRNVRAKTEAGEEEE----RARAARRERLREILAKQREMGHFEA 83

Query: 354 -VEVAE----IPSYYLLDSKKQVRARVENTMPLNKRGRF--------------------- 387
             E+ E    + +   LD K  V AR  +    +K  RF                     
Sbjct: 84  SQEIGEEEGGVAAPAALDGK-GVEARAADG---SKVCRFWLQGGCRKGSACDFKHESAPN 139

Query: 388 ---QNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 444
              +  +  RGR K   +   + ++ ++ SS        + TLL+KLL  ++KRD+  LL
Sbjct: 140 KDQKCRFFARGRCKAGARCPFKHEVTERKSSAADGGGNPQ-TLLKKLLDKEIKRDEQRLL 198

Query: 445 QTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLL 487
           Q FRF   N FF     K V+ PL M     +  +V++ ++ L
Sbjct: 199 QLFRFFVNNDFFTG--TKGVDEPLWMFPWADSGKKVIDRRAAL 239


>gi|118358218|ref|XP_001012358.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila]
 gi|89294124|gb|EAR92112.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila
           SB210]
          Length = 762

 Score = 45.8 bits (107), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 41/63 (65%)

Query: 399 RKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 458
           ++D++S+E  L+    SN+S+F  K  TLL  L+  ++ R++++LLQ  R++  N+FF +
Sbjct: 574 KRDKYSQELDLSFNYKSNSSNFRYKANTLLTNLVLDEIYRERNYLLQAIRYIVSNNFFDE 633

Query: 459 WPE 461
             E
Sbjct: 634 QAE 636


>gi|307107626|gb|EFN55868.1| hypothetical protein CHLNCDRAFT_52119 [Chlorella variabilis]
          Length = 684

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 41/70 (58%), Gaps = 2/70 (2%)

Query: 299 EVKQWCEERRKNYPTKANINKKLTEKQSN--SDVIDKEAKMRREQLKEILAKQAELGVEV 356
           EV+QW +ER++++PT   + +K  E ++      +D    +R+++L  ILA+Q+ +G+  
Sbjct: 388 EVEQWIKERKRSFPTSEAVARKEAEAEARRQRGELDAARTLRQQRLAAILAQQSAMGLSR 447

Query: 357 AEIPSYYLLD 366
           A      LLD
Sbjct: 448 AAGTLDMLLD 457


>gi|68073975|ref|XP_678902.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56499510|emb|CAH94055.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 611

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 49/183 (26%), Positives = 90/183 (49%), Gaps = 18/183 (9%)

Query: 283 PQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQL 342
             EK  K+  +  +++E+++W EER+KNYPTK    KK+ + ++N++ I+ E K ++ + 
Sbjct: 425 SHEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMDIKNNNN-INSETKEKKNKE 479

Query: 343 KEILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRG 395
           K+I +   +L VE     +     Y     +K +   + N +  N       N  Y    
Sbjct: 480 KKITSLIEKLLVEKYSSAIGRNIYYQTQSQRKSIFVPILNKLIENNHKNVYENTYYSISD 539

Query: 396 RYKRKD-QFSKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 453
           +Y +K+ +F+K  K  +  DS N      K   L+ +L+  D+   +  L+Q   F+  N
Sbjct: 540 KYNKKNIKFNKRVKKHNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEFIVNN 595

Query: 454 SFF 456
           +FF
Sbjct: 596 NFF 598


>gi|392567649|gb|EIW60824.1| hypothetical protein TRAVEDRAFT_71098 [Trametes versicolor
           FP-101664 SS1]
          Length = 590

 Score = 43.5 bits (101), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 64/121 (52%), Gaps = 15/121 (12%)

Query: 297 EQEVKQWCEERRKNYPTKANINKKL----TEKQSNSDVIDKEAKMRREQLKEILAKQAEL 352
           EQ ++++ EE++ +   + N +  +    TE +S     D+ A  R E+  +I+AK  EL
Sbjct: 211 EQALQRFIEEQKTSALQRFNFHMAVLIWETEARSAKATGDEAA--RSERHADIIAKLVEL 268

Query: 353 GVEVAEIPSY-----YLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQ 407
           G + +EIP+Y      LLD  +++  R+ NT+    R +     ++R R + +D F K+ 
Sbjct: 269 GYQSSEIPAYDHAFQRLLDQPRKLTPRIWNTI----RPKLVEMLEERRRAREQDAFKKKW 324

Query: 408 K 408
           +
Sbjct: 325 R 325


>gi|126337727|ref|XP_001369929.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Monodelphis domestica]
          Length = 469

 Score = 42.7 bits (99), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 16/104 (15%)

Query: 354 VEVAEIPSYYLLDSKKQVR-ARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADK 412
           V ++++P     D KKQ R + V  T  L K    +N   KRG Y++K            
Sbjct: 358 VILSDVPQNQSEDIKKQERNSPVAKTKSLRKNWMRKNCRYKRGLYRKKP----------- 406

Query: 413 DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 456
                S   K++P LL+ LL+ D++ +++ +LQ  R++  N FF
Sbjct: 407 ----VSEPQKRRPHLLEMLLAQDIRHERNVILQCVRYIIQNDFF 446



 Score = 39.3 bits (90), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 22/25 (88%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKK 320
           T +E+++W EERRKN+PT+ANI KK
Sbjct: 178 TPEEIEKWREERRKNFPTRANIEKK 202


>gi|255088323|ref|XP_002506084.1| predicted protein [Micromonas sp. RCC299]
 gi|226521355|gb|ACO67342.1| predicted protein [Micromonas sp. RCC299]
          Length = 567

 Score = 42.4 bits (98), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 24/32 (75%)

Query: 426 TLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 457
           TLL+KLL+ +V+ D+S LLQ FRF+  N F +
Sbjct: 458 TLLRKLLAREVRADRSRLLQVFRFLVNNDFLR 489


>gi|17540726|ref|NP_499991.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
 gi|351063723|emb|CCD71947.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
          Length = 213

 Score = 42.0 bits (97), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 8/120 (6%)

Query: 443 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 501
           L+  F ++    FF +D P+   N  L +V+D G   E+        GEE S G+ RR V
Sbjct: 93  LIGAFIYVLAGEFFAQDSPQTIFNKALALVRDDGRCQEIFGASIAGFGEETSRGR-RRHV 151

Query: 502 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 555
            H +++++   R R+  +++G  DE     E E  D   Q + +Y+  K      HV ++
Sbjct: 152 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 211


>gi|303286377|ref|XP_003062478.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455995|gb|EEH53297.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 552

 Score = 42.0 bits (97), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 26/35 (74%)

Query: 424 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 458
           +PTLL+KL + +V+  +S LLQ FRF+  N+F +D
Sbjct: 460 QPTLLKKLFAKEVRVSRSRLLQVFRFLVNNNFLRD 494


>gi|313219540|emb|CBY30463.1| unnamed protein product [Oikopleura dioica]
 gi|313226347|emb|CBY21491.1| unnamed protein product [Oikopleura dioica]
          Length = 310

 Score = 42.0 bits (97), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 35/49 (71%), Gaps = 1/49 (2%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKE 344
           TE+++++W EERRKN+PTKA I +K   +++ + +  K+   +REQ +E
Sbjct: 96  TEEDIRKWREERRKNFPTKARIAEKEAARKAGTSIETKKFS-KREQERE 143


>gi|341888654|gb|EGT44589.1| hypothetical protein CAEBREN_21252 [Caenorhabditis brenneri]
          Length = 219

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 8/120 (6%)

Query: 443 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 501
           L+  F ++    FF +D P+   N  L +V+D G   ++        GEE S G+ RR V
Sbjct: 99  LIGAFIYVLCGEFFAQDSPQTIFNKALAIVRDDGRCQDIFGATIAGFGEETSRGR-RRHV 157

Query: 502 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 555
            H +++++   R R+  +++G  DE     E E  D   Q + +Y+  K      HV ++
Sbjct: 158 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 217


>gi|295665248|ref|XP_002793175.1| CCCH zinc finger protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278089|gb|EEH33655.1| CCCH zinc finger protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 764

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)

Query: 299 EVKQWCEERRKNYPTKANINKKL--TEKQSNSDVIDKEAKMRRE 340
           ++  W EER+K YPTKA I ++L   EKQ  +    K+AK  RE
Sbjct: 356 DIAAWIEERKKRYPTKARIEERLKEAEKQKKASREAKDAKRARE 399


>gi|82594577|ref|XP_725484.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480507|gb|EAA17049.1| Zinc finger, C2H2 type, putative [Plasmodium yoelii yoelii]
          Length = 601

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 51/192 (26%), Positives = 91/192 (47%), Gaps = 37/192 (19%)

Query: 283 PQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQL 342
             EK  K+  +  +++E+++W EER+KNYPTK    KK+ E ++N+  I+ E K ++   
Sbjct: 418 SHEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMEMKNNN--INSEEKEKKNXQ 471

Query: 343 KEILAKQAELGVEVAEIP----SYYLLDSKKQ------VRARVENTMPLNKRGRFQNNYD 392
           K+I +   +L VE          YY   S+K+      +   +EN    N +  ++N Y 
Sbjct: 472 KKITSLIEKLLVEKYSSAIGRNIYYQTQSQKKSIFVPLLNKLIEN----NYKNVYENTY- 526

Query: 393 KRGRYKRKDQFSKEQKLADK--------DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 444
               Y   D+++++    +K        DS N      K   L+ +L+  D+   +  L+
Sbjct: 527 ----YSISDKYNQKNIKFNKKIKKNNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLM 578

Query: 445 QTFRFMAINSFF 456
           Q   F+  N+FF
Sbjct: 579 QCIEFIVNNNFF 590


>gi|345325138|ref|XP_001514551.2| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Ornithorhynchus anatinus]
          Length = 446

 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 12/51 (23%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 346
           T +E+ +W EERRKN+PT ANI KK            K  ++ REQ  E+L
Sbjct: 125 TPEEIAKWREERRKNFPTMANIEKK------------KALQLDREQRGEVL 163


>gi|291235450|ref|XP_002737657.1| PREDICTED: nuclear fragile X mental retardation protein interacting
           protein-like [Saccoglossus kowalevskii]
          Length = 639

 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 26/35 (74%)

Query: 422 KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 456
           KKKPTLL+ LL+ D++ +++ LLQ  R++   +FF
Sbjct: 458 KKKPTLLEMLLAGDIRHERNVLLQCVRYVKKKNFF 492


>gi|158297378|ref|XP_001237955.2| AGAP007875-PA [Anopheles gambiae str. PEST]
 gi|157015166|gb|EAU76388.2| AGAP007875-PA [Anopheles gambiae str. PEST]
          Length = 474

 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 24/31 (77%)

Query: 426 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 456
           T+L+KLL AD++ +++ LLQ  R++ +N FF
Sbjct: 416 TMLEKLLDADIRHERNVLLQCVRYVLVNKFF 446


>gi|67474753|ref|XP_653120.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56470045|gb|EAL47734.1| hypothetical protein EHI_019110 [Entamoeba histolytica HM-1:IMSS]
 gi|449703311|gb|EMD43787.1| Hypothetical protein EHI5A_022200 [Entamoeba histolytica KU27]
          Length = 129

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 29/38 (76%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 333
           TE+++++W EER+K +PTKA   K L+E++  +D I+K
Sbjct: 2   TEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39


>gi|407040593|gb|EKE40213.1| hypothetical protein ENU1_097960 [Entamoeba nuttalli P19]
          Length = 129

 Score = 39.7 bits (91), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 29/38 (76%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 333
           TE+++++W EER+K +PTKA   K L+E++  +D I+K
Sbjct: 2   TEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39


>gi|344301187|gb|EGW31499.1| hypothetical protein SPAPADRAFT_154739 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 212

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 35/55 (63%), Gaps = 2/55 (3%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQA 350
           TE+++ +W EER+K +P+K  + KKL +KQ  S++  KE + R  + KE   K+ 
Sbjct: 59  TEEDIAKWIEERKKKWPSKQVVAKKLEQKQ--SEIQQKEDRKRPAESKEAPGKRT 111


>gi|7305335|ref|NP_038773.1| nuclear fragile X mental retardation-interacting protein 1 [Mus
           musculus]
 gi|81917719|sp|Q9QXX8.1|NUFP1_MOUSE RecName: Full=Nuclear fragile X mental retardation-interacting
           protein 1; AltName: Full=Nuclear FMRP-interacting
           protein 1
 gi|6525073|gb|AAF15316.1| nuclear FMRP interacting protein 1 [Mus musculus]
 gi|26328149|dbj|BAC27815.1| unnamed protein product [Mus musculus]
 gi|33525186|gb|AAH56192.1| Nuclear fragile X mental retardation protein interacting protein 1
           [Mus musculus]
 gi|74220737|dbj|BAE33625.1| unnamed protein product [Mus musculus]
 gi|148703863|gb|EDL35810.1| nuclear fragile X mental retardation protein interacting protein 1
           [Mus musculus]
          Length = 484

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 5/51 (9%)

Query: 270 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKK 320
           K ++F   N M+ P  KK K      T +E+ +W EERRKNYPT ANI +K
Sbjct: 202 KIVQFHWRN-MHAPGMKKIK----LDTPEEIARWREERRKNYPTLANIERK 247


>gi|327283872|ref|XP_003226664.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Anolis carolinensis]
          Length = 451

 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 8/50 (16%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 345
           T +E+ +W EERRKN+PT ANI KK          + KE + R E LK +
Sbjct: 92  TPEEIAKWREERRKNFPTLANIEKK--------KALQKEKEQRGEVLKTL 133


>gi|156099300|ref|XP_001615652.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148804526|gb|EDL45925.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 589

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 47/208 (22%), Positives = 86/208 (41%), Gaps = 36/208 (17%)

Query: 286 KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 345
           K  K   +    +E+++W +ER+KNYPT+  I   L   ++      K+     E+L   
Sbjct: 406 KNDKDEVILSDSKEIEKWIQERKKNYPTREKIADSLKNNENGKKKKKKKPNCLIEELL-F 464

Query: 346 LAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKR---------GR 396
            +  + +G  +     Y+    +K +   +   M  N    F N Y+K          G+
Sbjct: 465 ESYCSAVGRNI-----YFKSKWEKSIFIPLLTKMEQNN---FHNIYEKSYYPLFNNSMGK 516

Query: 397 YKRKDQFSKEQKLADKDSSNTSSFN-KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSF 455
           +      +K+Q+   K+     S N  +KP LL +L+  ++   +  L++   F+  + F
Sbjct: 517 FFP----TKQQRRPFKERQGIDSLNIHRKPPLLYQLMKKEIYLYEKRLIKCIEFITRSGF 572

Query: 456 FKDWPEKPVNFPLVMVKDGGAAGEVVEE 483
           F D              +GGAAG  + E
Sbjct: 573 FDD-------------SEGGAAGTDILE 587


>gi|225679220|gb|EEH17504.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 766

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 2/44 (4%)

Query: 299 EVKQWCEERRKNYPTKANINKKL--TEKQSNSDVIDKEAKMRRE 340
           ++  W EER+K YPT+A I ++L   EKQ  +    K+AK  RE
Sbjct: 358 DIAAWIEERKKRYPTRARIEERLKEAEKQKKASREAKDAKRARE 401


>gi|255554857|ref|XP_002518466.1| conserved hypothetical protein [Ricinus communis]
 gi|223542311|gb|EEF43853.1| conserved hypothetical protein [Ricinus communis]
          Length = 228

 Score = 39.3 bits (90), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 56/112 (50%), Gaps = 26/112 (23%)

Query: 44  IMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGN- 102
           IMNPQ+ FP +N++  + +P + QN F               GLPQLG     H N  N 
Sbjct: 65  IMNPQIQFPFNNNS--SVSPFITQNQF---------------GLPQLGFSPLNHLNHLNT 107

Query: 103 MPLNVSQ---GQLLGPNLS-NLPQQFNQNMGFSNGQMCLQSVNQLLSMQMSN 150
           +P+ V+Q    QL       N PQQ N NMGF N    LQ++N  L +QM N
Sbjct: 108 VPMFVNQFNPSQLQPQQFPFNFPQQLNHNMGFPN----LQNMNPPLPIQMPN 155


>gi|354543450|emb|CCE40169.1| hypothetical protein CPAR2_102070 [Candida parapsilosis]
          Length = 253

 Score = 39.3 bits (90), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 13/25 (52%), Positives = 22/25 (88%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKK 320
           TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 77  TEEDIAKWIEERKKNWPTRANVERK 101


>gi|395527571|ref|XP_003765917.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1 [Sarcophilus harrisii]
          Length = 480

 Score = 39.3 bits (90), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 22/25 (88%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKK 320
           T +E+++W EERRKN+PT+ANI KK
Sbjct: 187 TPEEIEKWREERRKNFPTRANIEKK 211


>gi|448516074|ref|XP_003867484.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis Co 90-125]
 gi|380351823|emb|CCG22046.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis]
          Length = 243

 Score = 39.3 bits (90), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 13/25 (52%), Positives = 22/25 (88%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKK 320
           TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 69  TEEDIAKWIEERKKNWPTRANVERK 93


>gi|167387090|ref|XP_001738022.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165898944|gb|EDR25671.1| hypothetical protein EDI_290620 [Entamoeba dispar SAW760]
          Length = 137

 Score = 38.9 bits (89), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 31/45 (68%)

Query: 289 KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 333
           K +  + TE+++++W EER+K +PTKA   K L E++  +D I+K
Sbjct: 3   KFIKSSMTEEQLQKWIEERKKRFPTKAKREKCLFEEKKKTDEIEK 47


>gi|12843948|dbj|BAB26177.1| unnamed protein product [Mus musculus]
          Length = 249

 Score = 38.9 bits (89), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 5/51 (9%)

Query: 270 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKK 320
           K ++F   N M+ P  KK K      T +E+ +W EERRKNYPT ANI +K
Sbjct: 202 KIVQFHWRN-MHAPGMKKIK----LDTPEEIARWREERRKNYPTLANIERK 247


>gi|448079221|ref|XP_004194343.1| Piso0_004831 [Millerozyma farinosa CBS 7064]
 gi|359375765|emb|CCE86347.1| Piso0_004831 [Millerozyma farinosa CBS 7064]
          Length = 234

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 14/42 (33%), Positives = 29/42 (69%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKM 337
           TE+++K+W EERR+N+P++ N+  K  ++ + +   D++ K 
Sbjct: 72  TEEDIKKWIEERRRNWPSRKNVEAKKEKESAGAPANDEQIKT 113


>gi|195995467|ref|XP_002107602.1| hypothetical protein TRIADDRAFT_51288 [Trichoplax adhaerens]
 gi|190588378|gb|EDV28400.1| hypothetical protein TRIADDRAFT_51288 [Trichoplax adhaerens]
          Length = 444

 Score = 38.5 bits (88), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 26/38 (68%), Gaps = 3/38 (7%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 333
           + +++K W E+RRKNYPT AN+ KK   KQ   D I++
Sbjct: 205 SPEDIKAWREQRRKNYPTAANVEKK---KQEALDRIER 239


>gi|395834849|ref|XP_003790401.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1 [Otolemur garnettii]
          Length = 514

 Score = 38.5 bits (88), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 5/50 (10%)

Query: 270 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINK 319
           K ++F   N M+ P  KK K      T +E+ +W EERRKNYPT ANI +
Sbjct: 211 KIVQFHWKN-MHAPGMKKIK----LDTPEEIARWREERRKNYPTLANIER 255


>gi|260817126|ref|XP_002603438.1| hypothetical protein BRAFLDRAFT_80413 [Branchiostoma floridae]
 gi|229288757|gb|EEN59449.1| hypothetical protein BRAFLDRAFT_80413 [Branchiostoma floridae]
          Length = 408

 Score = 38.5 bits (88), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 289 KSLALTYTEQEVKQWCEERRKNYPTKANINKK 320
           K+L+   T + + +W E+RRKNYPT AN+ KK
Sbjct: 58  KALSKLDTPEAIAKWREDRRKNYPTVANVAKK 89


>gi|260945323|ref|XP_002616959.1| hypothetical protein CLUG_02403 [Clavispora lusitaniae ATCC 42720]
 gi|238848813|gb|EEQ38277.1| hypothetical protein CLUG_02403 [Clavispora lusitaniae ATCC 42720]
          Length = 221

 Score = 38.5 bits (88), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 24/33 (72%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNS 328
           TE+++ +W EERRKN+PT+ NI  K  + +S +
Sbjct: 76  TEEDIAKWIEERRKNWPTRKNIENKKAQLESKA 108


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.312    0.129    0.367 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,084,025,908
Number of Sequences: 23463169
Number of extensions: 418842500
Number of successful extensions: 1537860
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 161
Number of HSP's successfully gapped in prelim test: 2296
Number of HSP's that attempted gapping in prelim test: 1514781
Number of HSP's gapped (non-prelim): 16671
length of query: 563
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 415
effective length of database: 8,886,646,355
effective search space: 3687958237325
effective search space used: 3687958237325
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 80 (35.4 bits)