BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 015654
         (403 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224100559|ref|XP_002311923.1| predicted protein [Populus trichocarpa]
 gi|222851743|gb|EEE89290.1| predicted protein [Populus trichocarpa]
          Length = 659

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 176/393 (44%), Positives = 223/393 (56%), Gaps = 67/393 (17%)

Query: 44  QRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRKFGFYNDQK 102
           Q N  +NGQ S  KS  +K SGKN   NR +  SQ G QKS F+ M NGKRK  F N+  
Sbjct: 301 QGNTARNGQSSTPKSKWEKPSGKNLKNNRNRGRSQSGHQKSDFNCMDNGKRKLEFSNEHG 360

Query: 103 PKGHGNGKAIKFGLANQMNQPQEKKRK-SLALTYTEQEVKQWCEERRKNYPTKANINKKL 161
            KG+GN +  KFG  +  +Q  E+K K S    YTEQE+KQW E RRK+YPTK +I KK 
Sbjct: 361 RKGNGNERVAKFGRTDLTDQATEEKSKPSRTFFYTEQEIKQWRESRRKHYPTKTSIEKKQ 420

Query: 162 TEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-------------- 207
            E      VID+EA  R +QLKEILAKQAELGVEVAEIP  YLLDS              
Sbjct: 421 ME------VIDREANFRHKQLKEILAKQAELGVEVAEIPPDYLLDSEKLGVEVAEIPPPQ 474

Query: 208 -----------------KKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLA 250
                            +KQ   R +N   L K+G+F N +DKRGR+ +K + +K+   A
Sbjct: 475 VLSSEKLGVEVAEIPPPQKQEHGREDNRRSLTKKGKFWNKHDKRGRFNKKGRSAKQVGSA 534

Query: 251 DKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVM 310
           +++         +KPTLL+KLLS D+KRDK  LLQ FRFM  NSFFKDWPEKP+ FP V+
Sbjct: 535 NEE---------RKPTLLEKLLSTDIKRDKRQLLQVFRFMVANSFFKDWPEKPLKFPSVV 585

Query: 311 VKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERMHHNIQGFEDENDDEEED 370
           VK+ G   E+VE+KS L+GEE SE +   + E+F                  + DD  E 
Sbjct: 586 VKEDGYEDEIVEKKSSLVGEEVSEDRNNTIAENF-----------------GDRDDNIEH 628

Query: 371 NDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
           +   E    ++ GK +  DE  R+  EEGEI+D
Sbjct: 629 DAQVELGNCFVRGKCDIVDEVDRV--EEGEIVD 659


>gi|147775405|emb|CAN73830.1| hypothetical protein VITISV_043067 [Vitis vinifera]
          Length = 605

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 161/317 (50%), Positives = 200/317 (63%), Gaps = 16/317 (5%)

Query: 44  QRNFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQK 102
           Q NF+++   + S  N K  S KNF+ N + K    G +KS  H M NG+ K G  N+ +
Sbjct: 250 QGNFVKDAGVNNSNPNWKNSSRKNFMXNPKGKNSHWGSRKSQLHHMQNGRGKAGISNENR 309

Query: 103 PKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLT 162
            KG  N  A      N   Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK  
Sbjct: 310 GKGLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSA 369

Query: 163 EKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MP 220
           EK +NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV  R EN    
Sbjct: 370 EKLTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKA 429

Query: 221 LNKRGRFQNNYDKRGRYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTL 267
             K+ RFQN  +KR R+ RK        D F+K+Q+LA  DS     SN    NKKK TL
Sbjct: 430 FGKKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGTDSGDTNASNQPPLNKKKQTL 489

Query: 268 LQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLL 327
           LQKLLS D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV VK+ G  GEVV+ KS  
Sbjct: 490 LQKLLSTDIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVAVKETGCQGEVVDRKSSP 549

Query: 328 LGEEDSEGKGRRMVEHF 344
             +   +G  +   E F
Sbjct: 550 TSKGVPQGGRKTXAEEF 566


>gi|224113339|ref|XP_002316461.1| predicted protein [Populus trichocarpa]
 gi|222865501|gb|EEF02632.1| predicted protein [Populus trichocarpa]
          Length = 674

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 185/410 (45%), Positives = 223/410 (54%), Gaps = 81/410 (19%)

Query: 44  QRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRKFGFYNDQK 102
           Q N  +NGQGS   S     SGKNF  N  +E SQ G QKS FH+M NGKRK GF N   
Sbjct: 296 QGNPAKNGQGSTPNSKQGIFSGKNFKNNPKREQSQSGHQKSEFHRMDNGKRKLGFSNKHG 355

Query: 103 PKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLT 162
            KG GN +A KFG ++  NQ  E+KR  +   YTEQE+KQW E RRK++PTK NI KK T
Sbjct: 356 GKGKGNERAAKFGRSDPSNQVMEQKRTHI---YTEQEIKQWRESRRKHFPTKTNIEKKQT 412

Query: 163 EKQSNSDVIDKEAKMRRE-------------QLKEILAKQAELGVEVAEIPSYY------ 203
           EK  +S VIDKEA  RR+             QLKEILAKQAELGVEVAEIP  Y      
Sbjct: 413 EKLIDSGVIDKEANFRRKVLNALPRKLDDKLQLKEILAKQAELGVEVAEIPPEYMLDSEK 472

Query: 204 ------------------------------LLDSKKQVRARVENTMPLNKRGRFQNNYDK 233
                                         LLDS+KQ   R +N   L K+GRF N +D+
Sbjct: 473 LGVEVAETPLSYLLDSEKLGVEAAEIPPHHLLDSEKQEHGREDNRRSLTKKGRFWNKHDR 532

Query: 234 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 293
           RGRYKRK +   +  L +++         +KPTLL+KLL AD+KRDK  LLQ FRFM  N
Sbjct: 533 RGRYKRKGRSDMQLGLENEE---------RKPTLLEKLLCADIKRDKHRLLQVFRFMVAN 583

Query: 294 SFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERM 353
           SFFKDW +KP+ FP V+VK+ G   E  EEK  L+GEE+SE      VE F         
Sbjct: 584 SFFKDWSDKPLKFPSVVVKEDGCKDEPQEEKPSLVGEEESEVPNNTTVEDF--------- 634

Query: 354 HHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
                    + DD +E +   E     + GK +  DE  R+  EEGEIID
Sbjct: 635 --------GDRDDGDEHDAQVEPGNGSVMGKCDIVDEVYRV--EEGEIID 674


>gi|255554855|ref|XP_002518465.1| conserved hypothetical protein [Ricinus communis]
 gi|223542310|gb|EEF43852.1| conserved hypothetical protein [Ricinus communis]
          Length = 273

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 128/222 (57%), Positives = 169/222 (76%), Gaps = 4/222 (1%)

Query: 125 EKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKE 184
           +++ +S AL YTEQE+K W EERRKN+P+KANI K  +E+ +NS  IDKEAK+RRE+LKE
Sbjct: 5   QRENRSPALIYTEQEIKLWREERRKNFPSKANIEKMCSERVTNSGGIDKEAKLRRERLKE 64

Query: 185 ILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFS 244
           ILAKQAELGVEVAEIPS+YL DS+KQV+ + ++     K+GR ++ +D+RGRY +KD  +
Sbjct: 65  ILAKQAELGVEVAEIPSHYLSDSEKQVKVKKDSRRSAPKKGRSRHKHDRRGRYNKKDGLT 124

Query: 245 KEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPV 304
           ++  LA KDSSN SSF++ KPTLLQKLLSA+V++DK HLLQ FRFM +NSFF D PEKP+
Sbjct: 125 QQNTLAKKDSSNGSSFSRTKPTLLQKLLSAEVRKDKHHLLQVFRFMVMNSFFDDGPEKPL 184

Query: 305 NFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEH 346
            FP V+VK+     EVV EK+ + G+  SE   +  VE+  H
Sbjct: 185 KFPSVVVKE----DEVVAEKTSITGKGISEVSNKTAVENSGH 222


>gi|359487059|ref|XP_002271061.2| PREDICTED: uncharacterized protein LOC100252331 [Vitis vinifera]
          Length = 1031

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 139/257 (54%), Positives = 170/257 (66%), Gaps = 15/257 (5%)

Query: 105 GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 164
           G  N  A      N   Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK  EK
Sbjct: 738 GLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEK 797

Query: 165 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLN 222
            +NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV  R EN      
Sbjct: 798 LTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFG 857

Query: 223 KRGRFQNNYDKRGRYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTLLQ 269
           K+ RFQN  +KR R+ RK        D F+K+Q+LA KDS     SN    NKKK TLLQ
Sbjct: 858 KKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGKDSGDMNASNQPPLNKKKQTLLQ 917

Query: 270 KLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLG 329
           KLLS D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV+VK+ G  GEVV+ KS    
Sbjct: 918 KLLSTDIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVVVKETGCQGEVVDRKSSPTS 977

Query: 330 EEDSEGKGRRMVEHFEH 346
           +   +G  +   E F +
Sbjct: 978 KGVPQGGRKTAAEEFSN 994


>gi|449446325|ref|XP_004140922.1| PREDICTED: uncharacterized protein LOC101213190 [Cucumis sativus]
          Length = 599

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 156/352 (44%), Positives = 214/352 (60%), Gaps = 45/352 (12%)

Query: 66  KNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQE 125
           +NF++N  K    GFQK+  H + N K+KFGF   QK KG  N +  KF   N  +Q +E
Sbjct: 279 RNFMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKE 334

Query: 126 KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 185
           +KR SL+L YT+QE++QW E RRKNYP+  NI KKLT KQ+N  ++DKEAK+ R++LKEI
Sbjct: 335 QKR-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEI 393

Query: 186 LAKQAELGVEVAEIPSYYLL-----DSKKQVRARV-------ENTMPLNKRGRFQNNYDK 233
           LAKQAELGVEVAEIP  YL      D++KQ   R        E ++   ++   QN  +K
Sbjct: 394 LAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASI---EKENSQNRLNK 450

Query: 234 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 293
           RGR K+K++  K+ K  +K  SN     K++PTLLQKLL ADV++DKS LLQ  RF  +N
Sbjct: 451 RGRCKKKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMN 509

Query: 294 SFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRE 351
           SFFK+WP KP+ FP V VK+      VV+E SL  G  +  E     +VE+   HD    
Sbjct: 510 SFFKEWPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD---- 565

Query: 352 RMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
                    + +N+++ +D++  E++K    G G      +++ EEEGEIID
Sbjct: 566 --------IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 599


>gi|449494110|ref|XP_004159451.1| PREDICTED: uncharacterized LOC101213190 [Cucumis sativus]
          Length = 552

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 156/349 (44%), Positives = 212/349 (60%), Gaps = 39/349 (11%)

Query: 66  KNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQE 125
           +NF++N  K    GFQK+  H + N K+KFGF   QK KG  N +  KF   N  +Q +E
Sbjct: 232 RNFMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKE 287

Query: 126 KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 185
           +KR SL+L YT+QE++QW E RRKNYP+  NI KKLT KQ+N  ++DKEAK+ R++LKEI
Sbjct: 288 QKR-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEI 346

Query: 186 LAKQAELGVEVAEIPSYYLL-----DSKKQVRARV----ENTMPLNKRGRFQNNYDKRGR 236
           LAKQAELGVEVAEIP  YL      D++KQ   R     E      ++   QN  +KRGR
Sbjct: 347 LAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASIEKENSQNRLNKRGR 406

Query: 237 YKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
            K+K++  K+ K  +K  SN     K++PTLLQKLL ADV++DKS LLQ  RF  +NSFF
Sbjct: 407 CKKKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMNSFF 465

Query: 297 KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRERMH 354
           K+WP KP+ FP V VK+      VV+E SL  G  +  E     +VE+   HD       
Sbjct: 466 KEWPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD------- 518

Query: 355 HNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
                 + +N+++ +D++  E++K    G G      +++ EEEGEIID
Sbjct: 519 -----IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 552


>gi|145334515|ref|NP_001078603.1| uncharacterized protein [Arabidopsis thaliana]
 gi|145358174|ref|NP_197345.2| uncharacterized protein [Arabidopsis thaliana]
 gi|60547897|gb|AAX23912.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
 gi|71905555|gb|AAZ52755.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
 gi|71905557|gb|AAZ52756.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
 gi|332005180|gb|AED92563.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332005181|gb|AED92564.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 470

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 150/347 (43%), Positives = 207/347 (59%), Gaps = 30/347 (8%)

Query: 2   LTIFHDSLANFAVKTFFEIVI-VHEFSIDLQLPLVNLCLLLLLQRNFIQNGQG------- 53
             + + +L   ++  FF+  +  H FS   QL   N  L      N  QN Q        
Sbjct: 108 FAVANSNLMGHSLPNFFQPSLEPHAFSSRPQLNSFN-SLPYPPVPNPHQNHQSGPPGFSE 166

Query: 54  ------SISKSNSKKVSGKNFIKNRTKE-----FSQGFQKSHFHQMHNGKRKFGFYNDQK 102
                 S+  +N    +G +F     K        QGFQ+   HQ  NGKRK GF  D +
Sbjct: 167 PRPQGQSVDNTNGSGPNGNDFRNKFPKHQNFKGPGQGFQRPQLHQADNGKRKSGFNKDHR 226

Query: 103 PKGHGNGKAIKFGL--ANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKK 160
             G GN   +K GL  ++  N  +EKKR S AL YT +EV+QW E RRKNYPTK  + KK
Sbjct: 227 --GKGNNNKMKTGLDGSDTGNIAKEKKR-SYALMYTPREVQQWREARRKNYPTKFLVEKK 283

Query: 161 LTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMP 220
           + +K  ++ ++D+EAKMRR+QL+E+LAKQAELGVEVAE+PS+YL ++ +QV     N   
Sbjct: 284 V-KKNVSASILDEEAKMRRQQLREVLAKQAELGVEVAEVPSHYLSNNDEQVNGDRGNNN- 341

Query: 221 LNKRGRFQNNYDKRGRYKRKDQF-SKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRD 279
             ++GRFQNN   + R+ RKD+F +K+ +L DK SS  SS   +KPTLL+KLLSAD+KRD
Sbjct: 342 -GRKGRFQNNRRNKRRHDRKDKFDNKKPRLEDKKSSQDSSITTRKPTLLEKLLSADIKRD 400

Query: 280 KSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSL 326
           KS LLQ FRFM +NS  K++PE+P+  PL+ VK+ G   + +E+ S+
Sbjct: 401 KSQLLQVFRFMVMNSLLKEFPEQPLKLPLITVKETGCE-DAMEDPSI 446


>gi|297807865|ref|XP_002871816.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317653|gb|EFH48075.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 454

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 119/239 (49%), Positives = 170/239 (71%), Gaps = 6/239 (2%)

Query: 78  QGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL-ANQMNQPQEKKRKSLALTYT 136
           QGFQ+   HQ  NGKRK GF  D +  G GN   +K GL  +  +   ++KR+S AL YT
Sbjct: 203 QGFQRPQLHQADNGKRKSGFNKDHR--GKGNYNKMKNGLDGSDADNIAKEKRRSYALMYT 260

Query: 137 EQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEV 196
            ++V QW E RRKN+PT+ N+ KK+ +K  ++ ++D+EAKMRR+QL+E+LAKQAELG+EV
Sbjct: 261 PKDVNQWREARRKNFPTRLNVEKKV-KKNVSASILDEEAKMRRQQLREVLAKQAELGIEV 319

Query: 197 AEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSN 256
           A++PS+YL ++ ++V    +N     ++ +FQNN  K+ R+ RKD+F K  +L DK+SS 
Sbjct: 320 ADVPSHYLSNTDERVHG--DNGANDGQKRKFQNNRHKQRRHGRKDKFDKTPRLDDKNSSQ 377

Query: 257 TSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGG 315
            S    KKPTLL+KLLSA++KRDK HLLQ FRFM +NSF K++PE+P+  PL+ V++ G
Sbjct: 378 ESPMTTKKPTLLEKLLSANIKRDKIHLLQVFRFMVMNSFLKEFPEQPLKLPLITVEETG 436


>gi|356553962|ref|XP_003545319.1| PREDICTED: uncharacterized protein LOC100786384 [Glycine max]
          Length = 822

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 148/354 (41%), Positives = 212/354 (59%), Gaps = 17/354 (4%)

Query: 55  ISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 112
           +  SN K    KNF KN+     F  GFQKS FH ++NGK+  GF  +   KG  +G+  
Sbjct: 481 VPNSNWKGSPSKNF-KNKPNRGGFQAGFQKSKFHDVNNGKKGSGFPIEHNGKGPNSGRGG 539

Query: 113 KFGLANQMNQPQEKKR---KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSD 169
            +GL     +P+E K+   +SL++TYT QE++QW E R+KN+P   NI KK +E   +  
Sbjct: 540 HYGL-----KPKEHKQQPERSLSVTYTVQEIQQWREARKKNHPFNNNIQKKHSEHPKDRK 594

Query: 170 VIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN 229
            I++E   R  +LKE+LAKQAELGVEVAEIPSYYL +S  Q   + E       + +FQN
Sbjct: 595 AINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQA-LQSEGKNKFTDKRKFQN 651

Query: 230 NYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRF 289
            ++K+    RK +F+K QK  DKD S + S  K+KPTLLQKLLS+DVKRDKSHL+Q FRF
Sbjct: 652 KFNKKS--DRKGRFAKRQKFDDKDFSESPSLKKRKPTLLQKLLSSDVKRDKSHLIQVFRF 709

Query: 290 MAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDEN 349
           M +NSFFK   +KP+ +PLV+VK+ G+  +  EEK L  G++  +G     V+      N
Sbjct: 710 MVMNSFFKHCLDKPLRYPLVVVKEKGSEVD-GEEKYLHTGKDVLKGGNEETVQKIVTFNN 768

Query: 350 RERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
                   +  +D+ +D   DN+  +     +  + ++ +   + +EEEGEII+
Sbjct: 769 DNSHDCEDEDSDDDENDSIVDNNLHKDPSSLVKRQCDSGEGIGKFDEEEGEIIE 822


>gi|358347440|ref|XP_003637765.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
 gi|355503700|gb|AES84903.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
          Length = 553

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 152/355 (42%), Positives = 209/355 (58%), Gaps = 49/355 (13%)

Query: 54  SISKSNSKKVSGKNFIKNRTKEFSQG-FQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 112
           + + SN K    KNF   + +  SQG FQKS F+  + GKR   F  + K KG  N +A 
Sbjct: 243 NFANSNWKGSPNKNFKNKQNRGGSQGGFQKSKFNDTNKGKR---FSKEHKGKGPNNERAG 299

Query: 113 KFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVID 172
            FGL    N  QE KR S +LTY+EQE++QW E RRKN+P++  I K+ +E+  +S VID
Sbjct: 300 HFGL----NSKQEPKR-SFSLTYSEQEIQQWREARRKNHPSREKIEKQQSEQSKDSKVID 354

Query: 173 KEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--- 229
           +E   R  +LKE+LAKQAELG+EVAEIPSYYL++   Q     +       + +F+N   
Sbjct: 355 RELLQR--ELKEVLAKQAELGIEVAEIPSYYLMNGTNQGLQSEDKKNSFTDKRKFKNKLM 412

Query: 230 -NYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFR 288
            N D+R R  +K      QK AD+D       NKKKPTLLQKLLSAD+KRDKSHL Q FR
Sbjct: 413 RNPDRRSRNNKK------QKFADRDLLE----NKKKPTLLQKLLSADIKRDKSHLFQVFR 462

Query: 289 FMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDE 348
           F+  NSFFKD+P+KP+ +P V VK+ G+  EV   K  L G       G  ++EH     
Sbjct: 463 FITANSFFKDYPDKPLVYPPVSVKEMGS--EVYGGKKHLQG-------GEAVLEH----G 509

Query: 349 NRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
            +E +   ++G ++ +D+E+E++D           +G+  D     +EEEGEII+
Sbjct: 510 TKEIVQKFVKGSDNGHDNEDEESD-----------EGDNEDGINEFDEEEGEIIE 553


>gi|115440647|ref|NP_001044603.1| Os01g0814000 [Oryza sativa Japonica Group]
 gi|56785048|dbj|BAD82687.1| unknown protein [Oryza sativa Japonica Group]
 gi|113534134|dbj|BAF06517.1| Os01g0814000 [Oryza sativa Japonica Group]
          Length = 533

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/186 (48%), Positives = 121/186 (65%), Gaps = 2/186 (1%)

Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 187
           RK   + Y   EVKQW E R+KNYPT  NINKKL+E + + +  D+EA+MRR++LKE+L 
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKKLSESRPDGEKKDEEAQMRRQELKEVLE 340

Query: 188 KQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-KRGRYKRKDQFSKE 246
           KQ ELG E+ E+P  YL + + Q   R  N     +  RF N  D KR RY R D  SK 
Sbjct: 341 KQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNKRSRYDRNDFQSKR 400

Query: 247 QKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVN 305
            K+ ++   +  +  K ++PTLLQKLLS+DVKRD+  LL TF+FM +N+FF D+P+KP+ 
Sbjct: 401 PKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVLNNFFSDYPDKPLE 460

Query: 306 FPLVMV 311
           FP V V
Sbjct: 461 FPSVKV 466


>gi|242054757|ref|XP_002456524.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
 gi|241928499|gb|EES01644.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
          Length = 568

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 3/188 (1%)

Query: 127 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 186
           +RK   + Y   EVK+W E R+KNYPT  N++KKL+E  S+++  DK+A++RR++LKE+L
Sbjct: 295 RRKPPRIIYDANEVKRWVEARKKNYPTSVNVHKKLSEINSDNENKDKDAQLRRQELKEVL 354

Query: 187 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQFS 244
           AKQ ELG ++ E+P  YL ++  Q      N     +   F N  N +KR RY+R    S
Sbjct: 355 AKQQELGFDLPELPPGYLSETGDQCIENKNNRKAQCRDSHFGNRSNNNKRPRYERGGFQS 414

Query: 245 KEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKP 303
           K  K+ ++      +  K ++PTLLQKLLS+D+KRD+  LL  F+FM +N+FFKDWP+KP
Sbjct: 415 KRSKVWNRTPCADDAMAKSREPTLLQKLLSSDIKRDRHRLLHVFKFMTLNNFFKDWPDKP 474

Query: 304 VNFPLVMV 311
           + FP V V
Sbjct: 475 LQFPSVKV 482


>gi|357125563|ref|XP_003564462.1| PREDICTED: uncharacterized protein LOC100829078 [Brachypodium
           distachyon]
          Length = 609

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 87/195 (44%), Positives = 120/195 (61%), Gaps = 15/195 (7%)

Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 187
           RK   + Y   EVKQW E R+KNYPT  N+NKKL++ QS+    D+EA+MRR +LKE+LA
Sbjct: 333 RKRPPICYDTNEVKQWVEARKKNYPTSVNVNKKLSQSQSDDQKKDEEAQMRRLELKEVLA 392

Query: 188 KQAELGVEVAEIPSYYLLDSKKQVRARVEN-----------TMPLNKRGRFQNNYDKRGR 236
           KQ ELG E+ E+P  YL +++ Q +    N                 RGR     +KR R
Sbjct: 393 KQKELGFELPELPPGYLSETEGQPKGPQGNDKESKWKTQQRDCRFGNRGR----GNKRQR 448

Query: 237 YKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
           Y R D  SK  +  +    +  +  K++PTLLQKLLS+D+KRD+  LL TF+FMA+N+FF
Sbjct: 449 YDRADFQSKRPREWNNSRHDGGAVAKREPTLLQKLLSSDIKRDRHRLLHTFKFMALNNFF 508

Query: 297 KDWPEKPVNFPLVMV 311
           ++ P+KP+ FP V V
Sbjct: 509 RNLPDKPLEFPSVKV 523


>gi|223943225|gb|ACN25696.1| unknown [Zea mays]
 gi|414880010|tpg|DAA57141.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
          Length = 562

 Score =  160 bits (406), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 84/191 (43%), Positives = 125/191 (65%), Gaps = 13/191 (6%)

Query: 127 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 186
           +RK   + Y   EVKQW E R+KNYPT  N+NKKL+E +S+++  DK+A++RR++LKE++
Sbjct: 290 RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQELKEVI 349

Query: 187 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN-------NYDKRGRYKR 239
           AKQ ELG +  E+P  YL +   Q    +EN    N++ +F++       N +KR RY+R
Sbjct: 350 AKQQELGFDPPELPPGYLSEIGDQC---IENKN--NRKTQFRDSHFGNRFNNNKRSRYER 404

Query: 240 KDQFSKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 298
               SK  K+ +    ++      ++PTLLQKLLS+D+KRD+  LL  FRFM +N+FFKD
Sbjct: 405 GGFQSKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKD 464

Query: 299 WPEKPVNFPLV 309
           WP++P+ FP V
Sbjct: 465 WPDRPLQFPSV 475


>gi|218189267|gb|EEC71694.1| hypothetical protein OsI_04190 [Oryza sativa Indica Group]
          Length = 500

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 91/199 (45%), Positives = 122/199 (61%), Gaps = 15/199 (7%)

Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 174
           RK   + Y   EVKQW E R+KNYPT  NINK             KL+E + + +  D+E
Sbjct: 237 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 296

Query: 175 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-K 233
           A+MRR++LKE+LAKQ ELG E+ E+P  YL + + Q   R  N     +  RF N  D K
Sbjct: 297 AQMRRQELKEVLAKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNK 356

Query: 234 RGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAI 292
           R RY R D  SK  K+ ++   +  +  K ++PTLLQKLLS+DVKRD+  LL TF+FM +
Sbjct: 357 RSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVL 416

Query: 293 NSFFKDWPEKPVNFPLVMV 311
           N+FF D+P+KP+ FP V V
Sbjct: 417 NNFFSDYPDKPLEFPSVKV 435


>gi|296085358|emb|CBI29090.3| unnamed protein product [Vitis vinifera]
          Length = 467

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 102/216 (47%), Positives = 133/216 (61%), Gaps = 11/216 (5%)

Query: 44  QRNFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQK 102
           Q NF+++   + S  N K  S KNF+KN + K    G +KS  H M N + K G  N+ +
Sbjct: 244 QGNFVKDAGVNNSNPNWKNSSRKNFMKNPKGKNSHWGSRKSQLHHMQNARGKAGISNENR 303

Query: 103 PKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLT 162
            KG  N  A      N   Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK  
Sbjct: 304 GKGLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSA 363

Query: 163 EKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MP 220
           EK +NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV  R EN    
Sbjct: 364 EKLTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKA 423

Query: 221 LNKRGRFQNNYDKRGRYKRK--------DQFSKEQK 248
             K+ RFQN  +KR R+ RK        D F+K+Q+
Sbjct: 424 FGKKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQR 459


>gi|293335934|ref|NP_001167960.1| uncharacterized protein LOC100381676 [Zea mays]
 gi|223945137|gb|ACN26652.1| unknown [Zea mays]
 gi|414880008|tpg|DAA57139.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
          Length = 295

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 83/187 (44%), Positives = 119/187 (63%), Gaps = 4/187 (2%)

Query: 127 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRRE-QLKEI 185
           +RK   + Y   EVKQW E R+KNYPT  N+NKKL+E +S+++  DK+A++RR+ +LKE+
Sbjct: 22  RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQVELKEV 81

Query: 186 LAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQF 243
           +AKQ ELG +  E+P  YL +   Q      N     +   F N  N +KR RY+R    
Sbjct: 82  IAKQQELGFDPPELPPGYLSEIGDQCIENKNNRKTQFRDSHFGNRFNNNKRSRYERGGFQ 141

Query: 244 SKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEK 302
           SK  K+ +    ++      ++PTLLQKLLS+D+KRD+  LL  FRFM +N+FFKDWP++
Sbjct: 142 SKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDR 201

Query: 303 PVNFPLV 309
           P+ FP V
Sbjct: 202 PLQFPSV 208


>gi|20161477|dbj|BAB90401.1| P0432B10.22 [Oryza sativa Japonica Group]
          Length = 541

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 121/208 (58%), Gaps = 24/208 (11%)

Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINK----------------------KLTEKQ 165
           RK   + Y   EVKQW E R+KNYPT  NINK                      KL+E +
Sbjct: 267 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKASATLCPHQTIREASSTDHLMQKLSESR 326

Query: 166 SNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRG 225
            + +  D+EA+MRR++LKE+L KQ ELG E+ E+P  YL + + Q   R  N     +  
Sbjct: 327 PDGEKKDEEAQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDC 386

Query: 226 RFQNNYD-KRGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHL 283
           RF N  D KR RY R D  SK  K+ ++   +  +  K ++PTLLQKLLS+DVKRD+  L
Sbjct: 387 RFGNRADNKRSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRL 446

Query: 284 LQTFRFMAINSFFKDWPEKPVNFPLVMV 311
           L TF+FM +N+FF D+P+KP+ FP V V
Sbjct: 447 LHTFKFMVLNNFFSDYPDKPLEFPSVKV 474


>gi|326528619|dbj|BAJ97331.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 625

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 90/218 (41%), Positives = 128/218 (58%), Gaps = 20/218 (9%)

Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 187
           RK   + Y + EVKQW + R+KNYPT+AN+NKKL + Q +    D+EA+MRR++LKE++A
Sbjct: 335 RKRPPIIYDKNEVKQWVQARKKNYPTRANVNKKLCQNQLDEQKKDEEAQMRRQELKEVIA 394

Query: 188 KQAELGVEVAEIPSYYLLDSKKQVR----------------ARVENTMPLNKRGRFQNNY 231
           KQ ELG+E+ E+P  YL D++ Q R                           RGR     
Sbjct: 395 KQKELGLELPELPPGYLSDNEGQPRDPQKESNWKTRQGGGRFGNRGRGRGQGRGRGSGRD 454

Query: 232 DKRGRY-KRKDQFSKEQKLADKDS---SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTF 287
           +KR RY  R+D  SK  +  + +S      +    ++PTLLQKLL++D+KRD+  LL TF
Sbjct: 455 NKRQRYDSREDSPSKRPREWNNNSRCHDGGAVAKIREPTLLQKLLNSDIKRDRHRLLHTF 514

Query: 288 RFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 325
           +FMA+N+FFKDWP KP+ FP V +       ++ EE S
Sbjct: 515 KFMALNNFFKDWPAKPLEFPSVKLNQIELESDIAEEGS 552


>gi|356562267|ref|XP_003549393.1| PREDICTED: uncharacterized protein LOC100789443 [Glycine max]
          Length = 684

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 103/250 (41%), Positives = 153/250 (61%), Gaps = 21/250 (8%)

Query: 159 KKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQ-VRARVEN 217
           +K +E   +S  I++E   R  +LKE+LAKQAELGVEVAEIPSYYL +S  Q +++  +N
Sbjct: 451 QKHSECPKDSKAINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQGLQSEAKN 508

Query: 218 TMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVK 277
               +KR +FQN ++K+    RK +F+++QK ADKD S + S  K+KPTLLQKLLS+DVK
Sbjct: 509 KY-TDKR-KFQNKFNKKS--DRKGRFAEKQKFADKDFSESPSLKKRKPTLLQKLLSSDVK 564

Query: 278 RDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKG 337
           RDKSHLLQ  RFM +NSFF+  P+K + +P V VK+ G+  EV  EK  L   +D   +G
Sbjct: 565 RDKSHLLQVLRFMVMNSFFRHIPDKALRYPSVEVKEKGS--EVSGEKKHLHTGKDVLNRG 622

Query: 338 R----RMVEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVR 393
                + +  F +D   +         +D+ +D   DN+  +     +  + +  +   +
Sbjct: 623 SEETVQKIVIFNNDNGHD--------CDDDENDSIVDNNLHKDPSSLVKRQCDGGEGIKK 674

Query: 394 LEEEEGEIID 403
            +EEEGEI++
Sbjct: 675 SDEEEGEILE 684



 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%), Gaps = 3/64 (4%)

Query: 55  ISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 112
           +  SN K    KN +KN+     F  GFQKS FH ++NGKR  GF  +   KG  +G+A 
Sbjct: 282 VPNSNWKGSPSKN-LKNKPNRGRFQGGFQKSKFHDVNNGKRGSGFPKEHNGKGPNSGRAG 340

Query: 113 KFGL 116
            +GL
Sbjct: 341 HYGL 344


>gi|222619440|gb|EEE55572.1| hypothetical protein OsJ_03849 [Oryza sativa Japonica Group]
          Length = 484

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 49/120 (40%), Positives = 66/120 (55%), Gaps = 13/120 (10%)

Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 174
           RK   + Y   EVKQW E R+KNYPT  NINK             KL+E + + +  D+E
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 340

Query: 175 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKR 234
           A+MRR++LKE+L KQ ELG E+ E+P  YL + + Q   R  N     +  RF N  D +
Sbjct: 341 AQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNK 400


>gi|168019512|ref|XP_001762288.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686366|gb|EDQ72755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 614

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 65/90 (72%), Gaps = 6/90 (6%)

Query: 131 LALTYTEQ---EVKQWCEERRKNYPTKANINKK---LTEKQSNSDVIDKEAKMRREQLKE 184
           +AL  +E+   E++ W EER +NYPT++NI +K   L  K +  ++ID++ + R++++KE
Sbjct: 278 VALKASEEDDDEIRAWREERMRNYPTRSNIQRKKEALEGKVARGELIDEDTRKRQQRMKE 337

Query: 185 ILAKQAELGVEVAEIPSYYLLDSKKQVRAR 214
           ILAKQAELGV VAE+P++YL  S K+ R R
Sbjct: 338 ILAKQAELGVPVAEVPAHYLSQSYKEDRGR 367



 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 58/129 (44%), Gaps = 18/129 (13%)

Query: 220 PLNKRGRFQNNYDKRGRYKR--KDQFSKEQKLADKDSSNTSSFN---------KKKPTLL 268
           P +K G+ Q  + +RGR K+  K  F+       K   N +            K+ PTLL
Sbjct: 461 PSDKPGKTQCYFYQRGRCKKGFKCHFAHTGPRGKKGGDNEAGRGGPKGVGVEKKRPPTLL 520

Query: 269 QKLLSADVKRDKSHLLQTFRFMAINSFFKDWP-------EKPVNFPLVMVKDGGAAGEVV 321
            KLL  ++ RDKS++LQ  R+   N+F  +WP       E P            + GEVV
Sbjct: 521 SKLLQGEINRDKSYILQCLRYFVNNNFLLNWPNQLSESIELPEQSDDAHESSDYSEGEVV 580

Query: 322 EEKSLLLGE 330
           E   +LL +
Sbjct: 581 EAPDVLLAD 589


>gi|219363611|ref|NP_001136910.1| uncharacterized protein LOC100217067 [Zea mays]
 gi|194697570|gb|ACF82869.1| unknown [Zea mays]
          Length = 138

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 30/47 (63%), Positives = 40/47 (85%)

Query: 263 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLV 309
           ++PTLLQKLLS+D+KRD+  LL  FRFM +N+FFKDWP++P+ FP V
Sbjct: 5   REPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDRPLQFPSV 51


>gi|255546771|ref|XP_002514444.1| conserved hypothetical protein [Ricinus communis]
 gi|223546440|gb|EEF47940.1| conserved hypothetical protein [Ricinus communis]
          Length = 123

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 62/129 (48%), Gaps = 39/129 (30%)

Query: 181 QLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRK 240
           +LKE LAKQAELGVEVAEIPS YL DS+KQV+                            
Sbjct: 28  RLKETLAKQAELGVEVAEIPSRYLSDSEKQVK---------------------------- 59

Query: 241 DQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRD-KSHLLQTFRFMAINSFF--- 296
                   L +KDS+N SSF+  KPTL QKLLSA+ K    S  LQ         FF   
Sbjct: 60  ------DTLTNKDSTNGSSFSNTKPTLRQKLLSAEGKGAFLSECLQASSTFTCEQFFSSN 113

Query: 297 KDWP-EKPV 304
           K W   KPV
Sbjct: 114 KLWNFTKPV 122


>gi|384254158|gb|EIE27632.1| hypothetical protein COCSUDRAFT_55624 [Coccomyxa subellipsoidea
           C-169]
          Length = 539

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 33/50 (66%)

Query: 258 SSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFP 307
           S   +++PTLLQKLL+ D++ + S LLQ  RF+  N+F  D+   P+ FP
Sbjct: 313 SPVQQRQPTLLQKLLAPDIRAEHSRLLQCLRFLVTNNFLLDFGRAPLIFP 362


>gi|328766403|gb|EGF76457.1| hypothetical protein BATDEDRAFT_28378 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 411

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 88/185 (47%), Gaps = 25/185 (13%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKKL---TEKQSNSDVID---KEAKMRREQLKEILAK- 188
           T ++++QW +ER++ +PT++NI +K+    ++Q   ++ID   K+ K R    ++++ K 
Sbjct: 217 TPKDIEQWIQERKRKFPTESNIKQKMEDEKQRQKQGNIIDVNGKKLKGRNADKRQLIRKN 276

Query: 189 ---QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRGRYKRKDQF 243
              Q + G    +  + YL D+         +T+ L+  G+   +       R+    + 
Sbjct: 277 DTVQEKHGTSEQKDSTIYLDDASNAKLEASHDTLALDMTGKLNQHKGASSCCRHGSNCRH 336

Query: 244 SKE-QKLADKDSSNTSSFNKKKPT-----------LLQKLLSADVKRDKSHLLQTFRFMA 291
           S +  KLA+  S+   +     PT           LL  LL  D K+DK+ LLQ  RF+ 
Sbjct: 337 SHDLSKLAESRSA-PKTHTPLNPTAAPIKGTNRRPLLNMLLETDFKKDKNALLQCIRFII 395

Query: 292 INSFF 296
            N FF
Sbjct: 396 ENKFF 400


>gi|308800704|ref|XP_003075133.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
           (ISS) [Ostreococcus tauri]
 gi|116061687|emb|CAL52405.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
           (ISS) [Ostreococcus tauri]
          Length = 401

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 38/64 (59%), Gaps = 2/64 (3%)

Query: 234 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 293
           RGR K   +      + DK +S  SS N +  TLL+KLL  ++KRD+  LLQ FRF+  N
Sbjct: 261 RGRCKAGKRCPFRHDIVDKKTSGVSSGNPQ--TLLKKLLHKEIKRDEERLLQLFRFLVNN 318

Query: 294 SFFK 297
            FF+
Sbjct: 319 DFFE 322


>gi|145343958|ref|XP_001416510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576735|gb|ABO94803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 291

 Score = 46.6 bits (109), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 62/223 (27%), Positives = 100/223 (44%), Gaps = 44/223 (19%)

Query: 138 QEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELG---- 193
           ++V+++ EER K +P+  N+  K    +       +    RRE+L+EILAKQ E+G    
Sbjct: 28  EDVRRYREERAKYWPSDRNVRAKTEAGEEEE----RARAARRERLREILAKQREMGHFEA 83

Query: 194 -VEVAE----IPSYYLLDSKKQVRARVENTMPLNKRGRF--------------------- 227
             E+ E    + +   LD K  V AR  +    +K  RF                     
Sbjct: 84  SQEIGEEEGGVAAPAALDGKG-VEARAADG---SKVCRFWLQGGCRKGSACDFKHESAPN 139

Query: 228 ---QNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 284
              +  +  RGR K   +   + ++ ++ SS        + TLL+KLL  ++KRD+  LL
Sbjct: 140 KDQKCRFFARGRCKAGARCPFKHEVTERKSSAADGGGNPQ-TLLKKLLDKEIKRDEQRLL 198

Query: 285 QTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLL 327
           Q FRF   N FF     K V+ PL M     +  +V++ ++ L
Sbjct: 199 QLFRFFVNNDFFTG--TKGVDEPLWMFPWADSGKKVIDRRAAL 239


>gi|118358218|ref|XP_001012358.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila]
 gi|89294124|gb|EAR92112.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila
           SB210]
          Length = 762

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 41/63 (65%)

Query: 239 RKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 298
           ++D++S+E  L+    SN+S+F  K  TLL  L+  ++ R++++LLQ  R++  N+FF +
Sbjct: 574 KRDKYSQELDLSFNYKSNSSNFRYKANTLLTNLVLDEIYRERNYLLQAIRYIVSNNFFDE 633

Query: 299 WPE 301
             E
Sbjct: 634 QAE 636



 Score = 39.3 bits (90), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 63/123 (51%), Gaps = 21/123 (17%)

Query: 139 EVKQWCEERRKNYPTKANINKKLTEK---QSNSDVIDKE-AKMRREQLKEILAKQAELGV 194
           E+++W E+R+KNYPT  NI KK TE+   Q N  V  +E +K+ ++  K+I+   + L  
Sbjct: 289 EIRKWIEQRKKNYPTSENIKKKQTEEYRNQQNGQVNREELSKLEQKLRKKIMISNSSLKK 348

Query: 195 EVAEIPSYYLL----------DSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFS 244
              +   Y +L             K++R  +E     NK+ + QN  D   R+K ++ F 
Sbjct: 349 LEIQQKHYEMLRKLYTEGDKAPKTKKMREEIE-----NKKDKRQNRKD--DRHKNQENFK 401

Query: 245 KEQ 247
           K++
Sbjct: 402 KQE 404


>gi|392567649|gb|EIW60824.1| hypothetical protein TRAVEDRAFT_71098 [Trametes versicolor
           FP-101664 SS1]
          Length = 590

 Score = 42.0 bits (97), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 64/121 (52%), Gaps = 15/121 (12%)

Query: 137 EQEVKQWCEERRKNYPTKANINKKL----TEKQSNSDVIDKEAKMRREQLKEILAKQAEL 192
           EQ ++++ EE++ +   + N +  +    TE +S     D+ A  R E+  +I+AK  EL
Sbjct: 211 EQALQRFIEEQKTSALQRFNFHMAVLIWETEARSAKATGDEAA--RSERHADIIAKLVEL 268

Query: 193 GVEVAEIPSY-----YLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQ 247
           G + +EIP+Y      LLD  +++  R+ NT+    R +     ++R R + +D F K+ 
Sbjct: 269 GYQSSEIPAYDHAFQRLLDQPRKLTPRIWNTI----RPKLVEMLEERRRAREQDAFKKKW 324

Query: 248 K 248
           +
Sbjct: 325 R 325


>gi|126337727|ref|XP_001369929.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Monodelphis domestica]
          Length = 469

 Score = 42.0 bits (97), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 16/104 (15%)

Query: 194 VEVAEIPSYYLLDSKKQVR-ARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADK 252
           V ++++P     D KKQ R + V  T  L K    +N   KRG Y++K            
Sbjct: 358 VILSDVPQNQSEDIKKQERNSPVAKTKSLRKNWMRKNCRYKRGLYRKKP----------- 406

Query: 253 DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
                S   K++P LL+ LL+ D++ +++ +LQ  R++  N FF
Sbjct: 407 ----VSEPQKRRPHLLEMLLAQDIRHERNVILQCVRYIIQNDFF 446



 Score = 38.5 bits (88), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 22/25 (88%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
           T +E+++W EERRKN+PT+ANI KK
Sbjct: 178 TPEEIEKWREERRKNFPTRANIEKK 202


>gi|68073975|ref|XP_678902.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56499510|emb|CAH94055.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 611

 Score = 42.0 bits (97), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 90/182 (49%), Gaps = 18/182 (9%)

Query: 124 QEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLK 183
            EK  K+  +  +++E+++W EER+KNYPTK    KK+ + ++N++ I+ E K ++ + K
Sbjct: 426 HEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMDIKNNNN-INSETKEKKNKEK 480

Query: 184 EILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRGR 236
           +I +   +L VE     +     Y     +K +   + N +  N       N  Y    +
Sbjct: 481 KITSLIEKLLVEKYSSAIGRNIYYQTQSQRKSIFVPILNKLIENNHKNVYENTYYSISDK 540

Query: 237 YKRKD-QFSKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINS 294
           Y +K+ +F+K  K  +  DS N      K   L+ +L+  D+   +  L+Q   F+  N+
Sbjct: 541 YNKKNIKFNKRVKKHNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEFIVNNN 596

Query: 295 FF 296
           FF
Sbjct: 597 FF 598


>gi|303286377|ref|XP_003062478.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455995|gb|EEH53297.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 552

 Score = 41.6 bits (96), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 26/35 (74%)

Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 298
           +PTLL+KL + +V+  +S LLQ FRF+  N+F +D
Sbjct: 460 QPTLLKKLFAKEVRVSRSRLLQVFRFLVNNNFLRD 494


>gi|307107626|gb|EFN55868.1| hypothetical protein CHLNCDRAFT_52119 [Chlorella variabilis]
          Length = 684

 Score = 41.6 bits (96), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 38/61 (62%), Gaps = 2/61 (3%)

Query: 139 EVKQWCEERRKNYPTKANINKKLTEKQS--NSDVIDKEAKMRREQLKEILAKQAELGVEV 196
           EV+QW +ER++++PT   + +K  E ++      +D    +R+++L  ILA+Q+ +G+  
Sbjct: 388 EVEQWIKERKRSFPTSEAVARKEAEAEARRQRGELDAARTLRQQRLAAILAQQSAMGLSR 447

Query: 197 A 197
           A
Sbjct: 448 A 448


>gi|255088323|ref|XP_002506084.1| predicted protein [Micromonas sp. RCC299]
 gi|226521355|gb|ACO67342.1| predicted protein [Micromonas sp. RCC299]
          Length = 567

 Score = 41.2 bits (95), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 24/32 (75%)

Query: 266 TLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
           TLL+KLL+ +V+ D+S LLQ FRF+  N F +
Sbjct: 458 TLLRKLLAREVRADRSRLLQVFRFLVNNDFLR 489


>gi|17540726|ref|NP_499991.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
 gi|351063723|emb|CCD71947.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
          Length = 213

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 8/120 (6%)

Query: 283 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 341
           L+  F ++    FF +D P+   N  L +V+D G   E+        GEE S G+ RR V
Sbjct: 93  LIGAFIYVLAGEFFAQDSPQTIFNKALALVRDDGRCQEIFGASIAGFGEETSRGR-RRHV 151

Query: 342 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 395
            H +++++   R R+  +++G  DE     E E  D   Q + +Y+  K      HV ++
Sbjct: 152 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 211


>gi|313219540|emb|CBY30463.1| unnamed protein product [Oikopleura dioica]
 gi|313226347|emb|CBY21491.1| unnamed protein product [Oikopleura dioica]
          Length = 310

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 35/49 (71%), Gaps = 1/49 (2%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKE 184
           TE+++++W EERRKN+PTKA I +K   +++ + +  K+   +REQ +E
Sbjct: 96  TEEDIRKWREERRKNFPTKARIAEKEAARKAGTSIETKKFS-KREQERE 143


>gi|345325138|ref|XP_001514551.2| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Ornithorhynchus anatinus]
          Length = 446

 Score = 39.7 bits (91), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 12/51 (23%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 186
           T +E+ +W EERRKN+PT ANI KK            K  ++ REQ  E+L
Sbjct: 125 TPEEIAKWREERRKNFPTMANIEKK------------KALQLDREQRGEVL 163


>gi|82594577|ref|XP_725484.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480507|gb|EAA17049.1| Zinc finger, C2H2 type, putative [Plasmodium yoelii yoelii]
          Length = 601

 Score = 39.7 bits (91), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 49/187 (26%), Positives = 89/187 (47%), Gaps = 29/187 (15%)

Query: 124 QEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLK 183
            EK  K+  +  +++E+++W EER+KNYPTK    KK+ E ++N+  I+ E K ++   K
Sbjct: 419 HEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMEMKNNN--INSEEKEKKNXQK 472

Query: 184 EILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNK-RGRFQNNYDKRGRY 237
           +I +   +L VE     +     Y     KK +   + N +  N  +  ++N Y     Y
Sbjct: 473 KITSLIEKLLVEKYSSAIGRNIYYQTQSQKKSIFVPLLNKLIENNYKNVYENTY-----Y 527

Query: 238 KRKDQFSKEQKLADK--------DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRF 289
              D+++++    +K        DS N      K   L+ +L+  D+   +  L+Q   F
Sbjct: 528 SISDKYNQKNIKFNKKIKKNNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEF 583

Query: 290 MAINSFF 296
           +  N+FF
Sbjct: 584 IVNNNFF 590


>gi|341888654|gb|EGT44589.1| hypothetical protein CAEBREN_21252 [Caenorhabditis brenneri]
          Length = 219

 Score = 39.3 bits (90), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 8/120 (6%)

Query: 283 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 341
           L+  F ++    FF +D P+   N  L +V+D G   ++        GEE S G+ RR V
Sbjct: 99  LIGAFIYVLCGEFFAQDSPQTIFNKALAIVRDDGRCQDIFGATIAGFGEETSRGR-RRHV 157

Query: 342 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 395
            H +++++   R R+  +++G  DE     E E  D   Q + +Y+  K      HV ++
Sbjct: 158 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 217


>gi|47218661|emb|CAG04990.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 350

 Score = 39.3 bits (90), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 25/34 (73%)

Query: 263 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
           ++PTLL+ LL+ D++ +++ LLQ  RF+  N FF
Sbjct: 306 RRPTLLEMLLAPDIRHERNVLLQCVRFVVRNHFF 339



 Score = 38.1 bits (87), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 17/25 (68%), Positives = 19/25 (76%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
           T +EV +W EERRKNYPT  NI KK
Sbjct: 69  TPEEVAKWREERRKNYPTLQNIEKK 93


>gi|291235450|ref|XP_002737657.1| PREDICTED: nuclear fragile X mental retardation protein interacting
           protein-like [Saccoglossus kowalevskii]
          Length = 639

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 26/35 (74%)

Query: 262 KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
           KKKPTLL+ LL+ D++ +++ LLQ  R++   +FF
Sbjct: 458 KKKPTLLEMLLAGDIRHERNVLLQCVRYVKKKNFF 492


>gi|67474753|ref|XP_653120.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56470045|gb|EAL47734.1| hypothetical protein EHI_019110 [Entamoeba histolytica HM-1:IMSS]
 gi|449703311|gb|EMD43787.1| Hypothetical protein EHI5A_022200 [Entamoeba histolytica KU27]
          Length = 129

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 29/38 (76%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 173
           TE+++++W EER+K +PTKA   K L+E++  +D I+K
Sbjct: 2   TEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39


>gi|158297378|ref|XP_001237955.2| AGAP007875-PA [Anopheles gambiae str. PEST]
 gi|157015166|gb|EAU76388.2| AGAP007875-PA [Anopheles gambiae str. PEST]
          Length = 474

 Score = 38.9 bits (89), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 24/31 (77%)

Query: 266 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
           T+L+KLL AD++ +++ LLQ  R++ +N FF
Sbjct: 416 TMLEKLLDADIRHERNVLLQCVRYVLVNKFF 446


>gi|407040593|gb|EKE40213.1| hypothetical protein ENU1_097960 [Entamoeba nuttalli P19]
          Length = 129

 Score = 38.9 bits (89), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 29/39 (74%)

Query: 135 YTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 173
            TE+++++W EER+K +PTKA   K L+E++  +D I+K
Sbjct: 1   MTEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39


>gi|327283872|ref|XP_003226664.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Anolis carolinensis]
          Length = 451

 Score = 38.5 bits (88), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 8/50 (16%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 185
           T +E+ +W EERRKN+PT ANI KK          + KE + R E LK +
Sbjct: 92  TPEEIAKWREERRKNFPTLANIEKK--------KALQKEKEQRGEVLKTL 133


>gi|395527571|ref|XP_003765917.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1 [Sarcophilus harrisii]
          Length = 480

 Score = 38.5 bits (88), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 22/25 (88%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
           T +E+++W EERRKN+PT+ANI KK
Sbjct: 187 TPEEIEKWREERRKNFPTRANIEKK 211


>gi|156353142|ref|XP_001622934.1| predicted protein [Nematostella vectensis]
 gi|156209570|gb|EDO30834.1| predicted protein [Nematostella vectensis]
          Length = 401

 Score = 38.5 bits (88), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 250 ADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
           A++   NT S   + PTLL+ LL+ +++ +++ +LQ  R++  NSFF
Sbjct: 355 ANRTKQNTKS--NRNPTLLEMLLAPEIRHERNAILQCLRYIVRNSFF 399


>gi|344301187|gb|EGW31499.1| hypothetical protein SPAPADRAFT_154739 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 212

 Score = 38.5 bits (88), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 35/55 (63%), Gaps = 2/55 (3%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQA 190
           TE+++ +W EER+K +P+K  + KKL +KQ  S++  KE + R  + KE   K+ 
Sbjct: 59  TEEDIAKWIEERKKKWPSKQVVAKKLEQKQ--SEIQQKEDRKRPAESKEAPGKRT 111


>gi|7305335|ref|NP_038773.1| nuclear fragile X mental retardation-interacting protein 1 [Mus
           musculus]
 gi|81917719|sp|Q9QXX8.1|NUFP1_MOUSE RecName: Full=Nuclear fragile X mental retardation-interacting
           protein 1; AltName: Full=Nuclear FMRP-interacting
           protein 1
 gi|6525073|gb|AAF15316.1| nuclear FMRP interacting protein 1 [Mus musculus]
 gi|26328149|dbj|BAC27815.1| unnamed protein product [Mus musculus]
 gi|33525186|gb|AAH56192.1| Nuclear fragile X mental retardation protein interacting protein 1
           [Mus musculus]
 gi|74220737|dbj|BAE33625.1| unnamed protein product [Mus musculus]
 gi|148703863|gb|EDL35810.1| nuclear fragile X mental retardation protein interacting protein 1
           [Mus musculus]
          Length = 484

 Score = 38.5 bits (88), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
           T +E+ +W EERRKNYPT ANI +K
Sbjct: 223 TPEEIARWREERRKNYPTLANIERK 247


>gi|270007929|gb|EFA04377.1| hypothetical protein TcasGA2_TC014675 [Tribolium castaneum]
          Length = 327

 Score = 38.1 bits (87), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 39/63 (61%), Gaps = 3/63 (4%)

Query: 238 KRKDQFSKEQKLADKDSSNTSS---FNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINS 294
           +R+    K+QKL +K      S   F K+K TLL+KLL ++++ +++ LLQ  + +  N+
Sbjct: 265 RRRLSSGKKQKLNNKPEPGGFSRMKFVKRKVTLLEKLLDSEIRHERNVLLQCVQHVVTNN 324

Query: 295 FFK 297
           FF+
Sbjct: 325 FFR 327


>gi|350646629|emb|CCD58749.1| hypothetical protein Smp_099560.1 [Schistosoma mansoni]
          Length = 336

 Score = 38.1 bits (87), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
           +PTLLQ LL+ D++++++ L+Q  R++   +FF+
Sbjct: 303 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 336


>gi|256053307|ref|XP_002570139.1| hypothetical protein [Schistosoma mansoni]
          Length = 371

 Score = 38.1 bits (87), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
           +PTLLQ LL+ D++++++ L+Q  R++   +FF+
Sbjct: 338 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 371


>gi|256053309|ref|XP_002570140.1| hypothetical protein [Schistosoma mansoni]
          Length = 356

 Score = 38.1 bits (87), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
           +PTLLQ LL+ D++++++ L+Q  R++   +FF+
Sbjct: 323 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 356


>gi|350646628|emb|CCD58748.1| hypothetical protein Smp_099560.2 [Schistosoma mansoni]
          Length = 351

 Score = 38.1 bits (87), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
           +PTLLQ LL+ D++++++ L+Q  R++   +FF+
Sbjct: 318 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 351


>gi|448516074|ref|XP_003867484.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis Co 90-125]
 gi|380351823|emb|CCG22046.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis]
          Length = 243

 Score = 38.1 bits (87), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 13/25 (52%), Positives = 22/25 (88%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
           TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 69  TEEDIAKWIEERKKNWPTRANVERK 93


>gi|167387090|ref|XP_001738022.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165898944|gb|EDR25671.1| hypothetical protein EDI_290620 [Entamoeba dispar SAW760]
          Length = 137

 Score = 38.1 bits (87), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 29/40 (72%)

Query: 134 TYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 173
           + TE+++++W EER+K +PTKA   K L E++  +D I+K
Sbjct: 8   SMTEEQLQKWIEERKKRFPTKAKREKCLFEEKKKTDEIEK 47


>gi|195995467|ref|XP_002107602.1| hypothetical protein TRIADDRAFT_51288 [Trichoplax adhaerens]
 gi|190588378|gb|EDV28400.1| hypothetical protein TRIADDRAFT_51288 [Trichoplax adhaerens]
          Length = 444

 Score = 38.1 bits (87), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 15/28 (53%), Positives = 21/28 (75%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTE 163
           + +++K W E+RRKNYPT AN+ KK  E
Sbjct: 205 SPEDIKAWREQRRKNYPTAANVEKKKQE 232


>gi|354543450|emb|CCE40169.1| hypothetical protein CPAR2_102070 [Candida parapsilosis]
          Length = 253

 Score = 38.1 bits (87), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 13/25 (52%), Positives = 22/25 (88%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
           TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 77  TEEDIAKWIEERKKNWPTRANVERK 101


>gi|448079221|ref|XP_004194343.1| Piso0_004831 [Millerozyma farinosa CBS 7064]
 gi|359375765|emb|CCE86347.1| Piso0_004831 [Millerozyma farinosa CBS 7064]
          Length = 234

 Score = 38.1 bits (87), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 14/42 (33%), Positives = 29/42 (69%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKM 177
           TE+++K+W EERR+N+P++ N+  K  ++ + +   D++ K 
Sbjct: 72  TEEDIKKWIEERRRNWPSRKNVEAKKEKESAGAPANDEQIKT 113


>gi|12843948|dbj|BAB26177.1| unnamed protein product [Mus musculus]
          Length = 249

 Score = 38.1 bits (87), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
           T +E+ +W EERRKNYPT ANI +K
Sbjct: 223 TPEEIARWREERRKNYPTLANIERK 247


>gi|348545180|ref|XP_003460058.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
           1-like [Oreochromis niloticus]
          Length = 544

 Score = 37.7 bits (86), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 19/25 (76%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
           T +E+ +W EERRKNYPT  NI KK
Sbjct: 152 TPEEITKWREERRKNYPTLVNIEKK 176


>gi|295665248|ref|XP_002793175.1| CCCH zinc finger protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278089|gb|EEH33655.1| CCCH zinc finger protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 764

 Score = 37.7 bits (86), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 2/47 (4%)

Query: 136 TEQEVKQWCEERRKNYPTKANINKKL--TEKQSNSDVIDKEAKMRRE 180
           +  ++  W EER+K YPTKA I ++L   EKQ  +    K+AK  RE
Sbjct: 353 SPSDIAAWIEERKKRYPTKARIEERLKEAEKQKKASREAKDAKRARE 399


>gi|312375641|gb|EFR22971.1| hypothetical protein AND_13906 [Anopheles darlingi]
          Length = 575

 Score = 37.7 bits (86), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 24/31 (77%)

Query: 266 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
           T+L+KLL AD++ +++ LLQ  R++  N+FF
Sbjct: 459 TMLEKLLDADIRHERNVLLQCVRYVVSNNFF 489


>gi|260817126|ref|XP_002603438.1| hypothetical protein BRAFLDRAFT_80413 [Branchiostoma floridae]
 gi|229288757|gb|EEN59449.1| hypothetical protein BRAFLDRAFT_80413 [Branchiostoma floridae]
          Length = 408

 Score = 37.7 bits (86), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 129 KSLALTYTEQEVKQWCEERRKNYPTKANINKK 160
           K+L+   T + + +W E+RRKNYPT AN+ KK
Sbjct: 58  KALSKLDTPEAIAKWREDRRKNYPTVANVAKK 89


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.132    0.369 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,407,488,096
Number of Sequences: 23463169
Number of extensions: 292132639
Number of successful extensions: 1295094
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 171
Number of HSP's successfully gapped in prelim test: 1716
Number of HSP's that attempted gapping in prelim test: 1271493
Number of HSP's gapped (non-prelim): 16023
length of query: 403
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 258
effective length of database: 8,957,035,862
effective search space: 2310915252396
effective search space used: 2310915252396
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 78 (34.7 bits)