BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 005351
         (701 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225426838|ref|XP_002276704.1| PREDICTED: uncharacterized protein LOC100266763 [Vitis vinifera]
          Length = 747

 Score = 1133 bits (2930), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 544/714 (76%), Positives = 616/714 (86%), Gaps = 16/714 (2%)

Query: 1   MRNQTKSLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVL 60
           M++Q    + +   LI+  I   ++   V GVQILSKSKLEKCEK ++SDNLNCT KI+L
Sbjct: 1   MKDQKPRTRRRPLALIITIIFLSINGGSVYGVQILSKSKLEKCEKVSESDNLNCTKKIIL 60

Query: 61  NMAVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPY 120
           +MAVPSGSSGGEASIVAEVVEVEENST KM+T+R+PP +TVNK+++YAVYE+TYIRDVPY
Sbjct: 61  DMAVPSGSSGGEASIVAEVVEVEENSTHKMQTLRVPPTITVNKSSAYAVYEITYIRDVPY 120

Query: 121 KPQEFYMKTRKCEPDAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFD 168
           KPQE+++KTRKCEPDA A VVKICER            QPICCPCG  RR+PSSCGN FD
Sbjct: 121 KPQEYFVKTRKCEPDASAKVVKICERLQDENGHIIEHTQPICCPCGTHRRVPSSCGNFFD 180

Query: 169 KLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSAD 228
           KL+KGKANTAHCLRFPGDWFHVFGIGQRS+GFSV IEVKTGSK+SEV VGPEN+T  S D
Sbjct: 181 KLMKGKANTAHCLRFPGDWFHVFGIGQRSLGFSVHIEVKTGSKISEVIVGPENRTVMSND 240

Query: 229 NFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLE 288
           NFLKVNLIGDF GYTNIPSFE+FYLV PRQGGPGQPQ+LG NFSMWMLLER RFTLDGLE
Sbjct: 241 NFLKVNLIGDFAGYTNIPSFEDFYLVTPRQGGPGQPQNLGVNFSMWMLLERVRFTLDGLE 300

Query: 289 CNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQ 348
           CNKIGVSYEAFNGQP+FCSSPFW+CLHNQLWN+READQNRI+R+QLPLYGVEGRFER+NQ
Sbjct: 301 CNKIGVSYEAFNGQPNFCSSPFWNCLHNQLWNFREADQNRIDRHQLPLYGVEGRFERINQ 360

Query: 349 HPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATI 408
           HPNAG+ SFSIG+TEVLN+NLLIEL ADDIEYVYQRSPGKI+SV IPTFEALTQFG ATI
Sbjct: 361 HPNAGTRSFSIGITEVLNTNLLIELSADDIEYVYQRSPGKILSVTIPTFEALTQFGTATI 420

Query: 409 TTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAIL 468
           TT+N G+VEASYSLTFDCS GVTLMEEQ+FI+KP E  IRSFK+YPTT+QAAKY CSAIL
Sbjct: 421 TTKNVGKVEASYSLTFDCSRGVTLMEEQFFIMKPNENIIRSFKLYPTTDQAAKYVCSAIL 480

Query: 469 KDSDFSEVDRAECQFSTMATVLDNGSQI---TPFQPPKSSINDFFESIESIGKKLWEGLR 525
           KDSD+SEVDRAECQF+T ATV DNGSQ+   TPFQPPK+SIN FFESIESI  K W+G  
Sbjct: 481 KDSDYSEVDRAECQFTTTATVFDNGSQLLQTTPFQPPKTSINGFFESIESIWNKFWDGFV 540

Query: 526 DFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLY 585
           DFITGK CRRKCS FFDFSCHIQYIC+SW+V+FGL+LAIFPTVLVLLWLLHQKGLFDPLY
Sbjct: 541 DFITGKTCRRKCSRFFDFSCHIQYICMSWMVMFGLLLAIFPTVLVLLWLLHQKGLFDPLY 600

Query: 586 DWWDDHFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHK 645
           DWW+D F +DNQ I D R  RIDVD+PH+H+ KHHKQE RH++ +A+ +R  IH   +HK
Sbjct: 601 DWWEDRFWADNQSIGDTRRHRIDVDNPHIHL-KHHKQEARHYRHDAQSKRRSIHDKRRHK 659

Query: 646 HSDRDTDYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRE 699
           HS +D+DYYYYLHHV K+KHK GRSKNSS+M Q+Y D  ++D IG  R RK RE
Sbjct: 660 HSLQDSDYYYYLHHVHKNKHKQGRSKNSSIMHQVYSDRREDDGIGQRRCRKERE 713


>gi|357481707|ref|XP_003611139.1| HAP2 [Medicago truncatula]
 gi|355512474|gb|AES94097.1| HAP2 [Medicago truncatula]
          Length = 739

 Score = 1053 bits (2724), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 498/710 (70%), Positives = 593/710 (83%), Gaps = 21/710 (2%)

Query: 7   SLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSD-NLNCTTKIVLNMAVP 65
           S ++   + I F + + L+   V GVQI+SKSKLEKCEK ++SD NLNCTTKIVL+MAVP
Sbjct: 5   SPRITLIIFIFFTVSSFLTCH-VTGVQIISKSKLEKCEKNSNSDDNLNCTTKIVLSMAVP 63

Query: 66  SGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEF 125
           SGSSGGEASIVAE+VEVEENST KM+T+R+PPV+TVNKT++YAVYELTYIRDVPYKP+EF
Sbjct: 64  SGSSGGEASIVAELVEVEENSTTKMQTLRVPPVITVNKTSAYAVYELTYIRDVPYKPEEF 123

Query: 126 YMKTRKCEPDAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKLLKG 173
           Y++TRKCEPDAGA+VVKICER            QP CCPCGPQRR+PSSCGN FDKL KG
Sbjct: 124 YVQTRKCEPDAGANVVKICERLRDEDGHIIENTQPTCCPCGPQRRMPSSCGNFFDKLTKG 183

Query: 174 KANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKV 233
           KANTAHC+RFPGDWFHVFGIG+R++GFSVRI++K+G+KVSEV VGPEN+T TS D FL+V
Sbjct: 184 KANTAHCVRFPGDWFHVFGIGRRTLGFSVRIQIKSGTKVSEVVVGPENRTVTSDDKFLRV 243

Query: 234 NLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIG 293
           NLIGDFVGYTNIPSFE+FYLV+PRQG PGQP DLG N SMWMLLER RFTLDG+ECNKIG
Sbjct: 244 NLIGDFVGYTNIPSFEDFYLVVPRQGDPGQPHDLGRNISMWMLLERVRFTLDGIECNKIG 303

Query: 294 VSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAG 353
           VSYEAFNGQP+FC+SPFWSCLHNQLWN+ EAD NRI+RNQ+PLYG+EGRFER+NQHPNAG
Sbjct: 304 VSYEAFNGQPNFCASPFWSCLHNQLWNFHEADLNRISRNQVPLYGLEGRFERINQHPNAG 363

Query: 354 SHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNT 413
           S SFSIG+TEVLN+N++IEL A+D++YVYQRSPGKIISV +PTFEALTQFGVATITT+NT
Sbjct: 364 SFSFSIGITEVLNTNIVIELSANDVDYVYQRSPGKIISVSVPTFEALTQFGVATITTKNT 423

Query: 414 GEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDF 473
           GEVEASYSLTFDCS  +TLMEEQ+ I+KP E + RSFKIYP+T+QA+KY+C+AILKDSD+
Sbjct: 424 GEVEASYSLTFDCSKEITLMEEQFLIMKPNEITTRSFKIYPSTDQASKYSCAAILKDSDY 483

Query: 474 SEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKAC 533
            EVDRAECQF+T  TVLDNG+Q  PFQPP++ IN FF+SIES+  KLW G  +FITGK C
Sbjct: 484 GEVDRAECQFTTTGTVLDNGTQGMPFQPPETGINGFFDSIESMWNKLWTGFIEFITGKNC 543

Query: 534 RRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQ 593
           R+KC+ FFDF CHIQY+CLSW+++FGL LAIFPTVLVLLWLLHQKGLFDPLYDWW+D   
Sbjct: 544 RQKCAGFFDFKCHIQYVCLSWIMMFGLFLAIFPTVLVLLWLLHQKGLFDPLYDWWEDICG 603

Query: 594 SD-NQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEA-RRRRCGIHSDHKHKHSDRDT 651
           +D  Q I D    +I+  H H+H  KH KQE RH    A  RR+     +HKHKHS+ ++
Sbjct: 604 ADEKQFIMDRHRVKINQTHHHIHDNKHRKQEVRHLNHRAPNRRKTSYEHNHKHKHSEGNS 663

Query: 652 DYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRESS 701
           DY+ +LHHVQK+ HKH   K+   +Q +      ++H  HH+ RK ++ S
Sbjct: 664 DYFNHLHHVQKETHKHRHRKHVDNLQNI-----DDNHPAHHKHRKEQDPS 708


>gi|356532878|ref|XP_003534996.1| PREDICTED: uncharacterized protein LOC100818339 [Glycine max]
          Length = 711

 Score = 1030 bits (2662), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 498/706 (70%), Positives = 592/706 (83%), Gaps = 27/706 (3%)

Query: 16  ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDS-DNLNCTTKIVLNMAVPSGSSGGEAS 74
           I   I+ +LS   VVG+QI+SKSKLEKCEK ++S DNLNCTTKIVLNMAVPSGSSGGEAS
Sbjct: 7   ITLIIIFILSSFHVVGIQIISKSKLEKCEKNSNSEDNLNCTTKIVLNMAVPSGSSGGEAS 66

Query: 75  IVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEP 134
           IVAE+VEVEENS++KM+T+RIPPV+TVNKT++YA+Y+LTYIRDVPYKP+E+Y+KTRKCEP
Sbjct: 67  IVAELVEVEENSSRKMQTLRIPPVITVNKTSAYALYQLTYIRDVPYKPEEYYVKTRKCEP 126

Query: 135 DAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLR 182
           DAGA+VVKICER            QPICCPCGPQRR+PSSCGN FDKL KGKANTAHC+R
Sbjct: 127 DAGANVVKICERLRDEEGHIIEYTQPICCPCGPQRRMPSSCGNFFDKLTKGKANTAHCVR 186

Query: 183 FPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGY 242
           FPGDWFHVFGIG+R++GFSVRI+VK+G+KVSEV VGPEN+T  S D FL+VNLIGDFVGY
Sbjct: 187 FPGDWFHVFGIGRRTLGFSVRIQVKSGTKVSEVFVGPENRTVISDDKFLRVNLIGDFVGY 246

Query: 243 TNIPSFEEFYLVIPRQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
           TNIPSFE+FYLV+PRQ   PGQPQDLG N SMWMLLER RFTLDG+ECNKIGVSYEAFN 
Sbjct: 247 TNIPSFEDFYLVVPRQVCFPGQPQDLGRNISMWMLLERVRFTLDGIECNKIGVSYEAFNQ 306

Query: 302 QPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGV 361
           QP+FCSSPFW+CLHNQLWN+READ NRI+RNQ+PLYG+EGRFER+NQHP+AGS+SFSIG+
Sbjct: 307 QPNFCSSPFWTCLHNQLWNFREADLNRISRNQVPLYGLEGRFERINQHPSAGSYSFSIGI 366

Query: 362 TEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYS 421
           TEVL++NL++EL A+D+EYVYQRSPGKIISV +PTFEALTQFGVATITT+NTGEVEASYS
Sbjct: 367 TEVLSTNLVLELSANDVEYVYQRSPGKIISVSVPTFEALTQFGVATITTKNTGEVEASYS 426

Query: 422 LTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAEC 481
           LTF+CS  +TLMEEQ+ I+KP E + +S KIYP+T+QA+KY C+ +LKDSD++EVDRAEC
Sbjct: 427 LTFNCSKDITLMEEQFLIMKPNEVTTQSCKIYPSTDQASKYFCAVVLKDSDYNEVDRAEC 486

Query: 482 QFSTMATVLDNGSQI-----TPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRK 536
           QF+T ATVLDN + +      PFQPP++SIN FF+SIESI  K+W  L +FITGK CR K
Sbjct: 487 QFATTATVLDNDTHVCSFLGMPFQPPEASINSFFDSIESIWNKIWRSLTEFITGKTCREK 546

Query: 537 CSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDN 596
           CS FFDF CHIQY+CLSW+++FGL L IFPTVLVLLWLLHQKGLFDPLYDWW+D   +D 
Sbjct: 547 CSGFFDFKCHIQYVCLSWVMMFGLFLTIFPTVLVLLWLLHQKGLFDPLYDWWEDILGADE 606

Query: 597 QRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYY 656
           Q I D R  +ID  H H+H  KHHKQE RH    A+ RR   + +H HKHS+R++DY+  
Sbjct: 607 QIIMDKRRFKIDKGHHHIHDNKHHKQELRHSNYSAQNRRRTTY-EHMHKHSERNSDYFDD 665

Query: 657 LHHVQKDKHKHGRSK-NSSVMQQLYLDTGKNDHIGHHRRRKFRESS 701
           LHHV K+ HK+G  K N  ++Q +       DH  HH+ RK R+SS
Sbjct: 666 LHHVHKEMHKYGHKKQNMDIVQHIV------DHPAHHKHRKKRDSS 705


>gi|356555070|ref|XP_003545862.1| PREDICTED: uncharacterized protein LOC100780334 [Glycine max]
          Length = 711

 Score = 1026 bits (2653), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 497/706 (70%), Positives = 589/706 (83%), Gaps = 27/706 (3%)

Query: 16  ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSD-NLNCTTKIVLNMAVPSGSSGGEAS 74
           I   I+ +LS   VVG+QI+SKSKLEKCEK ++SD NLNCTTKIVLNMAVPSGSSGGEAS
Sbjct: 7   ITLIIIFILSSFYVVGIQIISKSKLEKCEKNSNSDDNLNCTTKIVLNMAVPSGSSGGEAS 66

Query: 75  IVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEP 134
           IVAE+VEVEENS++KM+T+RIPPV+TVNKT++YA+Y+LTYIRDVPYKP+E+Y+KTRKCEP
Sbjct: 67  IVAELVEVEENSSRKMQTLRIPPVITVNKTSAYALYQLTYIRDVPYKPEEYYVKTRKCEP 126

Query: 135 DAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLR 182
           DAGA+VVK CER            QPICCPCGPQRR+PSSCGN FDKL KGKANTAHC+R
Sbjct: 127 DAGANVVKTCERLRDEEGHIIEYTQPICCPCGPQRRMPSSCGNFFDKLTKGKANTAHCVR 186

Query: 183 FPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGY 242
           FPGDWFHVFGIG+R++GFSVRI+VK+G+KVSEV VGPEN+T  S D FL+VNLIGDFVGY
Sbjct: 187 FPGDWFHVFGIGRRTLGFSVRIQVKSGTKVSEVVVGPENRTVISDDKFLRVNLIGDFVGY 246

Query: 243 TNIPSFEEFYLVIPRQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
           TNIPSFE+FYLV+PRQ   P QPQDLG N SMWMLLER RFTLDG+ECNKIGVSYEAFN 
Sbjct: 247 TNIPSFEDFYLVVPRQVCFPAQPQDLGRNISMWMLLERVRFTLDGIECNKIGVSYEAFNQ 306

Query: 302 QPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGV 361
           QP+FC+SPFW+CLHNQLWN+READ NRI+RNQ+PLYG+EGRFER+NQHP+AGS+SFSIG+
Sbjct: 307 QPNFCASPFWTCLHNQLWNFREADLNRISRNQVPLYGLEGRFERINQHPSAGSYSFSIGI 366

Query: 362 TEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYS 421
           TEVLN+NL++EL A+D+EYVYQRSPGKIISV +PTFEALTQFGVATITT+NTGEVEASYS
Sbjct: 367 TEVLNTNLVLELSANDVEYVYQRSPGKIISVSVPTFEALTQFGVATITTKNTGEVEASYS 426

Query: 422 LTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAEC 481
           LTF+CS  +TLMEEQ+ I+KP E + +S KIYPTT+QA+KY C+A+LKDSD++EVDRAEC
Sbjct: 427 LTFNCSRDITLMEEQFLIMKPNEVTTQSCKIYPTTDQASKYFCAAVLKDSDYNEVDRAEC 486

Query: 482 QFSTMATVLDNGSQI-----TPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRK 536
           QF+T ATVLDN +Q+      PFQP ++SIN FF+SIESI  K+W  L +FITGK CR K
Sbjct: 487 QFATTATVLDNDTQVCSFLGMPFQPQETSINSFFDSIESIWNKIWTSLTEFITGKTCREK 546

Query: 537 CSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDN 596
           CS FFDF CHIQY+CLSW+++FGL L IFPTVLV+LWLLHQKGLFDPLYDWW+D   +D 
Sbjct: 547 CSGFFDFKCHIQYVCLSWVMMFGLFLTIFPTVLVVLWLLHQKGLFDPLYDWWEDILGADE 606

Query: 597 QRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYY 656
           Q I D R  +ID  H H+H  KHHKQE RH    A  RR   + +H HKHS+R++DY+  
Sbjct: 607 QIIMDKRKFKIDKGHHHIHDNKHHKQEHRHSNYSAENRRRTTY-EHMHKHSERNSDYFDD 665

Query: 657 LHHVQKDKHKHGRSK-NSSVMQQLYLDTGKNDHIGHHRRRKFRESS 701
           LHHV K+ HK+G  K N   +Q +       DH  HH+ RK R+SS
Sbjct: 666 LHHVHKEMHKYGHKKQNMDNVQHIV------DHPVHHKHRKKRDSS 705


>gi|449452486|ref|XP_004143990.1| PREDICTED: protein HAPLESS 2-like [Cucumis sativus]
          Length = 667

 Score = 1003 bits (2593), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 474/655 (72%), Positives = 545/655 (83%), Gaps = 20/655 (3%)

Query: 29  VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
           + GVQILSKSKLEKCE+ + SD LNCT KIVLNMAVPSGSSGGEASI+AE+VEVEENST 
Sbjct: 20  ISGVQILSKSKLEKCERNSGSDTLNCTKKIVLNMAVPSGSSGGEASIIAEIVEVEENSTN 79

Query: 89  KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
           KM+T+R PPVLTV+K+ +Y +YELTYIRDVPYKP+EFY+ TRKCEPDA A VV+ICER  
Sbjct: 80  KMQTLRTPPVLTVSKSPAYVLYELTYIRDVPYKPEEFYVPTRKCEPDASARVVQICERLR 139

Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
                     QPICCPCG +RR+P+SCGN FDK++KGKANTAHCLRFPGDWFHVF IGQ 
Sbjct: 140 DESGHIILSTQPICCPCGAKRRMPTSCGNFFDKMIKGKANTAHCLRFPGDWFHVFSIGQW 199

Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
           ++GFSV+I VK+GSKVSEV+VGPEN+T  S DNFL+ NLIGD VGYTNIPSFE+FYLVIP
Sbjct: 200 TLGFSVQIHVKSGSKVSEVSVGPENRTVVSNDNFLRANLIGDLVGYTNIPSFEDFYLVIP 259

Query: 257 RQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHN 316
           RQGGPGQPQ+LG NFSMWMLLER RFTLDGLECNKIGV YE FNGQP FC+SPFWSCLHN
Sbjct: 260 RQGGPGQPQNLGTNFSMWMLLERVRFTLDGLECNKIGVGYETFNGQPDFCTSPFWSCLHN 319

Query: 317 QLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRAD 376
           QLWN+READ +RI R QLPLYGVEGRFER+NQHPNAG+HSFSIGVTEVLN+NL+IELRAD
Sbjct: 320 QLWNFREADLSRIGRKQLPLYGVEGRFERINQHPNAGTHSFSIGVTEVLNTNLVIELRAD 379

Query: 377 DIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQ 436
           D+EYVYQRSPGKI+S+ IPTFEALTQFGVAT+ T+NTGEVEASYSLTF CS  V+LMEEQ
Sbjct: 380 DVEYVYQRSPGKIMSISIPTFEALTQFGVATVATKNTGEVEASYSLTFTCSKEVSLMEEQ 439

Query: 437 YFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQI 496
           Y+I+KP E + RSFK+YPTT+QAAKY C+AILKD+DFSEVDRAECQF+T ATVLDNGSQI
Sbjct: 440 YYIMKPNEIASRSFKLYPTTDQAAKYVCAAILKDADFSEVDRAECQFATTATVLDNGSQI 499

Query: 497 TPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLV 556
           TPF+ PK   N F  SI+   K+ W  + DF+TGK+CR+ CS FFDFSCHIQYICLSWLV
Sbjct: 500 TPFELPKKKENGFIHSIKLAWKQFWGSVIDFVTGKSCRKVCSGFFDFSCHIQYICLSWLV 559

Query: 557 LFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHVHV 616
           LFGL LA FP VLV+LW+LHQKGLFDPLYDWW+D F   ++  R     R +  H H H 
Sbjct: 560 LFGLFLATFPAVLVILWVLHQKGLFDPLYDWWEDMFCHKSEPTRSTWKYRGERKHYHRHG 619

Query: 617 RKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHVQKDKHKHGRSK 671
            +HH+  G  +K    RR   +H   KHKHS+RDTD  Y+LHHV + K K G ++
Sbjct: 620 SRHHQNHGSGYK----RRSHELHK--KHKHSERDTD--YFLHHVHRKKGKRGHNR 666


>gi|449495900|ref|XP_004159979.1| PREDICTED: protein HAPLESS 2-like [Cucumis sativus]
          Length = 833

 Score = 1003 bits (2592), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 472/650 (72%), Positives = 542/650 (83%), Gaps = 20/650 (3%)

Query: 27  RCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENS 86
           + + GVQILSKSKLEKCE+ + SD LNCT KIVLNMAVPSGSSGGEASI+AE+VEVEENS
Sbjct: 18  QTISGVQILSKSKLEKCERNSGSDTLNCTKKIVLNMAVPSGSSGGEASIIAEIVEVEENS 77

Query: 87  TQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
           T KM+T+R PPVLTV+K+ +Y +YELTYIRDVPYKP+EFY+ TRKCEPDA A VV+ICER
Sbjct: 78  TNKMQTLRTPPVLTVSKSPAYVLYELTYIRDVPYKPEEFYVPTRKCEPDASARVVQICER 137

Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
                       QPICCPCG +RR+P+SCGN FDK++KGKANTAHCLRFPGDWFHVF IG
Sbjct: 138 LRDESGHIILSTQPICCPCGAKRRMPTSCGNFFDKMIKGKANTAHCLRFPGDWFHVFSIG 197

Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
           Q ++GFSV+I VK+GSKVSEV+VGPEN+T  S DNFL+ NLIGD VGYTNIPSFE+FYLV
Sbjct: 198 QWTLGFSVQIHVKSGSKVSEVSVGPENRTVVSNDNFLRANLIGDLVGYTNIPSFEDFYLV 257

Query: 255 IPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCL 314
           IPRQGGPGQPQ+LG NFSMWMLLER RFTLDGLECNKIGV YE FNGQP FC+SPFWSCL
Sbjct: 258 IPRQGGPGQPQNLGTNFSMWMLLERVRFTLDGLECNKIGVGYETFNGQPDFCTSPFWSCL 317

Query: 315 HNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELR 374
           HNQLWN+READ +RI R QLPLYGVEGRFER+NQHPNAG+HSFSIGVTEVLN+NL+IELR
Sbjct: 318 HNQLWNFREADLSRIGRKQLPLYGVEGRFERINQHPNAGTHSFSIGVTEVLNTNLVIELR 377

Query: 375 ADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLME 434
           ADD+EYVYQRSPGKI+S+ IPTFEALTQFGVAT+ T+NTGEVEASYSLTF CS  V+LME
Sbjct: 378 ADDVEYVYQRSPGKIMSISIPTFEALTQFGVATVATKNTGEVEASYSLTFTCSKEVSLME 437

Query: 435 EQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGS 494
           EQY+I+KP E + RSFK+YPTT+QAAKY C+AILKD+DFSEVDRAECQF+T ATVLDNGS
Sbjct: 438 EQYYIMKPNEIASRSFKLYPTTDQAAKYVCAAILKDADFSEVDRAECQFATTATVLDNGS 497

Query: 495 QITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
           QITPF+ PK   N F  SI+   K+ W  + DF+TGK+CR+ CS FFDFSCHIQYICLSW
Sbjct: 498 QITPFELPKKKENGFIHSIKLAWKQFWGSVIDFVTGKSCRKVCSGFFDFSCHIQYICLSW 557

Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
           LVLFGL LA FP VLV+LW+LHQKGLFDPLYDWW+D F   ++  R     R +  H H 
Sbjct: 558 LVLFGLFLATFPAVLVILWVLHQKGLFDPLYDWWEDMFCHKSEPTRSTWKYRGERKHYHR 617

Query: 615 HVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHVQKDK 664
           H  +HH+  G  +K    RR   +H   KHKHS+RDTD  Y+LHHV + K
Sbjct: 618 HGSRHHQNHGSGYK----RRSHELHK--KHKHSERDTD--YFLHHVHRKK 659


>gi|297809457|ref|XP_002872612.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297318449|gb|EFH48871.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 710

 Score =  971 bits (2510), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 473/703 (67%), Positives = 553/703 (78%), Gaps = 45/703 (6%)

Query: 29  VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
           V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGSSGGEASIVAE+VEVE+NS+ 
Sbjct: 22  VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSSGGEASIVAEIVEVEDNSSS 81

Query: 89  KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
            M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKPQE+ + TRKCEPDAG D+V+ICER  
Sbjct: 82  NMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKPQEYSVTTRKCEPDAGPDIVQICERLR 141

Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
                     QPICCPCGPQRR+PSSCG++FDK++KGKANTAHCLRFPGDWFHVF IGQR
Sbjct: 142 DEKGNVLEQTQPICCPCGPQRRMPSSCGDIFDKMIKGKANTAHCLRFPGDWFHVFSIGQR 201

Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
           S+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNFLKVNLIGDF GYTNIPSFE+FYLVIP
Sbjct: 202 SLGFSVRVELKTGTRVSEVIIGPENRTATANDNFLKVNLIGDFAGYTNIPSFEDFYLVIP 261

Query: 257 RQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
           R+    GQP +LG N+SMWMLLER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLH
Sbjct: 262 REAAVAGQPGNLGANYSMWMLLERLRFTLDGLECNKIGVGYEAFNSQPNFCSSPYWSCLH 321

Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
           NQLWN+READ NRINRNQLPLYG+EGRFER+NQHPNAG HSFSIGVTE LN+NL+IELRA
Sbjct: 322 NQLWNFREADINRINRNQLPLYGLEGRFERINQHPNAGPHSFSIGVTETLNTNLMIELRA 381

Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
           DDIEYV+QRSPGKII++ IPTFEALTQFGVA +TT+NTGEVEASYSLTFDCS GV  +EE
Sbjct: 382 DDIEYVFQRSPGKIINIAIPTFEALTQFGVAAVTTKNTGEVEASYSLTFDCSKGVAFVEE 441

Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
           Q+FIIKPK  + RSFK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q
Sbjct: 442 QFFIIKPKAVTTRSFKLYPTKDQAAKYICTAILKDSLFSEVDRAECQFSTTATVLDNGTQ 501

Query: 496 IT-PFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
           +T PFQ P++    FF+SI  +  KL  GL DFITG  CR KCSSFFDFSCHIQY+CLSW
Sbjct: 502 VTNPFQIPETRPKGFFDSIRIMWTKLINGLVDFITGDTCRNKCSSFFDFSCHIQYVCLSW 561

Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
           +V+FGL+LA+ PT  VLLWLLHQKGLFDP YDWW+DHF  D+ R R     R D  +   
Sbjct: 562 MVMFGLLLALIPTTCVLLWLLHQKGLFDPFYDWWEDHFDLDHHR-RLLPPTREDAINRRH 620

Query: 615 HVRKHHKQEGRHHKLEARRRRCG---------------IHSDHKHKHSDRDTDYYYYLHH 659
           H      + G       RR                   +  DH   H      YY+ LH 
Sbjct: 621 HHHHRQHRHGVKTHNHHRRTHKRHKHHHNQDDDVLQNMLERDHNESH------YYHQLHR 674

Query: 660 VQKD--KHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRES 700
           V KD  + +  R+K+  V+        ++ H+   +R++ RES
Sbjct: 675 VHKDSKQKQRRRAKHGIVLP-------RDVHVDRRKRQRLRES 710


>gi|297809471|ref|XP_002872619.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297318456|gb|EFH48878.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 708

 Score =  969 bits (2504), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 463/668 (69%), Positives = 543/668 (81%), Gaps = 30/668 (4%)

Query: 29  VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
           V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGSSGGEASIVAE+VEVE+NS+ 
Sbjct: 22  VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSSGGEASIVAEIVEVEDNSSS 81

Query: 89  KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
            M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKPQE+++ TRKCEPDAG D+V+ICER  
Sbjct: 82  NMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKPQEYHVTTRKCEPDAGPDIVQICERLR 141

Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
                     QPICCPCGPQRR+PSSCG++FDK++KGKANTAHCLRFPGDWFHVF IGQR
Sbjct: 142 DEKGNVLEQTQPICCPCGPQRRMPSSCGDIFDKMIKGKANTAHCLRFPGDWFHVFSIGQR 201

Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
           S+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNFLKVNLIGDF GYT+IPSFE+FYLVIP
Sbjct: 202 SLGFSVRVELKTGTRVSEVIIGPENRTATANDNFLKVNLIGDFGGYTSIPSFEDFYLVIP 261

Query: 257 RQGGP-GQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
           R+    GQP  LG N+SMWMLLER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLH
Sbjct: 262 REAAAAGQPGSLGANYSMWMLLERVRFTLDGLECNKIGVGYEAFNTQPNFCSSPYWSCLH 321

Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
           NQLWN+READ NRI+R+QLPLYG+EGRFER+NQHPNAG HSFSIGVTE LN+NL+IELRA
Sbjct: 322 NQLWNFREADINRISRHQLPLYGLEGRFERINQHPNAGPHSFSIGVTETLNTNLMIELRA 381

Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
           DDIEYV+QRSPGKII++ IPTFEALTQFGVA +T +NTGEVEASYSLTFDCS GV  +EE
Sbjct: 382 DDIEYVFQRSPGKIINIAIPTFEALTQFGVAAVTIKNTGEVEASYSLTFDCSKGVAFVEE 441

Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
           Q+FIIKPK  + R+FK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q
Sbjct: 442 QFFIIKPKAVTTRAFKLYPTKDQAAKYICTAILKDSQFSEVDRAECQFSTTATVLDNGTQ 501

Query: 496 IT-PFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
           +T PFQ P++    FF+SI  +G K+  GL DFITG  CR KCSSFFDFSCHIQY+CLSW
Sbjct: 502 VTNPFQIPETHPKGFFDSIRILGTKIINGLVDFITGDTCRNKCSSFFDFSCHIQYVCLSW 561

Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQR----IRDFRSRRIDVD 610
           +V+FGL+LA+FPT  +LLWLLHQKGLFDP Y+WW+DHF  D+ R     R+  + R    
Sbjct: 562 MVMFGLLLALFPTTCLLLWLLHQKGLFDPCYNWWEDHFDLDHHRRLLPTRENIANRHHHH 621

Query: 611 -------HPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHVQKD 663
                    H H R+ H Q  +HH  E       +  D  H     D  YY+ LH V KD
Sbjct: 622 HKHHHGVKTHNHHRRTH-QRHKHHHGENHDVLQKMMLDRDHS----DAHYYHQLHRVHKD 676

Query: 664 KHKHGRSK 671
             +  R +
Sbjct: 677 SKQKQRRR 684


>gi|66731629|gb|AAY51998.1| HAP2 [Arabidopsis thaliana]
 gi|66731631|gb|AAY51999.1| HAP2 [Arabidopsis thaliana]
 gi|154425503|dbj|BAE71143.2| generative cell specific-1 [Arabidopsis thaliana]
          Length = 705

 Score =  966 bits (2496), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 467/692 (67%), Positives = 555/692 (80%), Gaps = 28/692 (4%)

Query: 29  VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
           V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGSSGGEASIVAE+VEVE+NS+ 
Sbjct: 22  VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSSGGEASIVAEIVEVEDNSSS 81

Query: 89  KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
            M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKPQE+++ TRKCEPDAG D+V+ICER  
Sbjct: 82  NMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKPQEYHVTTRKCEPDAGPDIVQICERLR 141

Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
                     QPICCPCGPQRR+PSSCG++FDK++KGKANTAHCLRFPGDWFHVFGIGQR
Sbjct: 142 DEKGNVLEQTQPICCPCGPQRRMPSSCGDIFDKMIKGKANTAHCLRFPGDWFHVFGIGQR 201

Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
           S+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNFLKVNLIGDF GYT+IPSFE+FYLVIP
Sbjct: 202 SLGFSVRVELKTGTRVSEVIIGPENRTATANDNFLKVNLIGDFGGYTSIPSFEDFYLVIP 261

Query: 257 RQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
           R+    GQP  LG N+SMWMLLER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLH
Sbjct: 262 REAAEAGQPGSLGANYSMWMLLERVRFTLDGLECNKIGVGYEAFNTQPNFCSSPYWSCLH 321

Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
           NQLWN+RE+D NRI+R+QLPLYG+EGRFER+NQHPNAG HSFSIGVTE LN+NL+IELRA
Sbjct: 322 NQLWNFRESDINRIDRHQLPLYGLEGRFERINQHPNAGPHSFSIGVTETLNTNLMIELRA 381

Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
           DDIEYV+QRSPGKII++ IPTFEALTQFGVA +  +NTGEVEASYSLTFDCS GV  +EE
Sbjct: 382 DDIEYVFQRSPGKIINIAIPTFEALTQFGVAAVIIKNTGEVEASYSLTFDCSKGVAFVEE 441

Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
           Q+FIIKPK  + RSFK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q
Sbjct: 442 QFFIIKPKAVTTRSFKLYPTKDQAAKYICTAILKDSQFSEVDRAECQFSTTATVLDNGTQ 501

Query: 496 IT-PFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
           +T PFQ P++    FF+SI  +  K+  GL DFITG  CR KCSSFFDFSCHIQY+CLSW
Sbjct: 502 VTNPFQIPETQPKGFFDSIRILWTKIINGLVDFITGDTCRNKCSSFFDFSCHIQYVCLSW 561

Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
           +V+FGL+LA+FP   +LLWLLHQKGLFDP YDWW+DHF  D+ R R   SR   V+  H 
Sbjct: 562 MVMFGLLLALFPITCLLLWLLHQKGLFDPCYDWWEDHFDLDHHR-RLLPSRADVVNRHHH 620

Query: 615 HVRKHHKQEGRHHKLEARRRRCGIHSDHKHK----HSDRDTDYYYYLHHVQKD--KHKHG 668
           H +  H         +  +   G   D   K        D+ YY+ LH V KD  + +  
Sbjct: 621 HHKHRHHHNHHRRTHQRHKHHHGQDDDVLQKMMLERDHSDSHYYHQLHRVHKDSKQKQRR 680

Query: 669 RSKNSSVMQQLYLDTGKNDHIGHHRRRKFRES 700
           R+K+  V+        ++ H+   R+++ RES
Sbjct: 681 RAKHGIVLP-------RDVHVERQRKQRLRES 705


>gi|145340119|ref|NP_192909.2| protein hapless 2 [Arabidopsis thaliana]
 gi|385178638|sp|F4JP36.1|HAP2_ARATH RecName: Full=Protein HAPLESS 2; AltName: Full=GENERATIVE CELL
           SPECIFIC 1; Flags: Precursor
 gi|332657641|gb|AEE83041.1| protein hapless 2 [Arabidopsis thaliana]
          Length = 705

 Score =  961 bits (2485), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 466/692 (67%), Positives = 554/692 (80%), Gaps = 28/692 (4%)

Query: 29  VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
           V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGSSGGEASIVAE+VEVE+NS+ 
Sbjct: 22  VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSSGGEASIVAEIVEVEDNSSS 81

Query: 89  KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
            M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKPQE+++ TRKCE DAG D+V+ICER  
Sbjct: 82  NMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKPQEYHVTTRKCEHDAGPDIVQICERLR 141

Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
                     QPICCPCGPQRR+PSSCG++FDK++KGKANTAHCLRFPGDWFHVFGIGQR
Sbjct: 142 DEKGNVLEQTQPICCPCGPQRRMPSSCGDIFDKMIKGKANTAHCLRFPGDWFHVFGIGQR 201

Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
           S+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNFLKVNLIGDF GYT+IPSFE+FYLVIP
Sbjct: 202 SLGFSVRVELKTGTRVSEVIIGPENRTATANDNFLKVNLIGDFGGYTSIPSFEDFYLVIP 261

Query: 257 RQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
           R+    GQP  LG N+SMWMLLER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLH
Sbjct: 262 REAAEAGQPGSLGANYSMWMLLERVRFTLDGLECNKIGVGYEAFNTQPNFCSSPYWSCLH 321

Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
           NQLWN+RE+D NRI+R+QLPLYG+EGRFER+NQHPNAG HSFSIGVTE LN+NL+IELRA
Sbjct: 322 NQLWNFRESDINRIDRHQLPLYGLEGRFERINQHPNAGPHSFSIGVTETLNTNLMIELRA 381

Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
           DDIEYV+QRSPGKII++ IPTFEALTQFGVA +  +NTGEVEASYSLTFDCS GV  +EE
Sbjct: 382 DDIEYVFQRSPGKIINIAIPTFEALTQFGVAAVIIKNTGEVEASYSLTFDCSKGVAFVEE 441

Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
           Q+FIIKPK  + RSFK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q
Sbjct: 442 QFFIIKPKAVTTRSFKLYPTKDQAAKYICTAILKDSQFSEVDRAECQFSTTATVLDNGTQ 501

Query: 496 IT-PFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
           +T PFQ P++    FF+SI  +  K+  GL DFITG  CR KCSSFFDFSCHIQY+CLSW
Sbjct: 502 VTNPFQIPETQPKGFFDSIRILWTKIINGLVDFITGDTCRNKCSSFFDFSCHIQYVCLSW 561

Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
           +V+FGL+LA+FP   +LLWLLHQKGLFDP YDWW+DHF  D+ R R   SR   V+  H 
Sbjct: 562 MVMFGLLLALFPITCLLLWLLHQKGLFDPCYDWWEDHFDLDHHR-RLLPSRADVVNRHHH 620

Query: 615 HVRKHHKQEGRHHKLEARRRRCGIHSDHKHK----HSDRDTDYYYYLHHVQKD--KHKHG 668
           H +  H         +  +   G   D   K        D+ YY+ LH V KD  + +  
Sbjct: 621 HHKHRHHHNHHRRTHQRHKHHHGQDDDVLQKMMLERDHSDSHYYHQLHRVHKDSKQKQRR 680

Query: 669 RSKNSSVMQQLYLDTGKNDHIGHHRRRKFRES 700
           R+K+  V+        ++ H+   R+++ RES
Sbjct: 681 RAKHGIVLP-------RDVHVERQRKQRLRES 705


>gi|297742571|emb|CBI34720.3| unnamed protein product [Vitis vinifera]
          Length = 818

 Score =  953 bits (2463), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 462/653 (70%), Positives = 523/653 (80%), Gaps = 26/653 (3%)

Query: 72  EASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTAS-----------YAVYE-LTYIRDVP 119
           E    A+VV++ E     +  ++      VNK  S           + VYE +T    + 
Sbjct: 133 EPDASAKVVKICERHVSLLSAIKCQFSGFVNKDISKEVMQKPCCLTWMVYEGMTLPSQLT 192

Query: 120 YKPQEFYMKTRKCEPDAGADVVKICER-QPICCPCGPQRRIPSSCGNVFDKLLKGKANTA 178
            K   F    RK + +   +   I E  QPICCPCG  RR+PSSCGN FDKL+KGKANTA
Sbjct: 193 QKMLVFRRSKRKWKEERKYENGHIIEHTQPICCPCGTHRRVPSSCGNFFDKLMKGKANTA 252

Query: 179 HCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGD 238
           HCLRFPGDWFHVFGIGQRS+GFSV IEVKTGSK+SEV VGPEN+T  S DNFLKVNLIGD
Sbjct: 253 HCLRFPGDWFHVFGIGQRSLGFSVHIEVKTGSKISEVIVGPENRTVMSNDNFLKVNLIGD 312

Query: 239 FVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEA 298
           F GYTNIPSFE+FYLV PRQGGPGQPQ+LG NFSMWMLLER RFTLDGLECNKIGVSYEA
Sbjct: 313 FAGYTNIPSFEDFYLVTPRQGGPGQPQNLGVNFSMWMLLERVRFTLDGLECNKIGVSYEA 372

Query: 299 FNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFS 358
           FNGQP+FCSSPFW+CLHNQLWN+READQNRI+R+QLPLYGVEGRFER+NQHPNAG+ SFS
Sbjct: 373 FNGQPNFCSSPFWNCLHNQLWNFREADQNRIDRHQLPLYGVEGRFERINQHPNAGTRSFS 432

Query: 359 IGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEA 418
           IG+TEVLN+NLLIEL ADDIEYVYQRSPGKI+SV IPTFEALTQFG ATITT+N G+VEA
Sbjct: 433 IGITEVLNTNLLIELSADDIEYVYQRSPGKILSVTIPTFEALTQFGTATITTKNVGKVEA 492

Query: 419 SYSLT------------FDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSA 466
           SYSLT            FDCS GVTLMEEQ+FI+KP E  IRSFK+YPTT+QAAKY CSA
Sbjct: 493 SYSLTALYVREDSVLYFFDCSRGVTLMEEQFFIMKPNENIIRSFKLYPTTDQAAKYVCSA 552

Query: 467 ILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRD 526
           ILKDSD+SEVDRAECQF+T ATV DNGSQ TPFQPPK+SIN FFESIESI  K W+G  D
Sbjct: 553 ILKDSDYSEVDRAECQFTTTATVFDNGSQTTPFQPPKTSINGFFESIESIWNKFWDGFVD 612

Query: 527 FITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYD 586
           FITGK CRRKCS FFDFSCHIQYIC+SW+V+FGL+LAIFPTVLVLLWLLHQKGLFDPLYD
Sbjct: 613 FITGKTCRRKCSRFFDFSCHIQYICMSWMVMFGLLLAIFPTVLVLLWLLHQKGLFDPLYD 672

Query: 587 WWDDHFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKH 646
           WW+D F +DNQ I D R  RIDVD+PH+H+ KHHKQE RH++ +A+ +R  IH   +HKH
Sbjct: 673 WWEDRFWADNQSIGDTRRHRIDVDNPHIHL-KHHKQEARHYRHDAQSKRRSIHDKRRHKH 731

Query: 647 SDRDTDYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRE 699
           S +D+DYYYYLHHV K+KHK GRSKNSS+M Q+Y D  ++D IG  R RK RE
Sbjct: 732 SLQDSDYYYYLHHVHKNKHKQGRSKNSSIMHQVYSDRREDDGIGQRRCRKERE 784



 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 101/147 (68%), Positives = 123/147 (83%)

Query: 1   MRNQTKSLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVL 60
           M++Q    + +   LI+  I   ++   V GVQILSKSKLEKCEK ++SDNLNCT KI+L
Sbjct: 1   MKDQKPRTRRRPLALIITIIFLSINGGSVYGVQILSKSKLEKCEKVSESDNLNCTKKIIL 60

Query: 61  NMAVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPY 120
           +MAVPSGSSGGEASIVAEVVEVEENST KM+T+R+PP +TVNK+++YAVYE+TYIRDVPY
Sbjct: 61  DMAVPSGSSGGEASIVAEVVEVEENSTHKMQTLRVPPTITVNKSSAYAVYEITYIRDVPY 120

Query: 121 KPQEFYMKTRKCEPDAGADVVKICERQ 147
           KPQE+++KTRKCEPDA A VVKICER 
Sbjct: 121 KPQEYFVKTRKCEPDASAKVVKICERH 147


>gi|255537305|ref|XP_002509719.1| conserved hypothetical protein [Ricinus communis]
 gi|223549618|gb|EEF51106.1| conserved hypothetical protein [Ricinus communis]
          Length = 661

 Score =  891 bits (2303), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 464/715 (64%), Positives = 546/715 (76%), Gaps = 82/715 (11%)

Query: 8   LKLKHFLLILFCILNL-LSPRCVVGVQILSKSKLEKCEKRTDSDN--LNCTTKIVLNMAV 64
           ++ +  ++IL C+++  L    V GV+ILSKSKLEKCEK +DSD+  LNCT KIVLNMAV
Sbjct: 1   MEKQAIVIILCCLVSYYLLVANVNGVEILSKSKLEKCEKASDSDSDSLNCTAKIVLNMAV 60

Query: 65  PSGSSGGEASIVAEVVEVEENST-QKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQ 123
           PSGSSGGEASIVAE+VEVEENST   M+T+RIPPV+TVNK+A+YA+YELTYIRDV YKPQ
Sbjct: 61  PSGSSGGEASIVAEIVEVEENSTSNNMQTLRIPPVITVNKSATYALYELTYIRDVAYKPQ 120

Query: 124 EFYMKTRKCEPDAGADVVKICER-------------------QPICCPCGPQRRIPSSCG 164
           E+Y+KTRKCE DAG +VV+ICER                   +P CCPCGPQRR+PSSCG
Sbjct: 121 EYYVKTRKCERDAGTNVVQICERRVILLLLLRDEKGHIIEHTEPTCCPCGPQRRVPSSCG 180

Query: 165 NVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTA 224
           N FDKL+KGKANTAHC+RFPGDWFHVFGIGQRSIGFS+RIEVKT  KVSEV VGPEN+TA
Sbjct: 181 NFFDKLMKGKANTAHCVRFPGDWFHVFGIGQRSIGFSIRIEVKTRYKVSEVIVGPENRTA 240

Query: 225 TSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTL 284
           TS DNFL+VNLIGDFVGY+++PSFE+FYLVIPRQ                    R RFTL
Sbjct: 241 TSKDNFLRVNLIGDFVGYSSLPSFEDFYLVIPRQ--------------------RVRFTL 280

Query: 285 DGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFE 344
           DG+ECNKIGVSYEAFN QP+FC+SPFWSCLHNQLWNYR+                     
Sbjct: 281 DGIECNKIGVSYEAFNQQPNFCASPFWSCLHNQLWNYRD--------------------- 319

Query: 345 RMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFG 404
                 N G+HSFSIG+TEVLN+NLLIEL ADDIE+VYQRSPGKI++V IPTFEALTQFG
Sbjct: 320 ------NGGTHSFSIGITEVLNTNLLIELSADDIEFVYQRSPGKILNVTIPTFEALTQFG 373

Query: 405 VATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTC 464
           V TITT NTG+VEASYSLT           EQ+FI+KP E +IRSFK+YPTT+QAAKY C
Sbjct: 374 VGTITTMNTGKVEASYSLT-----------EQFFIMKPNEIAIRSFKLYPTTDQAAKYIC 422

Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGL 524
           +AILKDS+F+EVDRAECQFST+AT+LDNGSQITPFQPPK+S N F +S+ESI   LW+GL
Sbjct: 423 AAILKDSNFNEVDRAECQFSTIATILDNGSQITPFQPPKNSKNGFLDSVESIWNTLWKGL 482

Query: 525 RDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPL 584
            DFITGK CRRKC+SFFDFSCHIQYIC+ W+V+FGL+LAI P VLVLLWLLHQKG FDPL
Sbjct: 483 VDFITGKTCRRKCTSFFDFSCHIQYICMGWMVMFGLLLAIIPLVLVLLWLLHQKGFFDPL 542

Query: 585 YDWWDDHFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKH 644
           YDWW+DH  +D QR    +   ID+ H H+HV++HH+   R  K +A+ +R   H +HKH
Sbjct: 543 YDWWEDHVCADKQRHGYIQRHNIDIHHHHIHVKQHHELGARRRKHDAQYKR-STHREHKH 601

Query: 645 KHSDRDTDYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRE 699
            HS  DTDY YYLHHV KD+ K   +K S + Q   LD  ++D+I HHR RK +E
Sbjct: 602 NHSGGDTDYNYYLHHVYKDRSKRRSAKKSRIKQHGLLDEMEDDNIKHHRHRKEKE 656


>gi|356540460|ref|XP_003538707.1| PREDICTED: uncharacterized protein LOC100794070 [Glycine max]
          Length = 667

 Score =  885 bits (2286), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 460/701 (65%), Positives = 545/701 (77%), Gaps = 61/701 (8%)

Query: 16  ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSD-NLNCTTKIVLNMAVPSGSSGGEAS 74
           I   I+ +LS   VVG+QI+SKSKLEKCEK ++SD NLNCTTKIVLNMAVPSGSSGGEAS
Sbjct: 7   ITLMIIFILSSFHVVGIQIISKSKLEKCEKNSNSDDNLNCTTKIVLNMAVPSGSSGGEAS 66

Query: 75  IVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEP 134
           IVAE+VEVEENS++KM+T+RI PV+TVNKT++YA+Y+LTYIRDVPYKP+E+Y+KTRKCEP
Sbjct: 67  IVAELVEVEENSSRKMQTLRITPVITVNKTSAYALYQLTYIRDVPYKPEEYYVKTRKCEP 126

Query: 135 DAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLR 182
           DAGA+VVKICER            QPICCPCGPQR +PSSCGN FDKL KGKANTAHC+ 
Sbjct: 127 DAGANVVKICERLRDEEGHIIEYTQPICCPCGPQRWMPSSCGNFFDKLTKGKANTAHCVH 186

Query: 183 FPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGY 242
           FPGDWFHVFGIGQR++GFSV+I+VK+G+KVSEV VGP+N+T  S D F +VNLIGDFVGY
Sbjct: 187 FPGDWFHVFGIGQRTLGFSVQIQVKSGTKVSEVVVGPQNRTVISDDKFFRVNLIGDFVGY 246

Query: 243 TNIPSFEEFYLVIPRQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
           TNIPSFE+FYLV+PRQ   PGQPQDLG N SMWMLLER RFTLDG+ECNKIGV+YEAFN 
Sbjct: 247 TNIPSFEDFYLVVPRQVCFPGQPQDLGRNISMWMLLERVRFTLDGIECNKIGVNYEAFNQ 306

Query: 302 QPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGV 361
           QP+FC SPFW+CLHNQLWN+READ NRI+RNQ+PLYG+EGRFER+NQHP+AGS+SFSIG+
Sbjct: 307 QPNFCPSPFWTCLHNQLWNFREADLNRISRNQVPLYGLEGRFERINQHPSAGSYSFSIGI 366

Query: 362 TEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYS 421
           TEVLN+NL++EL A+D+EYVYQRSPGKIISV +PTF ALTQFGVATITT+NTGEVEASYS
Sbjct: 367 TEVLNTNLVLELSANDVEYVYQRSPGKIISVSVPTFAALTQFGVATITTKNTGEVEASYS 426

Query: 422 LTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAEC 481
           LTF+CS  +TLME  Y+                    A  Y   ++LKDSD++EVDRAEC
Sbjct: 427 LTFNCSKDITLME--YY--------------------AKTYARVSVLKDSDYNEVDRAEC 464

Query: 482 QFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFF 541
           QF+T ATVLDN +Q+  F  P+               KL+ G + FI  K  R KCS FF
Sbjct: 465 QFATTATVLDNDTQVCSFLVPEF--------------KLFPGNK-FI--KKNREKCSGFF 507

Query: 542 DFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRD 601
           DF CHIQY+CLSW+++FGL L IF TVLVLLWLLHQKGLFDPLYDWW+D   +D Q I D
Sbjct: 508 DFKCHIQYVCLSWVMMFGLFLTIFLTVLVLLWLLHQKGLFDPLYDWWEDILGADEQIIMD 567

Query: 602 FRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHVQ 661
            R  +ID  H H+H  KHHKQE RH    A  RR   + +H HKHS+R++DY+  LHHV 
Sbjct: 568 KRKFKIDKGHHHIHENKHHKQEHRHSNYSAENRRRTTY-EHMHKHSERNSDYFDDLHHVH 626

Query: 662 KDKHKHGRSK-NSSVMQQLYLDTGKNDHIGHHRRRKFRESS 701
           K+ HK+   K N   +Q +       DH  HH+ RK R+SS
Sbjct: 627 KEMHKYEHKKQNMDNVQHIV------DHPAHHKHRKKRDSS 661


>gi|84453079|dbj|BAE71142.1| generative cell specific-1 [Lilium longiflorum]
          Length = 698

 Score =  840 bits (2171), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/670 (60%), Positives = 520/670 (77%), Gaps = 33/670 (4%)

Query: 30  VGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVE--ENST 87
             V+ILSKS++E+C K +DSD L+C  KIV+++AVPSGSSGGEASIVA++VEVE  EN+T
Sbjct: 23  TAVEILSKSRVERCTKTSDSDKLDCNNKIVVDLAVPSGSSGGEASIVAQLVEVEQRENAT 82

Query: 88  QKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICE-- 145
           +KM T+R PPV+T+NK+A+YA+Y+L Y+RDV YKP+EF+++TR+CEPDA  +++  C+  
Sbjct: 83  RKMHTLREPPVITINKSAAYALYKLIYLRDVAYKPEEFHVETRRCEPDAPYEILGECQGL 142

Query: 146 ----------RQPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQ 195
                      QP+CCPCGP+ R P++CG++F ++ KGK NTAHCL+FPGDWFHVF IG+
Sbjct: 143 RDQNGNIIENTQPVCCPCGPEGRYPTTCGSIF-QVFKGKTNTAHCLKFPGDWFHVFAIGK 201

Query: 196 RSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVI 255
           RS+GFSVR+EV+ GS  SE  VGP+N+   S DNFL+VNLIGDFVGYT+IPSFE+FYLV 
Sbjct: 202 RSLGFSVRVEVRKGSSQSEAIVGPDNRAVLSEDNFLRVNLIGDFVGYTSIPSFEDFYLVT 261

Query: 256 PRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
           PR G  GQP DLGG++S WMLLER RFTLDGLECNKIGVSY+A+  QP+FCSSP WSCLH
Sbjct: 262 PRLGAAGQPTDLGGDYSKWMLLERERFTLDGLECNKIGVSYDAYRSQPNFCSSPLWSCLH 321

Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
           NQLW++ EADQN+I RNQ P Y VEGRF+R+NQHPNAG+HSFS+G+TE LN+NLLIELRA
Sbjct: 322 NQLWHFWEADQNQIRRNQPPEYVVEGRFKRINQHPNAGTHSFSMGITEALNTNLLIELRA 381

Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
           DDI+YVYQRSPGK++++ IPTFEALTQFG AT+TT+NTG++EASYSLTF C +GV+ +EE
Sbjct: 382 DDIDYVYQRSPGKVLAINIPTFEALTQFGTATVTTKNTGKLEASYSLTFRCRSGVSYLEE 441

Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
           Q++I+KP+E   RSF++Y T++ AA Y C+AILK SDFSEVDRA+CQF+T AT+LD+GSQ
Sbjct: 442 QFYIMKPEEEVSRSFRLYLTSDLAATYECAAILKASDFSEVDRADCQFTTTATILDDGSQ 501

Query: 496 ITPFQPPKS-SINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
           I P    K   IN  F+SI+SI   +WEGL +F +GK CR KCSSFF+F CH+QYIC+SW
Sbjct: 502 IVPANELKEKGINGIFKSIKSIWGNIWEGLLEFFSGKTCRSKCSSFFNFRCHMQYICMSW 561

Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQR------IRDFRSRRID 608
           ++L  L+LA+FPT +VLLWLLHQ+GLFDP+YDWW D +    QR      +RD RS R  
Sbjct: 562 ILLLSLLLAVFPTGVVLLWLLHQQGLFDPIYDWWYDRYGEGFQRSSSLFSLRDSRSARHR 621

Query: 609 VD-HPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSD-RDTDYYYYLH----HVQK 662
            D +  +  RKH   E +  K     R       H+  HS+    D+Y++ H    HV K
Sbjct: 622 GDNNARLRDRKHSFYEEKKRKRSHTSRML-----HERSHSEIAAGDHYHHRHESHLHVHK 676

Query: 663 DKHKHGRSKN 672
           ++HK+  SK+
Sbjct: 677 ERHKYKHSKD 686


>gi|224053957|ref|XP_002298057.1| predicted protein [Populus trichocarpa]
 gi|222845315|gb|EEE82862.1| predicted protein [Populus trichocarpa]
          Length = 622

 Score =  830 bits (2144), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/709 (60%), Positives = 495/709 (69%), Gaps = 128/709 (18%)

Query: 14  LLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGE 72
           + ++FCI   LS   V  ++ILSKSKLE+CEK +DSDN LNCT KIVLNMAVPSGSSGGE
Sbjct: 8   IFLIFCIF--LSYFTVQSIEILSKSKLERCEKASDSDNDLNCTRKIVLNMAVPSGSSGGE 65

Query: 73  ASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKC 132
           ASIVAE+ EVEEN+T  M TVR+PP                   DV YKP+E+Y+KTRKC
Sbjct: 66  ASIVAEIAEVEENATDLMETVRVPP-------------------DVAYKPEEYYVKTRKC 106

Query: 133 EPDAGADVVKICERQPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFG 192
           + DAGA+VVKICE           R+     G              HC       FHVFG
Sbjct: 107 DRDAGANVVKICE----------SRQTDEREGQ-------------HCTLRQISRFHVFG 143

Query: 193 IGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFY 252
           IGQRS+GFSVRIEVKTGSKVSEVTVGPEN+T TS DNFL+VNLIGDFVGY+NIPSFE+FY
Sbjct: 144 IGQRSMGFSVRIEVKTGSKVSEVTVGPENRTVTSKDNFLRVNLIGDFVGYSNIPSFEDFY 203

Query: 253 LVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWS 312
           LVIPRQG PGQPQDLG NFSMWMLLER RFTLDG+ECNKIGVSYEAF+GQP+FC+SPFWS
Sbjct: 204 LVIPRQGEPGQPQDLGRNFSMWMLLERVRFTLDGVECNKIGVSYEAFSGQPNFCASPFWS 263

Query: 313 CLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIE 372
           CLHNQLWN+                           H NAG+HSFSIG+TEVLN+NLLIE
Sbjct: 264 CLHNQLWNF---------------------------HDNAGTHSFSIGITEVLNTNLLIE 296

Query: 373 LRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLT--------- 423
           L ADDIEYVYQRSPGK++S  IPTFEALTQFGVAT++ +N GEVEASYSLT         
Sbjct: 297 LTADDIEYVYQRSPGKLLSFTIPTFEALTQFGVATVSAENIGEVEASYSLTYGVVDVDSI 356

Query: 424 -------------FDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKD 470
                        FDCS GV+LMEEQ+FI+KP E +IRSFKIYPTT++AA+Y C+AILKD
Sbjct: 357 KICEKVEGQDLSHFDCSRGVSLMEEQFFILKPNEITIRSFKIYPTTDKAARYVCAAILKD 416

Query: 471 SDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITG 530
           S F+E+DRAECQF T AT+LDNGSQI PF PPK+S+N FFESIE+I  ++WEGL DFITG
Sbjct: 417 SGFNEIDRAECQFFTTATILDNGSQIAPFLPPKTSVNGFFESIENIWNRIWEGLVDFITG 476

Query: 531 KACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDD 590
           K CR+KCSSFFDFSCHIQY                            KGLFDPLYDWW+D
Sbjct: 477 KTCRQKCSSFFDFSCHIQY----------------------------KGLFDPLYDWWED 508

Query: 591 HFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRD 650
           H  +D QRIRD R    D     +HV +HH+   R HK  A ++R  IH +H+H+HS RD
Sbjct: 509 HLWTDEQRIRDTRRHNKD-----IHVNRHHELGARQHKHNAHKKRT-IHQEHRHRHSGRD 562

Query: 651 TDYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRE 699
           T+YY+YLHHV KDK KH  SK +SVMQQ+YLD   N  +GHH  RK R+
Sbjct: 563 TEYYHYLHHVHKDKSKHRGSKKTSVMQQVYLDGVGNTKVGHHGHRKERD 611


>gi|357154351|ref|XP_003576754.1| PREDICTED: uncharacterized protein LOC100833308 [Brachypodium
           distachyon]
          Length = 740

 Score =  823 bits (2127), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/578 (65%), Positives = 472/578 (81%), Gaps = 16/578 (2%)

Query: 31  GVQILSKSKLEKCEKRTDSD-NLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQK 89
           GV+ILSKS++E+C + + +  +L C  KI+LN+AVP+GS+GGEAS+VA+VVEVEEN TQ 
Sbjct: 29  GVEILSKSRVERCARDSGAGGHLACDRKIILNVAVPTGSTGGEASMVAQVVEVEENDTQA 88

Query: 90  MRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER--- 146
           M+T+R PPV+T+NK+A+YAVY L YIRDV Y+P+E +++TRKCE DAGA+VV+ CER   
Sbjct: 89  MQTIRDPPVITINKSATYAVYALNYIRDVAYRPEEQFVRTRKCESDAGAEVVRECERLRD 148

Query: 147 ---------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGI-GQR 196
                    +P+CCPCG Q R+PSSCG  FDK++KGKANTAHC+RFPGDWFHVFGI    
Sbjct: 149 QNGHVIEHTEPVCCPCGSQHRVPSSCGTFFDKMVKGKANTAHCVRFPGDWFHVFGIETSY 208

Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
           S+GFS+R++VK GS V+E+ VGPENKT  S DNFL+VNLIGDFVGY ++P+FE FYLV P
Sbjct: 209 SLGFSIRVQVKKGSSVTEIIVGPENKTVVSKDNFLRVNLIGDFVGYKSVPTFENFYLVTP 268

Query: 257 RQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
           R+G G GQPQ LG  FS WMLLER RFTLDGLECNKIGV YEA+  QPSFCS+PFWSCL+
Sbjct: 269 RKGDGGGQPQVLGDEFSRWMLLERVRFTLDGLECNKIGVGYEAYRNQPSFCSNPFWSCLY 328

Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
           NQLWN+ E+D NRINR Q P Y V+GRFER+NQHP+AG H+FS+G+TE +N+NLLIEL A
Sbjct: 329 NQLWNFWESDNNRINRKQQPQYVVQGRFERINQHPHAGVHTFSVGITESVNTNLLIELSA 388

Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
           DDI+YVYQRSPGKIIS+ +PTFEAL+Q G A +T +N G++EASYSLTFDC +G+T +EE
Sbjct: 389 DDIDYVYQRSPGKIISINVPTFEALSQVGTAQVTVRNIGKLEASYSLTFDCLSGITYVEE 448

Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
           QYFI+KP E  IRSF ++ +T+QA+KY C+AILK SDFSE+DRAECQFST ATVLDNG+Q
Sbjct: 449 QYFILKPDEVLIRSFYLHSSTDQASKYRCAAILKASDFSELDRAECQFSTAATVLDNGTQ 508

Query: 496 ITPF-QPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
           I P  Q  K  I  FFE+I+++ +  W+ + DF TG++C  +CSSFFD SCHIQYIC+ W
Sbjct: 509 IGPTNQHAKGGIRGFFEAIKALFRNTWDTVIDFFTGRSCSTRCSSFFDLSCHIQYICIGW 568

Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHF 592
           LV+FGL+LAI P V VLLWLLHQ GLFDPLYD W+D F
Sbjct: 569 LVMFGLLLAILPAVAVLLWLLHQNGLFDPLYDCWEDVF 606


>gi|291620044|gb|ADE20442.1| HAP2 [Sisymbrium irio]
          Length = 504

 Score =  821 bits (2120), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/504 (74%), Positives = 444/504 (88%), Gaps = 14/504 (2%)

Query: 63  AVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKP 122
           AVPSGSSGGEASIVAE+VEVE+NS+  M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKP
Sbjct: 1   AVPSGSSGGEASIVAEIVEVEDNSSSNMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKP 60

Query: 123 QEFYMKTRKCEPDAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKL 170
           QEF++ TRKCEPD+G D+V ICER            QP+CCPCGP+RR+PSSCG++F+++
Sbjct: 61  QEFHVTTRKCEPDSGPDIVDICERLRDDTGNVLEQTQPVCCPCGPERRLPSSCGDIFERM 120

Query: 171 LKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNF 230
           +KGKANTAHCLRFPGDW+HVF IGQRS+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNF
Sbjct: 121 VKGKANTAHCLRFPGDWYHVFSIGQRSLGFSVRVELKTGTRVSEVIIGPENRTATANDNF 180

Query: 231 LKVNLIGDFVGYTNIPSFEEFYLVIPRQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLEC 289
           LKVNLIGDF GYTNIPSFE+FYLVIPR+    GQP +LGGN+SMWMLLER RFTLDG+EC
Sbjct: 181 LKVNLIGDFAGYTNIPSFEDFYLVIPREAAVEGQPGNLGGNYSMWMLLERVRFTLDGIEC 240

Query: 290 NKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQH 349
           +KIGV YEAFN QP+FCS+P+WSCLHNQLWN+READ NR+NR+QLPLYG+EGRFER+NQH
Sbjct: 241 DKIGVGYEAFNNQPNFCSAPYWSCLHNQLWNFREADVNRMNRHQLPLYGLEGRFERINQH 300

Query: 350 PNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATIT 409
           PN+G HSFSIGVTE LN+NL+IELRADDIEYV+Q+SPGKII++ IPTFEALTQFGVA +T
Sbjct: 301 PNSGPHSFSIGVTETLNTNLMIELRADDIEYVFQKSPGKIINIAIPTFEALTQFGVAAVT 360

Query: 410 TQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILK 469
           T+NTGEVEASYSLTFDCS GV  +EEQ+FIIKP E + RSFK+YPT +QAAKY C+AILK
Sbjct: 361 TKNTGEVEASYSLTFDCSKGVAFVEEQFFIIKPNEATTRSFKLYPTKDQAAKYICTAILK 420

Query: 470 DSDFSEVDRAECQFSTMATVLDNGSQIT-PFQPPKSSINDFFESIESIGKKLWEGLRDFI 528
           DS FSEVDRAECQFST ATVLDNG+Q+T PFQ P++    FFESI  +   L  GL DFI
Sbjct: 421 DSQFSEVDRAECQFSTTATVLDNGTQVTNPFQIPETRPKGFFESIRLMWTNLVNGLVDFI 480

Query: 529 TGKACRRKCSSFFDFSCHIQYICL 552
           TG +CR KCSSFFDFSCHIQY+CL
Sbjct: 481 TGDSCRNKCSSFFDFSCHIQYVCL 504


>gi|4539463|emb|CAB39943.1| putative protein [Arabidopsis thaliana]
 gi|7267872|emb|CAB78215.1| putative protein [Arabidopsis thaliana]
          Length = 658

 Score =  700 bits (1807), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/746 (52%), Positives = 462/746 (61%), Gaps = 183/746 (24%)

Query: 29  VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGS-------------------- 68
           V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGS                    
Sbjct: 22  VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSVRFFFFSKTHIYTCFGFVFI 81

Query: 69  --------------SGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTY 114
                         SGGEASIVAE+VEVE+NS+  M+TVRIPPV+TVNK+A+YA+Y+LTY
Sbjct: 82  NFVFTCFGFVDETKSGGEASIVAEIVEVEDNSSSNMQTVRIPPVITVNKSAAYALYDLTY 141

Query: 115 IRDVPYKPQEFYMKTRKCEPDAGADVVKICER------------QPICCPCGPQRRIPSS 162
           IRDVPYKPQE+++ TRKCE DAG D+V+ICER            QPICCPCGPQRR+PSS
Sbjct: 142 IRDVPYKPQEYHVTTRKCEHDAGPDIVQICERLRDEKGNVLEQTQPICCPCGPQRRMPSS 201

Query: 163 CGNV-----------FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSK 211
           CG++           FDK++KGKANTAHCLRFPGDW                        
Sbjct: 202 CGDICMCFSFVTFKEFDKMIKGKANTAHCLRFPGDW------------------------ 237

Query: 212 VSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNF 271
                      TAT+ DNFLKVNLIGDF GYT+IPSFE+FYLVIPR              
Sbjct: 238 -----------TATANDNFLKVNLIGDFGGYTSIPSFEDFYLVIPR-------------- 272

Query: 272 SMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINR 331
                 ER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLHNQLWN+RE        
Sbjct: 273 ------ERVRFTLDGLECNKIGVGYEAFNTQPNFCSSPYWSCLHNQLWNFRE-------- 318

Query: 332 NQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIIS 391
                              NAG HSFSIGVTE LN+NL+IELRADDIEYV+QRSPGKII+
Sbjct: 319 -------------------NAGPHSFSIGVTETLNTNLMIELRADDIEYVFQRSPGKIIN 359

Query: 392 VIIPTFEALTQFGVATITTQNTGEVEASYSLT----------FDCSTGVTLMEEQYFIIK 441
           + IPTFEALTQFGVA +  +NTGEVEASYSLT          FDCS GV  +EEQ+FIIK
Sbjct: 360 IAIPTFEALTQFGVAAVIIKNTGEVEASYSLTVISKTESYLIFDCSKGVAFVEEQFFIIK 419

Query: 442 PKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQIT-PFQ 500
           PK  + RSFK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q+T PFQ
Sbjct: 420 PKAVTTRSFKLYPTKDQAAKYICTAILKDSQFSEVDRAECQFSTTATVLDNGTQVTNPFQ 479

Query: 501 PPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGL 560
            P++    FF+SI  +  K+  GL DFITG  C                   SW+V+FGL
Sbjct: 480 IPETQPKGFFDSIRILWTKIINGLVDFITGDTC-------------------SWMVMFGL 520

Query: 561 VLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHVHVRKHH 620
           +LA+FP   +LLWLLHQKGLFDP YDWW+DHF  D+ R R   SR   V+  H H +  H
Sbjct: 521 LLALFPITCLLLWLLHQKGLFDPCYDWWEDHFDLDHHR-RLLPSRADVVNRHHHHHKHRH 579

Query: 621 KQEGRHHKLEARRRRCGIHSDHKHK----HSDRDTDYYYYLHHVQKD--KHKHGRSKNSS 674
                    +  +   G   D   K        D+ YY+ LH V KD  + +  R+K+  
Sbjct: 580 HHNHHRRTHQRHKHHHGQDDDVLQKMMLERDHSDSHYYHQLHRVHKDSKQKQRRRAKHGI 639

Query: 675 VMQQLYLDTGKNDHIGHHRRRKFRES 700
           V+        ++ H+   R+++ RES
Sbjct: 640 VLP-------RDVHVERQRKQRLRES 658


>gi|115462909|ref|NP_001055054.1| Os05g0269500 [Oryza sativa Japonica Group]
 gi|75110629|sp|Q5W6B9.1|HAP2A_ORYSJ RecName: Full=Protein HAPLESS 2-A; Flags: Precursor
 gi|55168095|gb|AAV43963.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578605|dbj|BAF16968.1| Os05g0269500 [Oryza sativa Japonica Group]
 gi|215706325|dbj|BAG93181.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 722

 Score =  699 bits (1803), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/585 (58%), Positives = 437/585 (74%), Gaps = 32/585 (5%)

Query: 31  GVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE--NST 87
           G +ILSKS+LE C   +D+   L C  K+V+++AVPSG+SGGEAS+VA V  VEE  ++ 
Sbjct: 24  GTEILSKSRLESCSHDSDAGGRLKCDRKLVVDLAVPSGASGGEASLVARVAGVEEENDTP 83

Query: 88  QKMRTVRIPPVLTVNKTASYAVYELTYI-RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
              +++R PPV+TV+K+A+YA+Y LTY+ RDV Y+P E Y+KT KCEP AGA VV  CER
Sbjct: 84  SATKSIRDPPVITVSKSATYALYALTYLDRDVAYRPDEKYVKTHKCEPYAGAKVVGECER 143

Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
                       +PICCPCGP R + S CG+++ KL KGKANTAHC+RFPGDWFHVFGIG
Sbjct: 144 LWDEKGNVIKQTEPICCPCGPHR-VQSKCGDIWSKLTKGKANTAHCVRFPGDWFHVFGIG 202

Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
             S+ FS+R++VK GS V +V VGPENKT  S DNFL+V ++GD+ GYT+IPSFE+ YLV
Sbjct: 203 AWSLRFSIRVQVKKGSSVWDVVVGPENKTVVSGDNFLRVKVVGDYTGYTSIPSFEDNYLV 262

Query: 255 IPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSC 313
            PR+G G  QPQDLG   S WM+L+R RFTLDGLEC+KIGV YEA+  QP+FCS+P+ SC
Sbjct: 263 TPRKGTGSSQPQDLGNEHSKWMILDRVRFTLDGLECDKIGVGYEAYRNQPNFCSAPYGSC 322

Query: 314 LHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIEL 373
           L NQLWN+ E D+ RI+ +QLPLY VEGRF+R+NQHPNAG+H+FS+GVTE LN+NLLIEL
Sbjct: 323 LGNQLWNFWEYDKRRIDNSQLPLYIVEGRFQRINQHPNAGAHTFSVGVTEDLNTNLLIEL 382

Query: 374 RADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLM 433
            ADDIEYVYQRSP KII + +PTFEAL+Q G+A +TT+N G++E+SYSLTF CS+G++ +
Sbjct: 383 MADDIEYVYQRSPAKIIDIRVPTFEALSQVGIANVTTKNIGKLESSYSLTFKCSSGISPV 442

Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
           EEQ + +KP E   RSF++  TT+QAA + C AILK SDFSE+DR   +FST ATV +NG
Sbjct: 443 EEQLYTMKPDEVIARSFELRSTTDQAAMHQCEAILKASDFSELDREGYRFSTAATVYNNG 502

Query: 494 SQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLS 553
           +QI P    K     F++SI    K LW  L DF+TG+ C  KC   FDF CHIQY+C+ 
Sbjct: 503 AQIGPTNDHKK--GGFWDSI----KALWRNLIDFLTGRLCWTKCPRLFDFGCHIQYVCIG 556

Query: 554 WLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWW----DDHFQS 594
           W++    +L + P  +V LWLLHQ+GLFDPLYDWW    DD +++
Sbjct: 557 WIL----LLLLIPAAVVFLWLLHQEGLFDPLYDWWGLEPDDDYRA 597


>gi|222630910|gb|EEE63042.1| hypothetical protein OsJ_17850 [Oryza sativa Japonica Group]
          Length = 722

 Score =  699 bits (1803), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/585 (58%), Positives = 437/585 (74%), Gaps = 32/585 (5%)

Query: 31  GVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE--NST 87
           G +ILSKS+LE C   +D+   L C  K+V+++AVPSG+SGGEAS+VA V  VEE  ++ 
Sbjct: 24  GTEILSKSRLESCSHDSDAGGRLKCDRKLVVDLAVPSGASGGEASLVARVAGVEEENDTP 83

Query: 88  QKMRTVRIPPVLTVNKTASYAVYELTYI-RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
              +++R PPV+TV+K+A+YA+Y LTY+ RDV Y+P E Y+KT KCEP AGA VV  CER
Sbjct: 84  SATKSIRDPPVITVSKSATYALYALTYLDRDVAYRPDEKYVKTHKCEPYAGAKVVGECER 143

Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
                       +PICCPCGP R + S CG+++ KL KGKANTAHC+RFPGDWFHVFGIG
Sbjct: 144 LWDEKGNVIKQTEPICCPCGPHR-VQSKCGDIWSKLTKGKANTAHCVRFPGDWFHVFGIG 202

Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
             S+ FS+R++VK GS V +V VGPENKT  S DNFL+V ++GD+ GYT+IPSFE+ YLV
Sbjct: 203 AWSLRFSIRVQVKKGSSVWDVVVGPENKTVVSGDNFLRVKVVGDYTGYTSIPSFEDNYLV 262

Query: 255 IPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSC 313
            PR+G G  QPQDLG   S WM+L+R RFTLDGLEC+KIGV YEA+  QP+FCS+P+ SC
Sbjct: 263 TPRKGTGSSQPQDLGNEHSKWMILDRVRFTLDGLECDKIGVGYEAYRNQPNFCSAPYGSC 322

Query: 314 LHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIEL 373
           L NQLWN+ E D+ RI+ +QLPLY VEGRF+R+NQHPNAG+H+FS+GVTE LN+NLLIEL
Sbjct: 323 LGNQLWNFWEYDKRRIDNSQLPLYIVEGRFQRINQHPNAGAHTFSVGVTEDLNTNLLIEL 382

Query: 374 RADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLM 433
            ADDIEYVYQRSP KII + +PTFEAL+Q G+A +TT+N G++E+SYSLTF CS+G++ +
Sbjct: 383 MADDIEYVYQRSPAKIIDIRVPTFEALSQVGIANVTTKNIGKLESSYSLTFKCSSGISPV 442

Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
           EEQ + +KP E   RSF++  TT+QAA + C AILK SDFSE+DR   +FST ATV +NG
Sbjct: 443 EEQLYTMKPDEVIARSFELRSTTDQAAMHQCEAILKASDFSELDREGYRFSTAATVYNNG 502

Query: 494 SQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLS 553
           +QI P    K     F++SI    K LW  L DF+TG+ C  KC   FDF CHIQY+C+ 
Sbjct: 503 AQIGPTNDHKK--GGFWDSI----KALWRNLIDFLTGRLCWTKCPRLFDFGCHIQYVCIG 556

Query: 554 WLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWW----DDHFQS 594
           W++    +L + P  +V LWLLHQ+GLFDPLYDWW    DD +++
Sbjct: 557 WIL----LLLLIPAAVVFLWLLHQEGLFDPLYDWWGLEPDDDYRA 597


>gi|385178637|sp|B9G4M9.1|HAP2B_ORYSJ RecName: Full=Protein HAPLESS 2-B; Flags: Precursor
 gi|222641945|gb|EEE70077.1| hypothetical protein OsJ_30063 [Oryza sativa Japonica Group]
          Length = 714

 Score =  657 bits (1696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/585 (56%), Positives = 420/585 (71%), Gaps = 40/585 (6%)

Query: 31  GVQILSKSKLEKCEKRTDSDN---LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENST 87
           GV++L+KS+LE C +    D    L C +KIV+++AVPSGS    AS+VA V EVEEN T
Sbjct: 33  GVEVLAKSRLESCARGGSDDGRDRLTCDSKIVVDLAVPSGS----ASLVARVAEVEENGT 88

Query: 88  QKMRT-VRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
           +     +R P ++T+NK+  YA+Y+LTY+RDV YKP+E ++KTRKCEP+AGA+VVK CER
Sbjct: 89  EAGEMPIRDPLIITINKSEVYALYDLTYLRDVAYKPEEKFVKTRKCEPEAGANVVKSCER 148

Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
                       +P+CCPCGP RR+PSSCGN+ DK+ KGKANTAHCLRFP DWFHVF IG
Sbjct: 149 LRDEKGSIIEHTEPVCCPCGPHRRVPSSCGNILDKVAKGKANTAHCLRFPDDWFHVFDIG 208

Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
           +RS+ FS+R++VK GS  SEV VGPEN+T  S D+ L+VNL+GDF GYT++PS E FYLV
Sbjct: 209 RRSLWFSIRVQVKKGSSESEVIVGPENRTVVSEDSSLRVNLVGDFAGYTSLPSLENFYLV 268

Query: 255 IPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSC 313
            PR+G G GQ + LG +FS WMLLER  FTLDGLECNKIGV YEAF  QP+FCSSP  SC
Sbjct: 269 TPRKGVGGGQLEVLGDDFSRWMLLERVLFTLDGLECNKIGVGYEAFRSQPNFCSSPLDSC 328

Query: 314 LHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIEL 373
           L +QL  + E D+NR+N +Q P Y V G+FER+NQ+PNAG H+FS+G+ EVLN+NL+IEL
Sbjct: 329 LGDQLSKFWEIDKNRVNNSQPPQYVVLGKFERINQYPNAGVHTFSVGIPEVLNTNLMIEL 388

Query: 374 RADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLM 433
            ADDIEYVYQRS GKIIS+ I +FEAL+Q G A + T+N G +EASYSLTFDC +G+  +
Sbjct: 389 SADDIEYVYQRSSGKIISINISSFEALSQVGSARVKTKNIGRLEASYSLTFDCLSGINPV 448

Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
           EEQYFI+KP E  IR+F +  +T+QA+ YTC AILK SDFSE+DR E QFST ATVL+NG
Sbjct: 449 EEQYFIMKPDEKLIRTFDLRSSTDQASNYTCQAILKASDFSELDRKESQFSTTATVLNNG 508

Query: 494 SQITPFQP-PKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICL 552
           +QI   +   K  I  FFE+I++   K+W  L +F TG  C  +C SF  F  H      
Sbjct: 509 TQIGSSENHTKGGIWGFFEAIKAWCAKMWHMLINFFTGTTCSTRCWSFLKFVIHGL---- 564

Query: 553 SWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQ 597
                          ++ +LWLLH+KGLFDPLY WWD    S+ Q
Sbjct: 565 --------------LLVAVLWLLHRKGLFDPLYYWWDGVVGSEAQ 595


>gi|218202482|gb|EEC84909.1| hypothetical protein OsI_32102 [Oryza sativa Indica Group]
          Length = 718

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 328/585 (56%), Positives = 411/585 (70%), Gaps = 37/585 (6%)

Query: 31  GVQILSKSKLEKCEKRTDSDNLNC---TTKIVLNMAVPSGSSGGEASIVAEVVEVEENST 87
           GV++L+KS+LE C +    D       T +       P  + GGEAS+VA V EVEEN T
Sbjct: 36  GVEVLAKSRLESCARGGSDDGATASPATARSSSTWPCPV-ARGGEASLVARVAEVEENGT 94

Query: 88  QKMRTVRIPP-VLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
           +      + P ++T+NK+  YA+Y+LTY+RDV Y P+E Y+KTRKCEP+AGA+VVK CER
Sbjct: 95  EAGEMPILDPLIITINKSEVYALYDLTYLRDVAYIPEEKYVKTRKCEPEAGANVVKSCER 154

Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
                       +P+CCPCGP RR+PSSCGN+FDK+ KGKANTAHCLRFP DWFHVF IG
Sbjct: 155 LRDEKGSIIEHTEPVCCPCGPHRRVPSSCGNIFDKVAKGKANTAHCLRFPDDWFHVFDIG 214

Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
           +RS+ FS+R++VK GS  SEV VGPEN+T  S D+ L+VNL+GDF GYT++PS E FYLV
Sbjct: 215 RRSLWFSIRVQVKKGSSESEVIVGPENRTVVSEDSSLRVNLVGDFAGYTSLPSLENFYLV 274

Query: 255 IPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSC 313
            PR+G G GQ Q LG +FS WMLLER  FTLDGLECNKIGV YEAF  QP+FCSSP  SC
Sbjct: 275 TPRKGVGGGQLQVLGDDFSRWMLLERVLFTLDGLECNKIGVGYEAFRSQPNFCSSPLDSC 334

Query: 314 LHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIEL 373
           L +QL  + E D+NR+N +Q P Y V G+FER+NQ+PNAG H+FS+G+ EVLN+NL+IEL
Sbjct: 335 LGDQLSKFWEIDKNRVNNSQPPQYVVLGKFERINQYPNAGVHTFSVGIPEVLNTNLMIEL 394

Query: 374 RADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLM 433
            ADDIEYVYQRS GKIIS+ I +FEAL+Q G A + T+N G +EASYSLTFDC +G+  +
Sbjct: 395 SADDIEYVYQRSSGKIISINISSFEALSQVGSARVKTKNIGRLEASYSLTFDCLSGINPV 454

Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
           EEQYFI+KP E  IR+F +  +T+QA+ YTC AILK SDFSE+DR E QFST ATVL+NG
Sbjct: 455 EEQYFIMKPDEKLIRTFDLRSSTDQASNYTCQAILKASDFSELDRKESQFSTTATVLNNG 514

Query: 494 SQITPFQP-PKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICL 552
           +QI   +   K  I  FFE+I++   K+W  L +F TG  C  +C SF  F  H      
Sbjct: 515 TQIGSSENHTKGGIWGFFEAIKAWCAKMWHMLINFFTGTTCSTRCWSFLKFVIHGL---- 570

Query: 553 SWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQ 597
                          ++ +LWLLH+KGLFDPLY WWD    S+ Q
Sbjct: 571 --------------LLVAVLWLLHRKGLFDPLYYWWDGVVGSEAQ 601


>gi|218196451|gb|EEC78878.1| hypothetical protein OsI_19239 [Oryza sativa Indica Group]
          Length = 532

 Score =  514 bits (1325), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 247/412 (59%), Positives = 313/412 (75%), Gaps = 15/412 (3%)

Query: 188 FHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPS 247
           FHVFGIG  S+ FS+R++VK GS V +V VGPENKT  S DNFL+V ++GD+ GYT+IPS
Sbjct: 6   FHVFGIGAWSLRFSIRVQVKKGSSVWDVVVGPENKTVVSGDNFLRVKVVGDYTGYTSIPS 65

Query: 248 FEEFYLVIPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFC 306
           FEE YLV PR+G G  QPQDLG   S WM+L+R RFTLDGLEC+KIGV YEA+  QP+FC
Sbjct: 66  FEENYLVTPRKGTGSSQPQDLGNEHSKWMILDRVRFTLDGLECDKIGVGYEAYRNQPNFC 125

Query: 307 SSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLN 366
           S+P+ SCL NQLWN+ E D+ RI+ +QLPLY VEGRF+R+NQHPNAG+H+FS+GVTE LN
Sbjct: 126 SAPYGSCLGNQLWNFWEYDKRRIDNSQLPLYIVEGRFQRINQHPNAGAHTFSVGVTEDLN 185

Query: 367 SNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDC 426
           +NLLIEL ADDIEYVYQRSP KII + +PTFEAL+Q G+A +TT+N G++E+SYSLTF C
Sbjct: 186 TNLLIELMADDIEYVYQRSPAKIIDIRVPTFEALSQVGIANVTTKNIGKLESSYSLTFKC 245

Query: 427 STGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTM 486
           S+G++ +EEQ + +KP E   RSF++  TT+QAA + C AILK SDFSE+DR   +FST 
Sbjct: 246 SSGISPVEEQLYTMKPDEVIARSFELRSTTDQAAMHQCEAILKASDFSELDREGYRFSTA 305

Query: 487 ATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCH 546
           ATV +NG+QI P    K     F++SI    K LW  L DF+TG+ C  KC   FDF CH
Sbjct: 306 ATVYNNGAQIGPTNDHKKG--GFWDSI----KALWRNLIDFLTGRLCWTKCPRLFDFGCH 359

Query: 547 IQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWW----DDHFQS 594
           IQY+C+ W++    +L + P  +V LWLLHQ+GLFDPLYDWW    DD +++
Sbjct: 360 IQYVCIGWIL----LLLLIPAAVVFLWLLHQEGLFDPLYDWWGLEPDDDYRA 407


>gi|414591383|tpg|DAA41954.1| TPA: hypothetical protein ZEAMMB73_607847 [Zea mays]
          Length = 536

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 234/384 (60%), Positives = 295/384 (76%), Gaps = 6/384 (1%)

Query: 214 EVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQ-GGPGQPQDLGGNFS 272
           EV VGPEN+T  S DNFL+VNLIGDF GYT+IP+FE+FYLV PR+  G G+PQ+LG  + 
Sbjct: 21  EVVVGPENRTVVSKDNFLRVNLIGDFGGYTSIPAFEDFYLVTPRKSAGSGEPQNLGAEYR 80

Query: 273 MWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRN 332
            WMLLER RFT DG+ECNKIGV YEAF  QP+FC+SPF SCL+NQLW + E+D+NRI+ +
Sbjct: 81  KWMLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMS 139

Query: 333 QLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISV 392
           + P Y V+GRF+R+NQHP+A  HSFSIGVTEV+NSNL IEL ADDIEY+YQRSPG I  +
Sbjct: 140 RQPQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDI 199

Query: 393 IIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKI 452
            +P FE L+Q+G A +TT+N G +EASY+LTF CS+G++ MEEQY+I+KP E S R F +
Sbjct: 200 SVPAFEVLSQYGTAKVTTKNIGTLEASYTLTFHCSSGISFMEEQYYILKPNEESTRLFYL 259

Query: 453 YPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFES 512
           + +T+QAAKY C+AILK SD SE+DR EC FST ATVLDNG+QI      K     FF++
Sbjct: 260 HASTDQAAKYQCTAILKASDSSELDRQECVFSTTATVLDNGTQIIGSNGYKLG---FFDT 316

Query: 513 IESIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVL 571
           I+      W+ L D I+GK+CR  KC SFFDFSCH QY C++WLV+  L+L + P   ++
Sbjct: 317 IKGYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQYRCITWLVMLVLLLFMLPAGAIV 376

Query: 572 LWLLHQKGLFDPLYDWWDDHFQSD 595
           L+LLHQKG FDP+YDWWDD   +D
Sbjct: 377 LYLLHQKGFFDPVYDWWDDLLGAD 400


>gi|110430669|gb|ABG73459.1| histidine rich-like protein [Oryza brachyantha]
          Length = 634

 Score =  484 bits (1245), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 280/619 (45%), Positives = 366/619 (59%), Gaps = 119/619 (19%)

Query: 37  KSKLEKCEKRTDSDN--LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQK--MRT 92
           KS+LE C + TD     L C +K+VL++AVPS SSGGEAS+VA+V +VEEN T+   MR 
Sbjct: 42  KSRLESCVRDTDDGGRRLTCDSKLVLDVAVPSDSSGGEASLVAKVADVEENDTEATPMR- 100

Query: 93  VRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICERQPICCP 152
           +R PPV+T+NK+  +A+Y LTY+RDV YKP+E ++KTRKCEPDAG++VVK CE      P
Sbjct: 101 IRDPPVITINKSEVFALYALTYLRDVSYKPEEKFVKTRKCEPDAGSEVVKFCESL-FVVP 159

Query: 153 CGPQR------RIPSSCGNVFDKLLKGKANTAHCLRF---------PGDW--FHVFGIGQ 195
            G            S   N    +L G     H L++         P  +  FHVF IG+
Sbjct: 160 VGLTAVHLHPVETYSCLENYSHDILFG--FYIHVLKYMWRKITLWRPSVFARFHVFEIGR 217

Query: 196 RSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVI 255
           RS+GFS+ ++VK  S VS+V VGP+N+T  S DNFL+V L+GDFVGYT+IPSFE+FYLV 
Sbjct: 218 RSLGFSISVQVKKASSVSKVIVGPDNRTVVSKDNFLRVKLVGDFVGYTSIPSFEDFYLVT 277

Query: 256 PRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCL 314
           PR+G G G+PQ +G +FS WMLLER RFTLDGLECNKIGV YEA++ QP+FCSSP  SCL
Sbjct: 278 PRKGVGGGEPQ-VGDDFSRWMLLERVRFTLDGLECNKIGVGYEAYSSQPNFCSSPLQSCL 336

Query: 315 HNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELR 374
            +QLWN+ E+D+ R+N +Q P Y V+G                                 
Sbjct: 337 GDQLWNFWESDKIRVNNSQPPQYLVQG--------------------------------- 363

Query: 375 ADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLME 434
                    RSPGKIIS+ + TFEAL+Q G A + T+N G++EASYSLTF CS+G+  +E
Sbjct: 364 ---------RSPGKIISINVSTFEALSQVGTAQVKTKNIGKLEASYSLTFGCSSGINPVE 414

Query: 435 EQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGS 494
           EQ FI+KP E  IRSF ++ +T QA+ YTC AILK S+FSE+DR ECQFST ATVL+NG+
Sbjct: 415 EQSFIMKPDEEIIRSFDLHSSTVQASNYTCKAILKGSNFSELDRKECQFSTTATVLNNGT 474

Query: 495 QITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
           Q   F+            +  +G  +W  +         RR                   
Sbjct: 475 QYKMFK------------LFQVG-HVWHAI--------PRR------------------- 494

Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
                    IF +V   +W LHQKGLFDP+YDWWDD F     R      R   + + H 
Sbjct: 495 -------YGIFSSV---MWFLHQKGLFDPIYDWWDDVFGLSEARSHQRHKRSHSLRNYHH 544

Query: 615 HVRKHHKQEGRHHKLEARR 633
           H ++H  +    H+  + R
Sbjct: 545 HHKRHKSEPVSGHRHHSHR 563


>gi|242049910|ref|XP_002462699.1| hypothetical protein SORBIDRAFT_02g030440 [Sorghum bicolor]
 gi|241926076|gb|EER99220.1| hypothetical protein SORBIDRAFT_02g030440 [Sorghum bicolor]
          Length = 607

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 247/501 (49%), Positives = 333/501 (66%), Gaps = 58/501 (11%)

Query: 31  GVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQK 89
           G ++L+KS LE C   + +   L+C  K+V++MAVPS SSGGEAS+VA+V  V  N T++
Sbjct: 27  GAEVLAKSLLESCVDDSGAGGRLSCDRKVVVDMAVPSESSGGEASLVAQVAHV--NDTEQ 84

Query: 90  MRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICERQPI 149
            +T+R PPV+TVNK A YA+Y L YIRDV YKP+E +++TRKCEPDAGADVV  CE    
Sbjct: 85  TKTIRNPPVITVNKGAVYALYALNYIRDVAYKPEEQFVETRKCEPDAGADVVGACESL-- 142

Query: 150 CCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK-- 207
                    +P     V+  L++        ++F  +      +    +  S+ +E++  
Sbjct: 143 -------FAVPVVLTAVYLHLVE-----TFLIKFSKEKLIQLTVYDFQVTGSMFLELEKD 190

Query: 208 -----TGSKV-SEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQG-G 260
                +G K+  EV VGPEN+T  S DNFL+VNLIGDF GYT+IP+FE FYLV PR+G G
Sbjct: 191 YLGSTSGYKLRKEVVVGPENRTVVSKDNFLRVNLIGDFSGYTSIPTFENFYLVTPRKGAG 250

Query: 261 PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWN 320
            G+PQ+LG  +S WMLLER RFT +G+EC+KIGV Y+AF  QP+FC+S F SCL+NQL  
Sbjct: 251 SGEPQNLGAEYSKWMLLERVRFT-EGIECDKIGVGYQAFQNQPNFCASAFGSCLYNQLST 309

Query: 321 YREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEY 380
           + E                           NA  H+FSIGVTEV NSNL IEL ADDIEY
Sbjct: 310 FLE---------------------------NATVHTFSIGVTEVRNSNLRIELSADDIEY 342

Query: 381 VYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFII 440
           +YQRSPGKI ++ +PTFEAL+Q+G A +TT+N G++EASY+LTF+C +G++ +EEQY+++
Sbjct: 343 MYQRSPGKITNISVPTFEALSQYGTAKVTTKNIGKLEASYTLTFNCLSGISFVEEQYYVL 402

Query: 441 KPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQ 500
           KP E S R F +  +T++AAKY C+AILK SDFSE+DR EC FSTMATVLDNG+Q   F 
Sbjct: 403 KPDEASTRLFYLRASTDKAAKYQCTAILKASDFSELDRQECLFSTMATVLDNGTQKGFFD 462

Query: 501 PPKSSINDFFESIESIGKKLW 521
           P    + D++E +  +  + +
Sbjct: 463 P----VYDWWEDLLGLDDRTY 479



 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 15/72 (20%)

Query: 529 TGKACRRKCSSFF---DFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLY 585
           T KA + +C++     DFS   +  CL            F T+  +L    QKG FDP+Y
Sbjct: 418 TDKAAKYQCTAILKASDFSELDRQECL------------FSTMATVLDNGTQKGFFDPVY 465

Query: 586 DWWDDHFQSDNQ 597
           DWW+D    D++
Sbjct: 466 DWWEDLLGLDDR 477


>gi|226492062|ref|NP_001141873.1| hypothetical protein [Zea mays]
 gi|223944697|gb|ACN26432.1| unknown [Zea mays]
 gi|414591385|tpg|DAA41956.1| TPA: hypothetical protein ZEAMMB73_607847 [Zea mays]
          Length = 454

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 194/322 (60%), Positives = 246/322 (76%), Gaps = 5/322 (1%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
           MLLER RFT DG+ECNKIGV YEAF  QP+FC+SPF SCL+NQLW + E+D+NRI+ ++ 
Sbjct: 1   MLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 59

Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
           P Y V+GRF+R+NQHP+A  HSFSIGVTEV+NSNL IEL ADDIEY+YQRSPG I  + +
Sbjct: 60  PQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDISV 119

Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP 454
           P FE L+Q+G A +TT+N G +EASY+LTF CS+G++ MEEQY+I+KP E S R F ++ 
Sbjct: 120 PAFEVLSQYGTAKVTTKNIGTLEASYTLTFHCSSGISFMEEQYYILKPNEESTRLFYLHA 179

Query: 455 TTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIE 514
           +T+QAAKY C+AILK SD SE+DR EC FST ATVLDNG+QI      K     FF++I+
Sbjct: 180 STDQAAKYQCTAILKASDSSELDRQECVFSTTATVLDNGTQIIGSNGYKLG---FFDTIK 236

Query: 515 SIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLW 573
                 W+ L D I+GK+CR  KC SFFDFSCH QY C++WLV+  L+L + P   ++L+
Sbjct: 237 GYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQYRCITWLVMLVLLLFMLPAGAIVLY 296

Query: 574 LLHQKGLFDPLYDWWDDHFQSD 595
           LLHQKG FDP+YDWWDD   +D
Sbjct: 297 LLHQKGFFDPVYDWWDDLLGAD 318


>gi|194706256|gb|ACF87212.1| unknown [Zea mays]
          Length = 454

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 193/322 (59%), Positives = 245/322 (76%), Gaps = 5/322 (1%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
           MLLER RFT DG+ECNKIGV YEAF  QP+FC+SPF SCL+NQLW + E+D+NRI+ ++ 
Sbjct: 1   MLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 59

Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
           P Y V+GRF+R+NQHP+A  HSFSIGVTEV+NSNL IEL ADDIEY+YQRSPG I  + +
Sbjct: 60  PQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDISV 119

Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP 454
           P FE L+Q+G A +TT+N G +EASY+LTF CS+G++ MEEQY+I+KP E S R F ++ 
Sbjct: 120 PAFEVLSQYGTAKVTTKNIGTLEASYTLTFHCSSGISFMEEQYYILKPNEESTRLFYLHA 179

Query: 455 TTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIE 514
           +T+QAAKY C+AILK SD SE+DR  C FST ATVLDNG+QI      K     FF++I+
Sbjct: 180 STDQAAKYQCTAILKASDSSELDRQGCVFSTTATVLDNGTQIIGSNGYKLG---FFDTIK 236

Query: 515 SIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLW 573
                 W+ L D I+GK+CR  KC SFFDFSCH QY C++WLV+  L+L + P   ++L+
Sbjct: 237 GYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQYRCITWLVMLVLLLFMLPAGAIVLY 296

Query: 574 LLHQKGLFDPLYDWWDDHFQSD 595
           LLHQKG FDP+YDWWDD   +D
Sbjct: 297 LLHQKGFFDPVYDWWDDLLGAD 318


>gi|115480257|ref|NP_001063722.1| Os09g0525700 [Oryza sativa Japonica Group]
 gi|52076043|dbj|BAD46496.1| unknown protein [Oryza sativa Japonica Group]
 gi|52077311|dbj|BAD46352.1| unknown protein [Oryza sativa Japonica Group]
 gi|113631955|dbj|BAF25636.1| Os09g0525700 [Oryza sativa Japonica Group]
          Length = 425

 Score =  348 bits (893), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 182/324 (56%), Positives = 226/324 (69%), Gaps = 19/324 (5%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
           MLLER  FTLDGLECNKIGV YEAF  QP+FCSSP  SCL +QL  + E D+NR+N +Q 
Sbjct: 1   MLLERVLFTLDGLECNKIGVGYEAFRSQPNFCSSPLDSCLGDQLSKFWEIDKNRVNNSQP 60

Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
           P Y V G+FER+NQ+PNAG H+FS+G+ EVLN+NL+IEL ADDIEYVYQRS GKIIS+ I
Sbjct: 61  PQYVVLGKFERINQYPNAGVHTFSVGIPEVLNTNLMIELSADDIEYVYQRSSGKIISINI 120

Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP 454
            +FEAL+Q G A + T+N G +EASYSLTFDC +G+  +EEQYFI+KP E  IR+F +  
Sbjct: 121 SSFEALSQVGSARVKTKNIGRLEASYSLTFDCLSGINPVEEQYFIMKPDEKLIRTFDLRS 180

Query: 455 TTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQP-PKSSINDFFESI 513
           +T+QA+ YTC AILK SDFSE+DR E QFST ATVL+NG+QI   +   K  I  FFE+I
Sbjct: 181 STDQASNYTCQAILKASDFSELDRKESQFSTTATVLNNGTQIGSSENHTKGGIWGFFEAI 240

Query: 514 ESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLW 573
           ++   K+W  L +F TG  C  +C SF  F  H                     ++ +LW
Sbjct: 241 KAWCAKMWHMLINFFTGTTCSTRCWSFLKFVIHGL------------------LLVAVLW 282

Query: 574 LLHQKGLFDPLYDWWDDHFQSDNQ 597
           LLH+KGLFDPLY WWD    S+ Q
Sbjct: 283 LLHRKGLFDPLYYWWDGVVGSEAQ 306


>gi|414591386|tpg|DAA41957.1| TPA: hypothetical protein ZEAMMB73_607847, partial [Zea mays]
          Length = 224

 Score =  315 bits (808), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 145/222 (65%), Positives = 182/222 (81%), Gaps = 1/222 (0%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
           MLLER RFT DG+ECNKIGV YEAF  QP+FC+SPF SCL+NQLW + E+D+NRI+ ++ 
Sbjct: 1   MLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 59

Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
           P Y V+GRF+R+NQHP+A  HSFSIGVTEV+NSNL IEL ADDIEY+YQRSPG I  + +
Sbjct: 60  PQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDISV 119

Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP 454
           P FE L+Q+G A +TT+N G +EASY+LTF CS+G++ MEEQY+I+KP E S R F ++ 
Sbjct: 120 PAFEVLSQYGTAKVTTKNIGTLEASYTLTFHCSSGISFMEEQYYILKPNEESTRLFYLHA 179

Query: 455 TTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQI 496
           +T+QAAKY C+AILK SD SE+DR EC FST ATVLDNG+Q+
Sbjct: 180 STDQAAKYQCTAILKASDSSELDRQECVFSTTATVLDNGTQV 221


>gi|414591381|tpg|DAA41952.1| TPA: hypothetical protein ZEAMMB73_607847, partial [Zea mays]
          Length = 283

 Score =  295 bits (754), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 137/199 (68%), Positives = 167/199 (83%), Gaps = 2/199 (1%)

Query: 187 WFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIP 246
           WFHVFGIG RS+GF++R++VK GS VSEV VGPEN+T  S DNFL+VNLIGDF GYT+IP
Sbjct: 78  WFHVFGIGTRSLGFNIRVQVKKGSSVSEVVVGPENRTVVSKDNFLRVNLIGDFGGYTSIP 137

Query: 247 SFEEFYLVIPRQ-GGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSF 305
           +FE+FYLV PR+  G G+PQ+LG  +  WMLLER RFT DG+ECNKIGV YEAF  QP+F
Sbjct: 138 AFEDFYLVTPRKSAGSGEPQNLGAEYRKWMLLERVRFT-DGVECNKIGVGYEAFQNQPNF 196

Query: 306 CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVL 365
           C+SPF SCL+NQLW + E+D+NRI+ ++ P Y V+GRF+R+NQHP+A  HSFSIGVTEV+
Sbjct: 197 CASPFESCLNNQLWTFLESDKNRISMSRQPQYVVQGRFQRINQHPDASVHSFSIGVTEVI 256

Query: 366 NSNLLIELRADDIEYVYQR 384
           NSNL IEL ADDIEY+YQR
Sbjct: 257 NSNLRIELSADDIEYMYQR 275


>gi|224074881|ref|XP_002304473.1| predicted protein [Populus trichocarpa]
 gi|222841905|gb|EEE79452.1| predicted protein [Populus trichocarpa]
          Length = 239

 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 126/280 (45%), Positives = 159/280 (56%), Gaps = 51/280 (18%)

Query: 421 SLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAE 480
           SL FDCS GV +ME         E +IRSFKIYP T++AA+Y C+AILKDS F+E D AE
Sbjct: 5   SLQFDCSKGVAVMELS-------EVTIRSFKIYPATDKAARYVCAAILKDSSFNETDPAE 57

Query: 481 CQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSF 540
           CQ  T AT+L+NG++  PF+PPK SIN FFESIE I  ++WEGL   ITGK      ++ 
Sbjct: 58  CQLFTTATILENGARFAPFRPPKISINGFFESIEDIWNRIWEGLVASITGKVGSACAATA 117

Query: 541 FDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIR 600
           F                                    +  F PLYDWW+DH   D Q IR
Sbjct: 118 FA----------------------------------SERTFPPLYDWWEDHLWDDEQGIR 143

Query: 601 DFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHV 660
           D    + DV+          ++ G   +  AR+RR  I+ +H+  HS RD DYY+YLHHV
Sbjct: 144 DTLRHKKDVNGD--------RELGPRQQHNARKRRS-IYQEHRPGHSGRDADYYHYLHHV 194

Query: 661 QKDKHKHGRSKNSSVMQQLYLDTGKNDHI-GHHRRRKFRE 699
           QKDK KH  SK S+V QQ+YLD  +N +I GHHR RK R+
Sbjct: 195 QKDKSKHRGSKKSNVPQQVYLDGPENSNIGGHHRHRKERD 234


>gi|66819323|ref|XP_643321.1| hypothetical protein DDB_G0276069 [Dictyostelium discoideum AX4]
 gi|60471374|gb|EAL69334.1| hypothetical protein DDB_G0276069 [Dictyostelium discoideum AX4]
          Length = 572

 Score =  196 bits (497), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 137/473 (28%), Positives = 229/473 (48%), Gaps = 37/473 (7%)

Query: 48  DSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASY 107
           D  NL C  K+V+++ + S     E +   +V E+++ +  K++T+ IP  +   K+ ++
Sbjct: 44  DKTNLKCDKKLVVSLYIDSQKENSE-TFNFQVSEIKDENG-KLKTLVIPISVKFKKSETF 101

Query: 108 AVYELTYIRDVPYKPQEF------YMKTRKCEP-----------DAGADVVKICERQPIC 150
             Y L Y+++V Y+P+E       Y+ T  C+            DA   +++  + Q  C
Sbjct: 102 INYPLVYVQNVAYQPKETVIYKTDYVLTSGCKDKPTDHTCPGAIDANGKLIR--DSQGFC 159

Query: 151 CPCGPQRRIPS---SCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK 207
           C C     + +   S  N+   LL  K+++AHCL F    + V+ + +  + +++   + 
Sbjct: 160 CSCSFSDYVGADQNSRANLGCSLLGSKSSSAHCLSFSSVKYDVYNVAKTQVEYTITATLT 219

Query: 208 TGSKVSEVT--VGPENKTATSADNFLK--VNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQ 263
                + +T  +   N      D F +  V +IGDF   T I  F +  +V P      Q
Sbjct: 220 YSYNQNPITQDIILSNSAPMGMDTFSQAIVRIIGDFQSSTQINQFTDKKVVFPY----NQ 275

Query: 264 PQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYRE 323
           P  +    +  M+L++  F L GL CNKIGVSY AF  QP+ C++ F SCL NQ+ +Y  
Sbjct: 276 PNSI----NTCMVLDQNFFDLSGLTCNKIGVSYSAFQNQPNSCAALFGSCLQNQIADYYN 331

Query: 324 ADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQ 383
           AD   I+  +   Y       ++    N  S S  I   E   + L I L+AD ++Y+  
Sbjct: 332 ADVALISSGKKGNYIASQLGTKVQIAGNQDSRSLKIRFDESHRTMLTITLKADSLQYIVN 391

Query: 384 RSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLT-FDCSTGVTLMEEQYFIIKP 442
            SPGKII+  I  FE++++ GV  +  QNTG + A Y++T  +C+  +  +  Q   IK 
Sbjct: 392 ISPGKIINYQIDRFESMSKNGVLRVNVQNTGTINADYTMTIINCTGDINPINNQQVTIKS 451

Query: 443 KETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
           KE     F++Y T+   + Y C   L +     +D     F+T  T +DNG+Q
Sbjct: 452 KEIYSFVFQVYTTSKLDSSYHCFGDLYNEVAQVIDSIRINFNTSDTEIDNGAQ 504


>gi|147794121|emb|CAN62356.1| hypothetical protein VITISV_001267 [Vitis vinifera]
          Length = 933

 Score =  189 bits (481), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 105/150 (70%), Positives = 122/150 (81%), Gaps = 10/150 (6%)

Query: 8   LKLKHFLLILF----------CILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTK 57
           L  KH L+I F           IL   + R + GVQILSKSKLEKCEK ++SDNLNCT K
Sbjct: 480 LDSKHALVIGFDGSELINXYPLILTGFNTRRLYGVQILSKSKLEKCEKVSESDNLNCTKK 539

Query: 58  IVLNMAVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRD 117
           I+L+MAVPSGSSGGEASIVAEVVEVEENST KM+T+R+PP +TVNK+A+YAVYE+TYIRD
Sbjct: 540 IILDMAVPSGSSGGEASIVAEVVEVEENSTHKMQTLRVPPTITVNKSAAYAVYEITYIRD 599

Query: 118 VPYKPQEFYMKTRKCEPDAGADVVKICERQ 147
           VPYKPQE+++KTRKCEPDA A VVKICER 
Sbjct: 600 VPYKPQEYFVKTRKCEPDASAKVVKICERH 629



 Score =  176 bits (445), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 80/93 (86%), Positives = 85/93 (91%)

Query: 168 DKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSA 227
           DKL+KGKANTAHCLRFPGDWFHVFGIGQRS+GFSV IEVKTGSK+SEV VGPEN+T  S 
Sbjct: 805 DKLMKGKANTAHCLRFPGDWFHVFGIGQRSLGFSVHIEVKTGSKISEVIVGPENRTVMSN 864

Query: 228 DNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGG 260
           DNFLKVNLIGDF GYTNIPSFE+FYLV PRQ G
Sbjct: 865 DNFLKVNLIGDFAGYTNIPSFEDFYLVTPRQBG 897


>gi|384253026|gb|EIE26501.1| hypothetical protein COCSUDRAFT_39583 [Coccomyxa subellipsoidea
           C-169]
          Length = 1085

 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 139/520 (26%), Positives = 237/520 (45%), Gaps = 61/520 (11%)

Query: 34  ILSKSKLEKCEKRTDSDNL--NCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ--- 88
           +LS S+L+ C +   ++ L   C+ K++L +AV +G+S    S+   V  +   S+    
Sbjct: 27  VLSSSQLQTCIQDGSAEALLLQCSKKLILTLAVENGASLATQSLQFSVPCINSGSSGCPC 86

Query: 89  ----------KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKT------RKC 132
                       R +R    +T+ K A YA Y L Y +    +P E  ++T        C
Sbjct: 87  TCNYATDPGCTCRDLRDTLNVTITKGAVYASYPLIYQQAFNNRPTEAIIRTGANFPISSC 146

Query: 133 E-------PDAG----ADVVKICERQPICCPCGPQRRIPSSCGNVFDKLLKGKAN----- 176
                   P  G    A+  +I   Q  CC C       ++ G+  D+  +   +     
Sbjct: 147 NDGPLSDTPTCGWATDANGARIPASQGFCCSCTSSALAAATLGSGTDQYTRASLDCDLFH 206

Query: 177 -------TAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTG--SKVSEVTVGPENKTATSA 227
                  +AHCLR    ++  + +    + F+++I +++   S    +++ P      + 
Sbjct: 207 TWLRTPGSAHCLRMDDLYYQGYQVDPARLDFNIQISIQSANTSVTQTLSLNPTQPFVVND 266

Query: 228 DNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGL 287
            N +   L+GD   Y ++P F  +YL+IP        Q L  N   WM+++++  + DG 
Sbjct: 267 ANTVAAKLLGDLATYQSMPDFSSYYLMIPSPADSSPQQVLSSNTDKWMMVDKSMVSTDGT 326

Query: 288 ECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVE---GRFE 344
            CNK+G SY AF  Q   C  P  +CL NQL++  +AD  RI++   P+  V    G   
Sbjct: 327 TCNKVGTSYFAFQYQSGSCQQPQGTCLGNQLYDLYQADVKRISQGTTPVNFVSRWGGGQP 386

Query: 345 RMNQ--HPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII-------- 394
             NQ  + ++GS  F++ +T +LNS + + + AD +  +   SPGKI+S  +        
Sbjct: 387 GANQASYSSSGSLRFALPITNILNSVVTLTVNADAVMLIDNVSPGKILSAQVCQFNNATC 446

Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLT-FDCSTGVTLMEEQYFIIKPKETSIRSFKIY 453
            +F+ALTQ G  T T QN G + A++ ++  +C+  VT +  Q   +  K T   +F I+
Sbjct: 447 GSFQALTQRGYLTATVQNAGSIAATFIVSVVNCTASVTPIVAQSATLASKATKALTFDIF 506

Query: 454 PTTNQA-AKYTCSAILKDSDFSEVDRAECQFSTMATVLDN 492
            T+N+A A  TC   L DS  +     +   S  A +  N
Sbjct: 507 LTSNKADAAITCDVGLTDSQVNGAGAPQTGPSDCAALCPN 546


>gi|302776592|ref|XP_002971451.1| hypothetical protein SELMODRAFT_451371 [Selaginella moellendorffii]
 gi|300160583|gb|EFJ27200.1| hypothetical protein SELMODRAFT_451371 [Selaginella moellendorffii]
          Length = 565

 Score =  185 bits (469), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 156/588 (26%), Positives = 255/588 (43%), Gaps = 93/588 (15%)

Query: 30  VGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQK 89
           + +  +SKS L+ C    D + + C  KI++ +A+PSG  G   +++AEV +      ++
Sbjct: 20  INMTTISKSDLDVCVNTGDPNAIQCKKKILVTVAIPSGDGGNGEALIAEVKDPTSRDGKQ 79

Query: 90  MRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRK-CEP------------DA 136
           +    I   + + KT S   Y L Y+++V     E  +K +K C              D+
Sbjct: 80  VLEKSIS--VNIAKTDSIVKYALEYLKNVAGDLNERVIKKKKGCNTKLNDKATCGVLGDS 137

Query: 137 GADVVKICERQPICCPCGPQRRI--------PSSCGNVFDKLLKGKANTAHCLRFPGDWF 188
             +VV        CC C P ++I        P  CG + D      A  A CLRF   W 
Sbjct: 138 KGNVVP--GSSGFCCTCKPLKQIKHFRGMPKPGHCG-ISD------AGYAFCLRFGQMWC 188

Query: 189 HVFGIGQRSIGFSVRIEV--KTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIP 246
            +F I   +I F + I +  + G+K S   +G            L++ L+ +        
Sbjct: 189 VMFRIRTGTISFEITITLTDQNGNKASSRIIG----------FVLRLTLLAE----KPDG 234

Query: 247 SFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFC 306
           ++++F    PR  G            +WML++  R TL G  C+KIG+S   +  QP  C
Sbjct: 235 NWQQFTRGSPRADG-----------RLWMLVDEARVTLTGSACDKIGLSCLGYAQQPRTC 283

Query: 307 SSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLN 366
                 C+  QL ++ + D   + + +L ++G      R   + +    +  I V+   N
Sbjct: 284 DGALGMCIGEQLIDFIKEDLAALGKGRLAIHG----LFRYGSYRSLVPDALQIAVSPT-N 338

Query: 367 SNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDC 426
           S + IE+ AD++ +   +S GKI+ V +P FEA++  G  T+T  N G +EASY +  +C
Sbjct: 339 SLITIEIAADNVSFRRNKSTGKIVKVEVPPFEAMSTGGTLTLTVVNDGSLEASYGVYVEC 398

Query: 427 STGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTM 486
           S  +  +E +   + P       + +   +  A K +C   L+DS+    D  E +FST 
Sbjct: 399 SANINPLEGKRVSMIPNVPQTFLYTLITRSTDATKNSCIVTLRDSEGENCDVKEAKFSTT 458

Query: 487 ATVLDNGSQITPFQPPKSSINDFF--------------ESIESIGKKLWE---------- 522
           ATV +NGSQ+   Q   S  ND F                I+ I K  +           
Sbjct: 459 ATVFNNGSQVGGVQIAGSK-NDTFAKGLGGLGFFGKIGAGIKGIAKGAFNVVTSPFRKMF 517

Query: 523 GLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLF--GLVLAIFPTV 568
           GL + + GK     C   FD  C I + C+  ++ F  G+V A    V
Sbjct: 518 GLFNNLLGKC--DNCPGAFDIGCFIAHFCVKKILFFVGGIVAAALGKV 563


>gi|224031573|gb|ACN34862.1| unknown [Zea mays]
          Length = 297

 Score =  178 bits (451), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 91/164 (55%), Positives = 116/164 (70%), Gaps = 4/164 (2%)

Query: 433 MEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDN 492
           MEEQY+I+KP E S R F ++ +T+QAAKY C+AILK SD SE+DR EC FST ATVLDN
Sbjct: 1   MEEQYYILKPNEESTRLFYLHASTDQAAKYQCTAILKASDSSELDRQECVFSTTATVLDN 60

Query: 493 GSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQYIC 551
           G+QI      K     FF++I+      W+ L D I+GK+CR  KC SFFDFSCH QY C
Sbjct: 61  GTQIIGSNGYKLG---FFDTIKGYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQYRC 117

Query: 552 LSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSD 595
           ++WLV+  L+L + P   ++L+LLHQKG FDP+YDWWDD   +D
Sbjct: 118 ITWLVMLVLLLFMLPAGAIVLYLLHQKGFFDPVYDWWDDLLGAD 161


>gi|84453083|dbj|BAE71144.1| generative cell specific-1 [Physarum polycephalum]
          Length = 808

 Score =  175 bits (444), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 151/605 (24%), Positives = 257/605 (42%), Gaps = 77/605 (12%)

Query: 16  ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDN--LNCTTKIVLNMAVPSGSSGGEA 73
           +  CIL L      +   +++ S++  C     S++  LNC  K V++++V +G +  EA
Sbjct: 4   VFLCILFLFYLFSTLHADLIASSQITNCVLDGSSEDTILNCQKKFVVSLSVDNGQNKTEA 63

Query: 74  SIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTR--- 130
                    + N+T +      P  +T++K+     Y ++Y++ V   P E  + TR   
Sbjct: 64  VQFTISSATDGNTTLQFVN---PWTITLSKSPVAIYYPISYLQTVNADPSEAVIYTRDWI 120

Query: 131 ---KCEPDAGADVV-----------KICERQPICCPCG---------PQRRIPSSCGNVF 167
               C+  A +D              I + Q  CC C           Q R   +C    
Sbjct: 121 VVSSCQSGAYSDNPTCGWYKDSNGNNIPDSQGFCCSCNLAEYLGISDDQTRAGLTC---- 176

Query: 168 DKLLKGKANTAHCLRFPGD-WFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATS 226
                G +++AHCLRF  + W+ +F I      +++ I++  G   +  TV     T  S
Sbjct: 177 -SFFSGSSSSAHCLRFDDNGWYDIFQIANAQDMYTIDIDISQGGG-TNTTVTLSPSTTIS 234

Query: 227 ADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDG 286
           + + +   L+GDF  +  +P +   YL +P  G P +   +      WM+++   F L G
Sbjct: 235 SSSSVIARLLGDFSPFQQLPVYSTKYLAVPSSGNPRETDGM----DTWMMIDTDLFDLSG 290

Query: 287 LECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREAD-----QNRINRNQLPLYGVEG 341
             CNKIGVS+  FN + S C     SCL  Q+ +Y ++D      NRI    L  +G  G
Sbjct: 291 TVCNKIGVSFAGFNSEASHCKLLVNSCLGYQIEDYYQSDLQLQKANRIGNYFLSFFG--G 348

Query: 342 RFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALT 401
            +       +  +   +  +T + +S + +   ADDI +V   SPG+I+S  +  FEAL+
Sbjct: 349 LYYAETYTSSLTNRFLAFDLTGLQSSVITLTFSADDIRFVTNESPGQIVSAYVEEFEALS 408

Query: 402 QFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAA 460
           + G   +   N G + A Y +T   CSTG+  ++ Q   + P++ +   F I        
Sbjct: 409 KDGRMHVVVVNNGTINAQYEITVTQCSTGIATIQAQEPTLVPRKQTEFIFNIQSENALQK 468

Query: 461 KYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKL 520
            Y C   L DS    +D     F+T AT                     F++    G   
Sbjct: 469 SYQCKVSLLDSQAVLLDYRIVYFNTSATN--------------------FQTTAQGGDTS 508

Query: 521 WEGLRDFITGK--ACRRKCSSFFDFSCHIQYICLSWLVLF---GLVLAIFPTVLVLLWLL 575
            +   D  + K  +C + CS+F+D  C + + C  W  +F   G ++ I   + +L  L 
Sbjct: 509 GDSGDDLKSDKHSSCSQACSAFYDIICFLSHKC--WKNVFSFLGTIIGIAAGLFILYKLK 566

Query: 576 HQKGL 580
              G+
Sbjct: 567 QHFGM 571


>gi|224034879|gb|ACN36515.1| unknown [Zea mays]
          Length = 366

 Score =  172 bits (437), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 89/167 (53%), Positives = 116/167 (69%), Gaps = 4/167 (2%)

Query: 430 VTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATV 489
           +   +EQY+I+KP E S R F ++ +T+QAAKY C+AILK SD SE+DR EC FST ATV
Sbjct: 67  IAGFQEQYYILKPNEESTRLFYLHASTDQAAKYQCTAILKASDSSELDRQECVFSTTATV 126

Query: 490 LDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQ 548
           LDNG+QI      K     FF++I+      W+ L D I+GK+CR  KC SFFDFSCH Q
Sbjct: 127 LDNGTQIIGSNGYKLG---FFDTIKGYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQ 183

Query: 549 YICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSD 595
           Y C++WLV+  L+L + P   ++L+LLHQKG FDP+YDWWDD   +D
Sbjct: 184 YRCITWLVMLVLLLFMLPAGAIVLYLLHQKGFFDPVYDWWDDLLGAD 230


>gi|118396406|ref|XP_001030543.1| hypothetical protein TTHERM_01075640 [Tetrahymena thermophila]
 gi|89284850|gb|EAR82880.1| hypothetical protein TTHERM_01075640 [Tetrahymena thermophila
           SB210]
          Length = 715

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 160/603 (26%), Positives = 255/603 (42%), Gaps = 86/603 (14%)

Query: 8   LKLKHFLLILF--CILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVP 65
           +K   F LI F  CILN    RC    + ++ S ++KC   ++  N NC+ K V+ +++ 
Sbjct: 1   MKFLAFGLIYFHFCILN----RC----EYITSSTIQKCYNSSNEPN-NCSQKAVIVLSLE 51

Query: 66  SGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQE- 124
           +G       +VA + ++ ++   K   ++   +  V K+   A++ L Y++D   +P E 
Sbjct: 52  NGQIANTEQVVATLNQLSDSGVNKQ--LQNSFIFEVTKSPVTALFPLIYLQDFNSQPLEQ 109

Query: 125 ------------FYMKTRKCEPDAGADVVKICERQPICCPCGPQRRIPS----SCGNVFD 168
                       FY  +  C+    +   KI + Q  CC C     +      S G V  
Sbjct: 110 VIATTLFSCKDGFYDSSPTCKFQYDSKGQKILDSQGYCCYCSLSDILGMGNDLSRGKVCY 169

Query: 169 KL-LKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSE------VTVGPEN 221
            L L   + TAHCL+F   W+  F I Q  + F V I + T    ++      + +   N
Sbjct: 170 ALNLGAGSATAHCLKFSPLWYSAFKIQQYQLYFEVNINIYTVDSQNQKNLKQTLKLSTSN 229

Query: 222 KTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTR 281
            T  S+DN     +IG F           +YLV P       P+ L G  S WM +++T 
Sbjct: 230 PTMKSSDNSTISKIIGTFTPTQPPADLSSYYLVKPSFPAT-DPRVLQG-ISSWMFVDKTM 287

Query: 282 FTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEG 341
           FTLDG +CNKIGVSY  F  Q S CS P  SCL NQL N  ++D          L  +  
Sbjct: 288 FTLDGTQCNKIGVSYSGFRQQSSSCSQPVGSCLQNQLENLYQSD----------LILLSQ 337

Query: 342 RFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALT 401
           R         +GS S  I           IE+ A  I++V     G I    I  FE+ +
Sbjct: 338 RL--------SGSASTLI----------TIEIDAAQIKFVTNLGIGCISQCSINNFESHS 379

Query: 402 QFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAK 461
             G      QN G   A + L F+CS+ V  ++ Q              K++ T NQ   
Sbjct: 380 GNGKLVALVQNQGNYSAEFVLGFNCSSNVQPIQGQ--------------KLFLTANQLYN 425

Query: 462 YTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLW 521
           + CS  + +SD S ++   C  +    +   G+Q+       ++ +    S +       
Sbjct: 426 FNCSVSV-NSDISAINN-NCTINLYDAI---GNQLDSKNILFNTTSTNHTSNQGNNTGQQ 480

Query: 522 EGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLF 581
           +  +++ + ++C  KCSSF+ F C+    C+         +A   + L L+  L + G  
Sbjct: 481 QSSQEYKSSQSCSDKCSSFWSFWCYFSAGCIKEAFKSIASIAGVASALALVIFLAKNGYL 540

Query: 582 DPL 584
            P+
Sbjct: 541 VPI 543


>gi|440798371|gb|ELR19439.1| hypothetical protein ACA1_266960 [Acanthamoeba castellanii str.
           Neff]
          Length = 927

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 148/576 (25%), Positives = 244/576 (42%), Gaps = 83/576 (14%)

Query: 25  SPRCVVGVQ--ILSKSKLEKC--EKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVV 80
           +P+ +V ++  +L+ S++E+C  +  TD   ++C  ++++ + V SG +  E   +  V+
Sbjct: 23  APQLLVSIEGSLLASSRVERCVQDGATDVPTISCDRRMIVTLTVDSGQNNTEQ--LELVL 80

Query: 81  EVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYM------------- 127
           +  ++    +RT+  P  +   KT    +Y +TY+     KP E                
Sbjct: 81  DSTQDEDGVLRTLEHPVQIQWAKTIPRLLYPITYVGRTNNKPYETITYKDDILFLFDECN 140

Query: 128 -KTRKCEPDAG----ADVVKICERQPICCPC---------GPQRRIPSSCGNVFDKLLKG 173
                  P  G    AD   + + Q  CC C           Q R   +C ++F   + G
Sbjct: 141 DSPSSSSPTCGWFYNADGTVVRDSQGFCCSCDLSEVLWLSNEQTRAGLTC-SLFAFGVDG 199

Query: 174 KANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK------------TGSKVSE-VTVGPE 220
             ++AHCLRF   W+ VF IG   + F V + VK            TG  V+E + + P 
Sbjct: 200 --SSAHCLRFDQLWYDVFSIGAAQVSFEVVLSVKKYQTMTDMYGNTTGGYVTETLRLSPS 257

Query: 221 NKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERT 280
             T T+A   +   L GDF  +++ P   E YL +P       P+ + G    WMLL+R+
Sbjct: 258 QTTGTAAGGDIFAKLQGDFAPWSDNPVLSEKYLFVPSSPST-HPRVVAGT-DYWMLLDRS 315

Query: 281 RFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVE 340
                GL CNKIGVS+ AF  Q   C +   +CL NQL +Y   D  R  + QL  Y V 
Sbjct: 316 SADFSGLTCNKIGVSFSAFRYQGGACGNWLQACLGNQLDHYHREDLARWEQGQLGRYFVR 375

Query: 341 --GRFERMNQHPNAGSHSF-SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTF 397
             G F             + S  + ++  +   + L ADDI Y   RSPG+I+   I  F
Sbjct: 376 FWGDFVGNQAVVQTNDQRYLSFALDQIRATVTTLTLNADDIIYTINRSPGRIVVANITGF 435

Query: 398 EALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTT 456
           E L   G   +   N G ++A Y++T  +C   +  ++ +   I   +++  +F +Y   
Sbjct: 436 EGLATQGELDVVVMNNGTIQADYTITVTECGDRIQAVQAKMRSISAYQSANLTFALY--- 492

Query: 457 NQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESI 516
              + Y    ++ DS +  V       + +     +GS                      
Sbjct: 493 --MSLYDSLGVIVDSVWVNVTVFATNITCLGGQCSDGS--------------------GG 530

Query: 517 GKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICL 552
           GK    G +  IT  +C   C+S FD +C++   C+
Sbjct: 531 GKPADGGYKYAIT--SC-SACNSIFDIACYVDNSCM 563


>gi|159475573|ref|XP_001695893.1| gamete-specific protein [Chlamydomonas reinhardtii]
 gi|158275453|gb|EDP01230.1| gamete-specific protein [Chlamydomonas reinhardtii]
          Length = 813

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 158/641 (24%), Positives = 254/641 (39%), Gaps = 134/641 (20%)

Query: 32  VQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVE-------- 83
            ++++  +LEKC     ++ L+C  K+V+ + V +G S     +  E +E          
Sbjct: 22  AEVIASGRLEKCVVDGVTEELDCQEKVVVTLTVGNGQS-----LQTEALEFSLSCLNSPD 76

Query: 84  ---------ENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK--TRKC 132
                     + T   R +  P  +++ K+  +A Y L Y+    +KP E  ++   + C
Sbjct: 77  GRCPCSCSAADPTCACRDLAAPLRVSLTKSPLWASYPLQYLSSFNWKPLEVILRPSNKVC 136

Query: 133 E-------PDAG---ADVVKICERQPICCPCGPQRRIPSSCG----------------NV 166
           +       P  G      V++ + Q  CC C   +    + G                + 
Sbjct: 137 KDGDWEDSPTCGWFSQGGVRVADSQGFCCECSSSQVWDDTFGSSKERTRANLDCDFWSDP 196

Query: 167 FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGF--SVRIEVKTGSKVSEVT-------- 216
            D L+  K  +AHCL F   W+  + +G  S+ F  ++ +EV T    +  T        
Sbjct: 197 LDILIGRKPVSAHCLTFDPQWYSGYELGAASLQFEIAITVEVPTAPSPTPATTSATPRTT 256

Query: 217 -------------------------------VGPENKTATSADNFLKVNLIGDFVGYTNI 245
                                          +GP    A+SA   L   L+GD   YT +
Sbjct: 257 NNSSANSTNSTNSPAPQFLSPPAPSTREVLHLGPSVPLASSASRLLSAKLLGDLAMYTQL 316

Query: 246 PSFEEFYLVIPRQGG----PGQPQD--LGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
           P+     L++P+        G P D  L  N S WMLL++T  ++DGL C+K+G  + AF
Sbjct: 317 PAISNQVLMVPQPPAAAAATGSPLDATLATNRSAWMLLDKTMLSMDGLACDKVGTGFSAF 376

Query: 300 NGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGV---EGRFERMNQHPNAGSHS 356
             QPS C     +CL  QL +  EAD  RI   ++PLY +    G  +   Q  + G  S
Sbjct: 377 RYQPSGCGRAPQTCLSGQLKDLWEADLARIADGRVPLYMITRFTGGSDTTLQSFSGGPLS 436

Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--------PTFEALTQFGVATI 408
           F++ VT    S + + + AD +  V  RSPGKI    +          FEA+   G   +
Sbjct: 437 FALPVTSQSQSLVTLSVAADGVRLVTNRSPGKITGAAVCRFAGTSCGGFEAVAARGYIYV 496

Query: 409 TTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSI--RSFKIYPTTNQAAKY-TC 464
              NTG +++ Y+LT  +CS+ V  +E +   ++    +      ++Y     AA   TC
Sbjct: 497 NITNTGRLDSDYTLTVSNCSSNVRPIEARALAVRAGSAASLDPPMELYVEDQAAAAARTC 556

Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGL 524
           +  L DS  +  D     F T AT L           P    N    + + +G K     
Sbjct: 557 TVSLYDSVGAVTDSLTLSFYTNATQL--------VVKPSGGYNG---TGDGVGVKR---- 601

Query: 525 RDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
                G  C   C++  D  C +   C S    FG +L I 
Sbjct: 602 ----NGTDCSTACTNPIDVLCFVTKKCWS---KFGRLLGII 635


>gi|414591384|tpg|DAA41955.1| TPA: hypothetical protein ZEAMMB73_607847, partial [Zea mays]
          Length = 124

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 78/110 (70%), Positives = 94/110 (85%), Gaps = 1/110 (0%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
           MLLER RFT DG+ECNKIGV YEAF  QP+FC+SPF SCL+NQLW + E+D+NRI+ ++ 
Sbjct: 1   MLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 59

Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQR 384
           P Y V+GRF+R+NQHP+A  HSFSIGVTEV+NSNL IEL ADDIEY+YQR
Sbjct: 60  PQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQR 109


>gi|288563868|gb|ABO29824.2| fusion protein HAP2/GCS1 [Chlamydomonas reinhardtii]
          Length = 1139

 Score =  165 bits (418), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 158/641 (24%), Positives = 251/641 (39%), Gaps = 134/641 (20%)

Query: 32  VQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVE-------- 83
            ++++  +LEKC     ++ L+C  K+V+ + V +G S     +  E +E          
Sbjct: 22  AEVIASGRLEKCVVDGVTEELDCQEKVVVTLTVGNGQS-----LQTEALEFSLSCLNSPD 76

Query: 84  ---------ENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK--TRKC 132
                     + T   R +  P  +++ K+  +A Y L Y+    +KP E  ++   + C
Sbjct: 77  GRCPCSCSAADPTCACRDLAAPLRVSLTKSPLWASYPLQYLSSFNWKPLEVILRPSNKVC 136

Query: 133 E-------PDAG---ADVVKICERQPICCPCGPQRRIPSSCG----------------NV 166
           +       P  G      V++ + Q  CC C   +    + G                + 
Sbjct: 137 KDGDWEDSPTCGWFSQGGVRVADSQGFCCECSSSQVWDDTFGSSKERTRANLDCDFWSDP 196

Query: 167 FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKT------------------ 208
            D L+  K  +AHCL F   W+  + +G  S+ F + I V+                   
Sbjct: 197 LDILIGRKPVSAHCLTFDPQWYSGYELGAASLQFEIAITVEVPTAPSPTTATTSATPRTN 256

Query: 209 ----------------------GSKVSEVT-VGPENKTATSADNFLKVNLIGDFVGYTNI 245
                                      EV  +GP    A+SA   L   L+GD   YT +
Sbjct: 257 NSSSANSTNSTNSPAPQFLSPPAPSTREVLHLGPSVPLASSASRLLSAKLLGDLAMYTQL 316

Query: 246 PSFEEFYLVIPRQGG----PGQPQD--LGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
           P+     L++P+        G P D  L  N S WMLL++T  ++DGL C+K+G  + AF
Sbjct: 317 PAISNQVLMVPQPPAAAAATGSPLDATLATNRSAWMLLDKTMLSMDGLACDKVGTGFSAF 376

Query: 300 NGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGV---EGRFERMNQHPNAGSHS 356
             QPS C     +CL  QL +  EAD  RI   ++PLY +    G  +   Q  + G  S
Sbjct: 377 RYQPSGCGRAPQACLSGQLKDLWEADLARIADGRVPLYMITRFTGGSDTTLQSFSGGPLS 436

Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--------PTFEALTQFGVATI 408
           F++ VT    S + + + AD +  V  RSPGKI    +          FEA+   G   +
Sbjct: 437 FALPVTSHSQSLVTLSVAADGVRLVTNRSPGKITGAAVCRFAGTSCGGFEAVAARGYIYV 496

Query: 409 TTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSI--RSFKIYPTTNQAAKY-TC 464
              NTG +++ Y+LT  +CS+ V  +E +   ++    +      ++Y     AA   TC
Sbjct: 497 NITNTGRLDSDYTLTVSNCSSNVRPIEARTLAVRAGSAASLDPPMELYVEDQAAAAARTC 556

Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGL 524
           +  L DS  +  D     F T AT L           P    N         G     G+
Sbjct: 557 TVSLYDSVGAVTDSLTLSFYTNATQL--------VVKPSGGYN---------GTGDGAGV 599

Query: 525 RDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
           +    G  C   C++  D  C +   C S    FG +L I 
Sbjct: 600 KR--NGTDCSTACTNPIDVLCFVTKKCWS---KFGRLLGII 635


>gi|261333213|emb|CBH16208.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 618

 Score =  165 bits (418), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 136/524 (25%), Positives = 230/524 (43%), Gaps = 54/524 (10%)

Query: 13  FLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGE 72
           F+L++    + L PR      +++ S +E CE+ +  +   C  K+V+ ++V  G   G 
Sbjct: 10  FVLVVLLPTSGLFPR--TEAALVASSSIEYCERSSKLEPFPCEKKMVVTLSVGGGQKAGV 67

Query: 73  ASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAV-YELTYIRDVPYKPQEFYMKTRK 131
             +V     V++   +K + V   PV  V   +     Y + YIR+   KP E  ++T  
Sbjct: 68  EEVVLLREAVDKTGDEKGKRVEFEPVRMVTTESPVRYRYPIYYIRNFNAKPYEQRLRTSA 127

Query: 132 ---CEP-------------DAGADVVKICERQPICCPCG---------PQRRIPSSCGNV 166
              C+              D   DV+     Q  CC CG         P  R   +C   
Sbjct: 128 SSWCDDSSNPGSATCGVARDRRGDVIPY--SQGFCCLCGACALSGICNPTSRSVGTCS-- 183

Query: 167 FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK----------TGSKVSEVT 216
               + G    A CLRF   W+  + IG+  + + +++++           TGSK   ++
Sbjct: 184 ----VTGDTGMASCLRFSDLWYGGYTIGRGVVWYELQVKLSSGNNSTGGGSTGSKEFTMS 239

Query: 217 VGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWML 276
           +GP+  TATS +      LIGDF             L IP +  P   + +G  ++ W++
Sbjct: 240 LGPDKLTATSTEFGASARLIGDFAPPEMPLDLSGKMLFIPSE--PRGHERVGAGYNEWII 297

Query: 277 LERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPL 336
           ++    ++ G ECNK+GVSYE F  Q S C +   +CL NQL +YR+ D     + +   
Sbjct: 298 VDTHLVSIRGTECNKVGVSYEGFATQGSRCDAYPGACLANQLEDYRDRDLEAETKGERGK 357

Query: 337 YGVEGRFERMNQHP--NAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
           Y +   F  +   P  NA + + S   +  L++ + I + AD + +V   S G I+   +
Sbjct: 358 Y-MARFFAPLGFDPLANASAPAVSYQASGTLSTIVTITISADKLNFVLSVSSGVIVGATV 416

Query: 395 P--TFEALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFK 451
                 + ++    T+T  NTG++EA Y++   +C+  V  M  Q   I PK ++ R F 
Sbjct: 417 SGKVVHSYSRGSTITVTVLNTGDIEAQYTVVVGECTVNVQPMVAQTVYIPPKGSAQRRFT 476

Query: 452 IYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
           +    +   +  C+A L+++    VD     F   A    NGSQ
Sbjct: 477 LIVQDSIEGEAKCNATLRNARGDVVDTRAISFGVKALKPSNGSQ 520


>gi|145046216|dbj|BAE71145.2| generative cell specific-1 [Chlamydomonas reinhardtii]
          Length = 748

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 143/566 (25%), Positives = 230/566 (40%), Gaps = 112/566 (19%)

Query: 32  VQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVE-------- 83
            ++++  +LEKC     ++ L+C  K+V+ + V +G S     +  E +E          
Sbjct: 22  AEVIASGRLEKCVVDGVTEELDCQEKVVVTLTVGNGQS-----LQTEALEFSLSCLNSPD 76

Query: 84  ---------ENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK--TRKC 132
                     + T   R +  P  +++ K+  +A Y L Y+    +KP E  ++   + C
Sbjct: 77  GRCPCSCSAADPTCACRDLAAPLRVSLTKSPLWASYPLQYLSSFNWKPLEVILRPSNKVC 136

Query: 133 E-------PDAG---ADVVKICERQPICCPCGPQRRIPSSCG----------------NV 166
           +       P  G      V++ + Q  CC C   +    + G                + 
Sbjct: 137 KDGDWEDSPTCGWFSQGGVRVADSQGFCCECSSSQVWDDTFGSSKERTRANLDCDFWSDP 196

Query: 167 FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKT------------------ 208
            D L+  K  +AHCL F   W+  + +G  S+ F + I V+                   
Sbjct: 197 LDILIGRKPVSAHCLTFDPQWYSGYELGAASLQFEIAITVEVPTAPSPTTATTSATPRTN 256

Query: 209 ----------------------GSKVSEVT-VGPENKTATSADNFLKVNLIGDFVGYTNI 245
                                      EV  +GP    A+SA   L   L+GD   YT +
Sbjct: 257 NSSSANSTNSTNSPAPQFLSPPAPSTREVLHLGPSVPLASSASRLLSAKLLGDLAMYTQL 316

Query: 246 PSFEEFYLVIPRQGG----PGQPQD--LGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
           P+     L++P+        G P D  L  N S WMLL++T  ++DGL C+K+G  + AF
Sbjct: 317 PAISNQVLMVPQPPAAAAATGSPLDATLATNRSAWMLLDKTMLSMDGLACDKVGTGFSAF 376

Query: 300 NGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGV---EGRFERMNQHPNAGSHS 356
             QPS C     +CL  QL +  EAD  RI   ++PLY +    G  +   Q  + G  S
Sbjct: 377 RYQPSGCGRAPQACLSGQLKDLWEADLARIADGRVPLYMITRFTGGSDTTLQSFSGGPLS 436

Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--------PTFEALTQFGVATI 408
           F++ VT    S + + + AD +  V  RSPGKI    +          FEA+   G   +
Sbjct: 437 FALPVTSHSQSLVTLSVAADGVRLVTNRSPGKITGAAVCRFAGTSCGGFEAVAARGYIYV 496

Query: 409 TTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRS--FKIYPTTNQAAKY-TC 464
              NTG +++ Y+LT  +CS+ V  +E +   ++    +      ++Y     AA   TC
Sbjct: 497 NITNTGRLDSDYTLTVSNCSSNVRPIEARTLAVRAGSAASLDPPMELYVEDQAAAAARTC 556

Query: 465 SAILKDSDFSEVDRAECQFSTMATVL 490
           +  L DS  +  D     F T AT L
Sbjct: 557 TVSLYDSVGAVTDSLTLSFYTNATQL 582


>gi|330819085|ref|XP_003291595.1| hypothetical protein DICPUDRAFT_156210 [Dictyostelium purpureum]
 gi|325078197|gb|EGC31861.1| hypothetical protein DICPUDRAFT_156210 [Dictyostelium purpureum]
          Length = 651

 Score =  162 bits (409), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 141/503 (28%), Positives = 229/503 (45%), Gaps = 61/503 (12%)

Query: 76  VAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEF------YMKT 129
           ++ VV+++     K +T+  P V+  +K+ ++ VY L Y++ V +KP E       Y+  
Sbjct: 11  ISNVVDID----GKNKTLLEPIVVRFSKSETFVVYPLEYLQTVAFKPVEKVIYKTDYLIG 66

Query: 130 RKC-----EPDAGADVVKIC-----ERQPICCPCGPQRRIPS---SCGNVFDKLLKGKAN 176
             C     +   G  V  +      + Q  CC C       +   S GN+   L   K++
Sbjct: 67  TGCKDLPTDSTCGYAVNSVTGEAIRDSQGFCCSCSMSDYFGADQNSRGNLGCSLFGSKSS 126

Query: 177 TAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVT--VGPENKTATSADNFLKVN 234
           +AHCL F    + VF I +  + + +   V++      +   V   N   T   + + + 
Sbjct: 127 SAHCLSFSELKYDVFDISETRVQYQINATVQSFYNQLPIVDVVKLSNDVTTGKTSQVIIR 186

Query: 235 LIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGV 294
           ++GD    T I  +    +V PR      P       +  +LL+   F L G  CNKIGV
Sbjct: 187 IVGDLSTSTQIKQYPNKKIVFPR--ASSDPISSLPIINTSLLLDDDFFDLSGAGCNKIGV 244

Query: 295 SYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF---------ER 345
            Y AF  Q + C++ F SCL NQ+ +Y   D   IN       G +GR+         + 
Sbjct: 245 GYSAFQNQANRCAAVFQSCLQNQISDYYANDLKLIND------GKKGRYIISQLGTSVKV 298

Query: 346 MNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGV 405
           ++   N  S SF++   E+  + L + L AD +++V   SP KIIS  I TFE+++  GV
Sbjct: 299 ISSAANKNSRSFAVRFDEIQRTILTLTLSADSLQFVVNISPAKIISYNIETFESMSNNGV 358

Query: 406 ATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTC 464
             I+ QNTG + A Y L   +CS  +  M  Q   I+PKE  + SF+IY TT   + Y C
Sbjct: 359 LKISVQNTGALNADYLLQVHNCSGDIIQMPNQIATIQPKEIYVFSFQIYTTTMLQSYYYC 418

Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGL 524
            A L +   + +      F+T  T+++ G+Q +   P   S     +S+ +IG +L    
Sbjct: 419 FADLVNEQSTLLQSIRINFNTSKTIIEQGAQ-SGDNPNNQS-----DSL-NIGYEL---- 467

Query: 525 RDFITGKACRRKCSSFFDFSCHI 547
                   C   C +FF+  C++
Sbjct: 468 -------TCDLVCPNFFNIICYL 483


>gi|307111056|gb|EFN59291.1| hypothetical protein CHLNCDRAFT_137637 [Chlorella variabilis]
          Length = 1084

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 140/486 (28%), Positives = 217/486 (44%), Gaps = 64/486 (13%)

Query: 135 DAGADVVKICERQPICCPCGPQRRIPS-----SCGNVFDKLLKGKANTAHCLRFPGDWFH 189
           D G DV    + Q  CC CG            S GN+         ++AHCLRF   W+H
Sbjct: 137 DGGQDVA---DSQGFCCDCGSLINFGGDDGQLSRGNLDCGGFIQTQDSAHCLRFDNTWWH 193

Query: 190 V-FGIGQRSIGFSVRIEVKTGSKVSE----------VTVGPENKTATSADNFLKVNLIGD 238
             + IG+ S+ F++ + + T +  +           VT+ P       +   L   L+GD
Sbjct: 194 AGYVIGEYSLDFTINLNITTVTTNATTNATAAASELVTLTPSAPFRRDSSRRLSAKLLGD 253

Query: 239 FVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEA 298
              Y   P  +  +L+IP + G G  +    +F+ WM+++    T  GLEC+KIGVSY  
Sbjct: 254 LESYQQAPQLDGKWLLIPTKPGEGPQEWYTRHFNEWMVVDGNLVTTTGLECDKIGVSYSG 313

Query: 299 F-NGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMN---------- 347
           F N QP+ C++P  SCL NQ+ +   AD NRIN    PLY V GR+              
Sbjct: 314 FRNSQPNKCTTPQGSCLRNQIVDLYAADLNRINTGVDPLYFV-GRYGGGTLNSDQLTGEL 372

Query: 348 QHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPT--------FEA 399
           Q   +   + ++ ++ +  S L + + ADD+ +V  RSP +I SV + T        FEA
Sbjct: 373 QEDGSFKLALNLPISAIKVSLLTLMVAADDVAFVVNRSPAQITSVQVCTYDGIICGGFEA 432

Query: 400 LTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQ 458
           +T  G   +T +N+G V + Y++   +C+TGV  +  Q   I P+ +++  F++   ++ 
Sbjct: 433 MTARGYLRVTVRNSGYVASDYTVQVTNCTTGVRNVLAQRAGIAPQSSTVFQFELQMESDA 492

Query: 459 AAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGK 518
           A++ +C   + DS    V      F T AT           QPP+ S          IG 
Sbjct: 493 ASESSCMVSVVDSLGDTVATMGISFYTDATDYT--------QPPEQS---------DIGD 535

Query: 519 KLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPT----VLVLLWL 574
           ++     D  T   C++ C    DF C I   C   L   GL   + P      L ++W 
Sbjct: 536 QVTGPNED--TPDWCQQVCPRLTDFKCAINKGCYGRLAK-GLSAIVAPVAGLGALFMIWK 592

Query: 575 LHQKGL 580
               GL
Sbjct: 593 TGHLGL 598


>gi|71652476|ref|XP_814894.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70879906|gb|EAN93043.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 588

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 140/516 (27%), Positives = 237/516 (45%), Gaps = 31/516 (6%)

Query: 7   SLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPS 66
           SL    F L+LF ++   +P    G+ +L+ S +E+C++    ++L C  K+V+ ++V S
Sbjct: 4   SLSRMLFSLLLFALMVATTPFAAEGL-LLASSSIEQCDRVGTDNSLPCEKKLVVTLSVDS 62

Query: 67  GSSGGEASIVAEVVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEF 125
             +      V     V++        V   P+ LT +K+     Y L Y R+   KP E 
Sbjct: 63  DQAEDVEEFVILRDAVDKTKGTGEEHVEFQPIRLTTSKSRVQYSYPLFYERNFNAKPYEE 122

Query: 126 YMKTR--KCE----PDAGADVVKICERQPI------CCPCGPQRRIP----SSCGNVFDK 169
            + T    C+    P A   +      +PI      CC CGP + +      S G     
Sbjct: 123 EITTELVGCDDTFSPKATCGLAMDTAGRPIPYSQGFCCRCGPCQLLGLCPVGSRGLQVCD 182

Query: 170 LLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVT------VGPENKT 223
           + +G A  A CLRF   W+  + +G  +I + + +++ T S+ +  T      +GP+  +
Sbjct: 183 IFRGAA-LASCLRFGELWYSGYSMGSATIWYRLSVKLTTDSQNNSKTKEAVFELGPDVLS 241

Query: 224 ATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFT 283
            +SA+    V+LIGDFV            L IP    P   + +      W++L++   +
Sbjct: 242 GSSAEFGAWVSLIGDFVPAELPLVLSNKMLFIPSS--PRIHERVLAGQKEWLILDKHHVS 299

Query: 284 LDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
           + G +CNK+GVSYEAF+GQ S C     SCL +QL +YR +D     R     Y      
Sbjct: 300 MQGRDCNKVGVSYEAFSGQGSRCQLIRGSCLADQLEDYRSSDLAVEARGGRGKYLARFFG 359

Query: 344 ERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALT 401
           + +  + N      S  +   L + L + + AD ++Y+   SPG+I+S ++   T E  +
Sbjct: 360 DFVVNNVNNSRTRLSYWMRGSLATMLTVVISADRLQYLVSVSPGEIVSAVMSKSTVEESS 419

Query: 402 QFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKI-YPTTNQA 459
           + G  ++  +N G V A Y+L   +CS  V  +  Q   ++P+ T IRSF +      + 
Sbjct: 420 RDGSVSVIVRNIGHVTAQYTLGVGNCSGNVFPIMAQTLSLRPRGTVIRSFDLNIQDVAEE 479

Query: 460 AKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
               C   L+D+  +  D+   +F   + VL N +Q
Sbjct: 480 RIVQCDVTLRDAKGAITDKKILKFRVTSKVLTNDTQ 515


>gi|407849348|gb|EKG04115.1| hypothetical protein TCSYLVIO_004826 [Trypanosoma cruzi]
          Length = 588

 Score =  159 bits (403), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 138/516 (26%), Positives = 230/516 (44%), Gaps = 31/516 (6%)

Query: 7   SLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPS 66
           SL    F L+LF ++   +P    G+ +L+ S +E+C++    ++L C  K+V+ ++V S
Sbjct: 4   SLSRMLFSLLLFALMVATTPFAAEGL-LLASSSIEQCDRVGTDNSLPCDKKLVVTLSVDS 62

Query: 67  GSSGGEASIVAEVVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEF 125
             +      V     V++        V   P+ LT +K+     Y L Y R+   KP E 
Sbjct: 63  DQAEDVEEFVILRDAVDKTKGTGEERVEFQPIRLTTSKSRVQYTYPLFYERNFNAKPYEE 122

Query: 126 YMKTRKCEPDAGADVVKICE------------RQPICCPCGPQRRIP----SSCGNVFDK 169
            + T     D        C              Q  CC CGP + +      S G     
Sbjct: 123 EITTELVGCDDTFSSKATCGLATDTAGRPIPYSQGFCCRCGPCQLLGLCPVGSRGLQVCD 182

Query: 170 LLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKT----GSKVSEVT--VGPENKT 223
           + +G A  A CLRF   W+  + +G  +I + + +++ T     SK  E    +GP+  +
Sbjct: 183 IFRGAA-LASCLRFGELWYSGYSMGSATIWYRLSVKLTTDSQNNSKAKEAVFELGPDVLS 241

Query: 224 ATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFT 283
            +SA+    V+LIGDFV            L IP    P   + +      W++L++   +
Sbjct: 242 GSSAEFGAWVSLIGDFVPAELPLVLSNKMLFIPSS--PRIHERVLAGQKEWLILDKHHVS 299

Query: 284 LDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
           + G +CNK+GVSYEAF+GQ S C     SCL +QL +YR +D     R     Y      
Sbjct: 300 MQGRDCNKVGVSYEAFSGQGSRCQLIRGSCLADQLEDYRSSDLAVEARGGRGKYLARSFG 359

Query: 344 ERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALT 401
           + +    N      S  +   L + L + + AD ++Y+   S G+I+S ++   T E  +
Sbjct: 360 DFVVNSVNNSRTRLSYWMRGSLATMLTVVISADRLQYLVSVSQGEIVSAVMSKSTIEESS 419

Query: 402 QFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKI-YPTTNQA 459
           + G  ++  +N G V A Y+L   +CS  V  +  Q   ++P+ET +RSF +      + 
Sbjct: 420 RDGSVSVIVRNIGHVTAKYTLGVGNCSGNVFPIMAQTLSLRPRETVVRSFDLNIQDVTEE 479

Query: 460 AKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
               C   L+D+  +  D+   +F   + VL N +Q
Sbjct: 480 RIVQCDVTLRDAKGAITDKKVLKFRVTSKVLTNDTQ 515


>gi|156370880|ref|XP_001628495.1| predicted protein [Nematostella vectensis]
 gi|156215473|gb|EDO36432.1| predicted protein [Nematostella vectensis]
          Length = 853

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 142/532 (26%), Positives = 226/532 (42%), Gaps = 65/532 (12%)

Query: 16  ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDN-------LNCTTKIVLNMAVPSGS 68
           I+  ++ LL         +++KS L+ CE   +SD+         C  K+++ ++V SG 
Sbjct: 6   IIMILVGLLCLANESYSDVIAKSSLQMCENTGNSDDPYNVVDQKACEKKLIVTLSVRSGQ 65

Query: 69  SGGE-ASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYM 127
           +G E    V  V +V + + ++M  +  P ++T+ KT     Y   Y+  V  KP E  +
Sbjct: 66  NGTEFLKAVTNVSKVYDQTEKEMARLYNPFIITLAKTPVKLTYPYYYLAMVNNKPTERVV 125

Query: 128 KT----------RKCEPDAGADVVKIC------ERQPI------CCPCGPQRRI------ 159
            +            C  DA  D   +C      E +PI      CC C  Q +       
Sbjct: 126 ISDSKWHASGSYHACS-DAWDDEDALCGFYTDAEGKPIWDSQGFCCRCTEQEKWRGSFND 184

Query: 160 --PSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV------KTGSK 211
             P S   +  KL  G    AHC+ F   W+ V  +G   + FS+ ++       K G+K
Sbjct: 185 KNPYSRAGINCKLF-GTQAAAHCMTFDDLWYTVNEVGLWQMDFSIHVKAYDLVVEKVGNK 243

Query: 212 V-------SEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP----RQGG 260
                    E+ +GP  ++       L    IG+F  +   P     YL+IP    +   
Sbjct: 244 TQSKWVDGGEIVIGPTIRSGVGVHGRLHATFIGEFQSHKQFPVLTTKYLLIPYVSEKVDP 303

Query: 261 PGQPQDLGGNFSMWMLLERTRFTLDGL---ECNKIGVSYEAFNGQ-PSFCSSPFWSCLHN 316
              PQ   G    +ML+++           EC+KIGVS+ AF  Q P  CS     CLHN
Sbjct: 304 KTHPQFRNGPHD-YMLIDKHEVNYKSSGPHECDKIGVSFSAFRAQAPMGCSQKQGDCLHN 362

Query: 317 QLWNYREADQNRINRNQLPLYGVE--GRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELR 374
           Q  +Y E D  R    + P Y  +  G+   +NQ  +      +  V EV+ S + +++ 
Sbjct: 363 QPKDYFEEDTKRRASGKTPYYFPQKFGKLLGVNQRKDNNHFVLTYEVDEVMTSMVTLQIS 422

Query: 375 ADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEAS-YSLTFDCSTGVTLM 433
           ADD+  +Y R+ GKI+      FEAL++ G   +  QN G V A  Y +  +CS G+  +
Sbjct: 423 ADDVILIYNRAEGKILRAYAQDFEALSRDGNLYVIVQNIGLVTADFYVVIKECSVGIGKL 482

Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFST 485
            E+   I P++T   +F +     +     C   L D+    VD +   F T
Sbjct: 483 LEKAASINPQQTHSFTFSVKAQQWKGGDNFCIVQLYDARRKMVDSSNVTFRT 534


>gi|71748482|ref|XP_823296.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70832964|gb|EAN78468.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 618

 Score =  158 bits (400), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 149/588 (25%), Positives = 255/588 (43%), Gaps = 91/588 (15%)

Query: 35  LSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVV-----EVEENSTQK 89
           ++ S +E CE+ ++ +   C  K+V+ ++V     G E +I AE V      V++   +K
Sbjct: 30  VASSSIEYCERSSNGEPFPCEKKMVVGLSV-----GSEQTIEAEEVVLLREAVDKTGDEK 84

Query: 90  MRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMKTRK---CEP----------- 134
            + V   P+ L   K+     Y + YIR+   KP E  ++T     C+            
Sbjct: 85  GKRVEFEPIRLVTTKSPVQYRYPIYYIRNFNAKPYEQRLRTSASSWCDDSSNPGSATCGV 144

Query: 135 --DAGADVVKICERQPICCPCG---------PQRRIPSSCGNVFDKLLKGKANTAHCLRF 183
             D   DV+     Q  CC CG         P  R   +C       + G    A CLRF
Sbjct: 145 ARDRRGDVIPY--SQGFCCLCGACALSGICNPTSRSVGTCS------VTGDTGMASCLRF 196

Query: 184 PGDWFHVFGIGQRSIGFSVRIEVK----------TGSKVSEVTVGPENKTATSADNFLKV 233
              W+  + IG+  + + +++++           TGSK   +++GP+  TATS +     
Sbjct: 197 SDLWYGGYTIGRGVVWYELQVKLSSGNNSTGGGSTGSKEFTMSLGPDKLTATSTEFGASA 256

Query: 234 NLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIG 293
            LIGDF             L IP +  P   + +G  ++ W++++    ++ G ECNK+G
Sbjct: 257 RLIGDFAPPEMPLDLSGKMLFIPSE--PRGHERVGAGYNEWIIVDTHLVSIRGTECNKVG 314

Query: 294 VSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHP--N 351
           VSYE F  Q S C +   +CL NQL +YR+ D     + Q   Y +   F      P  N
Sbjct: 315 VSYEGFATQGSRCDAYPGACLANQLEDYRDRDLEAETKGQQGKY-MARFFAPFGFDPLAN 373

Query: 352 AGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIP--TFEALTQFGVATIT 409
           A + + +  VT  L++ + I + AD + +V   S G I+   +      + ++    T+T
Sbjct: 374 ASAPAVAYQVTGTLSTMVTITISADKLNFVLSVSSGVIVGATVSGKVVHSYSRGSTITVT 433

Query: 410 TQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAIL 468
             NTG++EA Y++   +C+  V  M  Q   I  + ++ R F +    +   +  C+A L
Sbjct: 434 VLNTGDIEAQYTVVVGECTVNVQPMVAQTVYIPLQGSAQRRFTLIVQDSIEGEAKCNATL 493

Query: 469 KDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLR--D 526
           +++    VD     F   A    NGSQ                     G   +E  R  +
Sbjct: 494 RNARGDVVDTRAISFGVKALKPSNGSQ---------------------GGSTFENGRYSE 532

Query: 527 FITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWL 574
              G++  ++C S+F+  C +++ C  W  L    + + P+V +L+ L
Sbjct: 533 EAKGESQCQQC-SWFNLLCFLRHRCW-WQPL----VYVLPSVTLLMLL 574


>gi|340508314|gb|EGR34043.1| hypothetical protein IMG5_026080 [Ichthyophthirius multifiliis]
          Length = 525

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 150/528 (28%), Positives = 220/528 (41%), Gaps = 79/528 (14%)

Query: 99  LTVNKTASYAVYELTYIRDVPYKPQE-------------FYMKTRKCEPDAGADVVKICE 145
           + V K+   AVY L Y+RD    PQE             F   +  C         KI +
Sbjct: 14  IEVTKSPVVAVYPLKYMRDYESMPQEKVISKSVFTCQDGFNEDSPTCGFQRDEKGEKIFD 73

Query: 146 RQPICCPCGPQ------RRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIG 199
            Q  CC CG        + +      +   L  G A +AHCLRFPG W+  + I Q  I 
Sbjct: 74  SQGFCCKCGAADFFGLGKEVMRGVDCLPFNLNSGSA-SAHCLRFPGRWYSGYEILQYYIY 132

Query: 200 FSVRIEV--------KTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEF 251
           + +++EV        K      ++T    ++   S DN   V +IGDF      P +   
Sbjct: 133 YEIKVEVYELEGNNNKKRKLKYKLTTSTTDRIKKSPDNKFLVKIIGDFFPTQPPPVYNNV 192

Query: 252 YLVIPRQGGPGQPQDLG----GNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCS 307
           YLV P    P    +L        S WML+E+ +FTLDG ECNKIGVSY AF  +   CS
Sbjct: 193 YLVRPTPNRPQANNELRVRVLEGISNWMLIEKNQFTLDGTECNKIGVSYAAFRRENGSCS 252

Query: 308 SPFWSCLHNQLWNYREADQNRINRNQLP--LYGVEGRF-ERMNQHPNAGSHSFSIGVTEV 364
               SCL NQ+ ++   D  RI + Q    L   +G F E  ++  N     F  G    
Sbjct: 253 KQIGSCLKNQIEHFYLRDIERIKKGQPTQNLLLPKGDFQESWDKQNNTQMILFIEGSMST 312

Query: 365 LNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTF 424
           L   + IE+ + +I+++     GK I V I  FE+ +  G       N     A ++L F
Sbjct: 313 L---ITIEMDSAEIQFLTMLGQGKFILVKINNFESHSGSGKFEAHILNKSSFAAEFNLGF 369

Query: 425 DCSTGVT--------LMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEV 476
            C   V         L ++Q FI K   +S+        TN      C+  L D+  + +
Sbjct: 370 SCDQNVLPISGQKLFLNQDQLFIFK---SSVNVVSDLGKTNNL----CNVTLSDAVNNVL 422

Query: 477 DRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRK 536
           D A+  F+T   V     +I+    P+ +   + E+  ++ K L E          C +K
Sbjct: 423 DFAQITFNTTDVV-----RIS----PQGNGTYYNENNSTLKKPLIE--------VTCNQK 465

Query: 537 CSSFFDFSCHIQYICL--------SWLVLFGLVLAIFPTVLVLLWLLH 576
           C  F+D  CH    CL        + L +  + L +F  V+ L  +LH
Sbjct: 466 CPDFWDIFCHFSTKCLNNGFKTLGTGLGILVIFLELF-DVVALFVVLH 512


>gi|146100443|ref|XP_001468864.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134073233|emb|CAM71954.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 917

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 153/609 (25%), Positives = 253/609 (41%), Gaps = 65/609 (10%)

Query: 15  LILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEAS 74
           + + C+  L+   C      +S S +  C    D +N++CT K+V+ + V      GE S
Sbjct: 111 IAVLCVSLLVRLACPARAAFVSSSLISYCSDSGD-ENISCTKKMVVTVTVEGEQLPGEES 169

Query: 75  IV----AEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK-- 128
           ++    A  + V   +  +   +RI    T +++A    Y L Y+++   KP E  +K  
Sbjct: 170 LLFLNSATDMTVNNGTAVQFSPLRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVKGS 225

Query: 129 -TRKCEPDAGADVV-----------KICERQPICCPCG---------PQRRIPSSCGNVF 167
              +C  D  AD              I   Q  CC C          P  R  ++C N+F
Sbjct: 226 LLNQCNADFNADTATCGLAYDAAGKAIPYSQGFCCDCSMCQTLGLCQPDARANAAC-NIF 284

Query: 168 DKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK------TGSKVSEVTV---G 218
           DK       TA CLRF   W+  + IG     ++V + +        G+  +E  V    
Sbjct: 285 DKY-----TTASCLRFAQRWYSGYTIGGYMTWYTVNLTLSRNVSGSGGAGAAEKVVMHLS 339

Query: 219 PENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQD---LGGNFSMWM 275
           P N   T+ + +   +++   VG T  P  +   L       P  P +   +    + W+
Sbjct: 340 PSNNGETAGEGW---DVMARIVG-TYAPVDQPLDLTSRMLFAPAIPPNDARVQAGAAEWL 395

Query: 276 LLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLP 335
           LL     TLDG EC+K+GVSYEAF  Q + C+    SCL +QL +YR AD  RI      
Sbjct: 396 LLPTNLVTLDGRECDKVGVSYEAFASQGNKCNLRPGSCLSSQLEDYRTADLQRIAAGNKG 455

Query: 336 LYGVEGRFERMNQHPNAGSHSF-SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
            Y +   F   N   +A +  + S        + + I + ADD+EY    + GKI+S  +
Sbjct: 456 QY-MATSFGDFNLENDAATSPYISYLAASPAATMISITVSADDLEYTVGLASGKIVSADL 514

Query: 395 --PTFEALTQFGVATITTQNTGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFK 451
             PT +A T  GV T+  +NT  V     +   +CS GV  M  Q   +  ++ +  +FK
Sbjct: 515 NKPTLQAGTADGVMTVMVRNTAAVTGRLVVGMLNCSDGVFPMTAQKLSLAAQQQAAVTFK 574

Query: 452 IYPTTNQAA-KYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFF 510
           +Y   + A+ K +C+ +++++  +  D     +   +T   NG+Q          +    
Sbjct: 575 VYVQNSYASGKASCTVVVRNAHEAITDLRVVSWKVSSTNFHNGTQGGSADDGSGGV---- 630

Query: 511 ESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLV 570
            S E          R      A RR+C         +  + ++ ++       +F   L 
Sbjct: 631 -STEESSAASCLNCRTLDIACAVRRRCWQLILLDLFVYLLIIAVVLCVIFFWRVFCCCLY 689

Query: 571 LLWLLHQKG 579
           LL   H++G
Sbjct: 690 LLGRQHRRG 698


>gi|407409949|gb|EKF32579.1| hypothetical protein MOQ_003566 [Trypanosoma cruzi marinkellei]
          Length = 589

 Score =  152 bits (385), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 135/511 (26%), Positives = 231/511 (45%), Gaps = 32/511 (6%)

Query: 13  FLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGE 72
           F  +LF ++   +P    G+ +L+ S +E+C++      L C  K+V+ ++V S  +   
Sbjct: 10  FSSLLFALVVATTPFAAEGL-LLASSSIEQCDRVETDKLLPCEKKLVVTLSVDSAQADNV 68

Query: 73  ASIVAEVVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKP--QEFYMKT 129
              V     V++        V   P+ LT +K+     Y L Y R+   KP  +E   + 
Sbjct: 69  EEFVILRDAVDKTKGTGEERVEFEPIRLTTSKSRVQYRYPLFYERNFNAKPYEEEITTEL 128

Query: 130 RKCE----PDAGADVVKICERQPI------CCPCGPQRRIP----SSCGNVFDKLLKGKA 175
             C+    P A   + K    +PI      CC CG  + +      S G     +  G A
Sbjct: 129 TGCDDTFSPTATCGLAKDTAGRPIPYSQGFCCRCGACQLLGLCPVGSRGLQVCDIFNGAA 188

Query: 176 NTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV-------KTGSKVSEVTVGPENKTATSAD 228
             A CLRF   W+  + IG  +I + + +++        T +K +   +GPE  + +S +
Sbjct: 189 -LAACLRFGKLWYSGYSIGPATIWYRLLVKLTADAENNSTKAKEAVFELGPEVLSGSSPE 247

Query: 229 NFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLE 288
               V+LIGDFV         +  L IP    P + + +      W++L++   ++ G +
Sbjct: 248 FGAWVSLIGDFVPAELPLVLSDKMLFIPSS--PRKHERVLAGQKEWIILDKHHVSMQGRD 305

Query: 289 CNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQ 348
           CNK+GVSYEAF+ Q S C     SCL +QL +YR +D     R     Y      E +  
Sbjct: 306 CNKVGVSYEAFSAQGSRCQLIQGSCLADQLEDYRASDLAVEARGGKGKYMARFFGEFVVN 365

Query: 349 HPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALTQFGVA 406
             N+     S  +   L + + + + AD ++Y+   SPG+I+S ++   T E  ++ G  
Sbjct: 366 TANSSRTRVSYWMRGSLATMITVVISADRLQYLISVSPGEIVSAVMSKSTIEESSRDGSI 425

Query: 407 TITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKI-YPTTNQAAKYTC 464
           ++  +N G + A Y+L   +CS  V  +  Q   ++P+ET IRSF +      +     C
Sbjct: 426 SVMVRNIGNLTAEYTLGVGNCSGNVFPIMAQTLSLRPQETLIRSFDVNIQDVTEERIVQC 485

Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
              L+D+  +  D+   +F  +  VL N +Q
Sbjct: 486 DVTLRDAKDAITDKKVVKFRVIRKVLTNNTQ 516


>gi|398022953|ref|XP_003864638.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322502874|emb|CBZ37956.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 917

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 153/609 (25%), Positives = 252/609 (41%), Gaps = 65/609 (10%)

Query: 15  LILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEAS 74
           + + C+  L+   C      +S S +  C    D +N++CT K+V+ + V      GE S
Sbjct: 111 IAVLCVSLLVRLACPARAAFVSSSLISYCSDSGD-ENISCTKKMVVTVTVEGEQLPGEES 169

Query: 75  IV----AEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK-- 128
           ++    A  + V   +  +   +RI    T +++A    Y L Y+++   KP E  +K  
Sbjct: 170 LLFLNSATDMTVNNGTAVQFSPLRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVKGS 225

Query: 129 -TRKCEPDAGADVV-----------KICERQPICCPCG---------PQRRIPSSCGNVF 167
              +C  D  AD              I   Q  CC C          P  R  ++C N+F
Sbjct: 226 LLNQCNADFNADTATCGLAYDAAGKAIPYSQGFCCDCSMCQTLGLCQPDARANAAC-NIF 284

Query: 168 DKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK------TGSKVSEVTV---G 218
           DK       TA CLRF   W+  + IG     ++V + +        G+  +E  V    
Sbjct: 285 DKY-----TTASCLRFAQRWYSGYTIGGYMTWYTVNLTLSRNVSGSGGAGAAEKVVMHLS 339

Query: 219 PENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQD---LGGNFSMWM 275
           P N   T+ + +   +++   VG T  P  +   L       P  P +   +    + W+
Sbjct: 340 PSNNGETAGEGW---DVMARIVG-TYAPVDQPLDLTSRMLFAPAIPPNDARVQAGAAEWL 395

Query: 276 LLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLP 335
           LL     TLDG EC+K+GVSYEAF  Q + C+    SCL +QL +YR AD  RI      
Sbjct: 396 LLPTNLVTLDGRECDKVGVSYEAFASQGNKCNLRPGSCLSSQLEDYRTADLQRIAAGNKG 455

Query: 336 LYGVEGRFERMNQHPNAGSHSF-SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
            Y +   F   N   +A +  + S        + + I + ADD+EY      GKI+S  +
Sbjct: 456 QY-MATSFGDFNLENDAATSPYISYLAASPAATMISITVSADDLEYTVGLVSGKIVSADL 514

Query: 395 --PTFEALTQFGVATITTQNTGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFK 451
             PT +A T  GV T+  +NT  V     +   +CS GV  M  Q   +  ++ +  +FK
Sbjct: 515 NKPTLQAGTADGVMTVMVRNTAAVTGRLVVGMLNCSDGVFPMTAQKLSLAAQQQAAVTFK 574

Query: 452 IYPTTNQAA-KYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFF 510
           +Y   + A+ K +C+ +++++  +  D     +   +T   NG+Q          +    
Sbjct: 575 VYVQNSYASGKASCTVVVRNAHEAITDLRVVSWKVSSTNFHNGTQGGSADDGSGGV---- 630

Query: 511 ESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLV 570
            S E          R      A RR+C         +  + ++ ++       +F   L 
Sbjct: 631 -STEESSAASCLNCRTLDIACAVRRRCWQLILLDLFVYLLIIAVVLCVIFFWRVFCCCLY 689

Query: 571 LLWLLHQKG 579
           LL   H++G
Sbjct: 690 LLGRQHRRG 698


>gi|342184647|emb|CCC94129.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 622

 Score =  152 bits (383), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 134/529 (25%), Positives = 231/529 (43%), Gaps = 57/529 (10%)

Query: 10  LKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSS 69
           L  FL +    +   SP  +    I++ S +E CE+   ++   C  K+V+ ++V S  +
Sbjct: 6   LVPFLTVAALAVVYYSP--ITEGAIVASSSVEHCERDGRTETFPCERKLVVTLSVDSEQT 63

Query: 70  GGEASIVAEVVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMK 128
            G   ++     +++   +K + V + P+ L   K+A +  Y + Y+++   KP E  + 
Sbjct: 64  AGAEEVIFLREALDKTGNRKEKRVFVEPIRLVTIKSAVHYRYPVYYVQNFNAKPYEQQLT 123

Query: 129 TRKCE----------PDAG----ADVVKICERQPICCPCG---------PQRRIPSSCGN 165
           T   E          P  G    +    I   Q  CC CG         P+ R  S C  
Sbjct: 124 TTAMEWCKDYNESASPTCGLARDSSGRVIPYSQGFCCSCGACELSGICRPKSRGASKCSI 183

Query: 166 VFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSE----------V 215
           +      G    A CLRF   W+  + IG+ ++ + +++ + T   VS           +
Sbjct: 184 I------GNTGKASCLRFGNMWYSGYNIGRGTVWYRLQVGLTTQGAVSGDGVVKPNQHML 237

Query: 216 TVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQP---QDLGGNFS 272
           ++GP+  TA+SA+  +   LIGDF      PS     L       P  P   + +    +
Sbjct: 238 SLGPDTITASSAEFGVSARLIGDFA-----PSEMPLDLTNKMLFAPAVPRTHERVRAGHN 292

Query: 273 MWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRN 332
            W+ L++   ++ G ECN++GVSYE F  Q   CS+   +CL NQL +YR  D    +  
Sbjct: 293 EWIFLDKHLVSVHGRECNRVGVSYEGFATQGGRCSALPGACLANQLDDYRGLDLKSESEG 352

Query: 333 QLPLYGVE--GRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKII 390
           +   Y     G F + + H N+ +   +      L + + I + AD +++V   SPG I+
Sbjct: 353 RKGHYMARFFGEF-KTDSHSNSSAPRITYQTRNSLATMVTITILADKLKFVLSVSPGTIV 411

Query: 391 SVIIP--TFEALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSI 447
           +V +      + ++    TI   NTG+VEA Y++   +C+     M  Q   I P  +  
Sbjct: 412 NVTVSGTNVASYSRGNTVTINVLNTGDVEAQYTVGVGNCTIDAHPMVAQVAFIPPLHSVQ 471

Query: 448 RSFKIYPTTNQ-AAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
           R+F +   ++    K +C+A L+++    VD     F        NGSQ
Sbjct: 472 RNFSLVSQSDSLVEKASCTASLQNARGDVVDTYTFYFDVKPVGWTNGSQ 520


>gi|389594441|ref|XP_003722443.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|323363671|emb|CBZ12676.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 917

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 160/639 (25%), Positives = 262/639 (41%), Gaps = 71/639 (11%)

Query: 35  LSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIV----AEVVEVEENSTQKM 90
           +S S +  C    D +N++CT K+V+ + V      GE S++    A  + V   ++ + 
Sbjct: 131 VSSSLISYCSDSGD-ENISCTKKMVVTVTVEGEQLPGEESLLFLNSATDMTVNNGTSVQF 189

Query: 91  RTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK---TRKCEPDAGADVV------ 141
             +RI    T +++A    Y L Y+++   KP E  +K     +C  D  AD        
Sbjct: 190 SPLRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVKGSLLNQCNADFNADTATCGLAY 245

Query: 142 -----KICERQPICCPCG---------PQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDW 187
                 I   Q  CC C          P  R  ++C NVF     GK  TA CLRF   W
Sbjct: 246 DAAGKAIPYSQGFCCDCSMCQTLGLCQPDARANAAC-NVF-----GKYTTASCLRFAQRW 299

Query: 188 FHVFGIGQRSIGFSVRIEVK------TGSKVSEVTV---GPENKTATSADNFLKVNLIGD 238
           +  + IG     ++V + +        G+  +E  V    P N    + + +   +++  
Sbjct: 300 YSGYTIGGYMTWYTVNLTLSRNVSDSGGAGAAEKVVMRLSPSNNGEVAGEGW---DVMAR 356

Query: 239 FVGYTNIPSFEEFYLVIPRQGGPGQPQD---LGGNFSMWMLLERTRFTLDGLECNKIGVS 295
            VG T  P  +   L       P  P +   +    + W+LL     TLDG EC+K+GVS
Sbjct: 357 IVG-TYAPVDQPLDLTSRMLFAPAIPPNDARVQAGAAEWLLLPTNLVTLDGRECDKVGVS 415

Query: 296 YEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSH 355
           YEAF  Q + C+    SCL +QL +YR AD  RI       Y +   F   N   +A + 
Sbjct: 416 YEAFASQGNKCNLRPGSCLSSQLEDYRTADLQRIAAGNKGQY-MATSFGDFNLENDAATS 474

Query: 356 SF-SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALTQFGVATITTQN 412
            + S        + + I + ADD+EY    + GKIIS  +  PT +A T  GV T+  +N
Sbjct: 475 PYISYLAASPAATMISITVSADDLEYTVGLASGKIISTDMNKPTLQAGTADGVMTVMVRN 534

Query: 413 TGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAA-KYTCSAILKD 470
           T  V     + T +CS GV  M  Q   +  ++ S  +FK+Y  ++ A+   +C+ ++++
Sbjct: 535 TAAVTGRLVVGTLNCSDGVFPMTAQKLSLAAQQQSAVTFKVYVQSSHASGNASCTVVVRN 594

Query: 471 SDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITG 530
           +     D     +   +T   NG+Q        ++      S E          R     
Sbjct: 595 AHEVITDLRVVSWKVSSTNFHNGTQGG-----SAADGSGGGSTEESSAASCLNCRTLDIA 649

Query: 531 KACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDD 590
            A RR+C         +  + ++ ++       +F   L LL   H++G         + 
Sbjct: 650 CAVRRRCWQLILLDLFVYLLIIAVILCVIFFWRVFCCCLYLLGRQHRRGSAG------EA 703

Query: 591 HFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKL 629
             +++  R   +  RR + D      +  HK  G    L
Sbjct: 704 EPKNEASRWGAYWKRRGESDATSSSRQTDHKNSGSSDVL 742


>gi|145490447|ref|XP_001431224.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398327|emb|CAK63826.1| unnamed protein product [Paramecium tetraurelia]
          Length = 685

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 144/564 (25%), Positives = 242/564 (42%), Gaps = 59/564 (10%)

Query: 32  VQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE----NST 87
            +I+S+S++ KC   + ++N  C+ K+++++ V +  +      V E +++ E    N T
Sbjct: 2   AEIISQSQINKCYSNS-TNNTECSEKMLISLTVENAQNT-----VTEYIKISETTIDNQT 55

Query: 88  QKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK--TRKCE-------PDAGA 138
            +++T   P ++++ KT  YA Y L Y  D   +P E  +      C+       P  G 
Sbjct: 56  SQLKT---PIIISITKTPVYAFYPLKYTEDYNSQPYEVKIAGAILSCDDSWYSNSPTCGF 112

Query: 139 DVVK---ICERQPICCPCGPQRRIPSSC----GNVFDKLLKGKANTAHCLRFPGDWFHVF 191
              K   I + Q  CC CG    I  S     GN+  K     A  A CLR+   W+  +
Sbjct: 113 QYEKKEKIFDSQGFCCSCGILDLIGLSDEFARGNICHKAGLTTATMAFCLRYSTLWYSAY 172

Query: 192 GIGQRSIGFSVRIEVK-TGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEE 250
            I   SI +++ I +  +  +  E+ +G E K        L   +IGDF      PS E 
Sbjct: 173 EISTYSIYYNITISITYSNQEQEELQLGSEVKVVQGKT--LIGRIIGDFTPLNPPPSLES 230

Query: 251 FYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPF 310
           FY + P    P     +    + +M++ + +  +   ECNKIGVSY AF  +   C    
Sbjct: 231 FYFMRP--SSPNSHARVQAGSAAFMIVSKDQ--VGRGECNKIGVSYSAFRTEAERCKKQV 286

Query: 311 WSCLHNQLWNYREADQNRINRNQLPLYGVE--GRFERMNQHPNAGSHSFSIGVTEVLNSN 368
            SCL NQL ++   DQ  I  N  P Y +   G+F+ +  + N  ++     V   + + 
Sbjct: 287 KSCLKNQLEDFYIEDQALIANNSQPKYLLSRYGKFKSI--YLNNETY-LQYSVEGSMQTM 343

Query: 369 LLIELRADD-IEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCS 427
           + +E+     I YV     GKI    I  FEA +  G+      N G++E+ ++   +CS
Sbjct: 344 ITLEITTTGLISYVVNLGKGKIDLAEIQDFEAKSGNGLLYAQITNVGDIESEFNTYLNCS 403

Query: 428 TGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMA 487
             V  +      +KP E+ I    +   ++      C+  L ++  + +D+ + +F+T  
Sbjct: 404 INVIPINSAALYLKPLESYIVKKDVNVLSDMNKSNICTFSLLNNKGTLLDQKQIEFNT-T 462

Query: 488 TVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHI 547
            +     Q    Q  K   N+   S ES                 C   CS F D +C+I
Sbjct: 463 EIQHESEQNHEEQNIKD--NEVLASDES--------------QDNCYSDCSVFLDITCYI 506

Query: 548 QYICLSWLVLFGLVLAIFPTVLVL 571
              C S ++ F  VL I    L++
Sbjct: 507 FNDCNSQIITFFTVLGITFIFLII 530


>gi|302842682|ref|XP_002952884.1| hypothetical protein VOLCADRAFT_105708 [Volvox carteri f.
           nagariensis]
 gi|300261924|gb|EFJ46134.1| hypothetical protein VOLCADRAFT_105708 [Volvox carteri f.
           nagariensis]
          Length = 1181

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 160/648 (24%), Positives = 256/648 (39%), Gaps = 146/648 (22%)

Query: 17  LFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIV 76
           L  IL+LL    V G ++L+  KLEKC +   ++ + C+ K+V+ + V +G +     + 
Sbjct: 115 LCVILSLLWASKVYG-EVLAAGKLEKCVRDGVTEVVQCSDKLVITVTVANGQTLKTEELD 173

Query: 77  AEVVEVEENSTQ-------------KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQ 123
             V+ V   + +               R +  P  +T+ K+  +A Y LT+++   +KP 
Sbjct: 174 LTVLCVNSPTGECPCPCNAAVDEDCSCRDLAAPMKVTITKSLLWASYPLTFVQQFNWKPV 233

Query: 124 EF--YMKTRKCEPDA------------GADVVKICERQPICCPCGP-------------Q 156
           E   Y  ++KC                G D  K+ + Q  CC C               +
Sbjct: 234 EIIQYTNSKKCRDGDYEQYPTCPYYYDGKD--KVPDSQGFCCQCSSGEVWDDTFGDLKYR 291

Query: 157 RRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHV-------FGIGQRSIGFS--VRIEVK 207
            R   +C      L+      AHC++     F V       + +G  S+ F   V IE+ 
Sbjct: 292 TRANLNCDFRLGMLIGIYPAAAHCVQLDRFNFAVSTRVGLGYNVGPPSLNFEIYVNIEIP 351

Query: 208 T-----------GSKVS-----------------------EVTVGPENKTATSADNFLKV 233
           T            S VS                        +T+ P    A S    + V
Sbjct: 352 TIPAGWSPRVNGTSSVSVNATTLSNGTLNTSQNTFVMRYETLTLSPSIPLAVSKTKMVSV 411

Query: 234 NLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIG 293
            L+GD   YT +P+F    L++P         D+G N S WML++++  +LDG  C+KIG
Sbjct: 412 KLLGDLAMYTMLPTFGHQMLMLPLY-------DIG-NRSTWMLVDKSLISLDGRTCDKIG 463

Query: 294 VSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAG 353
            S+ AF  QPS C     +CL  QL +  + D +RI + + PLY V       +Q+P   
Sbjct: 464 TSFSAFRYQPSGCHRAVSTCLKGQLKDLYDEDMDRIKKGRAPLYMV-------SQYPGYE 516

Query: 354 SHSFSIG-----------VTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQ 402
             SF+ G           VT    S L + + AD +  +  RSPGKI  V +  F   + 
Sbjct: 517 QASFTAGKFGNETVFLLPVTSQSQSVLTLTVSADKLRLITNRSPGKISDVQLCRFGNASH 576

Query: 403 FGV--------ATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETS--IRSFK 451
            G           +   NTG ++A Y++   +CS+ +  +E +   +    T+      +
Sbjct: 577 CGFFEAGNRGYIRLNVTNTGRLDADYTVAVTNCSSNIRPIEARMIAVSAGRTAPLWPPIE 636

Query: 452 IYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQP-PKSSINDFF 510
           +Y    +    +CS +L DS     D+ E  FST           T F P P    N   
Sbjct: 637 VYVEDTENKTRSCSVLLYDSTGGIADQTEMSFST---------NQTDFGPTPTGGFNGTG 687

Query: 511 ESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLF 558
           +S+  + K L            C   C++  +  C +   C  W  LF
Sbjct: 688 DSLARLEKDL-----------TCDEACTNPINVWCIVVKRC--WSKLF 722


>gi|125505600|gb|ABN45755.1| gamete fusion-like protein [Hydra magnipapillata]
          Length = 673

 Score =  143 bits (360), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 142/566 (25%), Positives = 241/566 (42%), Gaps = 92/566 (16%)

Query: 8   LKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLN----------CTTK 57
            K+K  LL  F   N+      VG  ILSKS +E CE    S++L           C  K
Sbjct: 5   FKMKKQLLSSF--FNITVNIIFVGGLILSKSSIEFCENTGSSNDLKDPTNVVTQSACEKK 62

Query: 58  IVLNMAVPSGSSGGEASIVAEVVEVEENS-TQKMRTVRIPPVLTVNKTASYAVYELTY-- 114
           +V+ ++V  G+  GE   +  VV V +NS T +   +  P ++TV+K+  Y  +   +  
Sbjct: 63  MVVLLSV--GNKQGETEKLQAVVSVVQNSATNEFARLYNPFMITVSKSPVYLNFPFFFNG 120

Query: 115 --IRDVPYK-----PQEFYMK--TRKC--------------------------EPDAGAD 139
             + + PY+        +Y+   +R+C                          + D    
Sbjct: 121 ITVNNQPYEEIILSKNRWYVSDSSRQCLDQWQVEEEDDEHPTCGYQYTNSTQKQTDGTWK 180

Query: 140 VVK--ICERQPICCPCGPQRR---------IPSSCGNVFDKLLKGKANTAHCLRFPGDWF 188
            VK  I + Q  CC C    +           +  G +   L      +AHC+R    W+
Sbjct: 181 TVKTRIWDSQGFCCYCTQDLKNYYIKKDIQDANRAGIICKPLTNSPQASAHCMRMSNLWY 240

Query: 189 HVFGIGQRSIGFSVRIE--------VKTGSKV-----SEVTVGPENKTATSADNFLKVNL 235
            +    +    FS+ ++        V+  S +      E+ + P  K+AT + N +  N 
Sbjct: 241 TLNEFTESYRDFSIYVKAFDQITKVVQNKSYIDYVNGGEILLSPSQKSATGSYNRITGNY 300

Query: 236 IGDFVGYTNIPSFEEFYLVIPRQG---GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKI 292
           +GD     + P     Y +IP       P +   L    S WM++ R   + D  +C+ I
Sbjct: 301 VGDLQPIKSYPVLTNNYFLIPFSSTNVDPKKEPQLKSGISKWMIIPRDLVSTDAKQCDMI 360

Query: 293 GVSYEAFNGQPSF-----CSSPFWSCLHNQLWNYREADQNRINRNQLPLY--GVEGRFER 345
           GV Y AF  Q ++     C +   SCL NQ +N    D++R+ + ++P Y     G+   
Sbjct: 361 GVGYSAFRNQAAYGTGYGCRAKKGSCLANQPYNKFMDDEDRLEKGKMPWYFPARYGKLAG 420

Query: 346 MNQHPNAGSHSFSIGVTEVLN---SNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQ 402
           + Q  N G +   +   E+ +   S + +++ ADD+  VY R+ G I    I  FEAL+ 
Sbjct: 421 VKQ--NIGDNDKYLLTYELDDEQISLVTLQISADDVVLVYNRATGIITRTAIQDFEALSL 478

Query: 403 FGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAK 461
            G  ++   NTG V + + ++   C++GV  +EE+   I P+ T   +FK+  +T++ + 
Sbjct: 479 EGQLSVDVLNTGYVSSDFRISIPSCTSGVQPIEEKRITIDPQMTETITFKMMTSTDKKSA 538

Query: 462 YTCSAILKDSDFSEVDRAECQFSTMA 487
           + C+  L DS    +      FST A
Sbjct: 539 HDCTINLYDSKNILLQSRNFTFSTKA 564


>gi|290983267|ref|XP_002674350.1| predicted protein [Naegleria gruberi]
 gi|284087940|gb|EFC41606.1| predicted protein [Naegleria gruberi]
          Length = 615

 Score =  141 bits (356), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 132/537 (24%), Positives = 226/537 (42%), Gaps = 58/537 (10%)

Query: 79  VVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYM---------- 127
           +VE+ +   +K   V + P+ + + K+A  AVY L Y++    KP E  +          
Sbjct: 2   LVELSDTVNEKGEKVNLRPIKIVIQKSAPKAVYPLLYVKTFNGKPTESIIYKDDILVPTC 61

Query: 128 --KTRKCEPDAG----ADVVKICERQPICCPCGPQRRIPSSCGNVFDKLLKG---KANTA 178
              ++   P  G    +   KI + Q  CC C   +    S  +    L  G     ++A
Sbjct: 62  DDSSKSAAPTCGWVKDSQGNKIPDSQGFCCSCSVGQMFGDSSASNRGALNCGFMQMKSSA 121

Query: 179 HCLRFPGDWFHVFGIGQRSIGFSVRIEV-KTG-SKVSEVTVGPENKTATSADNFLKVNLI 236
           HCLR    ++  + I    + F + + + +TG   + +VTV P +K A       +V L 
Sbjct: 122 HCLRLGEVYWDAYEIEGYVMSFEISVFIGETGFDDIGKVTVSPSSKLAQLPKGG-RVELE 180

Query: 237 GDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSY 296
           GDF  Y ++P +E  YL IP    P     +    + WM ++++  TL G EC+KIGVSY
Sbjct: 181 GDFSAYKSVPLYESKYLFIPSS--PKTSPIVVNGQANWMFIDKSMVTLSGSECDKIGVSY 238

Query: 297 EAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHS 356
             F  QP+ CS P  +CL NQ+ + R AD   +         +   F     +     + 
Sbjct: 239 AQFRNQPNACSRPALTCLANQIEDLRLADVELMKSGLKSGKYIVSNFGSFAVNKTNTGNV 298

Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEV 416
               + E  NS + + +  ++++++  +S  +I    + TF +L++ G   ++ +N G  
Sbjct: 299 LEKYLDEDTNSQINLYINGENVKFLITKSAAEISEAYVKTFTSLSKEGEMLVSVKNKGAN 358

Query: 417 EASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSE 475
             SY +T  +CS  +  + +Q   +        +F++      A    C   L  SD  +
Sbjct: 359 GCSYVVTVTECSDNILTIVQQTVFVDASNKKELTFQVRSEQKLATTNQCKVTLLFSDGEK 418

Query: 476 VDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRR 535
           +      F +     +N  +                S E  GK   EG  D   G+    
Sbjct: 419 IQDITVTFDSKDYAYENAME---------------SSGEQTGKVETEG--DHSLGQC--- 458

Query: 536 KCSSFFDFSCHI--QYICLSWLVLFGLVLAIF-----PTVLVLLWLLHQKGLFDPLY 585
           KC+S FD  C +     C S+++  G V +I      P + V LW   + GLF  ++
Sbjct: 459 KCNSPFDVVCIVLNSSSCTSYII--GWVASIVGIIATPVIFVFLW---RCGLFGLMF 510


>gi|66823829|ref|XP_645269.1| hypothetical protein DDB_G0272452 [Dictyostelium discoideum AX4]
 gi|60473432|gb|EAL71378.1| hypothetical protein DDB_G0272452 [Dictyostelium discoideum AX4]
          Length = 327

 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 148/300 (49%), Gaps = 15/300 (5%)

Query: 170 LLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVT--VGPENKTATSA 227
           LL  ++++AHCL F    + V+ I +  + + +   +      + +T  +   N      
Sbjct: 19  LLGSQSSSAHCLSFSPMKYDVYNIAKTQVEYKITATLTYSYNQNPITQDIILSNSNPMGM 78

Query: 228 DNFLK--VNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLD 285
           D+F +  + ++GDF   T I  F +  +V P      QP  +    +  MLL++  F L 
Sbjct: 79  DSFSQAMIRIVGDFQSSTQINQFTDKKVVFPY----NQPNSI----NTAMLLDQNFFDLS 130

Query: 286 GLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFER 345
           GL CNKIGVSY AF  QP+ C++ F SCL NQ+ +Y  AD   I+  +   Y       +
Sbjct: 131 GLTCNKIGVSYSAFQNQPNKCAALFGSCLQNQIADYYNADVTLISNGKKGNYIASQFGTK 190

Query: 346 MNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGV 405
           +    N  S S  I   E   + L I L+AD ++Y    SPGKIIS  I  FE++++ G+
Sbjct: 191 VAGDQN--SRSLKIRFDESHRTMLTITLKADSLQYRVDISPGKIISYQIDRFESMSKNGI 248

Query: 406 ATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTC 464
             +  QN G + + Y+L   +CS  +  ++ +   IK KE     F+I+ T+   + Y C
Sbjct: 249 LRVKVQNIGTINSDYTLAIVNCSGDINPIDSKDVTIKSKEIYSFEFQIFTTSKLDSSYQC 308


>gi|226230652|gb|ACO39319.1| hypothetical protein [Populus balsamifera]
 gi|226230656|gb|ACO39321.1| hypothetical protein [Populus balsamifera]
 gi|226230698|gb|ACO39342.1| hypothetical protein [Populus balsamifera]
 gi|226230712|gb|ACO39349.1| hypothetical protein [Populus balsamifera]
 gi|226230720|gb|ACO39353.1| hypothetical protein [Populus balsamifera]
 gi|226230724|gb|ACO39355.1| hypothetical protein [Populus balsamifera]
 gi|226230756|gb|ACO39371.1| hypothetical protein [Populus balsamifera]
 gi|226230764|gb|ACO39375.1| hypothetical protein [Populus balsamifera]
 gi|226230772|gb|ACO39379.1| hypothetical protein [Populus balsamifera]
 gi|226230774|gb|ACO39380.1| hypothetical protein [Populus balsamifera]
 gi|226230780|gb|ACO39383.1| hypothetical protein [Populus balsamifera]
          Length = 64

 Score =  134 bits (336), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 58/64 (90%), Positives = 61/64 (95%)

Query: 235 LIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGV 294
           LIGDFVGY+NIPSFE+FYLVIPRQG PGQPQDLG NFSMWMLLER RFTLDG+ECNKIGV
Sbjct: 1   LIGDFVGYSNIPSFEDFYLVIPRQGEPGQPQDLGRNFSMWMLLERVRFTLDGVECNKIGV 60

Query: 295 SYEA 298
           SYEA
Sbjct: 61  SYEA 64


>gi|154344439|ref|XP_001568161.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134065498|emb|CAM43265.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 905

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 129/506 (25%), Positives = 210/506 (41%), Gaps = 59/506 (11%)

Query: 36  SKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIV----AEVVEVEENSTQKMR 91
           S S +  C    D + +NC  K+V+ + V  G    E S++    A  + ++  +  +  
Sbjct: 129 SSSLISYCSDSGD-EKINCKKKMVVTVTVEGGQLPDEESLLFLNSATDMTIKNGTAVQFS 187

Query: 92  TVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK---TRKCEPD-----AGADVVKI 143
            +RI    T +++A    Y L Y+++   KP E  +K     +C  D     A   +   
Sbjct: 188 PIRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVKGSLLNQCNADFDTNTATCGIAHD 243

Query: 144 CERQPI------CCPCG---------PQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWF 188
              +PI      CC C          P  R  + C N+FD+       TA CLRF   W+
Sbjct: 244 AVGKPIPYSQGFCCDCSMCQTLGLCLPDARANAGC-NIFDRY-----TTASCLRFTKRWY 297

Query: 189 HVFGIGQRSIGFSVR------IEVKTGSKVSEVTV--------GPENKTATSADNF-LKV 233
             + IG     ++V       + V  G+  +E  V         P +   T+ + + +  
Sbjct: 298 SGYTIGGYVTWYTVNLTLSRNVSVSGGAGSAEKVVTQKVVMHLSPSSNGETAGEEWDVMA 357

Query: 234 NLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIG 293
            ++G++             L  P    P   + +    + WMLL     TLDG EC+K+G
Sbjct: 358 RVLGNYAPIVQPLDLTSRMLFAP--AIPPNDERVQAGAAEWMLLPTNLVTLDGRECDKVG 415

Query: 294 VSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAG 353
           VSYEAF  Q + C+    SCL +QL +YR  D  RI       Y      +   +     
Sbjct: 416 VSYEAFASQGNKCNLRPGSCLSSQLEDYRTTDLERIASGNKGQYMATSFGDFHLERDAVA 475

Query: 354 SHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALTQFGVATITTQ 411
           S   S   T    + L I + ADD+EY    + GKI+S  +  P  EA T+ GV T+  +
Sbjct: 476 SPYISYRATSPAATMLSITISADDLEYTVGLASGKIVSAELNKPVLEASTKDGVMTVVVR 535

Query: 412 NTGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAK-YTCSAILK 469
           N   V     + T  CS GV  +  Q   +  ++ S  +F +Y   + A++  +C  +L+
Sbjct: 536 NAASVTGRVVVGTSSCSDGVFPITAQTLSLAAQQQSTVAFNVYMQDSYASENASCMVVLR 595

Query: 470 DSDFSEVDRAECQFSTMATVLDNGSQ 495
           ++     D     +   +T   NG+Q
Sbjct: 596 NAQEVITDLRTVSWKVSSTSFHNGTQ 621


>gi|125551606|gb|EAY97315.1| hypothetical protein OsI_19236 [Oryza sativa Indica Group]
          Length = 143

 Score =  132 bits (331), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 66/120 (55%), Positives = 89/120 (74%), Gaps = 4/120 (3%)

Query: 31  GVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE--NST 87
           G +ILSKS+LE C   +D+   L C  K+V+++AVPSG+SGGEAS+VA V  VEE  ++ 
Sbjct: 24  GTEILSKSRLESCSHDSDAGGRLKCDRKLVVDLAVPSGASGGEASLVARVAGVEEENDTP 83

Query: 88  QKMRTVRIPPVLTVNKTASYAVYELTYI-RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
              +++R PPV+TV+K+A+YA+Y LTY+ RDV Y+P E Y+KT KCEP AGA VV  CER
Sbjct: 84  SATKSIRDPPVITVSKSATYALYALTYLDRDVAYRPDEKYVKTHKCEPYAGAKVVGECER 143


>gi|401429134|ref|XP_003879049.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322495299|emb|CBZ30602.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 917

 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 126/479 (26%), Positives = 209/479 (43%), Gaps = 56/479 (11%)

Query: 35  LSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIV----AEVVEVEENSTQKM 90
           +S S +  C    D +++ C  K+V+ + V      GE S++    A  + +++ +  + 
Sbjct: 131 VSSSLISYCSDSGD-ESIRCEKKMVVTVTVEGEQLPGEESLLFLNSATDMTIDDGTVVQF 189

Query: 91  RTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK---TRKCEPDAGADVVK----- 142
             +RI    T +++A    Y L Y+++   KP E  ++     +C  D  AD        
Sbjct: 190 SPLRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVRGNLLNQCNADFNADKATCGLAY 245

Query: 143 ------ICERQPICCPCG---------PQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDW 187
                 I   Q  CC C          P  R  ++C N+FDK        A CLRF   W
Sbjct: 246 DAAGKPIPYSQGFCCDCSMCQTLGLCKPDARANAAC-NIFDKY-----TAASCLRFGQRW 299

Query: 188 FHVFGIGQRSIGFSVRIEVKTGSKVS---------EVTVGPENKTATSADNF-LKVNLIG 237
           +  + IG     ++V + +     VS         E+ + P N   T+ + + +   ++G
Sbjct: 300 YSGYTIGGYMTWYTVNLTLSRSVSVSGGADAVEKVEMHLSPSNNGETAGEGWDVMARIVG 359

Query: 238 DFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYE 297
           ++             L  P    P     +    + W+LL     TLDG EC+K+GVSYE
Sbjct: 360 NYAPVDQPLDLTSRMLFAPAI--PPNDVRVQAGAAEWLLLPTNLVTLDGRECDKVGVSYE 417

Query: 298 AFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSF 357
           AF  Q + C+    SCL +QL +YR AD  RI       Y +   F   N   +A +  +
Sbjct: 418 AFASQGNKCNLRPGSCLSSQLEDYRTADLERIAAGNKGQY-MATSFGDFNLENDAATSPY 476

Query: 358 -SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALTQFGVATITTQNTG 414
            S        + + I + ADD+EY    + GKI+S  +  PT EA T  GV T+  +NT 
Sbjct: 477 ISYLAASPAATMISITVSADDLEYTVGVASGKIVSADLNKPTLEAGTTDGVMTVMVRNTA 536

Query: 415 EVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAA-KYTCSAILKDS 471
            V     + T +CS GV  M  Q   +  ++ S  +FK+Y   + A+   +C+ +++++
Sbjct: 537 AVTGRLVVGTLNCSDGVFPMTAQQLSLAAQQQSAVTFKVYMQNSYASGDASCTVVVRNA 595


>gi|340057663|emb|CCC52009.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 605

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 141/528 (26%), Positives = 214/528 (40%), Gaps = 62/528 (11%)

Query: 14  LLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEA 73
           ++  F ++    P  +     ++ S ++ CE+    D + C  K+V+ ++V +G   G  
Sbjct: 27  MVTAFVLIGTHLPHHMAEGVFIASSSIDYCERNNKVDPVPCEKKMVVTLSVDAGQDAG-- 84

Query: 74  SIVAEVVEVEENSTQK----MRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMK 128
             V EVV V E S +      R V   P+ LT  KT     Y L Y R+   KP E  + 
Sbjct: 85  --VEEVVLVREASDKTRDDDKRVVEFEPIYLTTKKTRVRYHYPLFYERNFNAKPYEEQIP 142

Query: 129 TRKCEP--------DAGADVVKICERQPI------CCPCGP---------QRRIPSSCGN 165
           T   +P         A   +     ++PI      CC CG            R   SC N
Sbjct: 143 TSLFDPCVDKPGSSKATCGIAHDNYQKPIPFSEGFCCNCGACQLAGICPSDSRGLGSC-N 201

Query: 166 VFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGF----SVRIEVKTGSKVSE-----VT 216
           +F         +A CLR    W+  + IGQ +  +    ++R EV   S  S      ++
Sbjct: 202 IFQT-----TGSASCLRLGELWYSGYNIGQGTAWYRLHVTLRDEVDNNSAASTRGSATMS 256

Query: 217 VGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWML 276
           +GP+     S        L+GDFV            L  P    P + + +      WM 
Sbjct: 257 LGPDQPADFSEKFGAWARLVGDFVPPEMPLDLTGKMLFTP--ATPRRHERVIAGSREWMF 314

Query: 277 LERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREAD-----QNRINR 331
           L++   +L G ECNKIGVSYE F  Q S C S   +CL +QL +YR+ D       R  +
Sbjct: 315 LDKHLVSLQGRECNKIGVSYEGFVTQGSRCVSRPGTCLADQLEDYRQRDVVAEAHGRRGK 374

Query: 332 NQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKI-- 389
               L+G            N  S   +  +   L++ + I + AD + YV   SPG I  
Sbjct: 375 YMARLFG----DMYTGGTRNTSSPYIAFWLRGSLSTMVTITINADSLRYVQSVSPGTILR 430

Query: 390 ISVIIPTFEALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIR 448
           I ++  T  + T+ GV ++T  NTG  E+ Y L   +CS GV  +  Q   I     +  
Sbjct: 431 IKLMNKTVFSYTRSGVVSVTVLNTGRAESQYFLAVRNCSVGVHPIAAQTINIPSGHNATC 490

Query: 449 SFKIYPTTN-QAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
            F +Y   +       C   L+D+  +  D +      M     +GSQ
Sbjct: 491 LFDLYVQEDVMTPNVKCHVELRDARGNVTDTSLFYLRLMPVNRTSGSQ 538


>gi|226230642|gb|ACO39314.1| hypothetical protein [Populus balsamifera]
 gi|226230644|gb|ACO39315.1| hypothetical protein [Populus balsamifera]
 gi|226230646|gb|ACO39316.1| hypothetical protein [Populus balsamifera]
 gi|226230648|gb|ACO39317.1| hypothetical protein [Populus balsamifera]
 gi|226230650|gb|ACO39318.1| hypothetical protein [Populus balsamifera]
 gi|226230654|gb|ACO39320.1| hypothetical protein [Populus balsamifera]
 gi|226230658|gb|ACO39322.1| hypothetical protein [Populus balsamifera]
 gi|226230660|gb|ACO39323.1| hypothetical protein [Populus balsamifera]
 gi|226230662|gb|ACO39324.1| hypothetical protein [Populus balsamifera]
 gi|226230664|gb|ACO39325.1| hypothetical protein [Populus balsamifera]
 gi|226230666|gb|ACO39326.1| hypothetical protein [Populus balsamifera]
 gi|226230668|gb|ACO39327.1| hypothetical protein [Populus balsamifera]
 gi|226230670|gb|ACO39328.1| hypothetical protein [Populus balsamifera]
 gi|226230672|gb|ACO39329.1| hypothetical protein [Populus balsamifera]
 gi|226230674|gb|ACO39330.1| hypothetical protein [Populus balsamifera]
 gi|226230676|gb|ACO39331.1| hypothetical protein [Populus balsamifera]
 gi|226230678|gb|ACO39332.1| hypothetical protein [Populus balsamifera]
 gi|226230680|gb|ACO39333.1| hypothetical protein [Populus balsamifera]
 gi|226230682|gb|ACO39334.1| hypothetical protein [Populus balsamifera]
 gi|226230684|gb|ACO39335.1| hypothetical protein [Populus balsamifera]
 gi|226230686|gb|ACO39336.1| hypothetical protein [Populus balsamifera]
 gi|226230688|gb|ACO39337.1| hypothetical protein [Populus balsamifera]
 gi|226230690|gb|ACO39338.1| hypothetical protein [Populus balsamifera]
 gi|226230694|gb|ACO39340.1| hypothetical protein [Populus balsamifera]
 gi|226230696|gb|ACO39341.1| hypothetical protein [Populus balsamifera]
 gi|226230700|gb|ACO39343.1| hypothetical protein [Populus balsamifera]
 gi|226230702|gb|ACO39344.1| hypothetical protein [Populus balsamifera]
 gi|226230704|gb|ACO39345.1| hypothetical protein [Populus balsamifera]
 gi|226230706|gb|ACO39346.1| hypothetical protein [Populus balsamifera]
 gi|226230708|gb|ACO39347.1| hypothetical protein [Populus balsamifera]
 gi|226230710|gb|ACO39348.1| hypothetical protein [Populus balsamifera]
 gi|226230714|gb|ACO39350.1| hypothetical protein [Populus balsamifera]
 gi|226230716|gb|ACO39351.1| hypothetical protein [Populus balsamifera]
 gi|226230718|gb|ACO39352.1| hypothetical protein [Populus balsamifera]
 gi|226230722|gb|ACO39354.1| hypothetical protein [Populus balsamifera]
 gi|226230726|gb|ACO39356.1| hypothetical protein [Populus balsamifera]
 gi|226230728|gb|ACO39357.1| hypothetical protein [Populus balsamifera]
 gi|226230730|gb|ACO39358.1| hypothetical protein [Populus balsamifera]
 gi|226230732|gb|ACO39359.1| hypothetical protein [Populus balsamifera]
 gi|226230734|gb|ACO39360.1| hypothetical protein [Populus balsamifera]
 gi|226230736|gb|ACO39361.1| hypothetical protein [Populus balsamifera]
 gi|226230738|gb|ACO39362.1| hypothetical protein [Populus balsamifera]
 gi|226230740|gb|ACO39363.1| hypothetical protein [Populus balsamifera]
 gi|226230742|gb|ACO39364.1| hypothetical protein [Populus balsamifera]
 gi|226230744|gb|ACO39365.1| hypothetical protein [Populus balsamifera]
 gi|226230746|gb|ACO39366.1| hypothetical protein [Populus balsamifera]
 gi|226230748|gb|ACO39367.1| hypothetical protein [Populus balsamifera]
 gi|226230750|gb|ACO39368.1| hypothetical protein [Populus balsamifera]
 gi|226230752|gb|ACO39369.1| hypothetical protein [Populus balsamifera]
 gi|226230754|gb|ACO39370.1| hypothetical protein [Populus balsamifera]
 gi|226230758|gb|ACO39372.1| hypothetical protein [Populus balsamifera]
 gi|226230762|gb|ACO39374.1| hypothetical protein [Populus balsamifera]
 gi|226230766|gb|ACO39376.1| hypothetical protein [Populus balsamifera]
 gi|226230768|gb|ACO39377.1| hypothetical protein [Populus balsamifera]
 gi|226230770|gb|ACO39378.1| hypothetical protein [Populus balsamifera]
 gi|226230778|gb|ACO39382.1| hypothetical protein [Populus balsamifera]
 gi|226230782|gb|ACO39384.1| hypothetical protein [Populus balsamifera]
 gi|226230784|gb|ACO39385.1| hypothetical protein [Populus balsamifera]
 gi|226230786|gb|ACO39386.1| hypothetical protein [Populus balsamifera]
 gi|226230788|gb|ACO39387.1| hypothetical protein [Populus balsamifera]
 gi|226230790|gb|ACO39388.1| hypothetical protein [Populus balsamifera]
 gi|226230792|gb|ACO39389.1| hypothetical protein [Populus balsamifera]
 gi|226230794|gb|ACO39390.1| hypothetical protein [Populus balsamifera]
 gi|226230796|gb|ACO39391.1| hypothetical protein [Populus balsamifera]
 gi|226230798|gb|ACO39392.1| hypothetical protein [Populus balsamifera]
 gi|226230800|gb|ACO39393.1| hypothetical protein [Populus balsamifera]
 gi|226230802|gb|ACO39394.1| hypothetical protein [Populus balsamifera]
 gi|226230804|gb|ACO39395.1| hypothetical protein [Populus balsamifera]
 gi|226230806|gb|ACO39396.1| hypothetical protein [Populus balsamifera]
 gi|226230808|gb|ACO39397.1| hypothetical protein [Populus balsamifera]
 gi|226230810|gb|ACO39398.1| hypothetical protein [Populus balsamifera]
 gi|226230812|gb|ACO39399.1| hypothetical protein [Populus balsamifera]
 gi|226230814|gb|ACO39400.1| hypothetical protein [Populus balsamifera]
 gi|226230816|gb|ACO39401.1| hypothetical protein [Populus balsamifera]
 gi|226230818|gb|ACO39402.1| hypothetical protein [Populus balsamifera]
 gi|226230820|gb|ACO39403.1| hypothetical protein [Populus balsamifera]
          Length = 64

 Score =  130 bits (327), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 57/64 (89%), Positives = 60/64 (93%)

Query: 235 LIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGV 294
           LIGDFVGY+NIPSFE+FYLVIPRQG  GQPQDLG NFSMWMLLER RFTLDG+ECNKIGV
Sbjct: 1   LIGDFVGYSNIPSFEDFYLVIPRQGESGQPQDLGRNFSMWMLLERVRFTLDGVECNKIGV 60

Query: 295 SYEA 298
           SYEA
Sbjct: 61  SYEA 64


>gi|226230692|gb|ACO39339.1| hypothetical protein [Populus balsamifera]
 gi|226230760|gb|ACO39373.1| hypothetical protein [Populus balsamifera]
 gi|226230776|gb|ACO39381.1| hypothetical protein [Populus balsamifera]
          Length = 64

 Score =  130 bits (326), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 57/64 (89%), Positives = 60/64 (93%)

Query: 235 LIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGV 294
           LIGDFVGY+NIPSFE+FYLVIPRQG PGQ QDLG NFSMWMLLER RFTLDG+ECNKIGV
Sbjct: 1   LIGDFVGYSNIPSFEDFYLVIPRQGEPGQLQDLGRNFSMWMLLERVRFTLDGVECNKIGV 60

Query: 295 SYEA 298
           SYEA
Sbjct: 61  SYEA 64


>gi|168036567|ref|XP_001770778.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677996|gb|EDQ64460.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 346

 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/290 (31%), Positives = 133/290 (45%), Gaps = 23/290 (7%)

Query: 215 VTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMW 274
           +TV P    AT+ D    V+L GDF+ Y + P   + YL  P    P    +       W
Sbjct: 15  LTVSPTQMEATNKDRNCIVHLAGDFLNYRSFPQLNDVYLFTPNADDPHD--NPFQKRVKW 72

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINR-NQ 333
           +L+ +   T DGLECNKIGV +  F  Q   C  P  +CL +QLW + +A+         
Sbjct: 73  LLIPKGHVTDDGLECNKIGVGFTPFRIQERGCYEPVGTCLASQLWTFAQAEAAACALVPP 132

Query: 334 LPLYGV--EG----RFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPG 387
            P++ V  EG    R       P+  S   +I + +++ S + +E+ A  +E+   RSPG
Sbjct: 133 RPIFSVLKEGVVDLRIHNFEGDPD--SRVLTITLDQIMTSVVTLEVEASGMEFFVNRSPG 190

Query: 388 KIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETS 446
           KIIS  +PTFEA T++G   +  QNTG + + Y +    CS+G         II P+   
Sbjct: 191 KIISASVPTFEAYTRYGQMEVVVQNTGTIVSLYFIQVHACSSG---------IIDPEGPL 241

Query: 447 IRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQI 496
              + +Y     AA       L DS    V    CQF T  T    G Q+
Sbjct: 242 SNLYVLY--HKNAAVCVIRVDLLDSFAVNVFSQICQFQTTQTENSKGDQV 289


>gi|145492867|ref|XP_001432430.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124399542|emb|CAK65033.1| unnamed protein product [Paramecium tetraurelia]
          Length = 685

 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 138/566 (24%), Positives = 242/566 (42%), Gaps = 89/566 (15%)

Query: 35  LSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENST-QKMRTV 93
           L+ S+++ C+   ++D   C+  +++++ +       E S      +++ NST    +TV
Sbjct: 17  LTTSQIKVCDSNKNAD---CSENMLISLTI-------ENSFSTSTEQIQINSTILNNQTV 66

Query: 94  RI--PPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKT--RKCE-------PDAG---AD 139
           ++  P  LT+ KT  YA Y L Y ++   +P E  + +    C+       P  G   + 
Sbjct: 67  QLSTPFTLTITKTPVYAYYPLKYFQNYNSQPYELQIPSAVNPCDDNWTSNSPTCGFQYSS 126

Query: 140 VVKICERQPICCPCGPQRRIPSSCGNVFDKLLKGK--ANTAHCLRFPGDWFHVFGIGQRS 197
             K+ + Q  CC CG       +  +V   + K    A  A CLR+   W+  + I +  
Sbjct: 127 TNKVQDSQGFCCSCGSSEYSGQNDQSVRINICKNASVATMAFCLRYSPLWYSSYNISKFV 186

Query: 198 IGFSVRIEVK-TGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
           I +++ I +K +  +V + T+G E K      +  K+  I D++     PS E F L+ P
Sbjct: 187 IHYNITISIKYSNDEVEQYTLGSEVKEVKGESSIAKI--ISDYIPSNQPPSLESFMLMKP 244

Query: 257 RQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHN 316
               P     +    + +M + +  F   G ECNKIGVSY +F  + + C     SCL N
Sbjct: 245 --SSPTSHNRVQAGSAAYMFVPK-EFLGQG-ECNKIGVSYTSFKNERNSCKKLIRSCLQN 300

Query: 317 QLWNYREADQNRINRNQLPLYGVE--GRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELR 374
           QL +  + D  ++N N  P Y ++  G F+++N + N     FSI   + + + + +E+ 
Sbjct: 301 QLEDLYQNDIAQLNNNSQPTYLIQKYGEFKQININ-NDQYLQFSID--QQMFTTITLEIN 357

Query: 375 ADD-IEYVYQRS---PGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGV 430
               I Y+  +     G+I  V I  F   +  G+      NTG   + +   F+CST  
Sbjct: 358 TTGRISYIGNKQESVKGQIDLVEIHNFSIASGSGLLYAQITNTGGSLSEFKSFFNCSTNT 417

Query: 431 TLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVD---RAECQFSTMA 487
                          +I S ++ P          S I++      +D      C FS ++
Sbjct: 418 --------------ITINSTELEPLQ--------SIIIQQDINVSIDIKKSTSCNFSLLS 455

Query: 488 ---TVLD-NGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDF 543
               +LD     +  F    +  N++ ++I S              GK C  KCS F D 
Sbjct: 456 NEGALLDWKIVYLNQFDNNTNQSNNYNQTITS-------------EGKVCEIKCSQFIDI 502

Query: 544 SCHIQYIC----LSWLVLFGLVLAIF 565
           SC++Q  C    +++  + G +L  F
Sbjct: 503 SCYLQNNCEKDAITFFTVLGGILLTF 528


>gi|320165667|gb|EFW42566.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 696

 Score =  119 bits (299), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 92/348 (26%), Positives = 154/348 (44%), Gaps = 14/348 (4%)

Query: 161 SSCGNVFDKLLKG--KANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVG 218
           S+CG   D   K    + +AHC+RF   W++V  +    +       + TGS+   +TV 
Sbjct: 40  STCGVYMDANSKPIRDSQSAHCMRFDQLWYNVVALDPPQMAVKFTFTIFTGSENKTITVS 99

Query: 219 PENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQG-GPGQPQDLGGNFSMWMLL 277
           P  +TA ++D  + V LIG F  +         YL+IP+    P  PQ   G  S WML+
Sbjct: 100 PSQRTAKNSDGSVIVRLIGSFQSFVADYDLTTNYLLIPQPATSPTSPQVALGR-SDWMLV 158

Query: 278 ERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLY 337
            ++   L G  C+KIGV +  F  Q   C+ P  SCL+NQ  ++  AD     + ++  Y
Sbjct: 159 PKSTVDLTGATCDKIGVGFTPFRYQEGQCTRPSGSCLNNQPKDFWTADTTLRRQGKMVNY 218

Query: 338 GVEGRFERMNQHPNAGS-HSFSIGVTEVLNSN--------LLIELRADDIEYVYQRSPGK 388
            +E   + +  +  A    S S G   VL +           +E+ AD + +        
Sbjct: 219 FLERYGDILGLYAGASDIVSMSPGQQYVLATRPRDPASILFTMEIAADSLTFYNNLGQAS 278

Query: 389 IISVIIPTFEALTQFGVATITTQNTGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSI 447
           I    +  FE+L+Q G   +   +   +EA Y++   DC   +  + EQ   I   +   
Sbjct: 279 IKVFSVTDFESLSQRGTLRVLVASETPLEALYAVRVVDCVPAIFPIAEQSQSIGSYQAKW 338

Query: 448 RSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
            +F +   T       C+ +L +S+  ++D+    F T +T +  G+Q
Sbjct: 339 YTFTLQTMTPIGGNTNCTILLVNSNSDQLDKKFVSFRTNSTTIYKGNQ 386


>gi|452824579|gb|EME31581.1| hypothetical protein Gasu_12520 [Galdieria sulphuraria]
          Length = 422

 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 76/296 (25%), Positives = 133/296 (44%), Gaps = 19/296 (6%)

Query: 266 DLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREAD 325
           D+    + WML++  + TL G ECNK+GVSY AF  + S C     SCL NQL N+ ++D
Sbjct: 19  DIPSEIAKWMLVDTDQVTLSGDECNKVGVSYSAFQDESSRCLRAVNSCLGNQLENFYKSD 78

Query: 326 QNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRS 385
              +       Y V+   +      +  +         + +S++L++  A+ + +V   S
Sbjct: 79  LKALQEGTSGNYFVQFFGDFDGNEVSGANPKMRFWTDRIQSSDILLQFAAESLFHVVDVS 138

Query: 386 PGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKET 445
            GKII   +   +A ++ G  T+T QNTG+VEASY +   C   +  +  Q   I P +T
Sbjct: 139 EGKIIGANVNLVQAYSKNGKMTVTLQNTGKVEASYEVAVSCPNNILPILAQQVYILPNQT 198

Query: 446 SIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSS 505
              +F++         + C+  L++S    + R + +  +    +  G Q    QP  S 
Sbjct: 199 KNVTFQVDVENTHGGHFVCNVSLQNSIGQSISRYQVKVESSGINVSTGPQAG--QPSGSD 256

Query: 506 INDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLV 561
                               ++ +  AC++ C SFF+  C  ++ C  WL +  ++
Sbjct: 257 ---------------GTSSTNYGSSSACQKSCGSFFNIICFFEHSC--WLNILYVI 295


>gi|328872922|gb|EGG21289.1| hypothetical protein DFA_01170 [Dictyostelium fasciculatum]
          Length = 749

 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 128/558 (22%), Positives = 231/558 (41%), Gaps = 79/558 (14%)

Query: 29  VVGVQILSKSKLEKCEK----RTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE 84
           +V    +  S ++KC +       S NL+C+ K+ +++ + +     E     + V  E+
Sbjct: 19  IVEPTFIGSSTIKKCIRDGTTTETSANLDCSEKLFVSLTLNNNQLETEQ---IQAVVYED 75

Query: 85  NSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTR------KCE----- 133
            ++  +     P  ++ +K+  +  + + Y   V  KP E  +  R      +CE     
Sbjct: 76  GTSGNLS---YPIEVSFSKSQVFIQHPVIYETTVSNKPYETVIYKRDDIILTECEDKPTQ 132

Query: 134 PDAGADVVK---ICERQPICCPCGPQRRIP---SSCGNVFDKLLKGKANTAHCLRFPGDW 187
              G  VV    + + Q  CC C          +S  N+   LL  ++++AHCL F    
Sbjct: 133 STCGYAVVNGSAVRDSQGFCCTCIFSDYFTQDHNSRANLKCTLLNDQSSSAHCLGFDKVL 192

Query: 188 FHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLK-VNLIGDFVGYTNIP 246
           ++V+ I   +I + +   +K         + PE        NF + + L+   V  T + 
Sbjct: 193 YNVYAIQPGTILYQINATIKY--------LDPEF-------NFTRSIPLVVSPVSPTAVD 237

Query: 247 SFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFC 306
           + +   L+      P  P+         ++L+R  F   G EC+KIGVSY  F  QP+ C
Sbjct: 238 AKKNMILI----SDPTNPRTKMSPIQSSLILDRRLF---GDECDKIGVSYSKFQNQPNRC 290

Query: 307 SSPFWSCLHNQLWNYREADQNRINRNQ-----LPLYGVEGRFERMNQHPNAGSHSFSIGV 361
            + F +CL+NQ+ +Y + D +++++       L  +G +  F  ++   +  +    I +
Sbjct: 291 GAQFGTCLNNQIDDYFKEDTDKMSKGLKGNYILSNFGSQ-MFASLDSSSSQANRFIKIQI 349

Query: 362 TEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYS 421
            ++  + + +EL+AD +  +   SPGKII   + TFEA++  GV   + +NTG + A Y 
Sbjct: 350 DQIHQTQISLELKADQLRVIMNTSPGKIIEAYVKTFEAMSNNGVLVASIKNTGVIVAEYD 409

Query: 422 LTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAE 480
           +   +CS  +  +  Q   I   +     F I   +     Y C   L + +   +D   
Sbjct: 410 VQVKNCSQEINPIPAQRSSIAGGQYKTLQFDITTQSELKDTYYCYVDLLNGNAELLDSKL 469

Query: 481 CQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSF 540
             F+   TV+ N        P  +   D        G  L  G         C   C  F
Sbjct: 470 VYFNVSETVIKN--------PQGTGTRD--------GDNLNIGFE-----LTCDDYCPDF 508

Query: 541 FDFSCHI-QYICLSWLVL 557
           F   C + Q  C S L++
Sbjct: 509 FQLLCFVSQPKCTSRLII 526


>gi|281205105|gb|EFA79298.1| hypothetical protein PPL_07716 [Polysphondylium pallidum PN500]
          Length = 698

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 126/517 (24%), Positives = 212/517 (41%), Gaps = 63/517 (12%)

Query: 32  VQILSKSKLEKC------EKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEEN 85
           + ++S S+++ C      E + D+  L C    V+++ + S     E S     V   +N
Sbjct: 13  IDVISSSQIQICKDDGTLESKKDNQYLKCQKMFVVSLTIDSNQDHTELSQFT--VNDVKN 70

Query: 86  STQKMRTVRIPPVLTVNKTASYAVYELTYIR--------DVPYKPQEFYMKTRKCEP--- 134
                 ++  P  ++ +K+  Y    L Y +        +V Y        + K  P   
Sbjct: 71  ENGDTFSLVYPVEISFSKSKQYGKSSLIYRKSYSEEKYENVHYTNDYLLFSSCKDSPSDH 130

Query: 135 ------DAGADVVKICERQPICCPC------GPQR--RIPSSCGNVFDKLLKGKANTAHC 180
                 DA  + V     Q  CC C      G  R  R   SC      L  G++++A C
Sbjct: 131 TCPTVRDASGNQVPY--SQGFCCSCDLGSYVGIDRDSRSHLSC-----TLFGGRSSSASC 183

Query: 181 LRFPGDWFHVFGIGQRSIGFSV-----RIEVKTG-SKVSEVTVGPENKTATSADNFLKVN 234
           +      +  + I      + +     +  V TG S+     +G +N    +  N + + 
Sbjct: 184 MAQRPLLYDSYSIEPPVTTYDIVVNITQFNVTTGASQTQTYRLGNDNLILNA--NGIVIK 241

Query: 235 LIGDFVGYTNIPSFEEFYLVIP-RQGGPGQP-QDLGGNFSMWMLLERTRFTLDGLECNKI 292
           L+GDF     + +FE+  L +   Q  P  P   L   F M  ++          ECNKI
Sbjct: 242 LVGDFASPQALRTFEDSMLFVNNEQSDPNNPIHKLP--FEMRAMIFSKSDVGSPNECNKI 299

Query: 293 GVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNA 352
              Y AF  QP+ CS+P  SCL NQ+  +R+ D     + +   Y ++        + N 
Sbjct: 300 ATDYVAFQNQPNQCSAPLNSCLDNQIKKFRDQDMALYAKGKKGQYLIKNYGATAEIYKNT 359

Query: 353 GSHS--FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITT 410
           G+++    + ++    S + I+++AD+  +VY+ SPG I+S  + TFE+++  G   I T
Sbjct: 360 GNNNLFLQVELSGKQTSLVTIKIKADNFAFVYKESPGVIVSNRLETFESMSSDGNLYIQT 419

Query: 411 QNTGEVEASYSL-TFDCSTGVTLME-EQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAIL 468
           +N G     Y L   +C+  +T+    Q F +KP E     F+I   T       C A L
Sbjct: 420 KNIGATNTQYVLNVLNCTEAITVNNPSQVFTMKPNEIVETKFEIRTVTKFGGNQHCYADL 479

Query: 469 KDSDFSEV-DRAECQFSTMATVL------DNGSQITP 498
           K      + D    +F+T  TV+      + GS  TP
Sbjct: 480 KGFAMGTLFDSILIKFNTTDTVIKVPGYNETGSGDTP 516


>gi|302842680|ref|XP_002952883.1| hypothetical protein VOLCADRAFT_105707 [Volvox carteri f.
           nagariensis]
 gi|300261923|gb|EFJ46133.1| hypothetical protein VOLCADRAFT_105707 [Volvox carteri f.
           nagariensis]
          Length = 980

 Score =  105 bits (263), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 117/469 (24%), Positives = 185/469 (39%), Gaps = 83/469 (17%)

Query: 165 NVFDKLLKGKANTAHCLRFPGDWFHVFGI-GQRSIGFSVRIEV------KTGS------- 210
           N+F +L + +  +AHCLR    W+  + +    ++ F VR+EV      K  S       
Sbjct: 166 NMFPELER-EPVSAHCLRLDDQWYRGYNLRPGGTLQFGVRLEVHIPTAGKPASMAYVNGT 224

Query: 211 -KVSEVTVGPENKT---------ATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQG- 259
            ++S+ TV   +++          T+++      L+GD   Y  +P      L+IPR   
Sbjct: 225 LRISKSTVITRSESLDLTLAGPLVTTSNKMTSARLLGDLSSYAQVPDLGTRALMIPRADF 284

Query: 260 -----GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCL 314
                 PG    + G    WM++ RT  T DGL C+KIG SY AF  Q + C     +CL
Sbjct: 285 NETELYPGDSVPVNGR--TWMMVNRTMVTYDGLSCDKIGTSYTAFRNQQNACFRTESTCL 342

Query: 315 HNQLWNYREADQNRINRNQLPLY------GVEGRFERMNQHPNAGSHSFSIGVTEVLNSN 368
            NQL +  E DQ RI     PLY      GV G    M      G+    + +     S 
Sbjct: 343 RNQLKDLFEGDQKRIGSGMTPLYLLSQFNGVNG----MAVDAIDGAIYLRLPIASQPPS- 397

Query: 369 LLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQ--------NTGEVEASY 420
           +++ + AD ++     SPGK+ ++ +  F + T  G     T+        N G + A Y
Sbjct: 398 VMLTVSADTVQMTTALSPGKLSNLRMCEFGSTTSCGSLGFITRGHIFLSVANMGLLPADY 457

Query: 421 SLTF-DCS-TGVTLMEEQYFIIKPKETSIRS--FKIYPTTNQAAKYTCSAILKDSDFSEV 476
            +   DCS   V  +E +   +   +T   S    IY       + +C+  L D+    V
Sbjct: 458 IIVVSDCSLNNVWPIEARMITVNAGQTVNLSPPIPIYMNDTTTKERSCAVQLYDATGKAV 517

Query: 477 DRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRK 536
           DR +  F   A+           Q P  ++N                         C   
Sbjct: 518 DRQKLTFDATAS--------KGLQKPSRNVN-------------------ATNATNCDEV 550

Query: 537 CSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLY 585
           C++    SC I+  C + +  F  +LA     + L+ L  +   F  LY
Sbjct: 551 CANPAGVSCFIENDCPAKMGRFLGILAAILAGITLMVLACKYSWFSKLY 599


>gi|449017067|dbj|BAM80469.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 708

 Score = 99.0 bits (245), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 119/566 (21%), Positives = 220/566 (38%), Gaps = 87/566 (15%)

Query: 13  FLLILFCI-LNLLSP--RCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSS 69
           F+L+ F + L L++P     VG  +     +  C       ++ C+ K VL +AV +G++
Sbjct: 10  FVLLFFQVPLWLVAPFFSASVGGSLTGVGSIVTCLDSGRPGSIPCSKKWVLTLAVENGAT 69

Query: 70  GGEASIVA-EVVEVEENSTQKMRTVRIPPV---------LTVNKTASYAVYELTYIRDVP 119
              +S+ A + V    ++   +R+   P           +T+ K+     Y L Y  D  
Sbjct: 70  AASSSVSATQAVYGSSSANATVRSADNPNTVYAFKYQVHITLTKSRIRLDYPLYYQSDFN 129

Query: 120 YKPQEFYMKTRK-----------------CEPDAG-----------ADVVKICERQPICC 151
            KP E   K  +                  +P  G           AD  +I   Q  CC
Sbjct: 130 NKPYEIVYKYNQKGPLNWLDNQCVATWGSSDPTCGYAYNPSWSTKPAD--RILYSQGFCC 187

Query: 152 PCGPQRRIPSSCGNVFDKL------LKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIE 205
            C     +  S   +   L            +AHCLRF   W+  F IG+  + F + + 
Sbjct: 188 DCNAGDLLGLSPNRIRGGLDCSLLNFDNPTESAHCLRFDSLWYSAFQIGEPDVNFVILVN 247

Query: 206 V-------KTGSKVSE----------------VTVGPENKTATSADNFLKVNLIGDFVGY 242
           V        T   +S                 +++ P +    +++  +    IGDF  +
Sbjct: 248 VTKCPLANSTIKSISGLVGNQDQAIQNCSTEIISLSPSSPIGYASNGKISAQAIGDFAPW 307

Query: 243 TNIPSFEEFYLVIP-------RQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVS 295
              PS+ E    +P             +   +    + WML++    T+ G  C+KIGVS
Sbjct: 308 EGTPSYSEKLFFVPSVCTDTSEAWCVDRISYIPTEINRWMLIDNDLVTITGDTCDKIGVS 367

Query: 296 YEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVE--GRFERMNQHPNAG 353
           Y AF  +   C  P  SCLH+QL +Y ++D       ++  Y V+  G F+     P   
Sbjct: 368 YSAFTNEGQRCERPTQSCLHDQLQDYYDSDLALEQTGKVGSYFVQFFGDFDVSGLTPRNP 427

Query: 354 SHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVI--IPTFEALTQFGVAT--IT 409
              F    T+   + ++++  A+++ Y    +P + +  +  I  F + ++ G+    I 
Sbjct: 428 LLRFFTNRTQA--TEVVLQFAAEELFYTIYLAPARFLRHLSKINPFTSQSKGGLIDLWIV 485

Query: 410 TQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILK 469
           ++ TG+  A ++++  C   V  ++ Q   + P +    S  I  T        C+  L+
Sbjct: 486 SEGTGQNAAQFTVSASCEPNVEPIQAQIVTLAPGQLVSISLPIIETKATGGAGVCNCTLR 545

Query: 470 DSDFSEVDRAECQFSTMATVLDNGSQ 495
           ++    +D    +F+  +    +G+Q
Sbjct: 546 NALGQVLDVLVLEFNASSVRTTDGAQ 571


>gi|403356130|gb|EJY77656.1| HAP2-GCS1 domain containing protein [Oxytricha trifallax]
          Length = 751

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 123/537 (22%), Positives = 217/537 (40%), Gaps = 73/537 (13%)

Query: 79  VVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRD--VPYKPQEFYMKTRKCE--- 133
           + +VE ++T +M+ +     +   K+A+  VY+LTY++D       Q  Y ++  C+   
Sbjct: 5   ITKVENSTTGEMQQLNQWIQIGFKKSAAQIVYDLTYVQDFYASVTEQVVYTQSIFCDDSY 64

Query: 134 ----PDAG----ADVVKICERQPICCPCGPQRRIPSSCGNVFDKLLKG----KANT---- 177
               P  G     +  KI   Q  CC C       S    + DK  +G     ANT    
Sbjct: 65  NSNDPTCGWQYDKNGNKIQYSQGFCCDCPL-----SDIFTIADKETRGFSCILANTFYAT 119

Query: 178 AHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSK------VSEVTVGPENKTATSADNFL 231
           AHCL+F  + F  + I    + +++  +++   K        ++ + P+ +     D+ +
Sbjct: 120 AHCLKFSSERFSAYKISPPRVEYTITAQIQIFDKNYNFYRQYDINLRPDRREKVIDDS-I 178

Query: 232 KVNLIGDFVGYTNIPSF-EEFYLVIPRQGGPGQPQDLGGNF-------SMWMLLERTRFT 283
            +++IGDF+  T  P +     L++P        +D G NF          +L++R   T
Sbjct: 179 YISIIGDFMP-TQFPVYYNNEILLVP-------SKDYGYNFYSTFDNCEYCLLVDREMIT 230

Query: 284 LDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
           + GLEC+KIG SY AF      C  P ++CL  Q+ +    D  RI   + P Y +    
Sbjct: 231 MTGLECDKIGTSYYAFQTAGDKCDQPVYTCLKLQIQDLIIDDFKRIQDKKTPNYLLNHVG 290

Query: 344 ERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKI----ISVIIPTFEA 399
            +        +   +     V ++ L  E+  D + ++  R+  K+    + ++   FE 
Sbjct: 291 SKTGLKFVQETGQLAASCPYVQSTQLKFEVNTDRMSFI--RAVVKMSISQVKLLNNGFEG 348

Query: 400 LTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQ 458
            T FG+  I  +N  ++     L+  +CS  VT       +I     S RS+     +N 
Sbjct: 349 YTNFGLIMINIRNDEDLAGEGQLSISNCSDFVTFFGNSVQVISVPAYSERSYNFTVGSNS 408

Query: 459 ---AAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIES 515
                  TCS  L +S  S ++    QF T AT      ++      +    ++  S E 
Sbjct: 409 NLPIENNTCSIQLTNSIGSILEDRSVQFQTTATNYQTVVEVAQGILSQQKEAEYLLSKEY 468

Query: 516 IGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLL 572
           +  K    L DF            FF   C+   +C   ++    V  + P +  +L
Sbjct: 469 LCGKC--DLEDF------------FFALFCYFYEVCTDQILRIIFVYILMPFLFAVL 511


>gi|443715870|gb|ELU07639.1| hypothetical protein CAPTEDRAFT_211680 [Capitella teleta]
          Length = 914

 Score = 89.4 bits (220), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 84/351 (23%), Positives = 146/351 (41%), Gaps = 21/351 (5%)

Query: 150 CCPCGPQRRI-PSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKT 208
           CC C P+++   S+C            N+AHCL+F   WF V  +GQ  +  S+R+E+  
Sbjct: 160 CCSCDPKKKNRDSACA--------PHQNSAHCLQFHPLWFTVSEVGQLHMKHSIRVELME 211

Query: 209 GSKVSEVTVGPENKTATS----ADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQP 264
            ++ +E +   +    TS     ++ + +    D V  ++    E+  L+ P +  PG P
Sbjct: 212 PTESNEWSSVADLNIGTSQPVDMNDKVTIQYKMDTVNISDPLKVEDKLLLTP-ELDPGMP 270

Query: 265 --QDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYR 322
             Q+L      ++L+++    L G  C+ +G SY  F  Q   CSS   SC+ NQ     
Sbjct: 271 LSQELMKEMDKFLLVKKDLVDLSGGSCDSVGTSYPGFLNQKEACSSSLESCMKNQPLELL 330

Query: 323 EADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVY 382
            ADQ R++R +     +      ++   + GSH  +    EV  S + + + AD+++   
Sbjct: 331 MADQMRLSRGEAAQLMIGHLGLALSPTYSGGSH-LAFLSNEVHLSQVTVLIDADNLDLKT 389

Query: 383 QRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKP 442
             S   II V+  + +  T      +T  N G ++        C   V +   Q   + P
Sbjct: 390 ASSDVAIIDVVSTSADHKT---TVKMTLFNAGILDVEVHAQMTCPWLVDIPASQKNRLMP 446

Query: 443 KETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
             ++   F      N   +  C+  + D    EV R E       + L  G
Sbjct: 447 YHSARVLFHFDADLND-TEVVCNVQVMDIQDEEVARREVSLKQSTSCLCMG 496


>gi|449664645|ref|XP_002159421.2| PREDICTED: protein HAPLESS 2-A [Hydra magnipapillata]
          Length = 385

 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 88/373 (23%), Positives = 146/373 (39%), Gaps = 79/373 (21%)

Query: 8   LKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLN----------CTTK 57
            K+K  LL  F   N+      VG  ILSKS +E CE    S++L           C  K
Sbjct: 5   FKMKKQLLSSF--FNITVNIIFVGGLILSKSSIEFCENTGSSNDLKDPTNVVTQSACEKK 62

Query: 58  IVLNMAVPSGSSGGEASIVAEVVEVEENS-TQKMRTVRIPPVLTVNKTASYAVYELTY-- 114
           +V+ ++V  G+  GE   +  VV V +NS T +   +  P ++TV+K+  Y  +   +  
Sbjct: 63  MVVLLSV--GNKQGETEKLQAVVSVVQNSATNEFARLYNPFMITVSKSPVYLNFPFFFNG 120

Query: 115 --IRDVPYK-----PQEFYMK--TRKC--------------------------EPDAGAD 139
             + + PY+        +Y+   +R+C                          + D    
Sbjct: 121 ITVNNQPYEEIILSKNRWYVSDSSRQCLDQWQVEEEDDEHPTCGYQYTNSTQKQTDGTWK 180

Query: 140 VVK--ICERQPICCPCGPQRR---------IPSSCGNVFDKLLKGKANTAHCLRFPGDWF 188
            VK  I + Q  CC C    +           +  G +   L      +AHC+R    W+
Sbjct: 181 TVKTRIWDSQGFCCYCTQDLKNYYIKKDIQDANRAGIICKPLTNSPQASAHCMRMSNLWY 240

Query: 189 HVFGIGQRSIGFSVRIE--------VKTGSKV-----SEVTVGPENKTATSADNFLKVNL 235
            +    +    FS+ ++        V+  S +      E+ + P  K+AT + N +  N 
Sbjct: 241 TLNEFTESYRDFSIYVKAFDQITKVVQNKSYIDYVNGGEILLSPSQKSATGSYNRITGNY 300

Query: 236 IGDFVGYTNIPSFEEFYLVIPRQG---GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKI 292
           +GD     + P     Y +IP       P +   L    S WM++ R   + D  +C+ I
Sbjct: 301 VGDLQPIKSYPVLTNNYFLIPFSSTNVDPKKESQLKSGISKWMIIPRDLVSTDAKQCDMI 360

Query: 293 GVSYEAFNGQPSF 305
           GV Y AF  Q ++
Sbjct: 361 GVGYSAFRNQAAY 373


>gi|449686992|ref|XP_004211319.1| PREDICTED: protein HAPLESS 2-A-like, partial [Hydra magnipapillata]
          Length = 331

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/181 (28%), Positives = 92/181 (50%), Gaps = 8/181 (4%)

Query: 313 CLHNQLWNYREADQNRINRNQLPLY--GVEGRFERMNQHPNAGSHSFSIGVTEVLN---S 367
           CL NQ +N    D++R+ + ++P Y     G+   + Q  N G +   +   E+ +   S
Sbjct: 1   CLANQPYNKFMDDEDRLEKGKMPWYFPARYGKLAGVKQ--NIGDNDKYLLTYELDDEQIS 58

Query: 368 NLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTF-DC 426
            + +++ ADD+  VY R+ G I    I  FEAL+  G  ++   NTG V + + ++   C
Sbjct: 59  LVTLQISADDVVLVYNRATGIITRTAIQDFEALSLEGQLSVDVLNTGYVSSDFRISIPSC 118

Query: 427 STGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTM 486
           ++GV  +EE+   I P+ T   +FK+  +T++ + + C+  L DS    +      FST 
Sbjct: 119 TSGVQPIEEKRITIDPQMTETITFKMMTSTDKKSAHDCTINLYDSKNILLQSRNFTFSTK 178

Query: 487 A 487
           A
Sbjct: 179 A 179


>gi|242019036|ref|XP_002429972.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515027|gb|EEB17234.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 1342

 Score = 69.3 bits (168), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 76/270 (28%), Positives = 122/270 (45%), Gaps = 52/270 (19%)

Query: 172  KGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV-------KTGSKVSEVTVGPENKTA 224
            K   N+AHCLRF G W+ ++ I +  +  +V ++V          S+  ++T G   +  
Sbjct: 897  KCSGNSAHCLRFGGLWYGMYQIKKPVVAQTVGVQVFEKNVFFNGNSEWRDLTKGKMVRVG 956

Query: 225  T---SADNFL---KVNLIGDFVGY--TNIPSFEEFYLVIPR--QGGPGQPQDLGGNFSMW 274
            T    A++ L    +   GD      T+I S + + L+IP   +GGP +           
Sbjct: 957  TFLPKAEDELPTFSMTYAGDNTKKISTSIDS-DNYMLLIPSHMEGGPDE----------- 1004

Query: 275  MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQ---LWNYREADQNRINR 331
             L+ R++    G ECN IG S+EAF+ QP  C+ P  SCL  Q   LW  RE  + RI  
Sbjct: 1005 YLVVRSKDVSSGSECNMIGTSFEAFSQQPDRCARPKGSCLTRQPVDLW--REDMEARIR- 1061

Query: 332  NQLPLYGVEGRF--ERMNQHPNAGSHSFSIGVTEVL--------NSNLLIELRADDIEYV 381
                  G  G++  E     PN+   ++S    E L         S + IE++AD    +
Sbjct: 1062 ------GKRGKYFVENFGTVPNSPVKTYSNLSGEYLALEYYGDYTSTIEIEMKADFNVMI 1115

Query: 382  YQRSPGKIISVII-PTFEALTQFGVATITT 410
             + S  +I SV +  T+   T+  ++ + T
Sbjct: 1116 RKGSSAQIPSVYVDSTYPDKTRIVLSVLNT 1145


>gi|326433608|gb|EGD79178.1| hypothetical protein PTSG_09908 [Salpingoeca sp. ATCC 50818]
          Length = 1226

 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 119/551 (21%), Positives = 204/551 (37%), Gaps = 126/551 (22%)

Query: 34  ILSKSKLEKCEKRTDSDNL------NCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENST 87
           +++ S+LE CE  + ++         C  K+V+ +++ +G    E   V EV    EN  
Sbjct: 30  VIANSRLEVCESTSATNQPLSSLGGTCKKKLVMTLSIGNGQQLPE---VVEVTNYIENGQ 86

Query: 88  QK--MRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCE------------ 133
           +K   +  RI P     K+ ++  Y L Y   +   P E + K ++              
Sbjct: 87  EKPLGQRYRIYPT----KSDAFVNYPLVYQSSINSLPYEHWFKVKRSNTQIAFFFDPCRD 142

Query: 134 ------PDAGADVVK---ICERQPICCPCGPQRRIPSSC--GNVFDKLLKGKANTAHCLR 182
                 P  G  V K   I   Q  CC C       ++   GN+  +L     ++AHCLR
Sbjct: 143 APTAEHPTCGYFVRKGERIPNSQGFCCKCKFFSGGGNNAYRGNLKCRLYSPHYSSAHCLR 202

Query: 183 FPGDWFHVFGIGQRSIGFSVRIEVKTGSK------------------------------V 212
           F  +W+  + + +  +  ++R+ V   +                               V
Sbjct: 203 FWPNWYDAWELDRAQVSHTIRLAVYKETSFNGTIPEPTQACKDSNARPTRRVNGFAFYCV 262

Query: 213 SEVTVGPENKTATSADNFLKVNLIGDFVG-------YTNI---PSFEEFY----LVIPRQ 258
             + +G +NK A S +  +    +GD           T +   P+F +F      + P +
Sbjct: 263 DVLLIGVQNKVARSTNGTVIATYLGDLAPSIFPHDLTTRVLLTPNFYQFENSTDPMFPFR 322

Query: 259 GGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF--NGQPSFCSSPFWSCLHN 316
             P            W+LL++T   L G  CNK GVS+ AF  +G+   C+   +SCL N
Sbjct: 323 RSPAH----------WLLLDKTDVDLSGETCNKPGVSFTAFYQHGKSGGCNMDPFSCLDN 372

Query: 317 QLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVL--------NSN 368
           Q  +    D +R+   +   Y + GR       P     +    V  +L         S 
Sbjct: 373 QPAHLVAQDLDRLAAGRAAQY-MAGRL----GPPLVKRQAVGERVQNMLAFKADIPQESL 427

Query: 369 LLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVE--------ASY 420
             +E+ ADD           ++ V    FE L  +G+A+++      +E        ++ 
Sbjct: 428 YTLEIEADDSRIF-------VLDVSFSIFE-LGVYGLASVSRDIFLWIELARLDKRPSTV 479

Query: 421 SLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP---TTNQAAKYTCSAILKDSDFSEVD 477
            L+ +CS  V  +       K  +T      I P   ++ Q   +TC    K++     D
Sbjct: 480 YLSVECSDLVLPVPTTAISWKQTDTGHLRDAIIPLRTSSTQEVVHTCRVEAKNARGRITD 539

Query: 478 RAECQFSTMAT 488
                F T AT
Sbjct: 540 TGFVAFKTTAT 550


>gi|281203885|gb|EFA78081.1| hypothetical protein PPL_08729 [Polysphondylium pallidum PN500]
          Length = 371

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 53/196 (27%), Positives = 91/196 (46%), Gaps = 21/196 (10%)

Query: 308 SPFWSCLHNQLWNYREADQNRINRNQLPLY--GVEGRFERMNQHPNAGSHSFSIGVTEVL 365
           +P  +CL NQ+  +R+ D        + LY  G +G+++ +N   +A  + ++      L
Sbjct: 179 APMNACLDNQIKKFRDQD--------MALYAKGKKGQYQIINYGASADIYRYTGNKNLFL 230

Query: 366 --------NSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVE 417
                    S L  +++AD+  Y Y+ SP   +S  + TFE+++  G+  I T+N G  +
Sbjct: 231 QVELGGKQTSVLTNKIKADNFAYAYKESPAVFVSNRLETFESMSTDGILYIQTKNIGATK 290

Query: 418 ASYSL-TFDCSTGVTL-MEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSE 475
             Y L   +C+ G+T+    Q F +KP E     F+I   T       C A LK      
Sbjct: 291 EQYDLNVLNCTNGITVNTPSQVFTMKPNEIFESKFEIRTVTKLGGIQHCYADLKGFAMGT 350

Query: 476 V-DRAECQFSTMATVL 490
           + D    +F+T  TV+
Sbjct: 351 LFDSILIKFNTTDTVI 366


>gi|167524322|ref|XP_001746497.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775259|gb|EDQ88884.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1058

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 126/558 (22%), Positives = 200/558 (35%), Gaps = 106/558 (18%)

Query: 14  LLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEA 73
           LL  F I+ L SP   V  QI +  ++E+C  R  S   NC  K+VLN+ V   + GGE 
Sbjct: 7   LLACFTIMGL-SP--AVQAQIKAAGQIERC-LRDGSLEPNCERKMVLNLGVLGSTGGGEY 62

Query: 74  SIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEF-------- 125
             + +    E+ +        I   + V KT+    Y L Y   V  KP E         
Sbjct: 63  YHLTQS-RTEDGAISDSENEFI---IVVQKTSVSIEYPLRYRGVVNNKPYEIAQPVQTLL 118

Query: 126 ----------------YMKTRKCEPDAGADVVKICERQPICCPCGPQRRI----PSSC-- 163
                           Y  +  C         K+   Q  CC C    ++    P+    
Sbjct: 119 GGLFGSGDKTGCKDSPYHSSPSCGWLIDGGGKKVEGSQGFCCRCSTADQLGIGMPTDSYR 178

Query: 164 GNVFDKLLKGKANTAHCLRFPGD-WFHVFGIGQRSIGFSVRIEV----KTGSKVSEVT-- 216
            N+   L      +AHC R+  + W+ V+      I F V + +      G   S  T  
Sbjct: 179 ANLDCGLFGKGQQSAHCFRYSDELWYGVYDFDPGHIRFKVYVSIYRKYAVGPGYSHATQD 238

Query: 217 ---------------------------VGPENKTATSADNFLKVNLIGDFVGYTNIPSFE 249
                                      VGP  +   ++D  + V   GDF     +    
Sbjct: 239 DVPAASPDCTVELPDEYADFRCQGVLEVGPHLRGGLTSDGAVSVVFGGDFATPVQLRDMS 298

Query: 250 EFYLVIP------RQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF--NG 301
              L+ P            Q +D       +++++++   + G  C+KIGV   AF  +G
Sbjct: 299 SKTLLAPILETVDNWHTHEQTKD---GMDTYLVVDKSDIDMTGSTCDKIGVEPLAFYLHG 355

Query: 302 QPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPL-----YGVEGRFERMNQHPNAGSHS 356
           +   C     +CL+NQ  ++  ADQ  +N  + P      Y   G  +  +   +     
Sbjct: 356 KGGGCGVSEGTCLNNQPRDFFLADQALVNAGKRPFNLLSAYNPYGFAQVEDDEESRYGIR 415

Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFG--VATITTQNTG 414
           F +   E   S++ I + ADD+           +++  P FEAL+  G  +A I + N  
Sbjct: 416 FPV---EQHISSITISVNADDLTVYTAVCRDMTLALGSPNFEALSGNGFILANIYS-NCP 471

Query: 415 EVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSF-----KIYPTTNQAAKYTCSAILK 469
            + A  S++  C  G       Y      E  I SF      IY  + +A  + C  ++ 
Sbjct: 472 NMTALVSVSVICH-GNARGSASY------EREITSFIQLQLPIYVVSEEAGDHFCDVLVF 524

Query: 470 DSDFSEVDRAECQFSTMA 487
           D+    V      FST A
Sbjct: 525 DAVGYLVVNKSVTFSTSA 542


>gi|221504805|gb|EEE30470.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 972

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/164 (23%), Positives = 80/164 (48%), Gaps = 16/164 (9%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNG-QPSFCSSPFWSCLHNQLWNYREADQNRINRNQ 333
           ++L++   ++ G EC+K+G   + +   +  FC+    +C+  QL  ++E D+ RI +N 
Sbjct: 269 IILDKDYVSVTGYECDKVGTGLDRWGDMRGEFCNLLPGTCITGQLRKFKEVDKLRIEQNL 328

Query: 334 LPLYGVE---GRFERMNQHPNAGSHSFSIGVTEVLN--------SNLLIELRADDIEYVY 382
            PLY ++   G F R   +P  G+   + G    L         S++  E+ A D+ ++ 
Sbjct: 329 APLYALKREFGGFPRYAPNPMNGTGFSTTGTRHYLGYDFGEQHYSDIRFEMDATDVTWLR 388

Query: 383 QRSPGKIISVIIPTFEALTQFGVATITTQ----NTGEVEASYSL 422
             SPG I  + +P  +A +   +     +    N+G  +A++++
Sbjct: 389 ATSPGHITFIEVPQLDACSSSTIGGCPLKAYVWNSGNEDAAFAV 432


>gi|237839869|ref|XP_002369232.1| hypothetical protein TGME49_085940 [Toxoplasma gondii ME49]
 gi|211966896|gb|EEB02092.1| hypothetical protein TGME49_085940 [Toxoplasma gondii ME49]
          Length = 972

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/164 (23%), Positives = 80/164 (48%), Gaps = 16/164 (9%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNG-QPSFCSSPFWSCLHNQLWNYREADQNRINRNQ 333
           ++L++   ++ G EC+K+G   + +   +  FC+    +C+  QL  ++E D+ RI +N 
Sbjct: 269 IILDKDYVSVTGYECDKVGTGLDRWGDMRGEFCNLLPGTCITGQLRKFKEVDKLRIEQNL 328

Query: 334 LPLYGVE---GRFERMNQHPNAGSHSFSIGVTEVLN--------SNLLIELRADDIEYVY 382
            PLY ++   G F R   +P  G+   + G    L         S++  E+ A D+ ++ 
Sbjct: 329 APLYALKREFGGFPRYAPNPMNGTGFSTTGTRHYLGYDFGEQHYSDIRFEMDATDVTWLR 388

Query: 383 QRSPGKIISVIIPTFEALTQFGVATITTQ----NTGEVEASYSL 422
             SPG I  + +P  +A +   +     +    N+G  +A++++
Sbjct: 389 ATSPGHITFIEVPQLDACSSSTIGGCPLKAYVWNSGNEDAAFAV 432


>gi|401404273|ref|XP_003881687.1| hypothetical protein NCLIV_014480 [Neospora caninum Liverpool]
 gi|325116100|emb|CBZ51654.1| hypothetical protein NCLIV_014480 [Neospora caninum Liverpool]
          Length = 1133

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 37/137 (27%), Positives = 66/137 (48%), Gaps = 12/137 (8%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNG-QPSFCSSPFWSCLHNQLWNYREADQNRINRNQ 333
           ++L++   ++ G EC+K+G   + +   +  FC     SC+  QL  ++E D+ RI +N 
Sbjct: 292 VILDKDYVSVTGYECDKVGTGLDRWGDMRGEFCYLLPGSCITGQLRKFKEVDRLRIEQNL 351

Query: 334 LPLYGVE---GRFERMNQHPNAGSHSFSIGVTEVLN--------SNLLIELRADDIEYVY 382
            PLY ++   G F R    P   +   S G    L         S++  E+ A+D+ ++ 
Sbjct: 352 APLYALKREFGGFPRYAPDPMNATSLSSAGTRHYLGYDFGEQHYSDIRFEMDANDVTWLR 411

Query: 383 QRSPGKIISVIIPTFEA 399
             SPG I  + +P  +A
Sbjct: 412 ATSPGHITFIEVPQLDA 428


>gi|221484611|gb|EEE22905.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 972

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/164 (23%), Positives = 79/164 (48%), Gaps = 16/164 (9%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNG-QPSFCSSPFWSCLHNQLWNYREADQNRINRNQ 333
           ++L++   ++ G EC+K+G   + +   +  FC+    +C+  QL  ++E D+ RI +N 
Sbjct: 269 IILDKDYVSVTGYECDKVGTGLDRWGDMRGEFCNLLPGTCITGQLRKFKEVDKLRIEQNL 328

Query: 334 LPLYGVE---GRFERMNQHPNAGSHSFSIGVTEVLN--------SNLLIELRADDIEYVY 382
            PLY ++   G F R   +P   +   + G    L         S++  E+ A D+ ++ 
Sbjct: 329 APLYALKREFGGFPRYAPNPMNRTGFSTTGTRHYLGYDFGEQHYSDIRFEMDATDVTWLR 388

Query: 383 QRSPGKIISVIIPTFEALTQFGVATITTQ----NTGEVEASYSL 422
             SPG I  + +P  +A +   +     +    N+G  +A++++
Sbjct: 389 ATSPGHITFIEVPQLDACSSSTIGGCPLKAYVWNSGNEDAAFAV 432


>gi|270010014|gb|EFA06462.1| hypothetical protein TcasGA2_TC009345 [Tribolium castaneum]
          Length = 964

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 83/203 (40%), Gaps = 30/203 (14%)

Query: 156 QRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV-------KT 208
           QRR   +C +    + +    + HCL F   W+ V+ I +  I   +RI++         
Sbjct: 286 QRRGGQTCDDADLNIPESFRESTHCLTFSNMWYSVYQISKPEIIHKLRIQIFQKYEDCHG 345

Query: 209 GSKVSEVTVGPENKTATSADNFLKVNLIGDFVG-----YTNIPSFEEFYLVIP--RQGGP 261
            +   ++T G   +  T    +++ ++I  +             ++   L+IP  R   P
Sbjct: 346 NTHWMDITQGKTIELGTQTPLYVEKDIIAKYCSEDIDFQDQALDYKNLKLLIPERRVVDP 405

Query: 262 GQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNY 321
            Q          +MLL +   + DG  C+  GV YEAF  Q   C+ P  SCL NQ    
Sbjct: 406 EQ----------FMLLPKNSVS-DGRTCDTAGVGYEAFFKQRKRCAQPQGSCLGNQPNQL 454

Query: 322 READ-----QNRINRNQLPLYGV 339
            E+D     Q R+ +  L  YG 
Sbjct: 455 HESDAEAVKQGRVGQYFLKFYGT 477


>gi|91085727|ref|XP_973371.1| PREDICTED: similar to synaptic vesicle protein 2 [Tribolium
           castaneum]
          Length = 1537

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 83/203 (40%), Gaps = 30/203 (14%)

Query: 156 QRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV-------KT 208
           QRR   +C +    + +    + HCL F   W+ V+ I +  I   +RI++         
Sbjct: 286 QRRGGQTCDDADLNIPESFRESTHCLTFSNMWYSVYQISKPEIIHKLRIQIFQKYEDCHG 345

Query: 209 GSKVSEVTVGPENKTATSADNFLKVNLIGDFVG-----YTNIPSFEEFYLVIP--RQGGP 261
            +   ++T G   +  T    +++ ++I  +             ++   L+IP  R   P
Sbjct: 346 NTHWMDITQGKTIELGTQTPLYVEKDIIAKYCSEDIDFQDQALDYKNLKLLIPERRVVDP 405

Query: 262 GQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNY 321
            Q          +MLL +   + DG  C+  GV YEAF  Q   C+ P  SCL NQ    
Sbjct: 406 EQ----------FMLLPKNSVS-DGRTCDTAGVGYEAFFKQRKRCAQPQGSCLGNQPNQL 454

Query: 322 READ-----QNRINRNQLPLYGV 339
            E+D     Q R+ +  L  YG 
Sbjct: 455 HESDAEAVKQGRVGQYFLKFYGT 477


>gi|71029132|ref|XP_764209.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351163|gb|EAN31926.1| hypothetical protein TP04_0574 [Theileria parva]
          Length = 759

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 21/143 (14%)

Query: 285 DGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNY--READQNRINRNQLPLYGVEGR 342
           DGL C+KIG+S + +  Q   C+S   SCL NQL +Y  +E D+ ++ +    LYGVE  
Sbjct: 390 DGLMCDKIGLSMKRWANQEEICNSSPGSCLKNQLKHYFDQEKDEAKLPK----LYGVEPT 445

Query: 343 FERMNQHPNAGSHSFSI-GVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEA-- 399
           F        A     S+  V E   + L    R   + Y++ +    +  + I TF+A  
Sbjct: 446 F-------TAVKKDLSLPAVKEANKTTLDDPNRIHTLTYIHSKD--DVTRLKIDTFDATV 496

Query: 400 ---LTQFGVATITTQNTGEVEAS 419
              ++ F    ++ +  GE E S
Sbjct: 497 TEIISDFPGFIVSAKMDGECEVS 519


>gi|224084920|ref|XP_002307449.1| predicted protein [Populus trichocarpa]
 gi|222856898|gb|EEE94445.1| predicted protein [Populus trichocarpa]
          Length = 228

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 41/75 (54%)

Query: 506 INDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
           I  F      +   L+    DF+ GK+C   C+S +DF C+I+  C++ L+    VLA+ 
Sbjct: 5   IISFLSGFTKVIGDLFGSPLDFLAGKSCSSVCASPWDFFCYIENFCVASLLKMVAVLALL 64

Query: 566 PTVLVLLWLLHQKGL 580
             VL+  +LL++ G+
Sbjct: 65  YIVLLFFYLLYKTGI 79


>gi|209877042|ref|XP_002139963.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209555569|gb|EEA05614.1| hypothetical protein CMU_026210 [Cryptosporidium muris RN66]
          Length = 696

 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 51/245 (20%), Positives = 97/245 (39%), Gaps = 33/245 (13%)

Query: 275 MLLERTRFTLDGLECNKIGVS---YEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINR 331
           ++L        G+ CNKI  S   + + NG+  FC  P  +C + Q+ +Y +        
Sbjct: 329 LILPPDTVDFTGVSCNKIASSIYTWSSVNGR--FCYHPPLTCQNVQIADYYKKLIKDQTS 386

Query: 332 NQLPLYGVEGR---------------FERMNQHPNAGSHSFSIGVT--EVLNSNLLIELR 374
            ++  + VE +                  M    N  +  F +G     + ++ ++  + 
Sbjct: 387 GKISEFSVEAQNSGEPQLIITPSNYNSSNMISDDNNSTLQFYLGYVFDSIFDTEIMFSVE 446

Query: 375 ADDIEYVYQRSPGKIISVIIPTFEALTQFGVA----TITTQNTGEVEASYSLTFDCSTG- 429
           A  + +V   +PG I  +  P  E+    G       I  +N+G  E+ + +     T  
Sbjct: 447 ASSVSWVASAAPGIITYIEPPPIESCFAMGYTGCPIKIYVRNSGTFESGFVVQIPYCTKD 506

Query: 430 ------VTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQF 483
                 V  +  Q   IK + T + +F I  +    +KY C+A+L +S    +D+    F
Sbjct: 507 SKPTNEVNPIMAQSRSIKAQSTGVFTFIIGVSVTSGSKYECTAVLYNSFSIHLDQHLFTF 566

Query: 484 STMAT 488
           ST ++
Sbjct: 567 STQSS 571


>gi|388557120|dbj|BAM16295.1| generative cell specific-1 [Eimeria tenella]
          Length = 834

 Score = 48.1 bits (113), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 41/170 (24%), Positives = 71/170 (41%), Gaps = 21/170 (12%)

Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
           ++L+    T+DG  C+  GVS + + G+  FC     +C    L  + E ++      + 
Sbjct: 374 IVLDEQHVTVDGSTCDLPGVSLQQW-GRDGFCDYAQGTCFAKNLKWFHEYNEQAAELGRT 432

Query: 335 PLYGVE---GRFERMN----------QHPNAGS---HSFSIGVTEVLNSNLLIELRADDI 378
           PLY +E   G + R +              AG    H  +    +   S + IE+ A  I
Sbjct: 433 PLYALEYPPGNYPRYHVGLDNVDDAIDTSKAGPFELHRLAFAYPDSHKSKVRIEMNAGLI 492

Query: 379 EYVYQRSPGKIISVIIPT---FEALTQFGVA-TITTQNTGEVEASYSLTF 424
            ++   SPG+I S+  P     +    FG    +   N+G ++A+Y L  
Sbjct: 493 RWIQSTSPGQITSIAPPAPRECDNAQTFGCPLKVYVLNSGTIDATYYLEL 542


>gi|452824580|gb|EME31582.1| hypothetical protein Gasu_12530 [Galdieria sulphuraria]
          Length = 265

 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 57/247 (23%), Positives = 96/247 (38%), Gaps = 49/247 (19%)

Query: 1   MRNQTKSLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVL 60
           MR    S  L  +++I   +  L + R      ++S  +++ C     S +L C    V+
Sbjct: 1   MRRTKPSCNL--YIIICVVLFYLKATR----ATLISAGEIQSC-TNNGSSSLQCDKMWVV 53

Query: 61  NMAVPSGSSGGEASIV-------AEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELT 113
            +AV +G  G ++++        +E V+ + N   K         +T++K+     Y L 
Sbjct: 54  TLAVANGQQGVDSTVAKVFGSNQSEYVK-DPNDPNKAYLFNYTLHITLSKSKIAIEYPLV 112

Query: 114 YIRDVPYKPQEFYMK----------TRKC---------------EPDAGADVV-KICERQ 147
           Y++D   +P E   +          T  C               +P    D   +I   Q
Sbjct: 113 YLQDFNNQPYEIVYEANSNGPLEEYTNPCVDSWGSSNPTCGYAYDPPNEIDAANRIYNSQ 172

Query: 148 PICCPCGPQRRIPS------SCGNVFDKLLKGK-ANTAHCLRFPGDWFHVFGIGQRSIGF 200
             CC CG    +        SC N+   L + + +  AHCLRF   W+  F IG   +G 
Sbjct: 173 GFCCQCGVSDYLAEGTREGLSC-NLLGSLFQIEPSQAAHCLRFDPLWYSGFQIGTYEVGV 231

Query: 201 SVRIEVK 207
             RI +K
Sbjct: 232 QQRIYLK 238


>gi|449466348|ref|XP_004150888.1| PREDICTED: protein HAPLESS 2-like [Cucumis sativus]
          Length = 85

 Score = 47.0 bits (110), Expect = 0.038,   Method: Composition-based stats.
 Identities = 19/31 (61%), Positives = 25/31 (80%)

Query: 116 RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
           +DV  KP+E+Y+ TRKCE +A A VV+ICER
Sbjct: 39  KDVSCKPEEYYVTTRKCESNASARVVQICER 69


>gi|449520257|ref|XP_004167150.1| PREDICTED: protein HAPLESS 2-like [Cucumis sativus]
          Length = 85

 Score = 46.2 bits (108), Expect = 0.057,   Method: Composition-based stats.
 Identities = 19/31 (61%), Positives = 24/31 (77%)

Query: 116 RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
           +D   KP+EFY+ TRKCE +A A VV+ICER
Sbjct: 39  KDFSCKPEEFYVTTRKCESNASARVVQICER 69


>gi|389603325|ref|XP_001569028.2| similar to leishmania major. l411.4-like protein [Leishmania
           braziliensis MHOM/BR/75/M2904]
 gi|322505809|emb|CAM44161.2| similar to leishmania major. l411.4-like protein [Leishmania
           braziliensis MHOM/BR/75/M2904]
          Length = 570

 Score = 45.8 bits (107), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 66/349 (18%), Positives = 139/349 (39%), Gaps = 66/349 (18%)

Query: 36  SKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQKM----- 90
           + + +  C+  + +    C  K+V+++ +   +  G  +I+   V VE    Q +     
Sbjct: 7   ATAYVRHCDATSSATPPGCVRKLVVDLTLDDRTLAG--AILETEVTVEHALHQSLFPHDA 64

Query: 91  -----------RTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMKT--------- 129
                        V +PP+ + + ++A    Y LTY+R  P   ++ Y+K          
Sbjct: 65  ASDVAGAAATSLQVSLPPIRVALRRSAVQVRYMLTYLRTFPAALRD-YVKVLHTAMSCDD 123

Query: 130 --RKCEPDAGADVVKICERQPICCPC-GPQRRIPSSCGNVFDK---LLKGKANTAHCLRF 183
              +C          +     +CC C G +  + +   NV  +     +  A  + C++ 
Sbjct: 124 GVTRCPSYTSMTGALVSAPLGVCCLCIGIECALTNEFCNVSMRGHFCFRTGAAGSICVQN 183

Query: 184 PGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGD 238
            G  +H + +G     +++R+   +G  +++ T+      P  +   S  + ++ + +  
Sbjct: 184 EGIVYHGWSVGSPLPYYTLRLS-ASGQGIAQTTLQLTTDAPSAQAGASFLHLVQASGVSP 242

Query: 239 FVGYTNIPSFEEFYLVIP------------RQGGPGQPQDLGGNFSMWMLLERTRFTLDG 286
             G T +       L +P            R   P +          W+LL  +  +  G
Sbjct: 243 GEGGTTV-DIAGRVLFVPSAESSSGSTSHVRDDDPAE----------WLLLPASLVSNSG 291

Query: 287 LECNKIGVSYEAFNGQPSF--CSSPFWSCLHNQLWNYREADQNRINRNQ 333
            EC+K+G+S + F  Q S   C++   +C+ +QL +YRE D  +I + +
Sbjct: 292 NECDKVGISPDYFYSQSSTTQCNAQKGTCVRHQLADYREEDLAQIAQGK 340


>gi|157877293|ref|XP_001686969.1| similar to leishmania major. l411.4-like protein [Leishmania major
           strain Friedlin]
 gi|68130044|emb|CAJ09352.1| similar to leishmania major. l411.4-like protein [Leishmania major
           strain Friedlin]
          Length = 576

 Score = 45.8 bits (107), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 63/284 (22%), Positives = 115/284 (40%), Gaps = 46/284 (16%)

Query: 93  VRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFY--MKTR--------KCEPDAGADVV 141
           V +PP+ + + + A    Y LTY+R  P   ++    +KT         +C         
Sbjct: 78  VSLPPITVAMRRGAVQMRYGLTYLRTFPAALRDSVRVLKTAMSCDDGVTRCPSYMSMTGT 137

Query: 142 KICERQPICCPCGP-QRRIPSSCGNVFDK---LLKGKANTAHCLRFPGDWFHVFGIGQRS 197
            +     +CC C   +  + S   N   +     +  A    C++  G  +H + +G  S
Sbjct: 138 LVSAPLGLCCLCTSVECALTSDLCNASMRAHFCFRTGAAGITCVQSEGITYHGWAVGSSS 197

Query: 198 IGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGDFVGYTN-------- 244
             + + +   +G  ++  T+      PE +   SA   L+ +  G   G +N        
Sbjct: 198 PYYMMHLS-ASGRGIAPTTLQLTTDAPEVQKGASALQILRAS--GVLPGESNPTVDISGR 254

Query: 245 ---IPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
              +PS E          GP +  D     + W+LL     ++ G +C+K+G+S + F  
Sbjct: 255 VLFVPSAEHSSASRSISTGPVRDDDP----AEWLLLPAPLVSVSGNDCDKVGISPDYFYS 310

Query: 302 QPSF--CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
             S   C++   +C+ +QL +YR AD  +I +      GV GR+
Sbjct: 311 LSSTKQCNAQKGTCVRHQLADYRAADLEQIAQ------GVGGRY 348


>gi|65335255|gb|AAY42350.1| excreted/secreted protein 37 [Leishmania major]
          Length = 611

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 63/284 (22%), Positives = 115/284 (40%), Gaps = 46/284 (16%)

Query: 93  VRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFY--MKTR--------KCEPDAGADVV 141
           V +PP+ + + + A    Y LTY+R  P   ++    +KT         +C         
Sbjct: 113 VSLPPITVAMRRGAVQMRYGLTYLRTFPAALRDSVRVLKTAMSCDDGVTRCPSYMSMTGT 172

Query: 142 KICERQPICCPCGP-QRRIPSSCGNVFDK---LLKGKANTAHCLRFPGDWFHVFGIGQRS 197
            +     +CC C   +  + S   N   +     +  A    C++  G  +H + +G  S
Sbjct: 173 LVSAPLGLCCLCTSVECALTSDLCNASMRAHFCFRTGAAGITCVQSEGITYHGWAVGSSS 232

Query: 198 IGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGDFVGYTN-------- 244
             + + +   +G  ++  T+      PE +   SA   L+ +  G   G +N        
Sbjct: 233 PYYMMHLS-ASGRGIAPTTLQLTTDAPEVQKGASALQILRAS--GVLPGESNPTVDISGR 289

Query: 245 ---IPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
              +PS E          GP +  D     + W+LL     ++ G +C+K+G+S + F  
Sbjct: 290 VLFVPSAEHSSASRSISTGPVRDDDP----AEWLLLPAPLVSVSGNDCDKVGISPDYFYS 345

Query: 302 QPSF--CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
             S   C++   +C+ +QL +YR AD  +I +      GV GR+
Sbjct: 346 LSSTKQCNAQKGTCVRHQLADYRAADLEQIAQ------GVGGRY 383


>gi|359492377|ref|XP_003634404.1| PREDICTED: uncharacterized protein LOC100854126 [Vitis vinifera]
          Length = 234

 Score = 44.3 bits (103), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 39/75 (52%)

Query: 506 INDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
           I  FF     +   L+    DF++GK+C   C   +DF C+I+  C++ L+   +V  + 
Sbjct: 5   IGSFFSGFARVIGDLFGSPLDFLSGKSCSSVCGITWDFICYIENFCVANLLKIAMVSFLL 64

Query: 566 PTVLVLLWLLHQKGL 580
             VL+  +LL + G+
Sbjct: 65  YIVLLFFYLLCKLGI 79


>gi|307176762|gb|EFN66162.1| hypothetical protein EAG_13618 [Camponotus floridanus]
          Length = 1820

 Score = 43.9 bits (102), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 42/177 (23%), Positives = 69/177 (38%), Gaps = 32/177 (18%)

Query: 177  TAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLI 236
            +AHCLRF   W+ V+ +    +   V ++V     +   ++  E+ T  S    +    +
Sbjct: 1265 SAHCLRFSDLWYSVYQLEDPIVEHIVYLQVYEKRTLRNGSIYWEDLTEDSV--IVYAIRL 1322

Query: 237  GDF------------VGYTNIPSFEEFY-------------LVIPRQ---GGPGQPQDLG 268
            G F              Y  IP + E               L++P        G P  + 
Sbjct: 1323 GTFNRHHRGSQGTIVFTYKKIPGWREEEEEEAPNLDVVRDRLLVPSSVTSKDSGYP--VK 1380

Query: 269  GNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREAD 325
            G  + ++++  +    +G EC+K GV + AF  QP  C     +CL NQ   YR  D
Sbjct: 1381 GEANEYLVVPASSINENGNECDKAGVGFAAFAKQPDRCGHVSGTCLKNQPLAYRRHD 1437


>gi|328705538|ref|XP_003242840.1| PREDICTED: hypothetical protein LOC100573999 [Acyrthosiphon pisum]
          Length = 754

 Score = 43.5 bits (101), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 3/49 (6%)

Query: 274 WMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQ---LW 319
           +++L     +  G ECNK GVSYEAF  Q + C     SCL+NQ   LW
Sbjct: 409 YLILNANNISTKGDECNKAGVSYEAFFKQSNRCGVKRSSCLNNQPSHLW 457


>gi|398024716|ref|XP_003865519.1| similar to leishmania major. l411.4-like protein [Leishmania
           donovani]
 gi|322503756|emb|CBZ38842.1| similar to leishmania major. l411.4-like protein [Leishmania
           donovani]
          Length = 577

 Score = 43.1 bits (100), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 60/286 (20%), Positives = 114/286 (39%), Gaps = 49/286 (17%)

Query: 93  VRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMKTR----------KCEPDAGADVV 141
           V +PP+ + + + A    Y LTY+R  P   ++     R          +C         
Sbjct: 78  VSLPPITVAIQRGAVQMRYGLTYLRTFPAALRDSVRVLRTAMSCDDGVTRCPSYMSMTGA 137

Query: 142 KICERQPICCPCGP-QRRIPSSCGNVFDKL---LKGKANTAHCLRFPGDWFHVFGIGQRS 197
            +     +CC C   +  + S   N   +     +  A    C++  G  +H + +G  S
Sbjct: 138 LVSAPLGLCCLCTSVECALTSDLCNASMRAHFCFRTGAAGITCVQGEGITYHGWSVGSSS 197

Query: 198 IGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGDFVGYTNIPSFE--E 250
             +++ +   +G  ++  T+      PE +   SA   L+ +   D +   + P  +   
Sbjct: 198 PYYTMNLSA-SGRGIAPTTLQLTTDAPEAQNGASALQLLRAS---DVLPEESNPKVDISG 253

Query: 251 FYLVIPR-----------QGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
             L +P              GP +  D     + W+LL     ++ G +C+K+G+S + F
Sbjct: 254 RVLFVPSAEHSRASRGTTSTGPVRDDDP----AEWLLLPAPLVSVSGNDCDKVGISPDYF 309

Query: 300 NGQPSF--CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
               S   C++   +C+ +QL +YR AD  +I +      GV GR+
Sbjct: 310 YSLSSTTQCNAQKGTCVRHQLADYRAADLEQIAQ------GVGGRY 349


>gi|146103721|ref|XP_001469629.1| similar to leishmania major. l411.4-like protein [Leishmania
           infantum JPCM5]
 gi|134073999|emb|CAM72739.1| similar to leishmania major. l411.4-like protein [Leishmania
           infantum JPCM5]
          Length = 577

 Score = 43.1 bits (100), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 60/286 (20%), Positives = 114/286 (39%), Gaps = 49/286 (17%)

Query: 93  VRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMKTR----------KCEPDAGADVV 141
           V +PP+ + + + A    Y LTY+R  P   ++     R          +C         
Sbjct: 78  VSLPPITVAIQRGAVQMRYGLTYLRTFPAALRDSVRVLRTAMSCDDGVTRCPSYMSMTGT 137

Query: 142 KICERQPICCPCGP-QRRIPSSCGNVFDKL---LKGKANTAHCLRFPGDWFHVFGIGQRS 197
            +     +CC C   +  + S   N   +     +  A    C++  G  +H + +G  S
Sbjct: 138 LVSAPLGLCCLCTSVECALTSDLCNASMRAHFCFRTGAAGITCVQGEGITYHGWSVGSSS 197

Query: 198 IGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGDFVGYTNIPSFE--E 250
             +++ +   +G  ++  T+      PE +   SA   L+ +   D +   + P  +   
Sbjct: 198 PYYTMHLSA-SGRGIAPTTLQLTTDAPEAQNGASALQLLRAS---DVLPEESNPKVDISG 253

Query: 251 FYLVIPR-----------QGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
             L +P              GP +  D     + W+LL     ++ G +C+K+G+S + F
Sbjct: 254 RVLFVPSAEHSRASRGTTSTGPVRDDDP----AEWLLLPAPLVSVSGNDCDKVGISPDYF 309

Query: 300 NGQPSF--CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
               S   C++   +C+ +QL +YR AD  +I +      GV GR+
Sbjct: 310 YSLSSTTQCNAQKGTCVRHQLADYRAADLEQIAQ------GVGGRY 349


>gi|302141790|emb|CBI18993.3| unnamed protein product [Vitis vinifera]
          Length = 176

 Score = 42.4 bits (98), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 39/75 (52%)

Query: 506 INDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
           I  FF     +   L+    DF++GK+C   C   +DF C+I+  C++ L+   +V  + 
Sbjct: 5   IGSFFSGFARVIGDLFGSPLDFLSGKSCSSVCGITWDFICYIENFCVANLLKIAMVSFLL 64

Query: 566 PTVLVLLWLLHQKGL 580
             VL+  +LL + G+
Sbjct: 65  YIVLLFFYLLCKLGI 79


>gi|301614936|ref|XP_002936934.1| PREDICTED: sodium/calcium exchanger 3-like isoform 3 [Xenopus
           (Silurana) tropicalis]
          Length = 915

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + DHP       
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  + K KH ++D D       YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388


>gi|301614944|ref|XP_002936938.1| PREDICTED: sodium/calcium exchanger 3-like isoform 7 [Xenopus
           (Silurana) tropicalis]
          Length = 923

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + DHP       
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  + K KH ++D D       YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388


>gi|301614932|ref|XP_002936932.1| PREDICTED: sodium/calcium exchanger 3-like isoform 1 [Xenopus
           (Silurana) tropicalis]
          Length = 922

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + DHP       
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  + K KH ++D D       YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388


>gi|449500727|ref|XP_004161179.1| PREDICTED: uncharacterized protein LOC101227573 [Cucumis sativus]
          Length = 238

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 6/81 (7%)

Query: 502 PKSSINDFFESIESI-GKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGL 560
             S  +D F +I  I G  L     DF++G++C   C S +DF C+I+  C++ L+  G+
Sbjct: 5   ASSLASDVFSAIGKIFGSPL-----DFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGM 59

Query: 561 VLAIFPTVLVLLWLLHQKGLF 581
           V  +   VL+LL+LLH+ G+F
Sbjct: 60  VFILSYFVLLLLYLLHKIGIF 80


>gi|449449908|ref|XP_004142706.1| PREDICTED: uncharacterized protein LOC101218855 [Cucumis sativus]
          Length = 238

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 6/81 (7%)

Query: 502 PKSSINDFFESIESI-GKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGL 560
             S  +D F +I  I G  L     DF++G++C   C S +DF C+I+  C++ L+  G+
Sbjct: 5   ASSLASDVFSAIGKIFGSPL-----DFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGM 59

Query: 561 VLAIFPTVLVLLWLLHQKGLF 581
           V  +   VL+LL+LLH+ G+F
Sbjct: 60  VFILSYFVLLLLYLLHKIGIF 80


>gi|301614934|ref|XP_002936933.1| PREDICTED: sodium/calcium exchanger 3-like isoform 2 [Xenopus
           (Silurana) tropicalis]
          Length = 919

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + DHP       
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  + K KH ++D D       YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388


>gi|301614938|ref|XP_002936935.1| PREDICTED: sodium/calcium exchanger 3-like isoform 4 [Xenopus
           (Silurana) tropicalis]
          Length = 925

 Score = 41.6 bits (96), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + DHP       
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  + K KH ++D D       YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388


>gi|301614942|ref|XP_002936937.1| PREDICTED: sodium/calcium exchanger 3-like isoform 6 [Xenopus
           (Silurana) tropicalis]
          Length = 919

 Score = 41.6 bits (96), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + DHP       
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  + K KH ++D D       YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388


>gi|301614940|ref|XP_002936936.1| PREDICTED: sodium/calcium exchanger 3-like isoform 5 [Xenopus
           (Silurana) tropicalis]
          Length = 916

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + DHP       
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  + K KH ++D D       YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388


>gi|392967161|ref|ZP_10332579.1| TonB-dependent receptor plug [Fibrisoma limi BUZ 3]
 gi|387843958|emb|CCH54627.1| TonB-dependent receptor plug [Fibrisoma limi BUZ 3]
          Length = 1078

 Score = 41.2 bits (95), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 27/184 (14%)

Query: 262 GQPQDLGGNFSMWMLLERT-------RFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCL 314
           G   DL  N+  W++L  T       RF       N    SY  +    SF ++  +  +
Sbjct: 611 GYYADLTSNYKNWLILNGTFRYDQTSRFYKSTRPTNSW--SYPYYGAAVSFIATDAFPAI 668

Query: 315 HNQLWNYRE--ADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIG------------ 360
            +   NY +  A+ N+   + +PLYG++  +      P   +   ++G            
Sbjct: 669 KSSFLNYAKIRANYNKNANDNIPLYGLDLAYGNGGGFPYGNTVGLTVGNRLPDANLRPEV 728

Query: 361 --VTEVLNSNLLIELRAD-DIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTG-EV 416
              TE+     L+  R + D+    QRS G++I+V +P     +   +    T+N G E 
Sbjct: 729 VYSTEIGGEFQLLNDRINVDVSAYSQRSEGQVITVRVPNTTGFSSLLINVGETKNWGYEA 788

Query: 417 EASY 420
           E  Y
Sbjct: 789 EVKY 792


>gi|332027092|gb|EGI67188.1| hypothetical protein G5I_04344 [Acromyrmex echinatior]
          Length = 1545

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 39/167 (23%), Positives = 65/167 (38%), Gaps = 26/167 (15%)

Query: 177  TAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK---------------TGSKVSE-----VT 216
            +AHCLRF   W+ V+ +    +  +V ++V                TG  V       + 
Sbjct: 967  SAHCLRFSDLWYSVYQLEDPIVDHAVYLQVYEKRVLANGSTYWKDLTGDSVVRQVVYAIR 1026

Query: 217  VGPENK-----TATSADNFLKVNLIG-DFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGN 270
            +G  N+       T A  + +V ++G +     N+    +  LV       G      G 
Sbjct: 1027 LGTFNRHHRGNQDTIAFAYKEVKMLGREEDEIPNLDVVRDRLLVPSSVTSKGFEYPAEGE 1086

Query: 271  FSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQ 317
               ++++  +     G EC+K GV + AF  QP  C     +CL NQ
Sbjct: 1087 SGEYLVIPASSINESGNECDKAGVGFAAFAKQPDRCERVRGTCLKNQ 1133


>gi|3970809|emb|CAA10220.1| sodium-calcium exchanger III [Oncorhynchus mykiss]
          Length = 263

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 55/128 (42%), Gaps = 26/128 (20%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQR-----IRDFRSRRID-VDHPH 613
           L LA FP  ++L WL  ++ LF   Y +    +++DN R         RS+ I+ +D   
Sbjct: 138 LTLAFFPICVILAWLADRRLLF---YKFMHKKYRADNHRGVIIETEHERSKGIEMMDGGG 194

Query: 614 VHVRKHHKQE-GRHHKL----------EARRRRCGIHSDHKHKHSDRDTDY------YYY 656
             V  H   + G  H L          E+RR    I  D K KH +++ D       YY 
Sbjct: 195 KMVNSHFAHDGGAAHNLISLIEGKEVDESRRDMIRILKDLKQKHPEKEMDQLVEMANYYA 254

Query: 657 LHHVQKDK 664
           L H QK +
Sbjct: 255 LSHQQKSR 262


>gi|63099310|gb|AAY32773.1| solute carrier family 8 member 3 [Mixophyes balbus]
          Length = 377

 Score = 40.8 bits (94), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + +HP       
Sbjct: 136 LTLFFFPVCVVLAWVADRRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  D K KH ++D D       YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q L LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDLCLD 289


>gi|63099306|gb|AAY32771.1| solute carrier family 8 member 3 [Rheobatrachus silus]
          Length = 377

 Score = 40.8 bits (94), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + +HP       
Sbjct: 136 LTLFFFPVCVVLAWVADRRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            +   H  +G    +      E+RR    I  D K KH ++D D       YY L H QK
Sbjct: 190 KIMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS MQ + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSMQDICLD 289


>gi|225443423|ref|XP_002267790.1| PREDICTED: uncharacterized protein LOC100249538 [Vitis vinifera]
          Length = 230

 Score = 40.8 bits (94), Expect = 2.9,   Method: Composition-based stats.
 Identities = 18/54 (33%), Positives = 32/54 (59%)

Query: 528 ITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLF 581
           I G +C   C+  +D +C I+++C+S LV   LVL +    L+  +L+ + G+F
Sbjct: 27  IFGDSCEGVCAGTWDITCFIEHLCVSNLVKLFLVLGLCYITLLFFYLMFKLGIF 80


>gi|297735741|emb|CBI18428.3| unnamed protein product [Vitis vinifera]
          Length = 808

 Score = 40.4 bits (93), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 32/54 (59%)

Query: 528 ITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLF 581
           I G +C   C+  +D +C I+++C+S LV   LVL +    L+  +L+ + G+F
Sbjct: 605 IFGDSCEGVCAGTWDITCFIEHLCVSNLVKLFLVLGLCYITLLFFYLMFKLGIF 658


>gi|238478576|ref|NP_001154356.1| uncharacterized protein [Arabidopsis thaliana]
 gi|5263325|gb|AAD41427.1|AC007727_16 F8K7.16 [Arabidopsis thaliana]
 gi|332192026|gb|AEE30147.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 233

 Score = 40.0 bits (92), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 506 INDFFESI-ESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAI 564
           ++ FF     SIG      L DF++GK+C   C S +DF C+++  C++ L    L+L +
Sbjct: 5   MDSFFTGFSHSIGNFFGSPL-DFLSGKSCSSVCPSPWDFICYVENFCVANLAKTALILIL 63

Query: 565 FPTVLVLLWLLHQKGLF 581
               L  +++L++ G +
Sbjct: 64  SYFFLFFIYMLYKVGFW 80


>gi|63099280|gb|AAY32758.1| solute carrier family 8 member 3 [Xenopus (Silurana) tropicalis]
          Length = 377

 Score = 40.0 bits (92), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + DHP       
Sbjct: 136 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 189

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  + K KH ++D D       YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 249

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 289


>gi|63099318|gb|AAY32777.1| solute carrier family 8 member 3 [Myobatrachus gouldii]
          Length = 377

 Score = 40.0 bits (92), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + +HP       
Sbjct: 136 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  D K KH ++D D       YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 289


>gi|63099282|gb|AAY32759.1| solute carrier family 8 member 3 [Heleophryne purcelli]
          Length = 377

 Score = 39.7 bits (91), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + +HP       
Sbjct: 136 LTLFFFPVCVVLAWVADRRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  D K KH ++D D       YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 289


>gi|63099276|gb|AAY32756.1| solute carrier family 8 member 3 [Limnodynastes salmini]
          Length = 377

 Score = 39.7 bits (91), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)

Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
           L L  FP  +VL W+  ++ LF   Y +    +++D  R     +   + +HP       
Sbjct: 136 LTLFFFPVCVVLAWVADRRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189

Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
            V   H  +G    +      E+RR    I  D K KH ++D D       YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249

Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
                                KH   +SK SS +Q + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 289


>gi|297845152|ref|XP_002890457.1| hypothetical protein ARALYDRAFT_313065 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336299|gb|EFH66716.1| hypothetical protein ARALYDRAFT_313065 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 232

 Score = 38.9 bits (89), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 506 INDFFESIE-SIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAI 564
           ++ FF     SIG      L DF++GK+C   C S +DF C ++  C++ L    L+L +
Sbjct: 5   LDSFFTGFSHSIGNFFGSPL-DFLSGKSCSSVCPSPWDFICFVENFCVANLAKAALILIL 63

Query: 565 FPTVLVLLWLLHQKGLF 581
               L  +++L++ G +
Sbjct: 64  SYFFLFFIYMLYKVGFW 80


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.137    0.424 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,193,821,965
Number of Sequences: 23463169
Number of extensions: 482742170
Number of successful extensions: 1092119
Number of sequences better than 100.0: 407
Number of HSP's better than 100.0 without gapping: 116
Number of HSP's successfully gapped in prelim test: 291
Number of HSP's that attempted gapping in prelim test: 1087273
Number of HSP's gapped (non-prelim): 2882
length of query: 701
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 551
effective length of database: 8,839,720,017
effective search space: 4870685729367
effective search space used: 4870685729367
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 81 (35.8 bits)