BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 038458
         (347 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449443945|ref|XP_004139736.1| PREDICTED: uncharacterized protein LOC101209112 [Cucumis sativus]
          Length = 1341

 Score =  608 bits (1567), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 289/347 (83%), Positives = 315/347 (90%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           +V+R LD  RW KAEE TAELIA IQP+P SEERRNAVA YV+RLI++CFPCQVFTFGSV
Sbjct: 26  TVMRMLDSERWSKAEERTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVFTFGSV 85

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL AFS +Q LK+TWAH VRDMLE+EEKNE+AEFRVKEVQYI+AEVKII
Sbjct: 86  PLKTYLPDGDIDLTAFSKNQNLKETWAHQVRDMLESEEKNENAEFRVKEVQYIKAEVKII 145

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KCLV+N VVDI+F+QLGGLCTLCFL+EVDHLIN+NHLFKRSIILIKAWCYYESRILG HH
Sbjct: 146 KCLVENIVVDISFDQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHH 205

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDWDNFC+SLWGPVPIS LPD
Sbjct: 206 GLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPISSLPD 265

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           VTAEPPRKDGG LLLSK FL++C   YA FPGGQENQGQPFVSKHFNVIDPLRVNNNLGR
Sbjct: 266 VTAEPPRKDGGELLLSKLFLEACSAVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 325

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVSKGNFFRIR+AF F AK LARL +CP ED+  E+NQFF+NT +RH
Sbjct: 326 SVSKGNFFRIRSAFAFGAKRLARLFECPREDILAELNQFFLNTWERH 372


>gi|356520288|ref|XP_003528795.1| PREDICTED: uncharacterized protein LOC100809742 [Glycine max]
          Length = 1331

 Score =  605 bits (1561), Expect = e-171,   Method: Composition-based stats.
 Identities = 294/349 (84%), Positives = 317/349 (90%), Gaps = 2/349 (0%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV--FTFG 58
           SVI+ LD  RWLKAE+ TAELIA IQP+P SEERRNAVA YV+RLI++CFPCQV  FTFG
Sbjct: 26  SVIQVLDSERWLKAEQRTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVGVFTFG 85

Query: 59  SVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
           SVPLKTYLPD DIDL AFS +Q LKD+WAH VRDMLENEEKNE+AEF VKEVQYIQAEVK
Sbjct: 86  SVPLKTYLPDGDIDLTAFSKNQNLKDSWAHQVRDMLENEEKNENAEFHVKEVQYIQAEVK 145

Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
           IIKCLV+N VVDI+FNQLGGLCTLCFL+EVD+LIN+NHLFKRSIILIKAWCYYESRILG 
Sbjct: 146 IIKCLVENIVVDISFNQLGGLCTLCFLEEVDNLINQNHLFKRSIILIKAWCYYESRILGA 205

Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
           HHGLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDW+NFC+SLWGPVPIS L
Sbjct: 206 HHGLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWENFCVSLWGPVPISSL 265

Query: 239 PDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
           PDVTAEPPRKDGG LLLSK FLD+C   YA FPGGQENQGQPFVSKHFNVIDPLRVNNNL
Sbjct: 266 PDVTAEPPRKDGGDLLLSKLFLDACSSVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNL 325

Query: 299 GRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GRSVSKGNFFRIR+AF F AK LARLLDCP E+L++EVNQFF NT +RH
Sbjct: 326 GRSVSKGNFFRIRSAFAFGAKKLARLLDCPEEELFSEVNQFFFNTWERH 374


>gi|356560284|ref|XP_003548423.1| PREDICTED: uncharacterized protein LOC100800527 [Glycine max]
          Length = 1337

 Score =  600 bits (1546), Expect = e-169,   Method: Composition-based stats.
 Identities = 292/349 (83%), Positives = 316/349 (90%), Gaps = 2/349 (0%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV--FTFG 58
           SVI+ LD  RWLKAE+ TAELIA IQP+P SEERRNAVA YV+RLI++CFPCQV  FTFG
Sbjct: 26  SVIQVLDSERWLKAEQRTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVRVFTFG 85

Query: 59  SVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
           SVPLKTYLPD DIDL AFS +Q LKD+WAH VRDMLENEEKNE+AEF VKEVQYIQAEVK
Sbjct: 86  SVPLKTYLPDGDIDLTAFSKNQNLKDSWAHQVRDMLENEEKNENAEFHVKEVQYIQAEVK 145

Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
           IIKCLV+N VVDI+FNQLGGLCTLCFL+EVD+LIN+NHLFKRSIILIKAWCYYESRILG 
Sbjct: 146 IIKCLVENIVVDISFNQLGGLCTLCFLEEVDNLINQNHLFKRSIILIKAWCYYESRILGA 205

Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
           HHGLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDW+NFC+SLWGPVPIS L
Sbjct: 206 HHGLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWENFCVSLWGPVPISSL 265

Query: 239 PDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
           PDVTAEPPRKDGG LLLSK FLD+C   YA FPGGQENQGQPFVSKHFNVIDPLRVNNNL
Sbjct: 266 PDVTAEPPRKDGGDLLLSKLFLDACSSVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNL 325

Query: 299 GRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GRSVSKGNFFRIR+AF F AK LARLLDC  ++L++EVNQFF NT +RH
Sbjct: 326 GRSVSKGNFFRIRSAFAFGAKRLARLLDCSEDELFSEVNQFFFNTWERH 374


>gi|225454502|ref|XP_002277075.1| PREDICTED: uncharacterized protein LOC100241322 [Vitis vinifera]
          Length = 1295

 Score =  581 bits (1498), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 282/347 (81%), Positives = 306/347 (88%), Gaps = 1/347 (0%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           S IR LD  RWL AEE TAELIA IQP+  SEE RNAVA YV+R+++QCFPCQVFTFGSV
Sbjct: 25  SAIRVLDTERWLIAEERTAELIACIQPNQPSEELRNAVADYVQRIVVQCFPCQVFTFGSV 84

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL AFS++Q LKDTWA+ VRDML++EEKNE+AEFRVKEVQYIQAEVKII
Sbjct: 85  PLKTYLPDGDIDLTAFSNNQNLKDTWANQVRDMLQSEEKNENAEFRVKEVQYIQAEVKII 144

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KCLV+N VVDI+FNQLGGLCTLCFL+EVDHLIN+NHLFKRSIILIKAWCYYESRILG HH
Sbjct: 145 KCLVENIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHH 204

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GLIS+YAL TLVLYIFHVFN SF GPLEVLYRFLEFFS FDWDNFC+SLWGPVPIS LPD
Sbjct: 205 GLISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVPISSLPD 264

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           VTAEPPR+D G LLLSK FLD+C   YA FP GQE QGQ F+SKHFNVIDPLRVNNNLGR
Sbjct: 265 VTAEPPRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRVNNNLGR 324

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVSKGNFFRIR+AF F AK LARLLD P E++  EVNQ FMNT +RH
Sbjct: 325 SVSKGNFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERH 370


>gi|297745424|emb|CBI40504.3| unnamed protein product [Vitis vinifera]
          Length = 1229

 Score =  580 bits (1494), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 282/347 (81%), Positives = 306/347 (88%), Gaps = 1/347 (0%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           S IR LD  RWL AEE TAELIA IQP+  SEE RNAVA YV+R+++QCFPCQVFTFGSV
Sbjct: 25  SAIRVLDTERWLIAEERTAELIACIQPNQPSEELRNAVADYVQRIVVQCFPCQVFTFGSV 84

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL AFS++Q LKDTWA+ VRDML++EEKNE+AEFRVKEVQYIQAEVKII
Sbjct: 85  PLKTYLPDGDIDLTAFSNNQNLKDTWANQVRDMLQSEEKNENAEFRVKEVQYIQAEVKII 144

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KCLV+N VVDI+FNQLGGLCTLCFL+EVDHLIN+NHLFKRSIILIKAWCYYESRILG HH
Sbjct: 145 KCLVENIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHH 204

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GLIS+YAL TLVLYIFHVFN SF GPLEVLYRFLEFFS FDWDNFC+SLWGPVPIS LPD
Sbjct: 205 GLISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVPISSLPD 264

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           VTAEPPR+D G LLLSK FLD+C   YA FP GQE QGQ F+SKHFNVIDPLRVNNNLGR
Sbjct: 265 VTAEPPRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRVNNNLGR 324

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVSKGNFFRIR+AF F AK LARLLD P E++  EVNQ FMNT +RH
Sbjct: 325 SVSKGNFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERH 370


>gi|255564741|ref|XP_002523365.1| hypothetical protein RCOM_0719270 [Ricinus communis]
 gi|223537453|gb|EEF39081.1| hypothetical protein RCOM_0719270 [Ricinus communis]
          Length = 1334

 Score =  569 bits (1467), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 284/347 (81%), Positives = 308/347 (88%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           SVIR LD  RW KAEE TAELI  I+P+  SE RRNAVA YV RLI +CFPC+VFTFGSV
Sbjct: 19  SVIRVLDSERWAKAEERTAELIDCIKPNEPSERRRNAVADYVERLITKCFPCRVFTFGSV 78

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL AFS+ Q++K+TWAH VRD+LENEEKNE+AEFRVKEVQYIQAEVKII
Sbjct: 79  PLKTYLPDGDIDLTAFSEGQSMKETWAHQVRDVLENEEKNENAEFRVKEVQYIQAEVKII 138

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KCLV+N VVDI+F+QLGGLCTLCFL+EVDHLIN++HLFK+SIILIKAWCYYESRILG HH
Sbjct: 139 KCLVENIVVDISFDQLGGLCTLCFLEEVDHLINQDHLFKKSIILIKAWCYYESRILGAHH 198

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDWDNFC+SLWGPVPIS LPD
Sbjct: 199 GLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPISSLPD 258

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           VTAEPPRKDGG LLLSK FL +C   YA  PGG E+QGQ F SKHFNVIDPLRVNNNLGR
Sbjct: 259 VTAEPPRKDGGELLLSKLFLKACGAVYAVSPGGPESQGQTFTSKHFNVIDPLRVNNNLGR 318

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVSKGNFFRIR+AF F AK LARLLDCP ED++ EVNQFFMNT DRH
Sbjct: 319 SVSKGNFFRIRSAFAFGAKRLARLLDCPKEDIHFEVNQFFMNTWDRH 365


>gi|302143676|emb|CBI22537.3| unnamed protein product [Vitis vinifera]
          Length = 1359

 Score =  566 bits (1458), Expect = e-159,   Method: Composition-based stats.
 Identities = 266/347 (76%), Positives = 295/347 (85%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           SV R LD  R   AEE T +LIA IQP+  SEERR AVA+YV+ LI++CF C+VF FGSV
Sbjct: 25  SVTRALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSV 84

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL AFS    LKDTWA+ VRD+LE EEK+  AEFRVKEVQYIQAEVKII
Sbjct: 85  PLKTYLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEVKII 144

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KCLV+N VVDI+FNQLGGLCTLCFL+EVDHLI++ HLFKRSIILIKAWCYYESRILG HH
Sbjct: 145 KCLVENIVVDISFNQLGGLCTLCFLEEVDHLISQKHLFKRSIILIKAWCYYESRILGAHH 204

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GLIS+YAL TLVLYIF VFN SFAGPLEVLYRFLEFFSKFDW+N+C+SLWGPVPIS LPD
Sbjct: 205 GLISTYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPD 264

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           VTA+PPRKD G LLLSK FLD+C   YA  P GQEN  QPF+SK+FNVIDPLR NNNLGR
Sbjct: 265 VTADPPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGR 324

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVSKGNFFRIR+AF F A+ LARLLDCP +++  EVNQFFMNT +RH
Sbjct: 325 SVSKGNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERH 371


>gi|225462743|ref|XP_002268106.1| PREDICTED: uncharacterized protein LOC100248390 [Vitis vinifera]
          Length = 1353

 Score =  565 bits (1456), Expect = e-158,   Method: Composition-based stats.
 Identities = 266/347 (76%), Positives = 295/347 (85%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           SV R LD  R   AEE T +LIA IQP+  SEERR AVA+YV+ LI++CF C+VF FGSV
Sbjct: 25  SVTRALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSV 84

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL AFS    LKDTWA+ VRD+LE EEK+  AEFRVKEVQYIQAEVKII
Sbjct: 85  PLKTYLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEVKII 144

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KCLV+N VVDI+FNQLGGLCTLCFL+EVDHLI++ HLFKRSIILIKAWCYYESRILG HH
Sbjct: 145 KCLVENIVVDISFNQLGGLCTLCFLEEVDHLISQKHLFKRSIILIKAWCYYESRILGAHH 204

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GLIS+YAL TLVLYIF VFN SFAGPLEVLYRFLEFFSKFDW+N+C+SLWGPVPIS LPD
Sbjct: 205 GLISTYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPD 264

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           VTA+PPRKD G LLLSK FLD+C   YA  P GQEN  QPF+SK+FNVIDPLR NNNLGR
Sbjct: 265 VTADPPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGR 324

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVSKGNFFRIR+AF F A+ LARLLDCP +++  EVNQFFMNT +RH
Sbjct: 325 SVSKGNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERH 371


>gi|42566126|ref|NP_191728.2| nucleotidyltransferase [Arabidopsis thaliana]
 gi|332646720|gb|AEE80241.1| nucleotidyltransferase [Arabidopsis thaliana]
          Length = 1303

 Score =  561 bits (1445), Expect = e-157,   Method: Composition-based stats.
 Identities = 271/348 (77%), Positives = 303/348 (87%), Gaps = 1/348 (0%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGS 59
           SV RPLD  RW KAE+ TA+LIA IQP+P SE+RRNAVA+YVRRLI++CFP  Q+F FGS
Sbjct: 29  SVTRPLDAERWAKAEDRTAKLIACIQPNPPSEDRRNAVASYVRRLIMECFPQVQIFMFGS 88

Query: 60  VPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKI 119
           VPLKTYLPD DIDL AFS +Q LKD+WA+LVRDMLE EEKNE+AEF VKEVQYIQAEVKI
Sbjct: 89  VPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDMLEKEEKNENAEFHVKEVQYIQAEVKI 148

Query: 120 IKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
           IKCLV+N VVDI+FNQ+GGLCTLCFL+EVDH IN+NHLFKRSIILIKAWCYYESRILG H
Sbjct: 149 IKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQNHLFKRSIILIKAWCYYESRILGAH 208

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLP 239
           HGLIS+YAL TLVLYIF++FN SF+GPLEVLYRFLEFFSKFDW NFCLSLWGPVP+S LP
Sbjct: 209 HGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFLEFFSKFDWQNFCLSLWGPVPVSSLP 268

Query: 240 DVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLG 299
           DVTAEPPR+D G L +S++F  +C   YA     QE QGQPFVSKHFNVIDPLR NNNLG
Sbjct: 269 DVTAEPPRRDVGELRVSEAFYRACSRVYAVNIAPQEIQGQPFVSKHFNVIDPLRENNNLG 328

Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           RSVSKGNFFRIR+AFT  AK L RLL+CP E+L +EVNQFFMNT +RH
Sbjct: 329 RSVSKGNFFRIRSAFTLGAKKLTRLLECPKENLIHEVNQFFMNTWERH 376


>gi|242036527|ref|XP_002465658.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
 gi|241919512|gb|EER92656.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
          Length = 1333

 Score =  551 bits (1420), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 264/346 (76%), Positives = 294/346 (84%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           V R LDP RW  AE+ TAELIARIQP+ +SE RR AV  YV+RLI+ C  CQVFTFGSVP
Sbjct: 16  VTRRLDPERWAVAEDRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVP 75

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
           LKTYLPD DID+ AFS+ + LK+ WA+LVRD LE EEKNE+AEF VKEVQYIQAEVKIIK
Sbjct: 76  LKTYLPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIK 135

Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
           CLV+N VVDI+FNQ+GGLCTLCFL+E+D+LI+ NHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 136 CLVENIVVDISFNQVGGLCTLCFLEEIDNLISRNHLFKRSIILIKAWCFYESRILGAHHG 195

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           LIS+YAL TLVLYIFH+FN SF GPLEVLYRFLEFFS FDW+ FCLSLWGPVPIS LPD+
Sbjct: 196 LISTYALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDM 255

Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
           TAEPPR D G LLL+KSFLD+C  AY   P  QENQGQPFVSKHFNVIDPLR NNNLGRS
Sbjct: 256 TAEPPRMDSGELLLNKSFLDTCSSAYGVVPRTQENQGQPFVSKHFNVIDPLRANNNLGRS 315

Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           VSKGNFFRIR+AF + AK L +LL+CP EDL  E+NQFF NT  RH
Sbjct: 316 VSKGNFFRIRSAFAYGAKRLGKLLECPKEDLIAELNQFFTNTWIRH 361


>gi|218192316|gb|EEC74743.1| hypothetical protein OsI_10487 [Oryza sativa Indica Group]
          Length = 1316

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 272/347 (78%), Positives = 293/347 (84%), Gaps = 1/347 (0%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           V R LD  RW  AE  TAELIARIQP+  SE RR AV  YVRRLI  C  CQVFTFGSVP
Sbjct: 14  VTRRLDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVP 73

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
           LKTYLPD DID+ AFSD + LKDTWA+LVRD LE+EEK+E+AEFRVKEVQYIQAEVKIIK
Sbjct: 74  LKTYLPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIK 133

Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
           CLVDN VVDI+FNQ+GGLCTLCFL+EVD LI++NHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 134 CLVDNIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHG 193

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           LIS+YAL TLVLYIFHVFN  F GPLEVLYRFLEFFS FDW+ FCLSL GPVPIS LPD+
Sbjct: 194 LISTYALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDM 253

Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQG-QPFVSKHFNVIDPLRVNNNLGR 300
           TAEPPR D   LLLSKSFLD C YAYA  P  QE+QG QPFVSKHFNVIDPLR NNNLGR
Sbjct: 254 TAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGR 313

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVSKGNFFRIR+AF+F AK LA+LL+CP EDL  EVNQFF NT  RH
Sbjct: 314 SVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRH 360


>gi|357113459|ref|XP_003558520.1| PREDICTED: uncharacterized protein LOC100841269 [Brachypodium
           distachyon]
          Length = 1305

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 264/346 (76%), Positives = 290/346 (83%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           V R LDP RW  AE  TAELIARIQP+  SE RR AV  YVRRLI+ C  C+VFTFGSVP
Sbjct: 14  VTRRLDPERWAVAESRTAELIARIQPNAHSEGRRLAVYNYVRRLIMNCLSCEVFTFGSVP 73

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
           LKTYLPD DID+ AFS+ + LKDTWA+LVRD LE+EEK+E+AEF VKEVQYIQAEVKIIK
Sbjct: 74  LKTYLPDGDIDVTAFSNSEELKDTWANLVRDALEHEEKSENAEFCVKEVQYIQAEVKIIK 133

Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
           CLVDN VVDI+FNQ+GGLCTLCFL+EVD+LIN +HLFKRSIIL+KAWC+YESRILG HHG
Sbjct: 134 CLVDNIVVDISFNQVGGLCTLCFLEEVDNLINHSHLFKRSIILVKAWCFYESRILGAHHG 193

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           LIS+YAL TLVLYIFHVFN SF GPLEVLYRFLEFF  FDW+ FCLSLWGPVPIS LPD+
Sbjct: 194 LISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFGNFDWEKFCLSLWGPVPISSLPDM 253

Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
           TAEPPR D G LLL K FLD+C  AY   P  QE QGQPFVSKHFNVIDPLR NNNLGRS
Sbjct: 254 TAEPPRMDTGELLLGKPFLDNCNQAYGVMPRTQETQGQPFVSKHFNVIDPLRTNNNLGRS 313

Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           V KGN+FRIR+AF F AK LA+LL+CP ED+  EVNQFF NT  RH
Sbjct: 314 VGKGNYFRIRSAFCFGAKKLAKLLECPKEDIITEVNQFFTNTLTRH 359


>gi|108706800|gb|ABF94595.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
           sativa Japonica Group]
          Length = 1316

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 272/347 (78%), Positives = 293/347 (84%), Gaps = 1/347 (0%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           V R LD  RW  AE  TAELIARIQP+  SE RR AV  YVRRLI  C  CQVFTFGSVP
Sbjct: 14  VTRRLDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVP 73

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
           LKTYLPD DID+ AFSD + LKDTWA+LVRD LE+EEK+E+AEFRVKEVQYIQAEVKIIK
Sbjct: 74  LKTYLPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIK 133

Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
           CLVDN VVDI+FNQ+GGLCTLCFL+EVD LI++NHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 134 CLVDNIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHG 193

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           LIS+YAL TLVLYIFHVFN  F GPLEVLYRFLEFFS FDW+ FCLSL GPVPIS LPD+
Sbjct: 194 LISTYALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDM 253

Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQG-QPFVSKHFNVIDPLRVNNNLGR 300
           TAEPPR D   LLLSKSFLD C YAYA  P  QE+QG QPFVSKHFNVIDPLR NNNLGR
Sbjct: 254 TAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGR 313

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVSKGNFFRIR+AF+F AK LA+LL+CP EDL  EVNQFF NT  RH
Sbjct: 314 SVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRH 360


>gi|414865287|tpg|DAA43844.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
 gi|414865288|tpg|DAA43845.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
          Length = 1332

 Score =  548 bits (1412), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 262/346 (75%), Positives = 292/346 (84%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           V R LDP RW  AE  TAELIARIQP+ +SE RR AV  YV+RLI+ C  CQVFTFGSVP
Sbjct: 16  VTRRLDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVP 75

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
           LKTYLPD DID+ AFS+ + LK+ WA+LVRD LE EEKNE+AEF VKEVQYIQAEVKIIK
Sbjct: 76  LKTYLPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIK 135

Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
           CLV+N VVDI+FNQ+GGLCTLCFL+E+D+LI+ENHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 136 CLVENIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHG 195

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           LIS+YAL TLVLYIFH+FN SF GPLEVLYRFLEFFS FDW+ FCLSLWGPVPIS LPD+
Sbjct: 196 LISTYALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDM 255

Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
           TAEPPR D G LLL+KSFLD+C  AY   P  QEN  QPF+SKHFNVIDPLR NNNLGRS
Sbjct: 256 TAEPPRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRS 315

Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           VSKGNFFRIR+AF + AK L +LL+CP EDL  E+NQFF NT  RH
Sbjct: 316 VSKGNFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRH 361


>gi|414865289|tpg|DAA43846.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
          Length = 1348

 Score =  548 bits (1412), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 262/346 (75%), Positives = 292/346 (84%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           V R LDP RW  AE  TAELIARIQP+ +SE RR AV  YV+RLI+ C  CQVFTFGSVP
Sbjct: 16  VTRRLDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVP 75

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
           LKTYLPD DID+ AFS+ + LK+ WA+LVRD LE EEKNE+AEF VKEVQYIQAEVKIIK
Sbjct: 76  LKTYLPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIK 135

Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
           CLV+N VVDI+FNQ+GGLCTLCFL+E+D+LI+ENHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 136 CLVENIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHG 195

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           LIS+YAL TLVLYIFH+FN SF GPLEVLYRFLEFFS FDW+ FCLSLWGPVPIS LPD+
Sbjct: 196 LISTYALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDM 255

Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
           TAEPPR D G LLL+KSFLD+C  AY   P  QEN  QPF+SKHFNVIDPLR NNNLGRS
Sbjct: 256 TAEPPRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRS 315

Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           VSKGNFFRIR+AF + AK L +LL+CP EDL  E+NQFF NT  RH
Sbjct: 316 VSKGNFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRH 361


>gi|6850860|emb|CAB71099.1| putative protein [Arabidopsis thaliana]
          Length = 1388

 Score =  542 bits (1397), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 271/348 (77%), Positives = 303/348 (87%), Gaps = 1/348 (0%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGS 59
           SV RPLD  RW KAE+ TA+LIA IQP+P SE+RRNAVA+YVRRLI++CFP  Q+F FGS
Sbjct: 29  SVTRPLDAERWAKAEDRTAKLIACIQPNPPSEDRRNAVASYVRRLIMECFPQVQIFMFGS 88

Query: 60  VPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKI 119
           VPLKTYLPD DIDL AFS +Q LKD+WA+LVRDMLE EEKNE+AEF VKEVQYIQAEVKI
Sbjct: 89  VPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDMLEKEEKNENAEFHVKEVQYIQAEVKI 148

Query: 120 IKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
           IKCLV+N VVDI+FNQ+GGLCTLCFL+EVDH IN+NHLFKRSIILIKAWCYYESRILG H
Sbjct: 149 IKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQNHLFKRSIILIKAWCYYESRILGAH 208

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLP 239
           HGLIS+YAL TLVLYIF++FN SF+GPLEVLYRFLEFFSKFDW NFCLSLWGPVP+S LP
Sbjct: 209 HGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFLEFFSKFDWQNFCLSLWGPVPVSSLP 268

Query: 240 DVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLG 299
           DVTAEPPR+D G L +S++F  +C   YA     QE QGQPFVSKHFNVIDPLR NNNLG
Sbjct: 269 DVTAEPPRRDVGELRVSEAFYRACSRVYAVNIAPQEIQGQPFVSKHFNVIDPLRENNNLG 328

Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           RSVSKGNFFRIR+AFT  AK L RLL+CP E+L +EVNQFFMNT +RH
Sbjct: 329 RSVSKGNFFRIRSAFTLGAKKLTRLLECPKENLIHEVNQFFMNTWERH 376


>gi|297817502|ref|XP_002876634.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
 gi|297322472|gb|EFH52893.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
          Length = 1302

 Score =  540 bits (1392), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 271/348 (77%), Positives = 302/348 (86%), Gaps = 1/348 (0%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGS 59
           SV R LD  RW KAE+ TA+LIA IQP+P SE+RRNAVA+YVRRLI++CFP  Q+F FGS
Sbjct: 29  SVTRQLDAERWAKAEDRTAKLIACIQPNPPSEDRRNAVASYVRRLIMECFPQVQIFMFGS 88

Query: 60  VPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKI 119
           VPLKTYLPD DIDL AFS +Q LKD+WA+LVRDMLE EEKNE+AEF VKEVQYIQAEVKI
Sbjct: 89  VPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDMLEKEEKNENAEFHVKEVQYIQAEVKI 148

Query: 120 IKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
           IKCLV+N VVDI+FNQ+GGLCTLCFL+EVDH IN+NHLFKRSIILIKAWCYYESRILG H
Sbjct: 149 IKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQNHLFKRSIILIKAWCYYESRILGAH 208

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLP 239
           HGLIS+YAL TLVLYIF++FN SF+GPLEVLYRFLEFFSKFDW NFCLSLWGPVP+S LP
Sbjct: 209 HGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFLEFFSKFDWQNFCLSLWGPVPVSSLP 268

Query: 240 DVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLG 299
           DVTA PPRKD G L +S++F  +C   YA     QE QGQPFVSKHFNVIDPLR NNNLG
Sbjct: 269 DVTAAPPRKDVGELRVSEAFYRACSKVYAVNIAPQEIQGQPFVSKHFNVIDPLRENNNLG 328

Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           RSVSKGNFFRIR+AFT  AK LARLL+CP E+L +EVNQFFMNT +RH
Sbjct: 329 RSVSKGNFFRIRSAFTLGAKKLARLLECPKENLIHEVNQFFMNTWERH 376


>gi|413956606|gb|AFW89255.1| hypothetical protein ZEAMMB73_893455 [Zea mays]
          Length = 1316

 Score =  538 bits (1387), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 259/346 (74%), Positives = 292/346 (84%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           + R LDP RW  AE+ TAELIA IQP+ +SE RR AV  YV+RLI+ C  CQVFTFGSVP
Sbjct: 16  MTRRLDPERWAVAEDRTAELIACIQPNVYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVP 75

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
           LKTYLPD DID+ AFS+ + LK+ WA+LVRD LE EEK+E+AEF VKEVQYIQAEVKIIK
Sbjct: 76  LKTYLPDGDIDVTAFSNSEELKEIWANLVRDALEREEKDENAEFHVKEVQYIQAEVKIIK 135

Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
           CLV+N VVDI+FNQ+GGLCTLCFL+E+D+LI++NHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 136 CLVENIVVDISFNQVGGLCTLCFLEEIDNLISQNHLFKRSIILIKAWCFYESRILGAHHG 195

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           LIS+YAL TLVLYIFH+FN SF GPLEVLYRFLEFFS FDW+ FCLSLWGPVPIS LPD+
Sbjct: 196 LISTYALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDM 255

Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
           TA PPR D G LLL+KSFLD+C  AY   P  QENQGQPFVSKHFNVIDPLR NNNLGRS
Sbjct: 256 TAIPPRMDSGELLLNKSFLDTCSSAYGVVPHTQENQGQPFVSKHFNVIDPLRTNNNLGRS 315

Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           VSKGNFFRIR+AF + AK L +LL+CP E L  E+NQFF NT  RH
Sbjct: 316 VSKGNFFRIRSAFAYGAKRLGKLLECPKEALIPELNQFFTNTWIRH 361


>gi|224118186|ref|XP_002317752.1| predicted protein [Populus trichocarpa]
 gi|222858425|gb|EEE95972.1| predicted protein [Populus trichocarpa]
          Length = 353

 Score =  536 bits (1380), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 260/342 (76%), Positives = 293/342 (85%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           L+  RW  AEE TAELIA IQP+  SEERR AV  YV+RLI++CFPCQVFTFGSVPLKTY
Sbjct: 1   LELERWAIAEERTAELIACIQPNQPSEERRTAVLGYVQRLIMKCFPCQVFTFGSVPLKTY 60

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DID+  F++ Q LK TWA  V+D+L++EEK+E+AEF VKEVQYIQAEVKIIKCLV+
Sbjct: 61  LPDGDIDITVFTESQDLKKTWADEVKDILQHEEKSENAEFHVKEVQYIQAEVKIIKCLVE 120

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N VVDI+FNQLGGLCTLCFL+EVD LI++NHLFKRSIILIKAWCYYESRILG HHGLIS+
Sbjct: 121 NIVVDISFNQLGGLCTLCFLEEVDQLISQNHLFKRSIILIKAWCYYESRILGAHHGLIST 180

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YAL TLVLYIFHVFN  FAGPLEVLYRFLEFFSKFDW++FC+SLWGPVPIS LP+VTA  
Sbjct: 181 YALETLVLYIFHVFNNRFAGPLEVLYRFLEFFSKFDWEHFCISLWGPVPISSLPNVTALS 240

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
           PR+DGG +LLS+ FL+ C   YA FP  QENQ Q FVSK+FNVIDPLR NNNLGRSVSKG
Sbjct: 241 PREDGGQILLSQLFLEVCSSVYAVFPSQQENQEQSFVSKYFNVIDPLRTNNNLGRSVSKG 300

Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           NF+RIR+AF F A+ LARLLDCP E+L  E NQFFMNT DRH
Sbjct: 301 NFYRIRSAFAFGAQRLARLLDCPKENLLAEFNQFFMNTWDRH 342


>gi|147867191|emb|CAN79954.1| hypothetical protein VITISV_027426 [Vitis vinifera]
          Length = 1388

 Score =  491 bits (1263), Expect = e-136,   Method: Composition-based stats.
 Identities = 239/347 (68%), Positives = 265/347 (76%), Gaps = 31/347 (8%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           SV R LD  R   AEE T +LIA IQP+  SEERR AVA+YV+ LI++CF C+VF FGSV
Sbjct: 25  SVTRALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSV 84

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL AFS    LKDTWA+ VRD+LE EEK+  AEFRVKEVQYIQAEV   
Sbjct: 85  PLKTYLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEV--- 141

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
                                       DHLI++ HLFKRSIILIKAWCYYESRILG HH
Sbjct: 142 ----------------------------DHLISQKHLFKRSIILIKAWCYYESRILGAHH 173

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GLIS+YAL TLVLYIF VFN SFAGPLEVLYRFLEFFSKFDW+N+C+SLWGPVPIS LPD
Sbjct: 174 GLISTYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPD 233

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           VTA+PPRKD G LLLSK FLD+C   YA  P GQEN  QPF+SK+FNVIDPLR NNNLGR
Sbjct: 234 VTADPPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGR 293

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVSKGNFFRIR+AF F A+ LARLLDCP +++  EVNQFFMNT +RH
Sbjct: 294 SVSKGNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERH 340


>gi|302802985|ref|XP_002983246.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
 gi|300148931|gb|EFJ15588.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
          Length = 351

 Score =  479 bits (1233), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 239/347 (68%), Positives = 279/347 (80%), Gaps = 11/347 (3%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           LD  RWL+AE  T ELI RIQP  FSE+RR AVA YV RLI +CF C+VFTFGSVPL+TY
Sbjct: 1   LDDERWLQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTY 60

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV-KIIKCLV 124
           LPD DIDL AFS  Q L+++WA+ VR +LE EE+++ AEFRVKEVQYIQAEV KIIKCLV
Sbjct: 61  LPDGDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLV 120

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
           +N VVDI+FNQLGGLCTLCFL+EVD LI  +HLFKRSIIL+KAWCYYESRILG HHGLIS
Sbjct: 121 ENIVVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLIS 180

Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
           +YAL TLVLYIFHVF+ S  GPL VLYRFLEFFS FDWD +CLSLWGP+P+S LPD+   
Sbjct: 181 TYALETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM--- 237

Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQEN----QGQPFVSKHFNVIDPLRVNNNLGR 300
              +DGG LLL+K FLDSC  AYA  P G  N    Q + F SK+ NV+DPL+  NNLGR
Sbjct: 238 ---QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGR 294

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SV+KGNF+RIR AF F A+ LAR+L+CP ED+ +EV++FF+NT DRH
Sbjct: 295 SVNKGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRH 341


>gi|302755776|ref|XP_002961312.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
 gi|300172251|gb|EFJ38851.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
          Length = 351

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 238/347 (68%), Positives = 279/347 (80%), Gaps = 11/347 (3%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           LD  RW++AE  T ELI RIQP  FSE+RR AVA YV RLI +CF C+VFTFGSVPL+TY
Sbjct: 1   LDDERWVQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTY 60

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV-KIIKCLV 124
           LPD DIDL AFS  Q L+++WA+ VR +LE EE+++ AEFRVKEVQYIQAEV KIIKCLV
Sbjct: 61  LPDGDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLV 120

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
           +N VVDI+FNQLGGLCTLCFL+EVD LI  +HLFKRSIIL+KAWCYYESRILG HHGLIS
Sbjct: 121 ENIVVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLIS 180

Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
           +YAL TLVLYIFHVF+ S  GPL VLYRFLEFFS FDWD +CLSLWGP+P+S LPD+   
Sbjct: 181 TYALETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM--- 237

Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQEN----QGQPFVSKHFNVIDPLRVNNNLGR 300
              +DGG LLL+K FLDSC  AYA  P G  N    Q + F SK+ NV+DPL+  NNLGR
Sbjct: 238 ---QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGR 294

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SV+KGNF+RIR AF F A+ LAR+L+CP ED+ +EV++FF+NT DRH
Sbjct: 295 SVNKGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRH 341


>gi|147820621|emb|CAN67650.1| hypothetical protein VITISV_005081 [Vitis vinifera]
          Length = 1572

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 222/301 (73%), Positives = 242/301 (80%), Gaps = 29/301 (9%)

Query: 47  IQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFR 106
           ++C   +VFTFGSVPLKTYLPD DIDL AFS++Q LKDTWA+                  
Sbjct: 222 VKC-ATRVFTFGSVPLKTYLPDGDIDLTAFSNNQNLKDTWAN------------------ 262

Query: 107 VKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIK 166
                    +VKIIKCLV+N VVDI+FNQLGGLCTLCFL+EVDHLIN+NHLFKRSIILIK
Sbjct: 263 ---------QVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIK 313

Query: 167 AWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFC 226
           AWCYYESRILG HHGLIS+YAL TLVLYIFHVFN SF GPLEVLYRFLEFFS FDWDNFC
Sbjct: 314 AWCYYESRILGAHHGLISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFC 373

Query: 227 LSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHF 286
           +SLWGPVPIS LPDVTAEPPR+D G LLLSK FLD+C   YA FP GQE QGQ F+SKHF
Sbjct: 374 VSLWGPVPISSLPDVTAEPPRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHF 433

Query: 287 NVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDR 346
           NVIDPLRVNNNLGRSVSKGNFFRIR+AF F AK LARLLD P E++  EVNQ FMNT +R
Sbjct: 434 NVIDPLRVNNNLGRSVSKGNFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWER 492

Query: 347 H 347
           H
Sbjct: 493 H 493


>gi|358347363|ref|XP_003637727.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
 gi|355503662|gb|AES84865.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
          Length = 827

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 197/231 (85%), Positives = 210/231 (90%)

Query: 117 VKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRIL 176
           VK++KCLV+N VVDI+FNQLGGLCTLCFL+EVD LIN NHLFKRSIILIKAWCYYESRIL
Sbjct: 109 VKLVKCLVENIVVDISFNQLGGLCTLCFLEEVDGLINHNHLFKRSIILIKAWCYYESRIL 168

Query: 177 GGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPIS 236
           G HHGLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDWDNFC+SLWGPVPIS
Sbjct: 169 GAHHGLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPIS 228

Query: 237 LLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNN 296
            LPDVTAEPPRKD G LLL KSFLD+C   YA FPGG ENQGQPFVSKHFNVIDPLRVNN
Sbjct: 229 SLPDVTAEPPRKDAGELLLHKSFLDACSTVYAVFPGGPENQGQPFVSKHFNVIDPLRVNN 288

Query: 297 NLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           NLGRSVSKGNFFRIR+AF F AK LARLLDCP ++L+ EVNQFF+NT DRH
Sbjct: 289 NLGRSVSKGNFFRIRSAFAFGAKKLARLLDCPKDELFLEVNQFFLNTWDRH 339


>gi|449449962|ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207419 [Cucumis sativus]
          Length = 898

 Score =  406 bits (1044), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 200/343 (58%), Positives = 254/343 (74%), Gaps = 1/343 (0%)

Query: 5   PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
           P+    W +AEE T  +I+++QP   SE RR AV  YV+RLI     C+VF FGSVPLKT
Sbjct: 38  PIGVDYWRRAEEATQAIISQVQPTVVSERRRKAVIDYVQRLIRGRLRCEVFPFGSVPLKT 97

Query: 65  YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
           YLPD DIDL A      +++  A  V  +L +E++N  AEF VK+VQ I+AEVK++KCLV
Sbjct: 98  YLPDGDIDLTALGG-SNVEEALASDVCSVLNSEDQNGAAEFVVKDVQLIRAEVKLVKCLV 156

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
            N VVDI+FNQLGGLCTLCFL+++D  I ++HLFKRSIILIKAWCYYESRILG HHGLIS
Sbjct: 157 QNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 216

Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
           +YAL TLVLYIFH+F+ +  GPL+VLY+FL++FSKFDWDN+C+SL GPV IS LP++ AE
Sbjct: 217 TYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGPVRISSLPELVAE 276

Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
            P   GG LLLS  FL SC   ++    G E   + F  KH N++DPL+ NNNLGRSVSK
Sbjct: 277 TPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPLKENNNLGRSVSK 336

Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GNF+RIR+AF++ A+ L  +L  P +++ +EV +FF NT DRH
Sbjct: 337 GNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRH 379


>gi|357463851|ref|XP_003602207.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
 gi|355491255|gb|AES72458.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
          Length = 768

 Score =  396 bits (1018), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 191/346 (55%), Positives = 261/346 (75%), Gaps = 2/346 (0%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           V + L+  +W + E+ T EL+  ++P+P SE  RN + +Y++ LII   P +VF FGSVP
Sbjct: 18  VPKVLERSKWSQVEDRTIELLQFLEPNPKSETLRNNIVSYIKGLIISHVPVKVFEFGSVP 77

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
           LKTYL D DIDL  F +++   + +   ++ +LE+E  NE ++FRVKEVQ + AEVKIIK
Sbjct: 78  LKTYLRDGDIDLTIFGNNELFPEIFIPHIQQILESEMNNEFSKFRVKEVQLVNAEVKIIK 137

Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
           CLV+ FV+DI+FNQL GLC+LCFLDEVD+LI+ NH+FKRS+ILIKAWCY+ESR+LG   G
Sbjct: 138 CLVEKFVIDISFNQLSGLCSLCFLDEVDYLISRNHIFKRSVILIKAWCYHESRLLGSKSG 197

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           L S+YAL  LVLY+F+++N  F GPLEVL+RFLEFFSKFDW N+C+SL GPVP+  LP++
Sbjct: 198 LFSTYALEILVLYLFNLYNNEFVGPLEVLFRFLEFFSKFDWGNYCISLSGPVPLDSLPNM 257

Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
           TA+ PRKD   LLL++SFL + ++ Y      Q+N+ + FVSKH N+IDPL+ NNNLG S
Sbjct: 258 TADCPRKDRQDLLLTESFLIASKFCYG--WRNQKNREKHFVSKHINIIDPLQENNNLGHS 315

Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           +S+GNFFRI++A  + A+ + R+LDC +E L +E + FF NT +RH
Sbjct: 316 ISRGNFFRIKSAIAYGAEQMMRILDCTDEYLISEFDHFFENTWNRH 361


>gi|359481238|ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258499 [Vitis vinifera]
          Length = 884

 Score =  396 bits (1017), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 198/339 (58%), Positives = 246/339 (72%), Gaps = 1/339 (0%)

Query: 9   GRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPD 68
            +W +AE    E+I  +QP   SEERR  V  YV+ LI     C+VF FGSVPLKTYLPD
Sbjct: 37  AQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFPFGSVPLKTYLPD 96

Query: 69  RDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV 128
            DIDL AF     ++DT A+ V  +LE E++N  AEF VK+VQ I AEVK++KCLV N V
Sbjct: 97  GDIDLTAFGG-PAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEVKLVKCLVQNIV 155

Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
           VDI+FNQLGGLCTLCFL+++D LI ++HLFKRSIILIKAWCYYESRILG HHGLIS+YAL
Sbjct: 156 VDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215

Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
            TLVLYIF +F+    GPL VLY+FL++FSKFDWDN+C+SL GPV IS LP++ AE P  
Sbjct: 216 ETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPEN 275

Query: 249 DGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFF 308
            G   LL+   L  C   ++    G E   + FV KHFN++DPL+ NNNLGRSVSKGNF+
Sbjct: 276 VGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFY 335

Query: 309 RIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           RIR+AFT+ A+ L R+L  P + +  E+ +FF NT +RH
Sbjct: 336 RIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERH 374


>gi|297735556|emb|CBI18050.3| unnamed protein product [Vitis vinifera]
          Length = 824

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 198/339 (58%), Positives = 246/339 (72%), Gaps = 1/339 (0%)

Query: 9   GRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPD 68
            +W +AE    E+I  +QP   SEERR  V  YV+ LI     C+VF FGSVPLKTYLPD
Sbjct: 37  AQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFPFGSVPLKTYLPD 96

Query: 69  RDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV 128
            DIDL AF     ++DT A+ V  +LE E++N  AEF VK+VQ I AEVK++KCLV N V
Sbjct: 97  GDIDLTAFGG-PAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEVKLVKCLVQNIV 155

Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
           VDI+FNQLGGLCTLCFL+++D LI ++HLFKRSIILIKAWCYYESRILG HHGLIS+YAL
Sbjct: 156 VDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215

Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
            TLVLYIF +F+    GPL VLY+FL++FSKFDWDN+C+SL GPV IS LP++ AE P  
Sbjct: 216 ETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPEN 275

Query: 249 DGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFF 308
            G   LL+   L  C   ++    G E   + FV KHFN++DPL+ NNNLGRSVSKGNF+
Sbjct: 276 VGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFY 335

Query: 309 RIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           RIR+AFT+ A+ L R+L  P + +  E+ +FF NT +RH
Sbjct: 336 RIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERH 374


>gi|222616508|gb|EEE52640.1| hypothetical protein OsJ_34991 [Oryza sativa Japonica Group]
          Length = 801

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 192/339 (56%), Positives = 240/339 (70%), Gaps = 6/339 (1%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W   E     ++ARIQP+P SE+RR AV AYV+ L+     CQVF FGSVPLKTYLPD D
Sbjct: 30  WDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQGLLRFNVGCQVFPFGSVPLKTYLPDGD 89

Query: 71  IDLGAF--SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV 128
           IDL AF  S D+ L    A  V+ +LE+EE  + AEF VK+VQYI AEVK++KC+V N +
Sbjct: 90  IDLTAFGHSSDEIL----AKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCIVQNII 145

Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
           VDI+FNQ GGLCTLCFL++VD    +NHLFKRSI+LIKAWCYYESRILG HHGLIS+YAL
Sbjct: 146 VDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKAWCYYESRILGAHHGLISTYAL 205

Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
             LVLYIFH+F+G+  GPL VLYRFL+++SKFDWDN  +SL+GP+ +S LP++  + P  
Sbjct: 206 EILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVTDSPDT 265

Query: 249 DGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFF 308
                 + + FL  C   +   P   E   Q F  K FN++DPL+ +NNLGRSVSKGNF 
Sbjct: 266 VNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVSKGNFL 325

Query: 309 RIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           RIR+AF F A+ L ++L  P+    +EVNQFF NT  RH
Sbjct: 326 RIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRNTLKRH 364


>gi|77548394|gb|ABA91191.1| nucleotidyltransferase family protein, putative, expressed [Oryza
           sativa Japonica Group]
          Length = 783

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 191/344 (55%), Positives = 241/344 (70%), Gaps = 6/344 (1%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           + P  W   E     ++ARIQP+P SE+RR AV AYV+ L+     CQVF FGSVPLKTY
Sbjct: 25  ISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLKTY 84

Query: 66  LPDRDIDLGAF--SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           LPD DIDL AF  S D+ L    A  V+ +LE+EE  + AEF VK+VQYI AEVK++KC+
Sbjct: 85  LPDGDIDLTAFGHSSDEIL----AKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCI 140

Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
           V N +VDI+FNQ GGLCTLCFL++VD    + HLFKRSI+LIKAWCYYESRILG HHGLI
Sbjct: 141 VQNIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLI 200

Query: 184 SSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTA 243
           S+YAL  LVLYIFH+F+G+  GPL VLYRFL+++SKFDWDN  +SL+GP+ +S LP++  
Sbjct: 201 STYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVT 260

Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
           + P        + + FL  C   +   P   E   Q F  K FN++DPL+ +NNLGRSVS
Sbjct: 261 DSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVS 320

Query: 304 KGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           KGNF RIR+AF F A+ L +++  P+    +EVNQFF NT  RH
Sbjct: 321 KGNFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRH 364


>gi|115483835|ref|NP_001065579.1| Os11g0114700 [Oryza sativa Japonica Group]
 gi|77548393|gb|ABA91190.1| nucleotidyltransferase family protein, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113644283|dbj|BAF27424.1| Os11g0114700 [Oryza sativa Japonica Group]
 gi|215694848|dbj|BAG90039.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218185112|gb|EEC67539.1| hypothetical protein OsI_34858 [Oryza sativa Indica Group]
 gi|222615390|gb|EEE51522.1| hypothetical protein OsJ_32709 [Oryza sativa Japonica Group]
          Length = 801

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 191/344 (55%), Positives = 241/344 (70%), Gaps = 6/344 (1%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           + P  W   E     ++ARIQP+P SE+RR AV AYV+ L+     CQVF FGSVPLKTY
Sbjct: 25  ISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLKTY 84

Query: 66  LPDRDIDLGAF--SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           LPD DIDL AF  S D+ L    A  V+ +LE+EE  + AEF VK+VQYI AEVK++KC+
Sbjct: 85  LPDGDIDLTAFGHSSDEIL----AKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCI 140

Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
           V N +VDI+FNQ GGLCTLCFL++VD    + HLFKRSI+LIKAWCYYESRILG HHGLI
Sbjct: 141 VQNIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLI 200

Query: 184 SSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTA 243
           S+YAL  LVLYIFH+F+G+  GPL VLYRFL+++SKFDWDN  +SL+GP+ +S LP++  
Sbjct: 201 STYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVT 260

Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
           + P        + + FL  C   +   P   E   Q F  K FN++DPL+ +NNLGRSVS
Sbjct: 261 DSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVS 320

Query: 304 KGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           KGNF RIR+AF F A+ L +++  P+    +EVNQFF NT  RH
Sbjct: 321 KGNFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRH 364


>gi|224124740|ref|XP_002319410.1| predicted protein [Populus trichocarpa]
 gi|222857786|gb|EEE95333.1| predicted protein [Populus trichocarpa]
          Length = 681

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 191/337 (56%), Positives = 238/337 (70%), Gaps = 1/337 (0%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W +AEE+  E++ RI P   S  +R  V  YV+RLI      +VF +GSVPLKTYLPD D
Sbjct: 58  WERAEEVATEIVYRIHPTVESSFKRKQVIDYVQRLIRYSLGFEVFPYGSVPLKTYLPDGD 117

Query: 71  IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
           IDL A S    +++     V  +L  EE NE A + VK+V  I AEVK+IKC+V N VVD
Sbjct: 118 IDLTAISS-PAIEEALVSDVYTVLRGEELNEDALYEVKDVHCIDAEVKLIKCIVQNTVVD 176

Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
           I+FNQLGGLCTLCFL+EVD L+ +NHLFKRSIILIKAWCYYESRILG HHGLIS+YAL T
Sbjct: 177 ISFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 236

Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
           L+LYIFH+F+ S  GPL VLY+FL++FSKFDW+N+C+SL GPV  S LP++ A+PP    
Sbjct: 237 LILYIFHLFHSSLNGPLAVLYKFLDYFSKFDWENYCISLNGPVCKSSLPNIVAKPPENVS 296

Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
           G LLLS  FL  C   +       E   +PF  KH N++DPL+ NNNLGRSV++GNFFRI
Sbjct: 297 GELLLSDEFLKDCVDRFYVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRI 356

Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           R+AF +  + L R+L  P E + +E+  FF NT DRH
Sbjct: 357 RSAFKYGGRKLGRILLLPREKIADELKTFFANTLDRH 393


>gi|359478494|ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253523 [Vitis vinifera]
          Length = 854

 Score =  383 bits (983), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 190/337 (56%), Positives = 243/337 (72%), Gaps = 1/337 (0%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W  AE  T E++A++QP   S   R  V  YV+RLI  C  C+VF +GSVPLKTYL D D
Sbjct: 40  WAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTYLLDGD 99

Query: 71  IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
           IDL A      +++  A  V  +L+ EE+NE+AEF VK++Q+I AEVK++KCLV + V+D
Sbjct: 100 IDLTALCS-SNVEEALASDVHAVLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKDIVID 158

Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
           I+FNQLGGL TLCFL++VD LI ++HLFKRSIILIK+WCYYESRILG HHGLIS+YAL  
Sbjct: 159 ISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKSWCYYESRILGAHHGLISTYALEI 218

Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
           LVLYIFH+F+ S  GPL VLYRFL++FSKFDWDN+C+SL GPV  S LPD+ AE P    
Sbjct: 219 LVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPDIVAELPENGQ 278

Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
             LLLS+ FL +C   ++    G E   + F  KH N+IDPLR NNNLGRSV+KGNF+RI
Sbjct: 279 DDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPLRENNNLGRSVNKGNFYRI 338

Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           R+AF + +  L ++L  P E + +E+  FF +T +RH
Sbjct: 339 RSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERH 375


>gi|224145449|ref|XP_002325647.1| predicted protein [Populus trichocarpa]
 gi|222862522|gb|EEF00029.1| predicted protein [Populus trichocarpa]
          Length = 533

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 191/337 (56%), Positives = 239/337 (70%), Gaps = 1/337 (0%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W +AEE T E++ RI P   S  +R  +  YV+RLI      +VF +GSVPLKTYLPD D
Sbjct: 58  WERAEEFTREIVYRIHPTVESNFKRKQIIGYVQRLIKSSLGFEVFPYGSVPLKTYLPDGD 117

Query: 71  IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
           IDL + S    +++     +  +L  EE NE + F VK+V  I AEVK+IKC+V N VVD
Sbjct: 118 IDLTSISS-PAIEEALVSDIHAVLRREELNEDSTFEVKDVHCIDAEVKLIKCIVQNTVVD 176

Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
           I+FNQLGGLCTLCFL+EVD L+ +NHLFKRSIILIKAWCYYESRILG HHGLIS+YAL T
Sbjct: 177 ISFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 236

Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
           L+LYIFH+F+ S  GPL VLYRFLE+FSKFDW+N+C+SL GPV  S LP++ AEP     
Sbjct: 237 LILYIFHLFHCSLNGPLAVLYRFLEYFSKFDWENYCISLNGPVCKSSLPNIVAEPLENGQ 296

Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
           G LLLS  FL  C   ++      E   +PF  KH N++DPL+ NNNLGRSV++GNFFRI
Sbjct: 297 GELLLSDEFLKDCADRFSVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRI 356

Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           R+AF + A+ L ++L  P E + +E+  FF NT DRH
Sbjct: 357 RSAFKYGARKLGQILLLPKERIADELKIFFANTLDRH 393


>gi|79597803|ref|NP_850678.2| NT domain of poly(A) polymerase and terminal uridylyl
           transferase-containing protein [Arabidopsis thaliana]
 gi|332645293|gb|AEE78814.1| NT domain of poly(A) polymerase and terminal uridylyl
           transferase-containing protein [Arabidopsis thaliana]
          Length = 829

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 191/340 (56%), Positives = 245/340 (72%), Gaps = 1/340 (0%)

Query: 8   PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
           P  W++ EE T E+I ++ P   SE+RR  V  YV++LI     C+V +FGSVPLKTYLP
Sbjct: 31  PELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLP 90

Query: 68  DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
           D DIDL AF      ++  A  V  +LE EE N  ++F VK+VQ I+AEVK++KCLV N 
Sbjct: 91  DGDIDLTAFGG-LYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNI 149

Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
           VVDI+FNQ+GG+CTLCFL+++DHLI ++HLFKRSIILIKAWCYYESRILG  HGLIS+YA
Sbjct: 150 VVDISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYA 209

Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
           L TLVLYIFH+F+ S  GPL VLY+FL++FSKFDWD++C+SL GPV +S LPD+  E P 
Sbjct: 210 LETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPE 269

Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF 307
             G  LLL+  FL  C   Y+    G E   + F SKH N++DPL+  NNLGRSVSKGNF
Sbjct: 270 NGGEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNF 329

Query: 308 FRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           +RIR+AFT+ A+ L +L    +E + +E+ +FF N   RH
Sbjct: 330 YRIRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRH 369


>gi|297816424|ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321933|gb|EFH52354.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 829

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 189/340 (55%), Positives = 243/340 (71%), Gaps = 1/340 (0%)

Query: 8   PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
           P  W++ EE T E+I ++ P   SE+RR  V  YV++LI     C+V +FGSVPLKTYLP
Sbjct: 31  PEFWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRITLGCEVHSFGSVPLKTYLP 90

Query: 68  DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
           D DIDL AF      ++  A  V  +LE EE N  + F VK+VQ I+AEVK++KCLV N 
Sbjct: 91  DGDIDLTAFGG-LYHEEELAAKVFSVLEREEHNVSSHFVVKDVQLIRAEVKLVKCLVQNI 149

Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
           VVDI+FNQ+GG+CTLCFL+++DHLI ++HLFKRSIILIKAWCYYESRILG  HGLIS+YA
Sbjct: 150 VVDISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYA 209

Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
           L TLVLYIFH+F+ S  GPL VLY+FL++FSKFDWDN+C+SL GPV +S LP++  E P 
Sbjct: 210 LETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVETPE 269

Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF 307
             G   LL+  FL  C   Y+    G E   + F SKH N++DPL+  NNLGRSVSKGNF
Sbjct: 270 NGGEDFLLTSEFLKECMEMYSVPSRGFETNQRGFQSKHLNIVDPLKETNNLGRSVSKGNF 329

Query: 308 FRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           +RIR+AFT+ A+ L ++    +E + +E+ +FF N   RH
Sbjct: 330 YRIRSAFTYGARKLGQIFLQSDEAIKSELRKFFSNMLLRH 369


>gi|255554485|ref|XP_002518281.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223542501|gb|EEF44041.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 821

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 185/337 (54%), Positives = 237/337 (70%), Gaps = 1/337 (0%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W +AE+ T +++ RI P   ++  R  V  YV+ LI      QVF +GSVPLKTYLPD D
Sbjct: 51  WERAEQATLQIVYRIHPTVEADCNRKHVVEYVQSLIQSSLGFQVFPYGSVPLKTYLPDGD 110

Query: 71  IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
           IDL A  +   + D     V  +L  EE+N  A ++VK+V +I AEVK+IKC+V + VVD
Sbjct: 111 IDLTAIINPAGV-DASVSDVHAVLRREEQNRDAPYKVKDVHFIDAEVKLIKCIVHDIVVD 169

Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
           I+FNQLGGL TLCFL++VD LI ++HLFKRSIILIKAWCYYESRILG HHGLIS+YAL T
Sbjct: 170 ISFNQLGGLSTLCFLEQVDQLIGKSHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 229

Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
           L+LYIFH+F+ S  GPL VLYRFL++FSKFDWDN+C+SL GPV  S LP + AEPP    
Sbjct: 230 LILYIFHLFHSSLNGPLMVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPKIVAEPPETGR 289

Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
           G LLL   FL +     +      E   +PF  KH N++DPLR NNNLGRSV++GNF+RI
Sbjct: 290 GNLLLDDEFLRNSVKMLSVPSRSPEMNSRPFTQKHLNIVDPLRENNNLGRSVNRGNFYRI 349

Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           R+AF + A+ L  +L   ++ + NE+++FF NT DRH
Sbjct: 350 RSAFKYGARKLGHILSLQSDRMINELDKFFANTLDRH 386


>gi|326531888|dbj|BAK01320.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 702

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 186/342 (54%), Positives = 236/342 (69%), Gaps = 1/342 (0%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           + P  W         ++ RIQP   SE RR AV  YV+RL+     C VF FGSVPLKTY
Sbjct: 27  ISPDAWAPFGAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVFPFGSVPLKTY 86

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL AF    +  ++ A+ VR +LE+EE+ + AEF +K+VQYI AEVK++KC V 
Sbjct: 87  LPDGDIDLAAFGSTCS-DESIANEVRAILESEERRKDAEFEIKDVQYINAEVKLVKCFVQ 145

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N VVDI+FNQ+GGL TLCFL++VD    +NHLFKRSI+LIKAWCYYESRILG HHGLIS+
Sbjct: 146 NIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRILGAHHGLIST 205

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YAL TLVLYIFH+F+ S  GPL VLYRFL+++SKFDWDN  +SL GP+ +S LPD+  +P
Sbjct: 206 YALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISLSSLPDLVTDP 265

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
           P       L  + FL  C   +   P   E   +PF  K  N++DPL+ +NNLGRSVSKG
Sbjct: 266 PGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPSNNLGRSVSKG 325

Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           NF+RIR+AF   A+ L ++L  P   + +EVNQFF +T  R+
Sbjct: 326 NFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRN 367


>gi|356553166|ref|XP_003544929.1| PREDICTED: uncharacterized protein LOC100816328 [Glycine max]
          Length = 779

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 187/333 (56%), Positives = 247/333 (74%), Gaps = 2/333 (0%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLG 74
           E+ TAE+++RI+P   ++ RR  V  YV+RLI     C+VF +GSVPLKTYLPD DIDL 
Sbjct: 45  EKTTAEILSRIRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLT 104

Query: 75  AFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFN 134
           A S  Q ++D     VR +L  EE NE +E+ VK+V++I AEVK++KC+V + VVDI+FN
Sbjct: 105 ALSC-QNIEDGLVSDVRAVLHGEEINEASEYEVKDVRFIDAEVKLVKCIVQDIVVDISFN 163

Query: 135 QLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLY 194
           QLGGL TLCFL++VD L+ ++HLFKRSIILIKAWCYYESR+LG HHGLIS+YAL TLVLY
Sbjct: 164 QLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLY 223

Query: 195 IFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLL 254
           IFH F+ S  GPL VLYRFL++FSKFDWDN+C+SL GPV  S  P++ AE P ++GG  L
Sbjct: 224 IFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVGKSSPPNIVAEVP-ENGGNTL 282

Query: 255 LSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAF 314
           L++ F+ SC  +++    G +   + F  KH N+IDPL+ NNNLGRSV+KGNF+RIR+AF
Sbjct: 283 LTEEFIRSCVESFSLPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAF 342

Query: 315 TFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
            + A+ L  +L  P + +  E+ +FF NT +RH
Sbjct: 343 KYGARKLGWILMLPEDRITEELIRFFTNTLERH 375


>gi|356500940|ref|XP_003519288.1| PREDICTED: uncharacterized protein LOC100814626 [Glycine max]
          Length = 780

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 187/333 (56%), Positives = 246/333 (73%), Gaps = 2/333 (0%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLG 74
           E  TAE++ RI+P   ++ RR  V  YV+RLI     C+VF +GSVPLKTYLPD DIDL 
Sbjct: 45  ERNTAEILRRIRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLT 104

Query: 75  AFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFN 134
           A S  + ++D     VR +L  EE NE AE+ VK+V++I AEVK++KC+V + VVDI+FN
Sbjct: 105 ALSC-ENIEDGLVSDVRAVLHGEEINEAAEYEVKDVRFIDAEVKLVKCIVQDIVVDISFN 163

Query: 135 QLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLY 194
           QLGGL TLCFL++VD L+ ++HLFKRSIILIKAWCYYESR+LG HHGLIS+YAL TLVLY
Sbjct: 164 QLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLY 223

Query: 195 IFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLL 254
           IFH F+ S  GPL VLYRFL++FSKFDWDN+C+SL GPV  + LP++ AE P ++GG  L
Sbjct: 224 IFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVSKTSLPNIVAEVP-ENGGNTL 282

Query: 255 LSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAF 314
           L++ F+ SC  +++    G +   + F  KH N+IDPL+ NNNLGRSV+KGNF+RIR+AF
Sbjct: 283 LTEEFIRSCVESFSVPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAF 342

Query: 315 TFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
            + A+ L  +L  P + +  E+ +FF NT +RH
Sbjct: 343 KYGARKLGWILRLPEDRIAEELIRFFANTLERH 375


>gi|357153090|ref|XP_003576335.1| PREDICTED: uncharacterized protein LOC100826374, partial
           [Brachypodium distachyon]
          Length = 769

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 191/341 (56%), Positives = 236/341 (69%), Gaps = 7/341 (2%)

Query: 10  RWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDR 69
           RW   EE    ++ RIQP   SE RR AV  YV+RL+     C+VF FGSVPLKTYLPD 
Sbjct: 10  RWRAFEEAALGVVGRIQPSAPSEGRRAAVVHYVQRLVRHAVGCEVFPFGSVPLKTYLPDG 69

Query: 70  DIDLGAF---SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN 126
           DIDL AF   S D+ L    A+ VR +LE+EE  + AEF VK+VQYI AEVK++KCLV N
Sbjct: 70  DIDLTAFGSISSDENL----ANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCLVQN 125

Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
            VVDI+FNQ+GGLCTLCFL++VD    + HLFK+SI+LIKAWCYYESRILG HHGLIS+Y
Sbjct: 126 IVVDISFNQIGGLCTLCFLEQVDQRFGKEHLFKKSIMLIKAWCYYESRILGAHHGLISTY 185

Query: 187 ALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPP 246
           AL  LVL IFH+F+ S  GPL VLYRFL+++SKFDWDN  +SL+GPV +S LP++ ++ P
Sbjct: 186 ALEILVLCIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVLLSSLPELVSDAP 245

Query: 247 RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGN 306
               G  L  + FL  C   +   P   E   + F  K  N++DPL+ NNNLGRSVSKGN
Sbjct: 246 VTHDGDFLKREEFLRECAQTFTVPPRNSEKNTRLFSRKFLNIVDPLKQNNNLGRSVSKGN 305

Query: 307 FFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           FFRIR+AF   A+ L ++L   +     EVNQFF NT  R+
Sbjct: 306 FFRIRSAFDLGARKLGKILKEASSSAVPEVNQFFRNTLKRN 346


>gi|326492351|dbj|BAK01959.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 724

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 187/349 (53%), Positives = 238/349 (68%), Gaps = 8/349 (2%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLI-------IQCFPCQVFTFG 58
           + P  W   E     ++ RIQP   SE RR AV  YV+RL+       +   P  VF FG
Sbjct: 27  ISPDAWAPFEAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVPVTPFPVFPFG 86

Query: 59  SVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
           SVPLKTYLPD DIDL AF    +  ++ A+ VR +LE+EE+ + AEF +K+VQYI AEVK
Sbjct: 87  SVPLKTYLPDGDIDLAAFGSTCS-DESIANEVRAILESEERRKDAEFEIKDVQYINAEVK 145

Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
           ++KC V N VVDI+FNQ+GGL TLCFL++VD    +NHLFKRSI+LIKAWCYYESRILG 
Sbjct: 146 LVKCFVQNIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRILGA 205

Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
           HHGLIS+YAL TLVLYIFH+F+ S  GPL VLYRFL+++SKFDWDN  +SL GP+ +S L
Sbjct: 206 HHGLISTYALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISLSSL 265

Query: 239 PDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
           PD+  +PP       L  + FL  C   +   P   E   +PF  K  N++DPL+ +NNL
Sbjct: 266 PDLVTDPPGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPSNNL 325

Query: 299 GRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GRSVSKGNF+RIR+AF   A+ L ++L  P   + +EVNQFF +T  R+
Sbjct: 326 GRSVSKGNFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRN 374


>gi|357155485|ref|XP_003577136.1| PREDICTED: uncharacterized protein LOC100840351 [Brachypodium
           distachyon]
          Length = 739

 Score =  369 bits (948), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 191/342 (55%), Positives = 238/342 (69%), Gaps = 1/342 (0%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           + P  W   E     +I RIQP   SE  R +V  Y++RL+     CQVF FGSVPLKTY
Sbjct: 25  VSPEVWEPLEAAALAVIGRIQPTIPSEGLRASVVDYIQRLVRCSVGCQVFPFGSVPLKTY 84

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL AF    +  ++ A+ VR +LE EE+ E AEF VK+VQYI AEVK++KC V 
Sbjct: 85  LPDGDIDLTAFGSTYS-DESLANEVRAILEAEERREDAEFEVKDVQYIHAEVKLVKCFVQ 143

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N VVDI+FNQ+GGLCTLCFL++VD    +NHLFKRSIILIKAWCYYESRILG HHGLIS+
Sbjct: 144 NIVVDISFNQMGGLCTLCFLEQVDQRFEKNHLFKRSIILIKAWCYYESRILGAHHGLIST 203

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YAL TLVLYIFH+F+ S  GPL VLYRFL+++SKFDWDN  +SL+GPV +S LP++  EP
Sbjct: 204 YALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVSLSSLPELVTEP 263

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
                   L  + FL  C   +   P   E   +PF  K+FN++DPL+ +NNLGRSVSKG
Sbjct: 264 TGTHDDSFLQREEFLKECAKMFTVPPRLNEKNTRPFYQKYFNIVDPLKQSNNLGRSVSKG 323

Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           NF+RIR+AF   A+ L ++L  P     +EVNQFF +T  R+
Sbjct: 324 NFYRIRSAFDLGARKLGKILQMPANSTVDEVNQFFKSTLKRN 365


>gi|449526634|ref|XP_004170318.1| PREDICTED: uncharacterized LOC101207419 [Cucumis sativus]
          Length = 816

 Score =  369 bits (948), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 180/295 (61%), Positives = 226/295 (76%), Gaps = 1/295 (0%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
           QVF FGSVPLKTYLPD DIDL A      +++  A  V  +L +E++N  AEF VK+VQ 
Sbjct: 4   QVFPFGSVPLKTYLPDGDIDLTALGG-SNVEEALASDVCSVLNSEDQNGAAEFVVKDVQL 62

Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
           I+AEVK++KCLV N VVDI+FNQLGGLCTLCFL+++D  I ++HLFKRSIILIKAWCYYE
Sbjct: 63  IRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYE 122

Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGP 232
           SRILG HHGLIS+YAL TLVLYIFH+F+ +  GPL+VLY+FL++FSKFDWDN+C+SL GP
Sbjct: 123 SRILGAHHGLISTYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGP 182

Query: 233 VPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPL 292
           V IS LP++ AE P   GG LLLS  FL SC   ++    G E   + F  KH N++DPL
Sbjct: 183 VRISSLPELVAETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPL 242

Query: 293 RVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           + NNNLGRSVSKGNF+RIR+AF++ A+ L  +L  P +++ +EV +FF NT DRH
Sbjct: 243 KENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRH 297


>gi|168037604|ref|XP_001771293.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677382|gb|EDQ63853.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 2035

 Score =  367 bits (942), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 193/373 (51%), Positives = 240/373 (64%), Gaps = 41/373 (10%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV---------------- 54
           W +AE  TAELI  ++P   SEERR AV  +V RLI   F C+V                
Sbjct: 587 WTRAEGQTAELIDSLKPTRLSEERRTAVTGFVERLIRDRFECEVSALPHELNGFIVRSSA 646

Query: 55  --------FTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFR 106
                     FGSVPLKTYLPD DIDL  F+ +  LK+TWA  V   L+  E +  AEFR
Sbjct: 647 GAVRYSAVIRFGSVPLKTYLPDGDIDLYIFARND-LKETWAQDVLKALKQAEDDADAEFR 705

Query: 107 VKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIK 166
           VKEVQYIQAEVK+IKCLV+N VVDI+FNQ+GGL TLCFL+ VD  +  NHLFKRS+IL+K
Sbjct: 706 VKEVQYIQAEVKLIKCLVENIVVDISFNQIGGLSTLCFLERVDEEVGLNHLFKRSVILVK 765

Query: 167 AWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG--SFAGPLEVLYRFLEFFSKFDWDN 224
           AWCYYESRILG HHGLIS++AL TLVLYIFHVF+   S  GPLEVLY FL +F  FDWD 
Sbjct: 766 AWCYYESRILGAHHGLISTFALETLVLYIFHVFHSMRSLHGPLEVLYLFLTYFCNFDWDQ 825

Query: 225 FCLSLWGPVPISLLPDVTAEPPRKD-------------GGVLLLSKSFLDSCRYAYADFP 271
           +CLS+WGPVP+  +P  ++E  +KD             GG L  S+ F++ C   Y+D  
Sbjct: 826 YCLSIWGPVPLDHIPKNSSELSQKDGGWRTVARSPWEVGGKLYFSEEFIEECINRYSDVR 885

Query: 272 GGQE-NQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNE 330
            G E +QG+ F  K+ NV+DP+R  NNLGRSV+ G+F RIR+AF   A+ L  + +CP +
Sbjct: 886 AGSESSQGRIFNPKYLNVLDPIRHTNNLGRSVNVGSFKRIRSAFGLGARTLGEVFECPKD 945

Query: 331 DLYNEVNQFFMNT 343
            +  +   FF  T
Sbjct: 946 QITEKFKSFFSCT 958


>gi|255564100|ref|XP_002523048.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223537731|gb|EEF39352.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 644

 Score =  358 bits (918), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 178/342 (52%), Positives = 238/342 (69%), Gaps = 3/342 (0%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           +D   WL AE+ T E++  +QP   SE++R  V  Y++RLI   +  +VF FGSVPLKTY
Sbjct: 28  IDSELWLMAEKRTQEILWVLQPSSSSEQKRKEVIDYIQRLIKHHYATEVFPFGSVPLKTY 87

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL A S  Q +++  A  V D+L   E+N  +E  VK+V+YIQA+VK++KC V 
Sbjct: 88  LPDGDIDLTALSH-QNMEEDLAREVCDILTYAEQNLESE--VKDVRYIQAQVKVVKCSVK 144

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N  VDI+FNQ+ GLC LCFL++VD LI ++HL K SIILIKAWC+YESRILG HHGL+S+
Sbjct: 145 NISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKHSIILIKAWCFYESRILGAHHGLLST 204

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YAL  LVLYI +VF+ S  GPL VLYRFLE++S FDWDN+C+++ GPV IS LP++  E 
Sbjct: 205 YALEILVLYIVNVFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAISSLPEIMTEA 264

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
           P  +   LLL+  FL  C+  ++      EN G  F  KH N++DPL+ +NNLGRSVSKG
Sbjct: 265 PYSNRNELLLTPEFLKRCKERFSVPIKAVENGGHEFSIKHLNILDPLKDSNNLGRSVSKG 324

Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           NF RI+ A ++ A+ L  +L  P E++   +  FF+NT DR+
Sbjct: 325 NFHRIKCALSYGAQRLGEILMLPGENMGAGLENFFINTLDRN 366


>gi|255559667|ref|XP_002520853.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223539984|gb|EEF41562.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 655

 Score =  355 bits (912), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 176/342 (51%), Positives = 235/342 (68%), Gaps = 3/342 (0%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           +D   WL AE+   E++  +QP   SE++R  V  Y++RLI   F  +V  FGSVPLKTY
Sbjct: 28  IDSELWLMAEKRAQEILWILQPSLASEQKRKVVIDYIQRLIKHHFATEVLPFGSVPLKTY 87

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL A S  Q +++     + ++L  EE+N  +E  VK+V+YIQA+VKI+KC V 
Sbjct: 88  LPDGDIDLTALSH-QNMEEDLVREICNILTYEEQNSESE--VKDVRYIQAQVKIVKCSVK 144

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N  VDI+FNQ+ GLC LCFL++VD LI ++HL K SIILIKAWC+YESRILG HHGL+S+
Sbjct: 145 NISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKCSIILIKAWCFYESRILGAHHGLLST 204

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YAL  LVLYI + F+ S  GPL VLYRFLE++S FDWDN+C+++ GPV +S LP++  E 
Sbjct: 205 YALEILVLYIINAFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAVSSLPEIMTES 264

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
           P  +G  LLL   FL  C+  ++      EN G  F  KH N++DPL+ NNNLGRSVSKG
Sbjct: 265 PYNNGNELLLCPEFLKRCKEKFSVPIKAVENGGHEFSIKHLNILDPLKDNNNLGRSVSKG 324

Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           NF RI+ A ++ A+ L  +L  P E++   +  FF+NT DR+
Sbjct: 325 NFHRIKCALSYGAQRLGEILALPGENMGAGLEIFFINTLDRN 366


>gi|293332253|ref|NP_001168029.1| uncharacterized protein LOC100381756 [Zea mays]
 gi|223945595|gb|ACN26881.1| unknown [Zea mays]
 gi|413924674|gb|AFW64606.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
          Length = 833

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 180/342 (52%), Positives = 234/342 (68%), Gaps = 1/342 (0%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           + P  W + E     ++ +IQP   SE  R AV  YV+RL       QVF FGSVPLKTY
Sbjct: 23  VSPDAWRRFETAALAVVNKIQPTAASEHLRAAVVDYVQRLFWFQARYQVFPFGSVPLKTY 82

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL  F       +  A+ V  +L++EE+ + +EF VK+VQY+ AEVK++KCLV 
Sbjct: 83  LPDGDIDLTLFGP-AISDENLANEVCTILKSEERRKDSEFEVKDVQYVPAEVKLVKCLVQ 141

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N VVDI+ NQ+GGLCTLCFL++VD    ++HLFK+SIILIK WCYYESRILG HHGLIS+
Sbjct: 142 NIVVDISVNQIGGLCTLCFLEKVDQHFGKDHLFKKSIILIKDWCYYESRILGAHHGLIST 201

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YAL TLVLYIFH+F+ S  GPL VLYRFL+++SKFDWDN  +SL+GPV +S LP++  +P
Sbjct: 202 YALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLSSLPELVTDP 261

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
           P       L  + FL  C  +++  P   E   + F  +  N++DPL+ +NNLGRSVSKG
Sbjct: 262 PDIQDDDFLQREEFLKECIESFSVLPRNSETNPRLFSRRFLNIVDPLKQSNNLGRSVSKG 321

Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           NF+RIR+AF F A+ L ++L  P+     EVNQFF NT  R+
Sbjct: 322 NFYRIRSAFDFGARKLGKILQVPSCLTVGEVNQFFRNTLKRN 363


>gi|413924673|gb|AFW64605.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
          Length = 815

 Score =  355 bits (910), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 180/342 (52%), Positives = 234/342 (68%), Gaps = 1/342 (0%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           + P  W + E     ++ +IQP   SE  R AV  YV+RL       QVF FGSVPLKTY
Sbjct: 23  VSPDAWRRFETAALAVVNKIQPTAASEHLRAAVVDYVQRLFWFQARYQVFPFGSVPLKTY 82

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL  F       +  A+ V  +L++EE+ + +EF VK+VQY+ AEVK++KCLV 
Sbjct: 83  LPDGDIDLTLFGP-AISDENLANEVCTILKSEERRKDSEFEVKDVQYVPAEVKLVKCLVQ 141

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N VVDI+ NQ+GGLCTLCFL++VD    ++HLFK+SIILIK WCYYESRILG HHGLIS+
Sbjct: 142 NIVVDISVNQIGGLCTLCFLEKVDQHFGKDHLFKKSIILIKDWCYYESRILGAHHGLIST 201

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YAL TLVLYIFH+F+ S  GPL VLYRFL+++SKFDWDN  +SL+GPV +S LP++  +P
Sbjct: 202 YALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLSSLPELVTDP 261

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
           P       L  + FL  C  +++  P   E   + F  +  N++DPL+ +NNLGRSVSKG
Sbjct: 262 PDIQDDDFLQREEFLKECIESFSVLPRNSETNPRLFSRRFLNIVDPLKQSNNLGRSVSKG 321

Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           NF+RIR+AF F A+ L ++L  P+     EVNQFF NT  R+
Sbjct: 322 NFYRIRSAFDFGARKLGKILQVPSCLTVGEVNQFFRNTLKRN 363


>gi|108708029|gb|ABF95824.1| expressed protein [Oryza sativa Japonica Group]
          Length = 1004

 Score =  350 bits (897), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 179/337 (53%), Positives = 231/337 (68%), Gaps = 12/337 (3%)

Query: 13  KAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDID 72
           +AEE   E++ R++P   SE RR AV  Y RRL+     C+VF +GSVPLKTYLPD D+D
Sbjct: 35  RAEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVD 94

Query: 73  L---GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVV 129
           L   G  S   TL D   H+    L++EE+N  AEF VK++Q I AEV++IKC ++N VV
Sbjct: 95  LTVLGNTSYGSTLIDDIYHI----LQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVV 150

Query: 130 DIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALV 189
           DI+FNQ GG+C LCFL+ VD  + +NHL K SIILIKAWCYYESR+LG HHGLIS+YAL 
Sbjct: 151 DISFNQTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALE 210

Query: 190 TLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKD 249
           TL+LYIF++F+ S  GPLEVLYRFLE+FSKFDWDN+C+SL GPV +S LP+   E     
Sbjct: 211 TLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTP 270

Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
           G  LL  K FL++            E     F SK+ N+IDPL+ +NNLGRSV+K +F R
Sbjct: 271 GSDLLFDKEFLNNSVQKTDSNACNTE-----FRSKYLNIIDPLKEHNNLGRSVNKASFNR 325

Query: 310 IRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDR 346
           IRTAF++ A+ L ++L    E + +E+  FF NT +R
Sbjct: 326 IRTAFSYGAQKLGQVLLLQPELIPDEIYGFFKNTLNR 362


>gi|242069725|ref|XP_002450139.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
 gi|241935982|gb|EES09127.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
          Length = 835

 Score =  350 bits (897), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 181/345 (52%), Positives = 237/345 (68%), Gaps = 7/345 (2%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           + P  W + E     ++ +IQP   SE+ R AV  YV+RL       QVF FGSVPLKTY
Sbjct: 23  VSPDAWRRFETAALAVVNKIQPTAASEQLRAAVIEYVQRLFWFQARYQVFPFGSVPLKTY 82

Query: 66  LPDRDIDLGAFS---DDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKC 122
           LPD DIDL  F     D+ L    A+ V  +L++EE+ + +EF VK+V Y+ AEVK++KC
Sbjct: 83  LPDGDIDLTLFGPAISDENL----ANEVCAILKSEERRKDSEFEVKDVHYVPAEVKLVKC 138

Query: 123 LVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGL 182
           LV N VVDI+ NQ+GGLCTLCFL++VD    +NHLFKRSI+L+K WCYYESRILG HHGL
Sbjct: 139 LVQNIVVDISVNQIGGLCTLCFLEKVDQNFGKNHLFKRSIMLVKDWCYYESRILGAHHGL 198

Query: 183 ISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVT 242
           IS+YAL TLVLYIFH+F+ S  GPL VLYRFL+++SKFDWDN  +SL+GPV +S LP++ 
Sbjct: 199 ISTYALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLSSLPELV 258

Query: 243 AEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSV 302
            +PP       L  + FL  C  +++  P   E   + F  +  N++DPL+ +NNLGRSV
Sbjct: 259 TDPPDTQDDDFLQREEFLKECTESFSVLPRNSETNPRVFSRRFLNIVDPLKQSNNLGRSV 318

Query: 303 SKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SKGNF+RIR+AF F A+ L ++L  P+    +EVNQFF NT  R+
Sbjct: 319 SKGNFYRIRSAFDFGARKLGKILQVPSCLTVSEVNQFFRNTLKRN 363


>gi|115452887|ref|NP_001050044.1| Os03g0336700 [Oryza sativa Japonica Group]
 gi|108708028|gb|ABF95823.1| expressed protein [Oryza sativa Japonica Group]
 gi|113548515|dbj|BAF11958.1| Os03g0336700 [Oryza sativa Japonica Group]
          Length = 1035

 Score =  350 bits (897), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 179/337 (53%), Positives = 231/337 (68%), Gaps = 12/337 (3%)

Query: 13  KAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDID 72
           +AEE   E++ R++P   SE RR AV  Y RRL+     C+VF +GSVPLKTYLPD D+D
Sbjct: 35  RAEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVD 94

Query: 73  L---GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVV 129
           L   G  S   TL D   H+    L++EE+N  AEF VK++Q I AEV++IKC ++N VV
Sbjct: 95  LTVLGNTSYGSTLIDDIYHI----LQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVV 150

Query: 130 DIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALV 189
           DI+FNQ GG+C LCFL+ VD  + +NHL K SIILIKAWCYYESR+LG HHGLIS+YAL 
Sbjct: 151 DISFNQTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALE 210

Query: 190 TLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKD 249
           TL+LYIF++F+ S  GPLEVLYRFLE+FSKFDWDN+C+SL GPV +S LP+   E     
Sbjct: 211 TLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTP 270

Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
           G  LL  K FL++            E     F SK+ N+IDPL+ +NNLGRSV+K +F R
Sbjct: 271 GSDLLFDKEFLNNSVQKTDSNACNTE-----FRSKYLNIIDPLKEHNNLGRSVNKASFNR 325

Query: 310 IRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDR 346
           IRTAF++ A+ L ++L    E + +E+  FF NT +R
Sbjct: 326 IRTAFSYGAQKLGQVLLLQPELIPDEIYGFFKNTLNR 362


>gi|414882101|tpg|DAA59232.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
          Length = 906

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 177/341 (51%), Positives = 231/341 (67%), Gaps = 9/341 (2%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W + E     ++  IQP   SE  R A+  YV+RL+      QVF FGSVPLKTYLPD D
Sbjct: 29  WRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFPFGSVPLKTYLPDGD 88

Query: 71  IDLGAF----SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN 126
           IDL  F    SD++   +  A     +L++EE  + +EF VK+VQYI AEVK++KC+V N
Sbjct: 89  IDLTTFGPAISDEKLANEVCA-----ILKSEEHRKDSEFDVKDVQYIHAEVKLVKCVVQN 143

Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
            +VDI+ NQ+GGLCTLCFL++VD    + HLFKRS++LIK WCYYE+RILG HHGLIS+Y
Sbjct: 144 IIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILGAHHGLISTY 203

Query: 187 ALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPP 246
           AL  LVLYIFH+F+ S  GPL VLYRFL+++S+FDWD   +SL+GPV +S LPD+  +PP
Sbjct: 204 ALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSSLPDLVTDPP 263

Query: 247 RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGN 306
                  LL + FL  C  A++  P   E   Q F  K  N++DPL+ +NNLGRSVS+GN
Sbjct: 264 VIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNNLGRSVSRGN 323

Query: 307 FFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           F+RIR+AF F A+ L ++L  P     +EVNQFF NT  R+
Sbjct: 324 FYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRN 364


>gi|414882102|tpg|DAA59233.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
          Length = 875

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 177/341 (51%), Positives = 231/341 (67%), Gaps = 9/341 (2%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W + E     ++  IQP   SE  R A+  YV+RL+      QVF FGSVPLKTYLPD D
Sbjct: 29  WRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFPFGSVPLKTYLPDGD 88

Query: 71  IDLGAF----SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN 126
           IDL  F    SD++   +  A     +L++EE  + +EF VK+VQYI AEVK++KC+V N
Sbjct: 89  IDLTTFGPAISDEKLANEVCA-----ILKSEEHRKDSEFDVKDVQYIHAEVKLVKCVVQN 143

Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
            +VDI+ NQ+GGLCTLCFL++VD    + HLFKRS++LIK WCYYE+RILG HHGLIS+Y
Sbjct: 144 IIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILGAHHGLISTY 203

Query: 187 ALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPP 246
           AL  LVLYIFH+F+ S  GPL VLYRFL+++S+FDWD   +SL+GPV +S LPD+  +PP
Sbjct: 204 ALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSSLPDLVTDPP 263

Query: 247 RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGN 306
                  LL + FL  C  A++  P   E   Q F  K  N++DPL+ +NNLGRSVS+GN
Sbjct: 264 VIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNNLGRSVSRGN 323

Query: 307 FFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           F+RIR+AF F A+ L ++L  P     +EVNQFF NT  R+
Sbjct: 324 FYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRN 364


>gi|357112328|ref|XP_003557961.1| PREDICTED: uncharacterized protein LOC100823912 [Brachypodium
           distachyon]
          Length = 1051

 Score =  348 bits (892), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 180/330 (54%), Positives = 227/330 (68%), Gaps = 7/330 (2%)

Query: 21  LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDL---GAFS 77
           ++ R+QP   SE RR  V  Y RR++     C+VF FGSVPLKTYLPD DIDL   G  S
Sbjct: 38  VVRRVQPTEASERRRAEVIDYARRIVGTALGCEVFAFGSVPLKTYLPDGDIDLTVLGNAS 97

Query: 78  DDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLG 137
            D TL D     V  +L + E+N  AEF VK++++I AEVK+IKC ++N +VDI+FNQ G
Sbjct: 98  CDSTLIDD----VYCILGSGEQNSDAEFEVKDLEHIDAEVKLIKCTIENIIVDISFNQTG 153

Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
           G+C LCFL+ VD  I +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL TL+LYIF+
Sbjct: 154 GICALCFLELVDRKIGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALETLILYIFN 213

Query: 198 VFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSK 257
           +F+ S  GPLEVLYRFLE+FSKFDWDN+C+SL GPV +S LP++  E        LL  K
Sbjct: 214 LFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEGTNIPVDDLLFDK 273

Query: 258 SFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFR 317
            FL S     +  P   + +   F  KH N+IDPL+  NNLGRSV+K NF RIRTAF++ 
Sbjct: 274 EFLHSSVEKASVPPRDSDARCTKFRVKHLNIIDPLKECNNLGRSVNKANFSRIRTAFSYG 333

Query: 318 AKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           A+ L + L  P+E +  E+  FF NT  R+
Sbjct: 334 ARKLGQYLMLPSERISGEIFGFFKNTLKRN 363


>gi|242041009|ref|XP_002467899.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
 gi|241921753|gb|EER94897.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
          Length = 1046

 Score =  343 bits (881), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 171/328 (52%), Positives = 225/328 (68%), Gaps = 1/328 (0%)

Query: 20  ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDD 79
           E++ R++P   SE RR  V  Y RRL+     C+VF FGSVPLKTYLPD DIDL    + 
Sbjct: 35  EVVRRVRPTEASERRRADVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGN- 93

Query: 80  QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGL 139
            +   T  + V  +LE+EE+N  AEF VK ++ I AEV++IKC + N ++DI+FNQ GG+
Sbjct: 94  TSYDSTLVNDVYCILESEEQNSDAEFIVKNLERIDAEVRLIKCTIGNIIIDISFNQTGGI 153

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
           C LCFL+ VD  + +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL  L+LYIF++F
Sbjct: 154 CALCFLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYIFNLF 213

Query: 200 NGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSF 259
           + S   PLEVLYRFLE+FSKFDWDN+C+SL GPV +S LP++T E        LL  K F
Sbjct: 214 HKSLHSPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLTVEATITHTSDLLFDKEF 273

Query: 260 LDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAK 319
           L S        P   ++    F  KH N++DPL+ +NNLGRSV++ +F RIRTAF + A+
Sbjct: 274 LKSSMDKATVPPKNSDSCYTRFRPKHLNIVDPLKEHNNLGRSVNRASFNRIRTAFLYGAR 333

Query: 320 GLARLLDCPNEDLYNEVNQFFMNTRDRH 347
            L  +L  P+E + +E+  FF NT +R+
Sbjct: 334 KLGHILMLPSEVIPDEIYGFFKNTLERN 361


>gi|414888115|tpg|DAA64129.1| TPA: hypothetical protein ZEAMMB73_121752 [Zea mays]
          Length = 942

 Score =  343 bits (879), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 176/332 (53%), Positives = 231/332 (69%), Gaps = 11/332 (3%)

Query: 19  AELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSD 78
            E++ R+ P   +E RR  V AY+RRLI  C  C+VF FGSVPL+TYLPD D+D+    +
Sbjct: 68  GEVVLRVHPTREAERRRQDVIAYLRRLIGSCLGCEVFAFGSVPLRTYLPDGDVDITVLGN 127

Query: 79  DQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGG 138
              L  T+   VR ML++E++N  AEF++  +Q+I AEVK+IKC+++N +VD++FNQ+GG
Sbjct: 128 TW-LNSTFIDDVRSMLQSEQENCDAEFKLTGLQFINAEVKLIKCVIENIIVDVSFNQIGG 186

Query: 139 LCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHV 198
           + T CFL+ VD  I +NHLFKRSI+LIKAWCY+ESRILG HHGLIS+YAL TLVLYIF++
Sbjct: 187 VSTFCFLELVDRQIGQNHLFKRSIMLIKAWCYHESRILGAHHGLISTYALETLVLYIFNM 246

Query: 199 FNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLL---L 255
           F+ S  GPLE LYRFLE+FSKFDWD + +SL G V +S L   T EP    G  LL   L
Sbjct: 247 FHKSLHGPLEALYRFLEYFSKFDWDRYGISLNGQVDLSSL---TVEPTDVQGESLLGKEL 303

Query: 256 SKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFT 315
            + +LD       +F G     G  F  K  N+IDPL+ NNNLGRSVSK NF+RIR+AF+
Sbjct: 304 QQGYLDRLVVIPNEFDGC----GTQFRQKFLNIIDPLKANNNLGRSVSKANFYRIRSAFS 359

Query: 316 FRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           F A+ L ++L  P+E + +E+  FF NT  RH
Sbjct: 360 FGAQKLGQILLLPSEYIRDEIYGFFANTLKRH 391


>gi|414591190|tpg|DAA41761.1| TPA: hypothetical protein ZEAMMB73_453733 [Zea mays]
          Length = 918

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 171/334 (51%), Positives = 227/334 (67%), Gaps = 2/334 (0%)

Query: 14  AEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDL 73
           AE    E++ R+ P   +E RR  V AY+ RLI     C+VF FGSVPL+TYLPD D+D+
Sbjct: 75  AEAAAGEVLLRVHPTREAERRRQDVIAYLTRLIGSSLGCEVFAFGSVPLRTYLPDGDVDI 134

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAF 133
               +   L  T    VR ML++E++N  AE ++  + +I AEVK+IKC+++N +VD++F
Sbjct: 135 TVLGNTW-LNSTLIDDVRSMLQSEQENCDAELKLTGLHFIDAEVKLIKCVIENIIVDVSF 193

Query: 134 NQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL 193
           NQ+GG+ T CFL+ VD  + +NHLFKRSI+L KAWCY+ESRILG HHGLIS+YAL TLVL
Sbjct: 194 NQIGGVSTFCFLELVDRQVGKNHLFKRSIMLTKAWCYHESRILGAHHGLISTYALETLVL 253

Query: 194 YIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVL 253
           YIF++F+ S  GPLEVLY+FLE+FSKFDWD + +SL GPV +S LP +T EP     G L
Sbjct: 254 YIFNMFHKSLHGPLEVLYKFLEYFSKFDWDRYGISLNGPVDLSSLPSLTVEPTEVQ-GEL 312

Query: 254 LLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTA 313
           LL K F           P   +     F  K  N++DPL+ NNNLGRSVSK NF+RIR+A
Sbjct: 313 LLGKDFHQGSLDRLVVIPNEFDGCDTQFRQKFLNIVDPLKANNNLGRSVSKANFYRIRSA 372

Query: 314 FTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           F+F A+ L ++L  P+E + +E+  FF NT  RH
Sbjct: 373 FSFGAQKLGQILLLPSEYICDEIYGFFSNTLKRH 406


>gi|224146203|ref|XP_002325920.1| predicted protein [Populus trichocarpa]
 gi|222862795|gb|EEF00302.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 175/343 (51%), Positives = 233/343 (67%), Gaps = 7/343 (2%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           +DP  WL AE+ T E++  IQP   SE +R  V  Y++ LI   F  +VF FGSVPLKTY
Sbjct: 28  IDPELWLMAEKRTQEILYTIQPTFASEHKRMEVINYIQSLIKYYFTVEVFAFGSVPLKTY 87

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL   S  Q +++  A  V  +L+ EE +   EF+V +VQYI A+VK++KC V 
Sbjct: 88  LPDGDIDLMVLSH-QNMEEELARGVCTLLQREELD--PEFQVNDVQYIHAQVKLVKCSVK 144

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N  VDI+FNQ+ G   LCFL++VD LI ++HLFKRSIILIKAWC+YESRILG HHGLIS+
Sbjct: 145 NISVDISFNQMAGPSALCFLEQVDQLIGQDHLFKRSIILIKAWCFYESRILGAHHGLIST 204

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YAL  LVL I +VF+ S   PL VLY+FL+++S FDWDN+C+S+ GP+PIS  P   +  
Sbjct: 205 YALQILVLNIINVFHSSLPDPLAVLYKFLDYYSAFDWDNYCVSINGPIPISSFPQTDST- 263

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQ-ENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
              +G   L+S+ FL + R  +A FP  + EN    F  KH N++DPL+ +NNLGRSV+K
Sbjct: 264 -HNNGNESLISQEFLRNFREKFA-FPMKELENGAHEFPIKHLNIVDPLKSSNNLGRSVNK 321

Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GNF RIR A ++ A+ L  ++  P E +   + +FFMNT DR+
Sbjct: 322 GNFHRIRGALSYGAQRLGEIIALPGEAMGGRLEKFFMNTLDRN 364


>gi|42565972|ref|NP_191191.2| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
           thaliana]
 gi|30725328|gb|AAP37686.1| At3g56320 [Arabidopsis thaliana]
 gi|110736147|dbj|BAF00045.1| hypothetical protein [Arabidopsis thaliana]
 gi|332645988|gb|AEE79509.1| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
           thaliana]
          Length = 603

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 168/343 (48%), Positives = 231/343 (67%), Gaps = 4/343 (1%)

Query: 5   PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
           P+D   W+ AEE   E++  IQP   S+  RN +  YVR LI+     +VF+FGSVPLKT
Sbjct: 34  PIDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKT 93

Query: 65  YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
           YLPD DIDL   +      D +  L    L+NEE+   +EF   +VQ+I A+VK+IKC +
Sbjct: 94  YLPDGDIDLTVLTKQNMDDDFYGQLC-SRLQNEER--ESEFHATDVQFIPAQVKVIKCNI 150

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
            N  VDI+FNQ  GLC LCFL++VD L   +HLFKRSIIL+KAWCYYESRILG + GLIS
Sbjct: 151 RNIAVDISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGANTGLIS 210

Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
           +YAL  LVLYI ++F+ S +GPL VLY+FL+++  FDW+N+C+S+ GPVPIS LP++TA 
Sbjct: 211 TYALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSLPELTAA 270

Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
            P ++G  LLL + FL +C   Y+      ++ G  F  KH N++DPL+ +NNLG+SV++
Sbjct: 271 SP-ENGHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLKYSNNLGKSVTQ 329

Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GN  RIR AFT  A+ L  +L  P + +   + +FF N+ +R+
Sbjct: 330 GNVQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERN 372


>gi|326517667|dbj|BAK03752.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 334

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 170/302 (56%), Positives = 213/302 (70%), Gaps = 1/302 (0%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           +  G W   E+  A ++ RIQP   SE+RR AV  YV+RLI     C+VF FGSVPLKTY
Sbjct: 30  ISAGAWRPFEDAAAAVVGRIQPSVSSEDRRAAVVHYVQRLIRCSVGCEVFPFGSVPLKTY 89

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL AF    +  +  A+ VR +LE+EE  + AEF VK+VQYI AEVK++KCLV 
Sbjct: 90  LPDGDIDLTAFGSASS-DENLANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCLVQ 148

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N VVDI+FNQ+GGLCTLCFL++VD    + HLFK+SI+LIKAWCYYESRILG HHGLIS+
Sbjct: 149 NIVVDISFNQIGGLCTLCFLEQVDERFGKKHLFKKSIMLIKAWCYYESRILGAHHGLIST 208

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YAL  LVLYIFH+F+ S  GPL VLYRFL+++SKFDWDN  +SL+GPVP+S LP++ ++ 
Sbjct: 209 YALEILVLYIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVPLSSLPELVSDT 268

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
           P       L  + FL      +   P   E   + F+ K  N++DPL+ NNNLGRSVSKG
Sbjct: 269 PDTHDVDFLKREEFLKEFAQMFTVPPRSFERNNRLFLRKFLNIVDPLKQNNNLGRSVSKG 328

Query: 306 NF 307
            F
Sbjct: 329 FF 330


>gi|414866687|tpg|DAA45244.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
          Length = 1050

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 167/324 (51%), Positives = 221/324 (68%), Gaps = 1/324 (0%)

Query: 24  RIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           R++P   SE RR  V  Y RRL+     C+VF FGSVPLKTYLPD DIDL    +  +  
Sbjct: 37  RVRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGN-TSYD 95

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
            T  + V  +LE+EE+N  AEF VK+++ I AEV++IKC + N +VDI+FNQ GG+C LC
Sbjct: 96  STLVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALC 155

Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
           FL+ VD  + +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL  L+LY+F++F+ S 
Sbjct: 156 FLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSL 215

Query: 204 AGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSC 263
             P+EVLYRFLE+FSKFDWDN+C+SL GPV +S LP++  E        LL  K FL S 
Sbjct: 216 HSPVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSS 275

Query: 264 RYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLAR 323
                  P   ++    F  KH N++DPL+  NNLGRSV++ +F RIRTAF + A+ L  
Sbjct: 276 MDKATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGH 335

Query: 324 LLDCPNEDLYNEVNQFFMNTRDRH 347
           ++  P+E + +E+ +FF NT  R+
Sbjct: 336 IVTLPSEVIPDEIYEFFKNTLGRN 359


>gi|414866686|tpg|DAA45243.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
          Length = 1056

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 167/324 (51%), Positives = 221/324 (68%), Gaps = 1/324 (0%)

Query: 24  RIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           R++P   SE RR  V  Y RRL+     C+VF FGSVPLKTYLPD DIDL    +  +  
Sbjct: 37  RVRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGN-TSYD 95

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
            T  + V  +LE+EE+N  AEF VK+++ I AEV++IKC + N +VDI+FNQ GG+C LC
Sbjct: 96  STLVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALC 155

Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
           FL+ VD  + +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL  L+LY+F++F+ S 
Sbjct: 156 FLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSL 215

Query: 204 AGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSC 263
             P+EVLYRFLE+FSKFDWDN+C+SL GPV +S LP++  E        LL  K FL S 
Sbjct: 216 HSPVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSS 275

Query: 264 RYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLAR 323
                  P   ++    F  KH N++DPL+  NNLGRSV++ +F RIRTAF + A+ L  
Sbjct: 276 MDKATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGH 335

Query: 324 LLDCPNEDLYNEVNQFFMNTRDRH 347
           ++  P+E + +E+ +FF NT  R+
Sbjct: 336 IVTLPSEVIPDEIYEFFKNTLGRN 359


>gi|168035287|ref|XP_001770142.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678668|gb|EDQ65124.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1504

 Score =  333 bits (853), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 167/296 (56%), Positives = 204/296 (68%), Gaps = 9/296 (3%)

Query: 56  TFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQA 115
           TFGSVPLKTYLPD DIDL AF+    +K TW     + L+  + N ++EFRVKEVQ I A
Sbjct: 143 TFGSVPLKTYLPDGDIDLSAFTPSPDVKRTWIQDTYNALQKAKDNPNSEFRVKEVQLIHA 202

Query: 116 EVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRI 175
           EVKI+KC V+N +VD++F+QLGGL TLCFL EVD LI E+HLFKRSIIL+KAWCYYESRI
Sbjct: 203 EVKIVKCFVENILVDVSFDQLGGLGTLCFLVEVDKLIGEDHLFKRSIILVKAWCYYESRI 262

Query: 176 LGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPI 235
           LG H GL+S+YA+  LVLYIF  F+ S  GPL+VLY FLEFFS FDWDN+C+SL  P+P+
Sbjct: 263 LGAHCGLMSTYAVEALVLYIFDKFHASLRGPLQVLYLFLEFFSSFDWDNYCVSLSSPIPL 322

Query: 236 SLLPDVT---------AEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHF 286
             L   +         A   R+DGG L  +K FL +C   Y   P  Q  +   F  K  
Sbjct: 323 KSLSKDSEKLEDLQKLALSTRRDGGELFFTKEFLVACETEYGVVPVSQITKSNKFTVKCL 382

Query: 287 NVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMN 342
           N+ DPLR +NNLGRSV++GNF RIR AF F A+ L R+L C  ED+  E+ QFF N
Sbjct: 383 NISDPLRSSNNLGRSVNQGNFARIRRAFDFGARTLRRVLSCTEEDVPAELEQFFKN 438



 Score = 42.0 bits (97), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 30/55 (54%)

Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
          W KAE   AELI  +QP+  SE+RR  V  YVR L+  C   Q     ++ LK +
Sbjct: 36 WAKAELRAAELITSLQPNEASEQRRQDVIDYVRGLVKGCIYGQCLHSEALCLKHF 90


>gi|7572930|emb|CAB87431.1| putative protein [Arabidopsis thaliana]
          Length = 614

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 169/354 (47%), Positives = 231/354 (65%), Gaps = 15/354 (4%)

Query: 5   PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
           P+D   W+ AEE   E++  IQP   S+  RN +  YVR LI+     +VF+FGSVPLKT
Sbjct: 34  PIDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKT 93

Query: 65  YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
           YLPD DIDL   +      D +  L    L+NEE+   +EF   +VQ+I A+VK+IKC +
Sbjct: 94  YLPDGDIDLTVLTKQNMDDDFYGQLC-SRLQNEER--ESEFHATDVQFIPAQVKVIKCNI 150

Query: 125 DNFVVDIAFNQLGGLCTLCFLD-----------EVDHLINENHLFKRSIILIKAWCYYES 173
            N  VDI+FNQ  GLC LCFL+           EVD L   +HLFKRSIIL+KAWCYYES
Sbjct: 151 RNIAVDISFNQTAGLCALCFLEQVLSAIQNQAPEVDQLFGRDHLFKRSIILVKAWCYYES 210

Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPV 233
           RILG + GLIS+YAL  LVLYI ++F+ S +GPL VLY+FL+++  FDW+N+C+S+ GPV
Sbjct: 211 RILGANTGLISTYALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPV 270

Query: 234 PISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLR 293
           PIS LP++TA  P ++G  LLL + FL +C   Y+      ++ G  F  KH N++DPL+
Sbjct: 271 PISSLPELTAASP-ENGHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLK 329

Query: 294 VNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
            +NNLG+SV++GN  RIR AFT  A+ L  +L  P + +   + +FF N+ +R+
Sbjct: 330 YSNNLGKSVTQGNVQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERN 383


>gi|297820390|ref|XP_002878078.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323916|gb|EFH54337.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 602

 Score =  330 bits (845), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 165/337 (48%), Positives = 228/337 (67%), Gaps = 4/337 (1%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W+ AEE   E++  IQP   S++ RN +  YVR LI      +VF+FGSVPLKTYLPD D
Sbjct: 40  WMIAEERAHEILCTIQPALVSDKSRNEIIDYVRTLIKSHDGIEVFSFGSVPLKTYLPDGD 99

Query: 71  IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
           IDL   +      D +  L    L+NEE+   +EF   +VQ+I A+VK+IKC + N  VD
Sbjct: 100 IDLTVLTKQNMDDDFYGQLC-SRLQNEER--ESEFHATDVQFIPAQVKVIKCNIRNIAVD 156

Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
           I+FNQ  GLC LCFL++VD L   +HLFKRSIIL+KAWCYYESRILG + GLIS+YAL  
Sbjct: 157 ISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGANTGLISTYALAV 216

Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
           LVLYI ++F+ S +GPL VLY+FL+++  FDW+N+C+S+ GPVPIS LP++TA  P ++G
Sbjct: 217 LVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSLPELTAASP-ENG 275

Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
             LLL + FL +C   ++      ++ G  F  KH N++DPL+ +NNLG+SV++GN  RI
Sbjct: 276 HELLLDEKFLRNCVELFSAPTKAVDSNGLDFPIKHLNIVDPLKYSNNLGKSVTQGNVQRI 335

Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           R AFT  A+ L  +L  P + +   + +FF N+ +R+
Sbjct: 336 RHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERN 372


>gi|326490774|dbj|BAJ90054.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1030

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 168/325 (51%), Positives = 222/325 (68%), Gaps = 7/325 (2%)

Query: 26  QPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDL---GAFSDDQTL 82
           QP   S+ RR  V  + RR++     C+VF FGSVPLKTYLPD DIDL   G  S   TL
Sbjct: 43  QPTQASDRRRAEVVDHARRIVGTALGCEVFVFGSVPLKTYLPDGDIDLTVIGNTSCGSTL 102

Query: 83  KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTL 142
            D   H+    LE+ E+N  AEF VK++++I AEV++IKC + N +VDI+FNQ GG+C +
Sbjct: 103 IDDVYHI----LESGEENGDAEFEVKDLEHIDAEVRLIKCTIGNIIVDISFNQTGGICAV 158

Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS 202
            FL+ VD  + +NHLFKRSIILIK WCYYESR+LG HHGLIS+YAL TL+LY+F++F+ S
Sbjct: 159 SFLELVDRKVGKNHLFKRSIILIKGWCYYESRLLGAHHGLISTYALETLILYVFNLFHKS 218

Query: 203 FAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDS 262
             GPLEVLYRFLE+FSKFDWD +C+SL GPV +S LP++  E     G  LL  + FLD+
Sbjct: 219 LHGPLEVLYRFLEYFSKFDWDKYCISLNGPVALSSLPNLIVEGLNVPGDDLLFDREFLDN 278

Query: 263 CRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLA 322
                +  P   + +   F  K  N+IDPL+  NNLGRSV++ NF RIRTAF+F A+ L 
Sbjct: 279 SVEKASAPPRNSDARCSKFRVKCLNIIDPLKECNNLGRSVNRANFHRIRTAFSFGARKLG 338

Query: 323 RLLDCPNEDLYNEVNQFFMNTRDRH 347
           ++L  P E + +++  FF NT +R+
Sbjct: 339 QILMLPPELIPDDIFAFFKNTLERN 363


>gi|218200261|gb|EEC82688.1| hypothetical protein OsI_27344 [Oryza sativa Indica Group]
          Length = 1001

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 163/328 (49%), Positives = 220/328 (67%), Gaps = 5/328 (1%)

Query: 20  ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDD 79
           E++ R+QP   +E  R  +  Y++ L      C+VF FGSVPLKTYLPD DID+    + 
Sbjct: 54  EVVLRVQPTEEAERTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNT 113

Query: 80  QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGL 139
                T+   VR +LE EE+ + A+  +  +Q+I AEVK+IKC++DN VVDI+FNQ+GG+
Sbjct: 114 AP-DSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGV 172

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
            TLC L+ VDH +  +HLFKRSI+LIKAWCY+ES ILG H GLIS+YAL  LVLYIF++F
Sbjct: 173 TTLCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIF 232

Query: 200 NGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSF 259
           + S   PLEVLY+FLE+FSKFDWD +C+SL GPVP+S LP++T EP      +L      
Sbjct: 233 HKSLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNLTVEPSGIHDELLFGPNGS 292

Query: 260 LDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAK 319
            D       D  G   N    F  K+ N+IDP++ +NNLGRSVSKG+F+RIR AF+F A+
Sbjct: 293 CDRLIVLKKDSDGSNMN----FRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGAQ 348

Query: 320 GLARLLDCPNEDLYNEVNQFFMNTRDRH 347
            L+++L  P + +  E+  FF+NT   H
Sbjct: 349 NLSQILMLPTDLIPTEIFGFFVNTLKSH 376


>gi|222637691|gb|EEE67823.1| hypothetical protein OsJ_25591 [Oryza sativa Japonica Group]
          Length = 1001

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 162/328 (49%), Positives = 220/328 (67%), Gaps = 5/328 (1%)

Query: 20  ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDD 79
           E++ R+QP   ++  R  +  Y++ L      C+VF FGSVPLKTYLPD DID+    + 
Sbjct: 54  EVVLRVQPTEEADRTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNT 113

Query: 80  QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGL 139
                T+   VR +LE EE+ + A+  +  +Q+I AEVK+IKC++DN VVDI+FNQ+GG+
Sbjct: 114 AP-DSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGV 172

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
            TLC L+ VDH +  +HLFKRSI+LIKAWCY+ES ILG H GLIS+YAL  LVLYIF++F
Sbjct: 173 TTLCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIF 232

Query: 200 NGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSF 259
           + S   PLEVLY+FLE+FSKFDWD +C+SL GPVP+S LP++T EP      +L      
Sbjct: 233 HKSLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNLTVEPSGIHDELLFGPNGS 292

Query: 260 LDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAK 319
            D       D  G   N    F  K+ N+IDP++ +NNLGRSVSKG+F+RIR AF+F A+
Sbjct: 293 CDRLIVLKKDSDGSNMN----FRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGAQ 348

Query: 320 GLARLLDCPNEDLYNEVNQFFMNTRDRH 347
            L+++L  P + +  E+  FF+NT   H
Sbjct: 349 NLSQILMLPTDLIPTEIFGFFVNTLKSH 376


>gi|115488182|ref|NP_001066578.1| Os12g0283100 [Oryza sativa Japonica Group]
 gi|77554657|gb|ABA97453.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649085|dbj|BAF29597.1| Os12g0283100 [Oryza sativa Japonica Group]
 gi|222616913|gb|EEE53045.1| hypothetical protein OsJ_35772 [Oryza sativa Japonica Group]
          Length = 989

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 159/327 (48%), Positives = 215/327 (65%), Gaps = 12/327 (3%)

Query: 21  LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
           ++ R+ P   +E RR  V  Y+RRL+     C+V  FGSVPLK+YLPD D+D+    +  
Sbjct: 56  VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGN-T 114

Query: 81  TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLC 140
            L       V  +LE+EE++  AE  +K + +I AEVK+IKC+++N VVDI+FNQ+GG+ 
Sbjct: 115 ALDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVS 174

Query: 141 TLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
           TLCFL+  D  + +NHLFKRSI+LIKAWCY+ESRILG HHGL+S+YAL TLVLYIF++F+
Sbjct: 175 TLCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFH 234

Query: 201 GSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFL 260
            S  GPLE LY+FLE+FSKFDWD +C+SL GPV +S LP    EP            S  
Sbjct: 235 KSLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVEP-----------SSIQ 283

Query: 261 DSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKG 320
           D   +     P   +     F  KH N+IDPL+ +NNLGRSVS+G+F+RIR A +F A+ 
Sbjct: 284 DELLFGKKTLPEVSDGSNINFCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQK 343

Query: 321 LARLLDCPNEDLYNEVNQFFMNTRDRH 347
           L ++L   ++ +  E+  FF NT  RH
Sbjct: 344 LGQILMLHSDLIPTEIFGFFANTLKRH 370


>gi|218186672|gb|EEC69099.1| hypothetical protein OsI_37998 [Oryza sativa Indica Group]
          Length = 989

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 159/327 (48%), Positives = 215/327 (65%), Gaps = 12/327 (3%)

Query: 21  LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
           ++ R+ P   +E RR  V  Y+RRL+     C+V  FGSVPLK+YLPD D+D+    +  
Sbjct: 56  VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGN-T 114

Query: 81  TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLC 140
            L       V  +LE+EE++  AE  +K + +I AEVK+IKC+++N VVDI+FNQ+GG+ 
Sbjct: 115 ALDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVS 174

Query: 141 TLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
           TLCFL+  D  + +NHLFKRSI+LIKAWCY+ESRILG HHGL+S+YAL TLVLYIF++F+
Sbjct: 175 TLCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFH 234

Query: 201 GSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFL 260
            S  GPLE LY+FLE+FSKFDWD +C+SL GPV +S LP    EP            S  
Sbjct: 235 KSLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVEP-----------SSIQ 283

Query: 261 DSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKG 320
           D   +     P   +     F  KH N+IDPL+ +NNLGRSVS+G+F+RIR A +F A+ 
Sbjct: 284 DELLFGKKTLPEVSDGSNINFCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQK 343

Query: 321 LARLLDCPNEDLYNEVNQFFMNTRDRH 347
           L ++L   ++ +  E+  FF NT  RH
Sbjct: 344 LGQILMLHSDLIPTEIFGFFANTLKRH 370


>gi|356561857|ref|XP_003549193.1| PREDICTED: uncharacterized protein LOC100787145 [Glycine max]
          Length = 684

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 166/348 (47%), Positives = 230/348 (66%), Gaps = 7/348 (2%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           S +  +D   W  AEE   E++  I+P   SE  R  V  YV+RLI   +  +V  FGSV
Sbjct: 14  SQLLSIDEELWRMAEERAQEILWTIEPIVLSEVNRKDVIDYVQRLIRGYYGAEVLPFGSV 73

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL A S +   +D  A  V ++L++    +  E++VK++QYI+A+V+++
Sbjct: 74  PLKTYLPDGDIDLTALSHEDAEED-LAQAVCNILQS---GDDPEYQVKDIQYIRAQVRLV 129

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KC V N  VDI+FNQ+ G+CTL FL++VD L+ +NH+FK SIILIKAWCYYESR+LGGHH
Sbjct: 130 KCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFKHSIILIKAWCYYESRLLGGHH 189

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GL+S+YA+  LVLYI + F+ S  GPLEVLY FL+++  FDWD+  +S+WGP P+S LP+
Sbjct: 190 GLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYVSIWGPKPLSSLPE 249

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPG-GQENQGQPFVSKHFNVIDPLRVNNNLG 299
           + AE P  D G  LL K FL + R     FP    E     F  K  N++DPLR +NNLG
Sbjct: 250 I-AETPECDQGEFLLQKEFLRNYR-NMCSFPSRASETMTHEFPVKFMNILDPLRNDNNLG 307

Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           RSV+  N  R+R A ++ A+ L ++L  P E++   + +FF +T DR+
Sbjct: 308 RSVNIANLHRVRFALSYGARRLKQILTLPGENMGAALEKFFFSTLDRN 355


>gi|356570171|ref|XP_003553264.1| PREDICTED: uncharacterized protein LOC100797780 [Glycine max]
          Length = 644

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 161/348 (46%), Positives = 225/348 (64%), Gaps = 7/348 (2%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           S +  +D   W  AEE   E++  IQP+  SE  R  V  YV+RLI   +  +V  FGSV
Sbjct: 14  SQLLSIDKELWQMAEERAQEILWTIQPNVLSEVNRKDVIDYVQRLIRGYYGAEVLPFGSV 73

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL A S +   +D    L + +    +  +  E++VK+++YI+A+V+++
Sbjct: 74  PLKTYLPDGDIDLTALSHEDAEED----LAQAVCYVLQSGDDPEYQVKDIKYIRAQVRLV 129

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KC V N  VDI+FNQ+ G+CTL FL++VD L+ +NH+FKRSIILIKAWCYYESR+LGGHH
Sbjct: 130 KCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFKRSIILIKAWCYYESRLLGGHH 189

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GL+S+YA+  LVLYI + F+ S  GPLEVLY FL+++  FDWD+  +S+WGP P+S  P+
Sbjct: 190 GLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYVSIWGPKPLSSFPE 249

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPG-GQENQGQPFVSKHFNVIDPLRVNNNLG 299
           + AE    D G  LL K FL + R     FP    +     F  K  N++DPLR +NNLG
Sbjct: 250 I-AETLECDHGEFLLQKEFLRNYR-NMCSFPSRATKTMTHEFPVKFMNILDPLRNDNNLG 307

Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           RSV+  +  R R A ++ A+ L ++L  P E +   + +FF +T DR+
Sbjct: 308 RSVNIASLHRFRFALSYGARRLKQILTLPGETMGAALEKFFFSTLDRN 355


>gi|413924678|gb|AFW64610.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
          Length = 474

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 167/348 (47%), Positives = 224/348 (64%), Gaps = 14/348 (4%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLI-IQCFPCQVFTFGSVPLKT 64
           + P  W + E  T  ++ +I P   S+  R  V  YV+RL  +     QV +FGSVPLKT
Sbjct: 67  ISPDDWRRLEGATFSVMCKIHPTVSSQHLRARVIDYVQRLFRLHHDGYQVISFGSVPLKT 126

Query: 65  YLPDRDIDL----GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           YLPD DIDL     A SD+    +  A     +L++EE+ + +EF VK+V+Y+ AEVK++
Sbjct: 127 YLPDGDIDLTLLCAAISDENLENEVCA-----ILKSEEQRKDSEFEVKDVKYVPAEVKLV 181

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KC V N  VDI+ NQ+GG   + FL++VD  + +N+L +RSI+LIK WCYYES ILG   
Sbjct: 182 KCKVQNIAVDISVNQIGGPNKVYFLEKVDQNLGKNNLLRRSIMLIKHWCYYESCILGAQR 241

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GL+S+YAL TLVLYIFHVF+ S  GPL VLYRFL+++SKFDWDN  +SL+GP+ +S LP+
Sbjct: 242 GLVSTYALETLVLYIFHVFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPISLSSLPE 301

Query: 241 VTAEPP--RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
           +  EPP  R DG   L  ++FL  C  A++  P   E   Q F  K  N++DPL+ +NNL
Sbjct: 302 LVTEPPYTRDDG--FLSREAFLKDCAKAFSVPPINSEENPQVFSKKFVNIVDPLKQSNNL 359

Query: 299 GRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDR 346
           GRS+SKGN  RIR  F F A  L ++L  P     NE+N+FF NT  R
Sbjct: 360 GRSISKGNLGRIRKEFYFGACKLGKILQAPACFSANEINRFFRNTLSR 407


>gi|242082774|ref|XP_002441812.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
 gi|241942505|gb|EES15650.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
          Length = 546

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 174/389 (44%), Positives = 230/389 (59%), Gaps = 48/389 (12%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLI------------------- 46
           + P  W + E     ++ +IQP   SE  R+AV  Y++RL+                   
Sbjct: 29  IPPDAWRRFESAALGVVNKIQPTVASENFRSAVIDYLKRLLGSRAGVQSWLLPFLPFHFY 88

Query: 47  -------------------IQCFPCQ-------VFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
                              I    C        VF FGSVPLKTYLPD DIDL AFS   
Sbjct: 89  VFFGAKPVRDYEYKCVTVWIYFVGCALESLCDLVFPFGSVPLKTYLPDGDIDLTAFSPAI 148

Query: 81  TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLC 140
           +  +  A+ V  +L +E+  + +EF VK+VQYI AEVK++KCLV N VVDI+ NQ+GGL 
Sbjct: 149 S-DENLANQVYAILSSEQHRKDSEFDVKDVQYIHAEVKLVKCLVQNIVVDISVNQIGGLS 207

Query: 141 TLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
           TLCFL++VD    + HL KRSI+LIK WCYYESRILG  +GL+S+YAL  LVLY+F +F+
Sbjct: 208 TLCFLEKVDENFGKKHLLKRSIVLIKDWCYYESRILGAQNGLLSTYALEVLVLYVFLIFH 267

Query: 201 GSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP--PRKDGGVLLLSKS 258
            S  GPL VLYRFL+F+SKFDWD+  +SL+GPV +S LP++  +P  P  D    +  + 
Sbjct: 268 RSLGGPLAVLYRFLDFYSKFDWDSKGISLFGPVSLSSLPNLVTDPHLPAIDDDFFVPREK 327

Query: 259 FLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRA 318
            L      ++  P   E   Q F  K  N++DPL+ +NNLGRSV+KGNF+RIR+AF F A
Sbjct: 328 ILRKYAEDFSAPPRNSERDAQVFSRKFLNIVDPLKQSNNLGRSVNKGNFYRIRSAFDFGA 387

Query: 319 KGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           + L ++L  P     NEVNQFF NT  R+
Sbjct: 388 RKLGKILQMPVCYTVNEVNQFFSNTLKRN 416


>gi|356507300|ref|XP_003522406.1| PREDICTED: uncharacterized protein LOC100813790 [Glycine max]
          Length = 692

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 155/347 (44%), Positives = 227/347 (65%), Gaps = 5/347 (1%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           S +  +D   W  AE+   E++  IQP+  SE  R  V  YV+RLI   +  +V  FGSV
Sbjct: 15  SQLLSIDEELWQMAEDRVQEILWTIQPNVLSEVNRKDVIDYVQRLIRDYYGAEVLPFGSV 74

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD D+DL     +   +D  A  + ++L++    + +E++VK++QYI+A+V+++
Sbjct: 75  PLKTYLPDGDVDLTTLIHEDA-EDDLAQAICNVLKS---GDDSEYQVKDIQYIRAQVRLV 130

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KC V N  VDI+FNQ+ G+ TL FL++VD L+ +NH+FKRSIILIK WCYY+SR+LGGHH
Sbjct: 131 KCTVKNIAVDISFNQMAGIYTLRFLEQVDQLVGKNHIFKRSIILIKGWCYYDSRLLGGHH 190

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GL+S+YA+  LVLYI + F+ S  GPLEVLY FL+++  FDWD+  +S+WGP  +S LP+
Sbjct: 191 GLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYISIWGPKSLSSLPE 250

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           + AE P  D G  LL K FL + +   +   G  E     F  K  N++DPLR +NNLGR
Sbjct: 251 I-AEAPECDQGEFLLQKEFLGNYKNMCSYPAGASETLTHEFPVKFMNILDPLRNDNNLGR 309

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVS  +  R+R AF++  + L ++   P E++   + +FF +T +R+
Sbjct: 310 SVSIASLHRLRFAFSYGVQKLKQIFTLPGENMGAALEKFFSSTLNRN 356


>gi|356570173|ref|XP_003553265.1| PREDICTED: uncharacterized protein LOC100798838 [Glycine max]
          Length = 626

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 158/343 (46%), Positives = 221/343 (64%), Gaps = 5/343 (1%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           S +  +D   W   EE   E++  IQP+  SE  R  +  YV+RLI +    QVF FGS 
Sbjct: 15  SQLLSIDEELWRMIEERAQEILWTIQPNVLSEVNRKNIIDYVQRLIGEYCGAQVFPFGSF 74

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD DIDL A S +   +D    LVR +    +  + +E++VK++++I+A+V+++
Sbjct: 75  PLKTYLPDGDIDLTALSHEDEEED----LVRAVCNILKSEDDSEYQVKDIEHIRAQVQVV 130

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KC V N  VDI+FNQ+ GL TL FL++VD L+ +NH+FKRS+ILIK+WCYYESRILG H 
Sbjct: 131 KCTVKNIPVDISFNQMAGLYTLFFLEQVDQLVGKNHIFKRSVILIKSWCYYESRILGAHC 190

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GL+S+YA   LVLYI + F+ S  GPL VLY FL+++S FDW++  +S+WGP  +S LP+
Sbjct: 191 GLLSTYATEILVLYIINRFHSSVRGPLAVLYVFLDYYSSFDWEHNYISIWGPKVLSSLPE 250

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           +  + P  D G  LL K FL + R   +      E     F  KH N++DPLR NNNLGR
Sbjct: 251 I-VDTPEYDQGEFLLQKEFLKNYRDMCSSKAKASETMTNAFPVKHMNILDPLRNNNNLGR 309

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNT 343
           SV+ GN  RIR AF+  ++ L ++L    E++   + +FF NT
Sbjct: 310 SVNIGNLSRIRLAFSLGSQRLKQILTLAGENMGAALEKFFFNT 352


>gi|326521958|dbj|BAK04107.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1031

 Score =  304 bits (778), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 157/327 (48%), Positives = 216/327 (66%), Gaps = 2/327 (0%)

Query: 21  LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
           ++ R+ P   +E RR+ +  Y + LI   F C+VF FGSVPLKTYLPD D+D+   ++  
Sbjct: 157 VLLRLHPTEEAERRRHKIIDYAKNLIGTTFGCEVFAFGSVPLKTYLPDGDVDITILTN-V 215

Query: 81  TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLC 140
            L + +   V  +L  E+ NE AEF +KE+Q I A+VKIIKC++DN V+DI+FNQ+GG+ 
Sbjct: 216 NLDNNFVQDVCCLLAAEQSNEAAEFALKEIQVINAKVKIIKCVIDNLVMDISFNQVGGVS 275

Query: 141 TLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
           TLCFL+  +  I ++HLFKRSIILIKAWCY+E  I G +H L+S+YAL  L+LYIF++F+
Sbjct: 276 TLCFLEMANKEIGKDHLFKRSIILIKAWCYHEGSIHGSNHWLMSTYALEVLILYIFNLFH 335

Query: 201 GSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFL 260
               GPL+ LY+FLE++SKFDWDN CL+L GPVP+S L + TA  P      LLLSK  L
Sbjct: 336 TVLHGPLQALYKFLEYYSKFDWDNQCLTLNGPVPLSSLRNYTAG-PTGSNEELLLSKEPL 394

Query: 261 DSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKG 320
           +       D P G + +G  F  K+ N+IDPL+  NNLG S+S+ N   IR AF   A+ 
Sbjct: 395 EPSLRRLFDLPAGSDGRGPEFRLKYLNIIDPLKGGNNLGTSISEANSRVIRDAFAAGAEK 454

Query: 321 LARLLDCPNEDLYNEVNQFFMNTRDRH 347
           L ++L  P E +  +V  FF +T  +H
Sbjct: 455 LGQILKLPCELIAEQVYVFFTHTLGKH 481


>gi|359486339|ref|XP_002274554.2| PREDICTED: uncharacterized protein LOC100253615 [Vitis vinifera]
          Length = 755

 Score =  303 bits (776), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 167/338 (49%), Positives = 223/338 (65%), Gaps = 6/338 (1%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W   +    E++  IQP   SE+RR  +  YV+RLI   F  +V  FGS+PLKTYLPD D
Sbjct: 32  WSITKLTIQEILCAIQPTIVSEQRRKEIIDYVQRLIRDSFGNEVLPFGSMPLKTYLPDGD 91

Query: 71  IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
           IDL A   +   +D +A  V  +LE E +   +EFRV+++ YI+A+VKI+KC+V +  VD
Sbjct: 92  IDLTALCPENDEED-FARDVCTLLEGE-RQMGSEFRVEDISYIRAKVKIVKCMVQDISVD 149

Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
           I+FNQ GGL TLCFL+++D LI ++HLFKRS+ILIKAWCYYE RILG H GL+S+YAL  
Sbjct: 150 ISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEGRILGSHCGLLSTYALEI 209

Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
           LVLY+ ++F  S   PL VLYRFL+++S FDW+ F +S+ GPV IS L  +T  P   D 
Sbjct: 210 LVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPVSISSL--LTGAPETAD- 266

Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
             LL+++ FL SC+ A+A      E   QPF+ KH N+ DPLR  NNLGRS+S GN +R 
Sbjct: 267 KPLLINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRF 326

Query: 311 RTAFTFRAKGLARLLDCPNEDLYNE-VNQFFMNTRDRH 347
           R A +  A+ L  +L    E   NE + +FF NT DR+
Sbjct: 327 RYAISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRN 364


>gi|302835555|ref|XP_002949339.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
           nagariensis]
 gi|300265641|gb|EFJ49832.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
           nagariensis]
          Length = 3433

 Score =  298 bits (764), Expect = 2e-78,   Method: Composition-based stats.
 Identities = 156/322 (48%), Positives = 204/322 (63%), Gaps = 18/322 (5%)

Query: 18  TAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFT---FGSVPLKTYLPDRDIDLG 74
           T  LI+RI+P   S +RR  +  +V  ++ +CF     T   FGSVPLKTYLPD DIDL 
Sbjct: 34  TDTLISRIRPTGLSLQRRWVITEHVTSIVKRCFAPHDVTAIPFGSVPLKTYLPDGDIDLS 93

Query: 75  AFSDD-------QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
            +S+        + L+DTWA  ++  LE E  N  A FRV  VQ I AEVK++KCLVDN 
Sbjct: 94  IYSESPRAQALKEALRDTWATQLQVCLEEEANNPTAVFRVANVQVIHAEVKLLKCLVDNI 153

Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINE-----NHLFKRSIILIKAWCYYESRILGGHHGL 182
           VVDI+F Q+GGL T  FL++VD  +++      HLFK SIIL+K WCYYESR+LG HHGL
Sbjct: 154 VVDISFFQVGGLNTYNFLEDVDRFVDQCIPVRKHLFKDSIILVKGWCYYESRVLGAHHGL 213

Query: 183 ISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVT 242
           IS+YAL TLVLY+ ++++     PL+VLY+FL   S FDW+N+CLSL GP+P+S  P   
Sbjct: 214 ISTYALETLVLYVINLYHRELTNPLQVLYKFLVECSCFDWENYCLSLEGPIPLSSFPKPV 273

Query: 243 AEPPRKDGGVLLLSKSFLDSCRYAYADFPG--GQENQGQPFVSKHFNVIDPLRVNNNLGR 300
            E P       LL+K F+    + Y + P    Q  + +PF  K  NV+DP+   NNLGR
Sbjct: 274 VETPEALQRDALLTKDFMARAYFKYTE-PQLRAQGGEPKPFAIKQLNVMDPILPGNNLGR 332

Query: 301 SVSKGNFFRIRTAFTFRAKGLA 322
           SVSK ++ RIR AF   A+ LA
Sbjct: 333 SVSKASYLRIRRAFEHGARMLA 354


>gi|356518940|ref|XP_003528133.1| PREDICTED: uncharacterized protein LOC100815787 [Glycine max]
          Length = 680

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 153/347 (44%), Positives = 226/347 (65%), Gaps = 7/347 (2%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           S +  +D   W  AE+   E++  I+P+  SE  R  V  YV+RLI   +  +V  FGSV
Sbjct: 15  SQLVSIDEELWRMAEDRVQEILWTIEPNVLSEVNRKDVIDYVQRLIKGYYGAKVLPFGSV 74

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
           PLKTYLPD D+DL     +   +D  A  + ++L++    + +E++VK++QYI+A+V+++
Sbjct: 75  PLKTYLPDGDVDLTTLIHEDAEED-LAQAICNILKS---GDDSEYQVKDIQYIRAQVRLV 130

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           KC V N  VDI+FNQ+ G+ TL FL++VD L+ +NH+FKRSIILIKAWCYY+SR+LGGH+
Sbjct: 131 KCTVKNIAVDISFNQMAGIYTLRFLEQVDQLVGKNHIFKRSIILIKAWCYYDSRLLGGHY 190

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           GL+S+YA+  LVLYI + F+    GPLEVLY FL+++S FDWD+  +S+WGP  +S LP+
Sbjct: 191 GLLSTYAVEILVLYIINRFHSVVRGPLEVLYIFLDYYSSFDWDHNYVSIWGPKSLSSLPE 250

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
           +T   P  D G  LL K FL + +   +      E     F  K  N++DPLR +NNLGR
Sbjct: 251 IT---PECDQGEFLLQKEFLTNYKNMCSYPTRASETLTHEFPVKFMNILDPLRNDNNLGR 307

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           SVS  +  R+R AF + A+ L ++   P E++   + +FF +T +R+
Sbjct: 308 SVSIASLHRLRFAFAYSAQKLKQIFTLPGENMGAALEKFFFSTLERN 354


>gi|357491471|ref|XP_003616023.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
 gi|355517358|gb|AES98981.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
          Length = 387

 Score =  293 bits (750), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 168/342 (49%), Positives = 210/342 (61%), Gaps = 61/342 (17%)

Query: 14  AEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ-------------------- 53
           AE+ TAE++ RIQP   ++ RR  V  YV+RLI     C+                    
Sbjct: 44  AEQTTAEILRRIQPTLAADRRRREVVDYVQRLIRYGARCEKLLPNVWRKLDFEVRIFRIG 103

Query: 54  -VFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
            VF +GSVPLKTYLPD DIDL A S  Q ++D     V  +L  EE N+ AE+ VK+V++
Sbjct: 104 KVFPYGSVPLKTYLPDGDIDLTALSP-QNIEDGLVSDVHAVLRGEENNDAAEYEVKDVRF 162

Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
           I AE         N VVDI+FNQLGGL TLCFL++VD L+ ++H+FKRSIILIKAWCYYE
Sbjct: 163 IDAE---------NIVVDISFNQLGGLSTLCFLEKVDRLVAKDHIFKRSIILIKAWCYYE 213

Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPL------------------------- 207
           SRILG HHGLIS+YAL TLVLYIFH F+ S  GPL                         
Sbjct: 214 SRILGAHHGLISTYALETLVLYIFHRFHVSLDGPLAEKERKRNLNHIMLVMHPFNKHFMH 273

Query: 208 ----EVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSC 263
               +VLYRFL++FSKFDWDN+C+SL GPV  S  PDV AE   ++GG  LL+  F+ SC
Sbjct: 274 PALFQVLYRFLDYFSKFDWDNYCVSLKGPVAKSSPPDVVAE-ALENGGNTLLTDEFIRSC 332

Query: 264 RYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
             +++  P G +   + F  KH N+IDPL+ NNNLGRSV+KG
Sbjct: 333 VESFSVPPRGLDLNLRAFPHKHLNIIDPLKENNNLGRSVNKG 374


>gi|297823987|ref|XP_002879876.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325715|gb|EFH56135.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 516

 Score =  291 bits (746), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 152/337 (45%), Positives = 218/337 (64%), Gaps = 7/337 (2%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           WL AEE   E++  IQP   SE  RN +  +++ L+ +    +VF FGSVPLKTYLPD D
Sbjct: 33  WLIAEERAQEILFAIQPMYLSERSRNEIINHLQTLMRERLGIEVFLFGSVPLKTYLPDGD 92

Query: 71  IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
           IDL   +    +++  A  +R++LE E     ++F+V +VQYI A+VK+IKC + N  +D
Sbjct: 93  IDLTVLTP-YGMEENCAKALRNILEAERG--ESDFQVTDVQYIHAQVKVIKCTIRNVALD 149

Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
           I+FNQ+ GL  LCFL++VD     +HLFKRSIILIKAWC+YESRILG ++GLIS+YAL  
Sbjct: 150 ISFNQMAGLSALCFLEQVDRAFGRDHLFKRSIILIKAWCFYESRILGANNGLISTYALAI 209

Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
           LVL I ++   S +GPL VLY+F++F+  FDW+N+C+++ G VPIS  PD+T     +  
Sbjct: 210 LVLNIVNMSYSSVSGPLAVLYKFMDFYGSFDWENYCITVTGLVPISSFPDITETRNHE-- 267

Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
             + L + F   C  +Y+      E   + F  KH+N++DPL+ +NNLGRSVS+GN  R+
Sbjct: 268 --VFLDEKFFRECIESYSGPANVVEANRKYFPVKHYNILDPLKHSNNLGRSVSEGNAIRL 325

Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           R  F   A+ L  +L  P E +  ++  FF N+ DR+
Sbjct: 326 RHCFRRGAQKLRDVLTFPGETVGWKLEDFFGNSLDRN 362


>gi|159471748|ref|XP_001694018.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277185|gb|EDP02954.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 633

 Score =  290 bits (742), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 156/330 (47%), Positives = 209/330 (63%), Gaps = 18/330 (5%)

Query: 18  TAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFT---FGSVPLKTYLPDRDIDLG 74
           T  LI+RI+P   S +RR  +  +V +L+ +CF     T   FGSVPLKTYLPD DIDL 
Sbjct: 31  TDTLISRIRPTTLSLQRRFVITEHVTQLVKRCFAPHDVTAVPFGSVPLKTYLPDGDIDLS 90

Query: 75  AFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN 126
            +S         DQ L+DTWA  ++  LE+E  N HA F+V  VQ I AEVK++KCLVDN
Sbjct: 91  IYSYSSRAQSLKDQ-LRDTWATTLQLCLEDEANNPHAAFKVANVQVIHAEVKLLKCLVDN 149

Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINE-----NHLFKRSIILIKAWCYYESRILGGHHG 181
            VVDI+F Q+GGL T  FL++VD  +++      HLFK SIIL+K WCYYESR+LG HHG
Sbjct: 150 IVVDISFFQIGGLNTYNFLEDVDAFVDKAITARKHLFKDSIILVKGWCYYESRVLGAHHG 209

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           LIS+YAL TLVLY+ ++++   + PL+VLY+FL   S FDW+ +CL+L GP+P++  P+ 
Sbjct: 210 LISTYALETLVLYVINLYHRELSNPLQVLYKFLVECSGFDWERYCLTLQGPIPLASFPNP 269

Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAY-ADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
             E P       LL++ F+      Y A        + +PF  K  NV+DP+  NNNLGR
Sbjct: 270 VVETPEPLQREPLLTEHFMTRAYNKYTAPQVAAMGGEVKPFAIKQLNVMDPILPNNNLGR 329

Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNE 330
           SVSK ++ RIR AF   A+ LA + +   E
Sbjct: 330 SVSKASYLRIRRAFEHGARMLAAIAEQTKE 359


>gi|384253068|gb|EIE26543.1| hypothetical protein COCSUDRAFT_39611 [Coccomyxa subellipsoidea
           C-169]
          Length = 1155

 Score =  288 bits (737), Expect = 3e-75,   Method: Composition-based stats.
 Identities = 144/292 (49%), Positives = 187/292 (64%), Gaps = 4/292 (1%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDD-QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQ 111
           + + FGSVPLKTYLPD DIDL  F      L+D W + +  +LE E +N     RVK+VQ
Sbjct: 5   EAYMFGSVPLKTYLPDGDIDLAVFQGKGPRLRDVWTYELSALLEAEGRNALNPHRVKDVQ 64

Query: 112 YIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYY 171
            I AEVK++KCLVDN VVDI+F+ LGGLCT+ FL+ +D  I + HLFKRS+IL+KAWCYY
Sbjct: 65  IINAEVKLLKCLVDNIVVDISFDTLGGLCTVAFLESIDRHIGKQHLFKRSVILVKAWCYY 124

Query: 172 ESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWG 231
           ESR+LG HHGL+S+YAL T+VLYIF++++     PL+VL +FL  FSKFDWD   LSL G
Sbjct: 125 ESRLLGAHHGLLSTYALETMVLYIFNMYHHELQSPLKVLRKFLVVFSKFDWDGHALSLQG 184

Query: 232 PVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDP 291
           P+P+S  PD   EP     G  LL    L +    Y+     Q+  G+ F  K+ N++DP
Sbjct: 185 PIPLSSFPDPQVEPVAGAEGGALLRGDVLKTMLEMYSPV---QQGPGKAFTIKNMNIMDP 241

Query: 292 LRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNT 343
           L   NNLGRSV+K +  RIR A       L  + D   ++    V+ FF NT
Sbjct: 242 LLPTNNLGRSVNKASKARIRKALAHGCHMLDSIFDKVGQEATEAVDGFFRNT 293


>gi|356518706|ref|XP_003528019.1| PREDICTED: uncharacterized protein LOC100788864 [Glycine max]
          Length = 721

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 146/342 (42%), Positives = 213/342 (62%), Gaps = 5/342 (1%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           +D   W   EE   E++  IQP+  SE  R  V  YV++LI   +  +VF FGS PLKTY
Sbjct: 20  IDEELWRMTEERIQEILWTIQPNVLSEMNRKNVLNYVQKLIGDYYDTKVFPFGSFPLKTY 79

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL   + +    D   +L +++    E      ++VK++++I+A+V+++KC V 
Sbjct: 80  LPDGDIDLTVINHE----DEEENLAKEICTILECANDLIYQVKDIEHIRAQVQVVKCTVK 135

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N  +DI FNQ+ GLCTLCFL++VD L  +NH+FKRSIILIKAWC Y+SR+LG  HGL+S+
Sbjct: 136 NIPIDITFNQMTGLCTLCFLEQVDQLAGKNHIFKRSIILIKAWCCYDSRLLGSQHGLLST 195

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
           YA   LVLYI + F+ S   PLEVLY F +++  FDW++  +S+WGP  +S LP++  + 
Sbjct: 196 YATEVLVLYIINRFHASVRDPLEVLYIFFDYYGTFDWEHNYMSIWGPKALSSLPEI-VDR 254

Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
           P  D    LL K FL + R  ++      E     F  KH N++DPLR +NNLGRSV++ 
Sbjct: 255 PECDQDEFLLHKEFLINYRDIFSSKAKSSETTTNTFPVKHINILDPLRNDNNLGRSVNEA 314

Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           +F RIR A ++ AK   ++     E++   + +FF +T  R+
Sbjct: 315 SFHRIRFALSYGAKKFKQIFTLAGENMGEALEKFFFDTLQRN 356


>gi|30688308|ref|NP_850331.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|145330711|ref|NP_001078031.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|186506897|ref|NP_001118485.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|186506900|ref|NP_001118486.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|60547743|gb|AAX23835.1| hypothetical protein At2g40520 [Arabidopsis thaliana]
 gi|330254746|gb|AEC09840.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|330254747|gb|AEC09841.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|330254748|gb|AEC09842.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|330254749|gb|AEC09843.1| nucleotidyltransferase protein [Arabidopsis thaliana]
          Length = 502

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 217/343 (63%), Gaps = 7/343 (2%)

Query: 5   PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
           P++   WL AE    E++  IQP+  +E  RN + + ++ L+ +    +V+ FGS+PLKT
Sbjct: 27  PIEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLKT 86

Query: 65  YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
           YLPD DIDL   +   + +D  A  V  +LE E  N  ++ +V  VQY+QA+VK+IKC +
Sbjct: 87  YLPDGDIDLTVLTHHASEEDC-ARAVCCVLEAEMGN--SDLQVTGVQYVQAKVKVIKCSI 143

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
            +   DI+FNQL GL  LCFL++VD     +HLFK+SIIL+KAWC+YESRILG + GLIS
Sbjct: 144 RDVAFDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLIS 203

Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
           +YAL  LVL I ++   S +GPL VLY+F+ ++  FDW N+C+++ GPVPIS LPD+T  
Sbjct: 204 TYALAILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDITET 263

Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
              +    + L + F   C   Y+   G  E   + F  K++N++DPL+ +NNLGRSV+K
Sbjct: 264 GNHE----VFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTK 319

Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GN  R+R  F    + L  +L  P E++  ++ +FF  + +R+
Sbjct: 320 GNMVRLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERN 362


>gi|21805733|gb|AAM76764.1| hypothetical protein [Arabidopsis thaliana]
          Length = 502

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 217/343 (63%), Gaps = 7/343 (2%)

Query: 5   PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
           P++   WL AE    E++  +QP+  +E  RN + + ++ L+ +    +V+ FGS+PLKT
Sbjct: 27  PIEAEVWLIAEARAQEILCAVQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLKT 86

Query: 65  YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
           YLPD DIDL   +   + +D  A  V  +LE E  N  ++ +V  VQY+QA+VK+IKC +
Sbjct: 87  YLPDGDIDLTVLTHHASEEDC-ARAVCCVLEAEMGN--SDLQVTGVQYVQAKVKVIKCSI 143

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
            +   DI+FNQL GL  LCFL++VD     +HLFK+SIIL+KAWC+YESRILG + GLIS
Sbjct: 144 RDVAFDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLIS 203

Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
           +YAL  LVL I ++   S +GPL VLY+F+ ++  FDW N+C+++ GPVPIS LPD+T  
Sbjct: 204 TYALAILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDITET 263

Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
              +    + L + F   C   Y+   G  E   + F  K++N++DPL+ +NNLGRSV+K
Sbjct: 264 GNHE----VFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTK 319

Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GN  R+R  F    + L  +L  P E++  ++ +FF  + +R+
Sbjct: 320 GNMVRLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERN 362


>gi|297736507|emb|CBI25378.3| unnamed protein product [Vitis vinifera]
          Length = 893

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 150/295 (50%), Positives = 198/295 (67%), Gaps = 3/295 (1%)

Query: 54  VFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI 113
           V  FGS+PLKTYLPD DIDL A   +   +D +A  V  +LE E +   +EFRV+++ YI
Sbjct: 210 VLPFGSMPLKTYLPDGDIDLTALCPENDEED-FARDVCTLLEGE-RQMGSEFRVEDISYI 267

Query: 114 QAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYES 173
           +A+VKI+KC+V +  VDI+FNQ GGL TLCFL+++D LI ++HLFKRS+ILIKAWCYYE 
Sbjct: 268 RAKVKIVKCMVQDISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEG 327

Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPV 233
           RILG H GL+S+YAL  LVLY+ ++F  S   PL VLYRFL+++S FDW+ F +S+ GPV
Sbjct: 328 RILGSHCGLLSTYALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPV 387

Query: 234 PISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLR 293
            IS L     E        LL+++ FL SC+ A+A      E   QPF+ KH N+ DPLR
Sbjct: 388 SISSLLTGAPEAAETADKPLLINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLR 447

Query: 294 VNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNE-VNQFFMNTRDRH 347
             NNLGRS+S GN +R R A +  A+ L  +L    E   NE + +FF NT DR+
Sbjct: 448 DYNNLGRSISLGNSYRFRYAISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRN 502


>gi|357116041|ref|XP_003559793.1| PREDICTED: uncharacterized protein LOC100830879 [Brachypodium
           distachyon]
          Length = 899

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 143/296 (48%), Positives = 190/296 (64%), Gaps = 2/296 (0%)

Query: 8   PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
           P +   AE   A ++  + P   +E RR  V  + RRLI   F CQV T+GSVPLKTYLP
Sbjct: 33  PEQMRVAEAAAAGVLRCLLPTEEAERRRRQVTDHARRLIGTNFGCQVLTYGSVPLKTYLP 92

Query: 68  DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
           D DID+   +  + L  T    VR++L  EEKN  AEF ++  +Y+ A+VK+ KC + N 
Sbjct: 93  DGDIDVTILTH-KPLDSTIIDDVRNLLNAEEKNTDAEFVLESRRYVDAQVKVFKCNIANI 151

Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
            VDI+FNQ+GG+ TLCFL+ VD  + ++HLFKRSIILIKAWCY E+RI G    L+S+YA
Sbjct: 152 DVDISFNQIGGVSTLCFLELVDTEVGKDHLFKRSIILIKAWCYNEARIQGSDQWLLSTYA 211

Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
           L  L+LYIF++F+ S  GP E LY FLE++SKFDW  +C++L GPVP+S L + TAEP  
Sbjct: 212 LEILILYIFNMFHNSLHGPFEALYMFLEYYSKFDWGKYCVTLDGPVPLSSLANFTAEPAV 271

Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
            +   LLL K  L +        P G +     F  K  N+IDPL+ +NNLGRS+S
Sbjct: 272 AN-DELLLGKESLSASSDRLLVLPKGSDRHDPEFRPKILNIIDPLKGDNNLGRSIS 326


>gi|147780178|emb|CAN75522.1| hypothetical protein VITISV_043595 [Vitis vinifera]
          Length = 733

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 153/330 (46%), Positives = 202/330 (61%), Gaps = 37/330 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
           +V  FGS+PLKTYLPD DIDL A   +   +D +A  V  +LE E +   +EFRV+++ Y
Sbjct: 11  EVLPFGSMPLKTYLPDGDIDLTALCPENDEED-FARDVCTLLEGE-RQMGSEFRVEDISY 68

Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
           I+A+VKI+KC+V +  VDI+FNQ GGL TLCFL+++D LI ++HLFKRS+ILIKAWCYYE
Sbjct: 69  IRAKVKIVKCMVQDISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYE 128

Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGP 232
            RILG H GL+S+YAL  LVLY+ ++F  S   PL VLYRFL+++S FDW+ F +S+ GP
Sbjct: 129 GRILGSHCGLLSTYALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGP 188

Query: 233 VPISLL-------------------------------PDVT---AEPPRKDGGVLLLSKS 258
           V IS L                               PD     AE        LL+++ 
Sbjct: 189 VSISSLLTGARESCLIMWLCLMVCFFRLIGLPFYLIFPDFVLFVAEAAETADKPLLINEE 248

Query: 259 FLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRA 318
           FL SC+ A+A      E   QPF+ KH N+ DPLR  NNLGRS+S GN +R R A +  A
Sbjct: 249 FLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRFRYAISVGA 308

Query: 319 KGLARLLDCPNEDLYNE-VNQFFMNTRDRH 347
           + L  +L    E   NE + +FF NT DR+
Sbjct: 309 QRLKEILLMLPEGRMNEGLKEFFNNTLDRN 338


>gi|297745772|emb|CBI15828.3| unnamed protein product [Vitis vinifera]
          Length = 929

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 157/412 (38%), Positives = 222/412 (53%), Gaps = 76/412 (18%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W  AE  T E++A++QP   S   R  V  YV+RLI  C  C+VF +GSVPLKTYL D D
Sbjct: 40  WAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTYLLDGD 99

Query: 71  IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
           IDL A      +++  A  V  +L+ EE+NE+AEF VK++Q+I AEVK++KCLV + V+D
Sbjct: 100 IDLTALCS-SNVEEALASDVHAVLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKDIVID 158

Query: 131 IAFNQLGGLCTLCFLDEVDHLIN-------ENHLFKRSIILI--------KAWCYYES-- 173
           I+FNQLGGL TLCFL++   L+        + ++ + S++++          +C Y S  
Sbjct: 159 ISFNQLGGLSTLCFLEQWFILLTSYGETQMKENIIEASLLVLWFLYWHIWSLYCIYPSFT 218

Query: 174 ----------------------------------RILGGHHGLISSYALV-------TLV 192
                                             R++G  H    S  L+       + +
Sbjct: 219 SVQNHKRENPWFHMYGVQFLCNYSFKPLLSVIVDRLIGKDHLFKRSIILIKSWCYYESRI 278

Query: 193 LYIFHVFNGSFAGPLEVLY-----------------RFLEFFSKFDWDNFCLSLWGPVPI 235
           L   H    ++A  + VLY                 RFL++FSKFDWDN+C+SL GPV  
Sbjct: 279 LGAHHGLISTYALEILVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGPVCK 338

Query: 236 SLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVN 295
           S LPD+ AE P      LLLS+ FL +C   ++    G E   + F  KH N+IDPLR N
Sbjct: 339 SSLPDIVAELPENGQDDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPLREN 398

Query: 296 NNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           NNLGRSV+KGNF+RIR+AF + +  L ++L  P E + +E+  FF +T +RH
Sbjct: 399 NNLGRSVNKGNFYRIRSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERH 450


>gi|255083767|ref|XP_002508458.1| predicted protein [Micromonas sp. RCC299]
 gi|226523735|gb|ACO69716.1| predicted protein [Micromonas sp. RCC299]
          Length = 1269

 Score =  251 bits (641), Expect = 3e-64,   Method: Composition-based stats.
 Identities = 150/355 (42%), Positives = 201/355 (56%), Gaps = 28/355 (7%)

Query: 20  ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ---VFTFGSVPLKTYLPDRDIDLGAF 76
           ELI  ++P   S+ RR  V  ++  L+  CF  +   V  FGSVPL+TYLPD DID+   
Sbjct: 30  ELIDVLRPTEQSDRRRRGVFRHIASLVDGCFAGENVLVTAFGSVPLRTYLPDGDIDVCLL 89

Query: 77  SDDQTL-KDTWAHLVRDMLEN----------EEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
              + L +D W   +R  +E           E  +  AEF V E+  I AEVK++K + D
Sbjct: 90  GPHELLSRDDWTVRLRAHVERAEAAAAEASIELGSPVAEFAVSEIHIIHAEVKLMKLICD 149

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
             VVD++ NQ GGL  L FL+EV+  I +  +FKRSI+LIKAW +YE R+LG HH LIS+
Sbjct: 150 GVVVDVSANQFGGLAALGFLEEVNAFIGKGEIFKRSIVLIKAWGFYEGRLLGAHHALIST 209

Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVT--- 242
           YAL TLVLYI + F+   + PLEVL++FL FF+ FDWD F +S+ GPVP+  L  VT   
Sbjct: 210 YALETLVLYILNRFHKELSTPLEVLHKFLVFFADFDWDKFAVSVHGPVPLEDLHKVTGPI 269

Query: 243 AEPPRKDGGVLLLSKSFLDSCRYAY------ADFPGGQENQGQPFVSKHFNVIDPLRVNN 296
            + P       LL+  F+      Y      A   GG ++  +P   K+ NV+DPL  +N
Sbjct: 270 GKRPEVHAEGALLTPDFMWRMMDKYGNESVSAKLGGGADSTPRPMARKYLNVVDPLLSSN 329

Query: 297 NLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPN-EDLYNEV---NQFFMNTRDRH 347
           NLGRSVS+GN  RIR A    A+ L  L +     + +  V    QFF NT  RH
Sbjct: 330 NLGRSVSQGNAKRIRKALALGAQRLTALRESSTGGECFGAVRMLEQFFGNTM-RH 383


>gi|145341816|ref|XP_001415999.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576222|gb|ABO94291.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 904

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 135/319 (42%), Positives = 191/319 (59%), Gaps = 14/319 (4%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ---VFTFGSVPLKTYLPDRDI 71
           E +T EL+A ++P   SE RR AV  ++++L  +CF      V  +GSVPL+ YLPD DI
Sbjct: 36  ETLTNELVASLRPTEMSEIRRRAVFEHIKQLAQECFGTAHTLVSAYGSVPLRAYLPDGDI 95

Query: 72  DLGAFSDDQTL-KDTWAHLVRDMLENEEKNEHA--EFRVKEVQYIQAEVKIIKCLVDNFV 128
           D+    D + + K  W    R  +E  E       EF V EV  I AEV+++KC+VD  +
Sbjct: 96  DVCLLGDHRVIDKAQWTTKFRKHIEKAEAEADPPHEFAVSEVSVINAEVRLMKCIVDGMM 155

Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
           VD++ NQ GGL +L FL+E++  I  + LF RSIIL+KAW +YE RILG HH LIS+YAL
Sbjct: 156 VDVSANQFGGLASLGFLEEMNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALISTYAL 215

Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
            TLVLYI + ++     PL VL++ L  F++FDW+ + L++ GPV I  +    A PP +
Sbjct: 216 ETLVLYIINKYHADLTCPLSVLHKLLSVFAEFDWEGYALTIHGPVAIEGI----ATPPDE 271

Query: 249 --DGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGN 306
             +GG  L+++ F+ +    Y+           P   K+ N+IDPL  NNNLGRSVS GN
Sbjct: 272 CLEGG--LITEEFMRTMLSTYSCEFMRAAASSAPVTVKYMNIIDPLLPNNNLGRSVSCGN 329

Query: 307 FFRIRTAFTFRAKGLARLL 325
           + R+R A    A+ L  L+
Sbjct: 330 YRRVRAALKLGAQRLDALM 348


>gi|308799699|ref|XP_003074630.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
 gi|116000801|emb|CAL50481.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
          Length = 875

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 135/317 (42%), Positives = 187/317 (58%), Gaps = 13/317 (4%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ---VFTFGSVPLKTYLPDRDI 71
           E +T EL+  ++P   SE RR AV  +++ L   CF      V  +GSVPL+ YLPD DI
Sbjct: 32  ETLTNELVESLRPTAKSEMRRRAVFEHIKELAQGCFGTAHTLVSVYGSVPLRAYLPDGDI 91

Query: 72  DLGAFSDDQTL-KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
           D+    D + + K +W    +  +E  E     EF V EV  I AEV+++KC+VD  +VD
Sbjct: 92  DVCLLGDHRVIDKASWTTKFQKHIEKVEAESDFEFAVSEVSVINAEVRLMKCIVDGMMVD 151

Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
           ++ NQ GGL +L FL+E +  I  + LF RSIIL+KAW +YE RILG HH LI++YAL T
Sbjct: 152 VSANQFGGLASLGFLEETNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALIATYALET 211

Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR-KD 249
           LVLYI + +      PL VL++ L  F  FDW+ + L++ GPV    L D    PP   +
Sbjct: 212 LVLYIINKYYAELTCPLSVLHKLLRVFGDFDWEGYVLTIHGPV---ALEDANNIPPGCLE 268

Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
           GG  LL++ F+ S    Y      + +   P V K+ N+IDPL  NNNLGRSVS GN+ R
Sbjct: 269 GG--LLTEEFMQSMLCQYGQI---ETSNSAPVVVKYMNIIDPLVPNNNLGRSVSCGNYRR 323

Query: 310 IRTAFTFRAKGLARLLD 326
           +R A    A+ L +L++
Sbjct: 324 VRAALRLGARHLDKLME 340


>gi|147817122|emb|CAN62161.1| hypothetical protein VITISV_017634 [Vitis vinifera]
          Length = 1147

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 119/201 (59%), Positives = 149/201 (74%)

Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGP 206
           ++D LI ++HLFKRSIILIKAWCYYESRILG HHGLIS+YAL TLVLYIFH+F+    GP
Sbjct: 405 KIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSLLNGP 464

Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYA 266
           L VLY+FL++FSKFDWDN+C+SL GPV IS LP++ AE P   G   LL    L  C   
Sbjct: 465 LAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPENVGADPLLGNDXLRDCLDR 524

Query: 267 YADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLD 326
           ++    G E   + FV KHFN++DPL+ NNNLGRSVSKGNF+RIR+AFT+ A+ L R+L 
Sbjct: 525 FSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGRILL 584

Query: 327 CPNEDLYNEVNQFFMNTRDRH 347
            P + +  E+ +FF NT +RH
Sbjct: 585 QPEDKISEELCKFFTNTLERH 605



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/125 (52%), Positives = 85/125 (68%), Gaps = 3/125 (2%)

Query: 54  VFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI 113
           VF FGSVPLKTYLPD DIDL AF     ++DT A+ V  +LE E++N  AEF VK+VQ I
Sbjct: 186 VFPFGSVPLKTYLPDGDIDLTAFGG-PAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLI 244

Query: 114 QAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLIN--ENHLFKRSIILIKAWCYY 171
            AEVK++KCLV N VVDI+FNQLGGLCTLCFL++   + +  E    KR  +  + +   
Sbjct: 245 HAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQQKAIWDGVEERFLKRLSLWKRQYISK 304

Query: 172 ESRIL 176
             R++
Sbjct: 305 GXRLM 309



 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 24/44 (54%)

Query: 10 RWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ 53
          +W +AE    E+I  +QP   SEERR  V  YV+ LI     C+
Sbjct: 38 QWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCE 81


>gi|302125450|emb|CBI35537.3| unnamed protein product [Vitis vinifera]
          Length = 398

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 125/207 (60%), Positives = 152/207 (73%), Gaps = 2/207 (0%)

Query: 45  LIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAE 104
           LI  C  C+VF +GSVPLK YL D DIDL        +++  A  V  +L+ E +NE+AE
Sbjct: 55  LIRCCLGCEVFPYGSVPLKIYLLDGDIDLTVLCSS-NVEEALASDVHAVLKGERQNENAE 113

Query: 105 FRVKEVQY-IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSII 163
           F VK VQ+ I  EVK +KCLV + V+DI+FNQLGGL TLCFL +VD LI ++HLFKRSII
Sbjct: 114 FEVKNVQFNIIVEVKPVKCLVKDIVIDISFNQLGGLSTLCFLKQVDRLIGKDHLFKRSII 173

Query: 164 LIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWD 223
           LIK+ CYYESRILG +HGLIS+YAL  LVLYIFH+F+ S  GPL V YRFL++FSKFDWD
Sbjct: 174 LIKSRCYYESRILGAYHGLISTYALEILVLYIFHLFHSSLDGPLAVGYRFLDYFSKFDWD 233

Query: 224 NFCLSLWGPVPISLLPDVTAEPPRKDG 250
           N+C+SL G V  S LPD+ AE P   G
Sbjct: 234 NYCISLNGSVCKSSLPDIVAELPENGG 260


>gi|6572083|emb|CAB63026.1| putative protein [Arabidopsis thaliana]
          Length = 764

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 142/340 (41%), Positives = 190/340 (55%), Gaps = 62/340 (18%)

Query: 8   PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
           P  W++ EE T E+I ++ P   SE+RR  V  YV++LI     C+V +FGSVPLKTYLP
Sbjct: 31  PELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLP 90

Query: 68  DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
           D DIDL AF      ++  A  V  +LE EE N  ++F VK+VQ I+AEVK++KCLV N 
Sbjct: 91  DGDIDLTAFGG-LYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNI 149

Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
           VVDI+FNQ+GG   +C L                       C+ E               
Sbjct: 150 VVDISFNQIGG---ICTL-----------------------CFLE--------------- 168

Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
                               +VLY+FL++FSKFDWD++C+SL GPV +S LPD+  E P 
Sbjct: 169 --------------------KVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPE 208

Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF 307
             G  LLL+  FL  C   Y+    G E   + F SKH N++DPL+  NNLGRSVSKGNF
Sbjct: 209 NGGEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNF 268

Query: 308 FRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           +RIR+AFT+ A+ L +L    +E + +E+ +FF N   RH
Sbjct: 269 YRIRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRH 308


>gi|30693508|ref|NP_190730.2| NT domain of poly(A) polymerase and terminal uridylyl
           transferase-containing protein [Arabidopsis thaliana]
 gi|332645292|gb|AEE78813.1| NT domain of poly(A) polymerase and terminal uridylyl
           transferase-containing protein [Arabidopsis thaliana]
          Length = 755

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 142/340 (41%), Positives = 190/340 (55%), Gaps = 62/340 (18%)

Query: 8   PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
           P  W++ EE T E+I ++ P   SE+RR  V  YV++LI     C+V +FGSVPLKTYLP
Sbjct: 31  PELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLP 90

Query: 68  DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
           D DIDL AF      ++  A  V  +LE EE N  ++F VK+VQ I+AEVK++KCLV N 
Sbjct: 91  DGDIDLTAFGG-LYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNI 149

Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
           VVDI+FNQ+GG   +C L                       C+ E               
Sbjct: 150 VVDISFNQIGG---ICTL-----------------------CFLE--------------- 168

Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
                               +VLY+FL++FSKFDWD++C+SL GPV +S LPD+  E P 
Sbjct: 169 --------------------KVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPE 208

Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF 307
             G  LLL+  FL  C   Y+    G E   + F SKH N++DPL+  NNLGRSVSKGNF
Sbjct: 209 NGGEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNF 268

Query: 308 FRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           +RIR+AFT+ A+ L +L    +E + +E+ +FF N   RH
Sbjct: 269 YRIRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRH 308


>gi|168035607|ref|XP_001770301.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678518|gb|EDQ64976.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1631

 Score =  239 bits (609), Expect = 2e-60,   Method: Composition-based stats.
 Identities = 113/215 (52%), Positives = 149/215 (69%), Gaps = 16/215 (7%)

Query: 145 LDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-- 202
           +D  D  + +NHLFKRS+IL+KAWCYYESRILG HHGLIS+YAL TLVLYIFHVF+    
Sbjct: 255 IDRNDFELKQNHLFKRSVILVKAWCYYESRILGAHHGLISTYALETLVLYIFHVFHPKRR 314

Query: 203 FAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVT-------------AEPPRKD 249
             GPLEVLY FL +F  FDWD +C+++WGPVP++ + +++             AE PRKD
Sbjct: 315 LRGPLEVLYLFLVYFCNFDWDKYCVTMWGPVPLARITEISSGSARKTFRISDFAEAPRKD 374

Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQE-NQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFF 308
            G LLLSK FL+ C  +Y+D  GGQE +Q + F++K  NV+DP+R  NNLGRSV+ G+F 
Sbjct: 375 RGKLLLSKEFLERCIDSYSDAKGGQESSQRRNFITKFLNVLDPIRDTNNLGRSVNVGSFK 434

Query: 309 RIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNT 343
           RIR+AF   A+ L  +L+CP + +  +   FF  T
Sbjct: 435 RIRSAFGLGARTLGEVLECPTDQINEKFKSFFSCT 469



 Score =  174 bits (442), Expect = 5e-41,   Method: Composition-based stats.
 Identities = 88/142 (61%), Positives = 105/142 (73%), Gaps = 1/142 (0%)

Query: 6   LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
           L+ G W + E  TAELI  I+P   SEERR AV A+V+RLI   F C+V  FGSVPLKTY
Sbjct: 31  LEDGWWSRVEGHTAELIDSIKPTRSSEERRTAVTAFVQRLIRDRFDCKVVKFGSVPLKTY 90

Query: 66  LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
           LPD DIDL  F+ +  LK+TWA  V   L+  E++ +AEFRVKEVQYIQAEVK+IKCLV+
Sbjct: 91  LPDGDIDLTIFARND-LKETWAQDVVKALKQAEEDTNAEFRVKEVQYIQAEVKLIKCLVE 149

Query: 126 NFVVDIAFNQLGGLCTLCFLDE 147
           N VVDI+FNQ GGL T CFL+E
Sbjct: 150 NIVVDISFNQTGGLSTFCFLEE 171


>gi|290976573|ref|XP_002671014.1| predicted protein [Naegleria gruberi]
 gi|284084579|gb|EFC38270.1| predicted protein [Naegleria gruberi]
          Length = 763

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 135/376 (35%), Positives = 200/376 (53%), Gaps = 66/376 (17%)

Query: 4   RPLDPGR--WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
            P+D     + +   +  +L+ RIQP   SE+ R  V   +   +++    + + +GSV 
Sbjct: 153 EPIDESTSCFRRCNSLIQQLLYRIQPSSESEKHRKEVFDIIA-AVLELANLKTYLYGSVA 211

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEE---------------KNEHAEFR 106
            KTYLPD DIDL  F  ++   +  +  V ++L ++                KN H   +
Sbjct: 212 FKTYLPDGDIDLSVFVSNEEYLELSSQNVNNLLSHQPQVNDSTISYVHNVLLKNMHIGLK 271

Query: 107 ----------------------------VKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGG 138
                                       ++++ +I AEVK+IKC V+N  +D++  Q+GG
Sbjct: 272 QQLADPSIPWYNKARSLFSEIQRNNLAYIEDMTFINAEVKLIKCTVNNIPIDMSSGQIGG 331

Query: 139 LCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHV 198
           L TLCFL EVD  I +NHLFKRSIIL+K+W YYESRILG HHGL+S+Y L  L++Y+F +
Sbjct: 332 LSTLCFLHEVDDKIADNHLFKRSIILMKSWSYYESRILGSHHGLVSTYGLTVLLMYMFRL 391

Query: 199 FNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTA---------EPPRKD 249
           +      PL+ LYRFL ++S FDW NF +S++GP+P+  + D  +          P R D
Sbjct: 392 Y--KIETPLQALYRFLNYYSTFDWTNFGISIYGPIPLGAINDHKSIEDFYYENLPPERHD 449

Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
                L+ SFL SC+  Y     G  +  + F  K+ N++DPLR  NNLGRSV+  NF R
Sbjct: 450 S----LTSSFLQSCKSKY-----GTVDSSKTFTIKNLNIVDPLRDFNNLGRSVNYNNFLR 500

Query: 310 IRTAFTFRAKGLARLL 325
           IR A    +K +  +L
Sbjct: 501 IRRAIKKGSKTITDIL 516


>gi|110738268|dbj|BAF01063.1| hypothetical protein [Arabidopsis thaliana]
          Length = 660

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 117/200 (58%), Positives = 147/200 (73%)

Query: 148 VDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPL 207
           +DHLI ++HLFKRSIILIKAWCYYESRILG  HGLIS+YAL TLVLYIFH+F+ S  GPL
Sbjct: 1   IDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALETLVLYIFHLFHSSLNGPL 60

Query: 208 EVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAY 267
            VLY+FL++FSKFDWD++C+SL GPV +S LPD+  E P   G  LLL+  FL  C   Y
Sbjct: 61  AVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEMY 120

Query: 268 ADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDC 327
           +    G E   + F SKH N++DPL+  NNLGRSVSKGNF+RIR+AFT+ A+ L +L   
Sbjct: 121 SVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFLQ 180

Query: 328 PNEDLYNEVNQFFMNTRDRH 347
            +E + +E+ +FF N   RH
Sbjct: 181 SDEAISSELRKFFSNMLLRH 200


>gi|307104056|gb|EFN52312.1| hypothetical protein CHLNCDRAFT_58914 [Chlorella variabilis]
          Length = 740

 Score =  227 bits (578), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 187/329 (56%), Gaps = 40/329 (12%)

Query: 23  ARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQT- 81
           A ++ +   E+R+ AVA     L+ +C   + F FGSVPL+  LPD DID+  F+   T 
Sbjct: 367 ASLEVEQLLEQRQAAVA-----LVQECLQVEAFMFGSVPLRAVLPDGDIDISFFATAATT 421

Query: 82  -----------------------LKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
                                  L+DTWA  +   LE E     A F++++VQ IQAEVK
Sbjct: 422 PSSPSGNGGEQPGHRAGASPPGDLRDTWASQLLRALEREAVRPDAPFKIRDVQIIQAEVK 481

Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
           ++KC+V + VVD++F+ +GGLCT+ FL+  D  I   HLFKRSI+L+KAWCYYESR+LG 
Sbjct: 482 LVKCVVHDVVVDVSFDTVGGLCTVAFLEAADRRIGRQHLFKRSILLLKAWCYYESRLLGA 541

Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
           HHGLISSYAL  LVLYIF++ +     PL+VL RFL     FDW+ +CL+L GP+PI+ L
Sbjct: 542 HHGLISSYALEVLVLYIFNLHHAELHTPLDVLRRFLAVLGSFDWERYCLALQGPLPIADL 601

Query: 239 PDVTAEPPR--KDGGVLLLSKSFLDSCRYAYA----DFPGGQENQGQPFVS-----KHFN 287
             +  +       G   LL   F+      Y+         QE  G   V+     KH N
Sbjct: 602 HKLHVDRTALVSSGTEPLLDADFMRGVLQHYSVQHLSQQQQQEAAGMQLVAPRFPLKHLN 661

Query: 288 VIDPLRVNNNLGRSVSKGNFFRIRTAFTF 316
           ++DPL  +NNLGRSVSK ++ R++ A   
Sbjct: 662 IVDPLLPSNNLGRSVSKASYARVKKALAL 690


>gi|428171015|gb|EKX39935.1| hypothetical protein GUITHDRAFT_113927 [Guillardia theta CCMP2712]
          Length = 632

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 129/337 (38%), Positives = 191/337 (56%), Gaps = 31/337 (9%)

Query: 16  EITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ------VFTFGSVPLKTYLPDR 69
           E   E++ ++QP   +E  R  V  YV++LI      +      V  FGSVPLKTYLP  
Sbjct: 17  EQADEIVRQLQPHRRAERHRLTVFEYVKKLIKHVADEENKTEIYVHRFGSVPLKTYLPHG 76

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNE--------HA---EFRVKEVQYIQAE-- 116
           D+D+ AF+ +    D W   ++  LE+E K          H+   + R +  + +  +  
Sbjct: 77  DLDVTAFAAN----DLWLERLKAKLEDEAKKNDMYVVSGVHSVPRDLRAQSREELGKKDQ 132

Query: 117 -----VKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYY 171
                VK++KC V+   VDI  N LGG+C LCFL++VD ++  +HLFKR+ IL+K+WCY+
Sbjct: 133 GPVEIVKVVKCQVNGISVDITANALGGMCNLCFLEKVDTMLKRDHLFKRATILVKSWCYF 192

Query: 172 ESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWG 231
           ES IL   +GL+S+YAL TLVL I ++F+     PL+VL RFLE+++ FDW N CL++ G
Sbjct: 193 ESHILSSQNGLLSTYALETLVLCIVNIFHEELQTPLDVLKRFLEYYANFDWRNHCLTMRG 252

Query: 232 PVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQ-ENQGQPFVSKHFNVID 290
           PV  S +P     P   +    LL+ + L    +      G Q +N+G  F  K+ N+ D
Sbjct: 253 PVNRSNIPPGGEVPHLDNEPSYLLNDAILQEDSHLQFLMSGLQDDNRG--FQWKYMNICD 310

Query: 291 PLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDC 327
           PL   NN+GRSVS+ + +RI +AF    + L+ LL C
Sbjct: 311 PLSTRNNIGRSVSRSSAYRIASAFRHGWQSLSGLLYC 347


>gi|414866688|tpg|DAA45245.1| TPA: hypothetical protein ZEAMMB73_273182, partial [Zea mays]
          Length = 260

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 106/194 (54%), Positives = 138/194 (71%), Gaps = 1/194 (0%)

Query: 24  RIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           R++P   SE RR  V  Y RRL+     C+VF FGSVPLKTYLPD DIDL    +  +  
Sbjct: 37  RVRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGN-TSYD 95

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
            T  + V  +LE+EE+N  AEF VK+++ I AEV++IKC + N +VDI+FNQ GG+C LC
Sbjct: 96  STLVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALC 155

Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
           FL+ VD  + +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL  L+LY+F++F+ S 
Sbjct: 156 FLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSL 215

Query: 204 AGPLEVLYRFLEFF 217
             P+EV  +   +F
Sbjct: 216 HSPVEVCLKRFTYF 229


>gi|297612542|ref|NP_001065982.2| Os12g0114200 [Oryza sativa Japonica Group]
 gi|255669984|dbj|BAF29001.2| Os12g0114200, partial [Oryza sativa Japonica Group]
          Length = 178

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 130/168 (77%), Gaps = 6/168 (3%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAF--SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEV 110
           QVF FGSVPLKTYLPD DIDL AF  S D+ L    A  V+ +LE+EE  + AEF VK+V
Sbjct: 1   QVFPFGSVPLKTYLPDGDIDLTAFGHSSDEIL----AKQVQAVLESEEARKDAEFEVKDV 56

Query: 111 QYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCY 170
           QYI AEVK++KC+V N +VDI+FNQ GGLCTLCFL++VD    +NHLFKRSI+LIKAWCY
Sbjct: 57  QYIHAEVKLVKCIVQNIIVDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKAWCY 116

Query: 171 YESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFS 218
           YESRILG HHGLIS+YAL  LVLYIFH+F+G+  GPL V   F  F S
Sbjct: 117 YESRILGAHHGLISTYALEILVLYIFHLFHGTLDGPLAVSSDFQLFCS 164


>gi|77553482|gb|ABA96278.1| nucleotidyltransferase family protein, putative, expressed [Oryza
           sativa Japonica Group]
 gi|215769169|dbj|BAH01398.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 622

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 101/185 (54%), Positives = 129/185 (69%)

Query: 163 ILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDW 222
           +LIKAWCYYESRILG HHGLIS+YAL  LVLYIFH+F+G+  GPL VLYRFL+++SKFDW
Sbjct: 1   MLIKAWCYYESRILGAHHGLISTYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDW 60

Query: 223 DNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFV 282
           DN  +SL+GP+ +S LP++  + P        + + FL  C   +   P   E   Q F 
Sbjct: 61  DNKGISLYGPISLSSLPELVTDSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFP 120

Query: 283 SKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMN 342
            K FN++DPL+ +NNLGRSVSKGNF RIR+AF F A+ L ++L  P+    +EVNQFF N
Sbjct: 121 RKFFNIVDPLKQSNNLGRSVSKGNFLRIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRN 180

Query: 343 TRDRH 347
           T  RH
Sbjct: 181 TLKRH 185


>gi|226506494|ref|NP_001141604.1| uncharacterized protein LOC100273722 [Zea mays]
 gi|194705246|gb|ACF86707.1| unknown [Zea mays]
 gi|413924676|gb|AFW64608.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
          Length = 251

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 99/186 (53%), Positives = 125/186 (67%), Gaps = 4/186 (2%)

Query: 163 ILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDW 222
           +LIK WCYYES ILG   GL+S+YAL TLVLYIFHVF+ S  GPL VLYRFL+++SKFDW
Sbjct: 1   MLIKHWCYYESCILGAQRGLVSTYALETLVLYIFHVFHKSLDGPLAVLYRFLDYYSKFDW 60

Query: 223 DNFCLSLWGPVPISLLPDVTAEPP--RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQP 280
           DN  +SL+GP+ +S LP++  EPP  R DG   L  ++FL  C  A++  P   E   Q 
Sbjct: 61  DNKGISLFGPISLSSLPELVTEPPYTRDDG--FLSREAFLKDCAKAFSVPPINSEENPQV 118

Query: 281 FVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFF 340
           F  K  N++DPL+ +NNLGRS+SKGN  RIR  F F A  L ++L  P     NE+N+FF
Sbjct: 119 FSKKFVNIVDPLKQSNNLGRSISKGNLGRIRKEFYFGACKLGKILQAPACFSANEINRFF 178

Query: 341 MNTRDR 346
            NT  R
Sbjct: 179 RNTLSR 184


>gi|218192781|gb|EEC75208.1| hypothetical protein OsI_11468 [Oryza sativa Indica Group]
          Length = 860

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 103/193 (53%), Positives = 124/193 (64%), Gaps = 38/193 (19%)

Query: 54  VFTFGSVPLKTYLPDRDIDL---GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEV 110
           VF +GSVPLKTYLPD D+DL   G  S   TL D   H+    L++EE+N  AEF VK++
Sbjct: 20  VFAYGSVPLKTYLPDGDVDLTVLGNTSYGSTLIDDIYHI----LQSEEQNCDAEFEVKDL 75

Query: 111 QYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCY 170
           Q I AEV                               D  + +NHL K SIILIKAWCY
Sbjct: 76  QLINAEV-------------------------------DRKVGKNHLVKNSIILIKAWCY 104

Query: 171 YESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLW 230
           YESR+LG HHGLIS+YAL TL+LYIF++F+ S  GPLEVLYRFLE+FSKFDWDN+C+SL 
Sbjct: 105 YESRLLGAHHGLISTYALETLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLN 164

Query: 231 GPVPISLLPDVTA 243
           GPV +S LP+  A
Sbjct: 165 GPVALSSLPNQIA 177


>gi|303287038|ref|XP_003062808.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455444|gb|EEH52747.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 781

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 99/229 (43%), Positives = 135/229 (58%), Gaps = 13/229 (5%)

Query: 120 IKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
           +KC+ D  VVDI+ NQ GGL TL FL+EVD  I  + +FKRSIILIKAW +YE R+LG H
Sbjct: 1   MKCIADGVVVDISANQFGGLATLGFLEEVDAFIARDGIFKRSIILIKAWGFYEGRVLGAH 60

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLP 239
           H LIS+YAL TLVLY+ + ++   + PLEVL++FL +F+ F+WD + +S+ GPV +  L 
Sbjct: 61  HALISTYALETLVLYVLNAYHEELSTPLEVLHKFLTYFADFEWDAYAVSIHGPVRLDALE 120

Query: 240 DVTAEPPRKDGGVLL---LSKSFLDSCRYAYADFPGGQENQG--------QPFVSKHFNV 288
               +      G LL    +K  LD  +Y        ++ Q         +    KH NV
Sbjct: 121 KGVRDADAPARGPLLTPAFTKRVLD--KYGNDAIINAEKGQAGPGGGGNRRAMQPKHLNV 178

Query: 289 IDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVN 337
           IDPL  +NNLGRSVS+GN  RI+ A    A  L  L +   + +  E+N
Sbjct: 179 IDPLLPSNNLGRSVSQGNAKRIQKALRLGAAKLTSLRNAMRDGVSCELN 227


>gi|401410712|ref|XP_003884804.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
 gi|325119222|emb|CBZ54776.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
          Length = 3449

 Score =  172 bits (437), Expect = 2e-40,   Method: Composition-based stats.
 Identities = 77/206 (37%), Positives = 125/206 (60%), Gaps = 9/206 (4%)

Query: 54  VFTFGSVPLKTYLPDRDIDLGAFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEF 105
           V+ +GS PL+T+LPD D+D+G  S        + +   D    ++ D  + E+   H  F
Sbjct: 358 VYRYGSFPLRTFLPDGDLDIGIISYNRRTGVVEGEEESDALLAVLLDKFQREDVKTHKTF 417

Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILI 165
            ++E   + AEV+I+KC+V    VD++ N++GG C+L FL+  D  I  +HLFKRS++LI
Sbjct: 418 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRHHLFKRSVLLI 477

Query: 166 KAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-FAGPLEVLYRFLEFFSKFDWDN 224
           K+W  YES +LG   GL+++Y +  LVL++FHV   S    PL +LY F  ++S F WD 
Sbjct: 478 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVLPASLLPTPLHLLYHFFSYYSSFHWDR 537

Query: 225 FCLSLWGPVPISLLPDVTAEPPRKDG 250
           + ++  GP+P++ +   ++ P R+ G
Sbjct: 538 YAVTACGPLPLTFITRASSVPDRRGG 563



 Score = 41.2 bits (95), Expect = 0.67,   Method: Composition-based stats.
 Identities = 19/46 (41%), Positives = 27/46 (58%)

Query: 280 PFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLL 325
           PF+ +  NV+DPL   NNL RSVS+  F+R+  A     + L  +L
Sbjct: 889 PFLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTHIL 934


>gi|221502484|gb|EEE28211.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 3297

 Score =  169 bits (427), Expect = 2e-39,   Method: Composition-based stats.
 Identities = 77/213 (36%), Positives = 127/213 (59%), Gaps = 9/213 (4%)

Query: 54  VFTFGSVPLKTYLPDRDIDLGAFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEF 105
           V+ +GS PL+T+LPD D+D+G  S        + +   D    ++ +  +  E   H  F
Sbjct: 224 VYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHKTF 283

Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILI 165
            ++E   + AEV+I+KC+V    VD++ N++GG C+L FL+  D  I  NHLFKRS++LI
Sbjct: 284 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVLLI 343

Query: 166 KAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-FAGPLEVLYRFLEFFSKFDWDN 224
           K+W  YES +LG   GL+++Y +  LVL++FHVF  +    PL +LY+F  ++S F WD 
Sbjct: 344 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHWDR 403

Query: 225 FCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSK 257
           + ++  G +P++ +   ++   R+ G   L S+
Sbjct: 404 YAVTACGALPLTFITRTSSVQDRRGGSAPLPSR 436



 Score = 41.6 bits (96), Expect = 0.57,   Method: Composition-based stats.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 280 PFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLL 325
           PF+ +  NV+DPL   NNL RSVS+  F+R+  A     + L ++L
Sbjct: 738 PFLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 783


>gi|221482136|gb|EEE20497.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 3441

 Score =  169 bits (427), Expect = 2e-39,   Method: Composition-based stats.
 Identities = 77/213 (36%), Positives = 127/213 (59%), Gaps = 9/213 (4%)

Query: 54  VFTFGSVPLKTYLPDRDIDLGAFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEF 105
           V+ +GS PL+T+LPD D+D+G  S        + +   D    ++ +  +  E   H  F
Sbjct: 369 VYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHKTF 428

Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILI 165
            ++E   + AEV+I+KC+V    VD++ N++GG C+L FL+  D  I  NHLFKRS++LI
Sbjct: 429 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVLLI 488

Query: 166 KAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-FAGPLEVLYRFLEFFSKFDWDN 224
           K+W  YES +LG   GL+++Y +  LVL++FHVF  +    PL +LY+F  ++S F WD 
Sbjct: 489 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHWDR 548

Query: 225 FCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSK 257
           + ++  G +P++ +   ++   R+ G   L S+
Sbjct: 549 YAVTACGALPLTFITRTSSVQDRRGGSAPLPSR 581



 Score = 41.6 bits (96), Expect = 0.57,   Method: Composition-based stats.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 280 PFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLL 325
           PF+ +  NV+DPL   NNL RSVS+  F+R+  A     + L ++L
Sbjct: 883 PFLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 928


>gi|237843045|ref|XP_002370820.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
 gi|211968484|gb|EEB03680.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
          Length = 3436

 Score =  169 bits (427), Expect = 2e-39,   Method: Composition-based stats.
 Identities = 77/213 (36%), Positives = 127/213 (59%), Gaps = 9/213 (4%)

Query: 54  VFTFGSVPLKTYLPDRDIDLGAFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEF 105
           V+ +GS PL+T+LPD D+D+G  S        + +   D    ++ +  +  E   H  F
Sbjct: 363 VYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHKTF 422

Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILI 165
            ++E   + AEV+I+KC+V    VD++ N++GG C+L FL+  D  I  NHLFKRS++LI
Sbjct: 423 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVLLI 482

Query: 166 KAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-FAGPLEVLYRFLEFFSKFDWDN 224
           K+W  YES +LG   GL+++Y +  LVL++FHVF  +    PL +LY+F  ++S F WD 
Sbjct: 483 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHWDR 542

Query: 225 FCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSK 257
           + ++  G +P++ +   ++   R+ G   L S+
Sbjct: 543 YAVTACGALPLTFITRTSSVQDRRGGSAPLPSR 575



 Score = 41.6 bits (96), Expect = 0.57,   Method: Composition-based stats.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 280 PFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLL 325
           PF+ +  NV+DPL   NNL RSVS+  F+R+  A     + L ++L
Sbjct: 877 PFLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 922


>gi|222624434|gb|EEE58566.1| hypothetical protein OsJ_09878 [Oryza sativa Japonica Group]
          Length = 1064

 Score =  167 bits (423), Expect = 7e-39,   Method: Composition-based stats.
 Identities = 81/106 (76%), Positives = 86/106 (81%), Gaps = 1/106 (0%)

Query: 243 AEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQ-PFVSKHFNVIDPLRVNNNLGRS 301
           AEPPR D   LLLSKSFLD C YAYA  P  QE+QGQ PFVSKHFNVIDPLR NNNLGRS
Sbjct: 3   AEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGRS 62

Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           VSKGNFFRIR+AF+F AK LA+LL+CP EDL  EVNQFF NT  RH
Sbjct: 63  VSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRH 108


>gi|412992209|emb|CCO19922.1| predicted protein [Bathycoccus prasinos]
          Length = 1318

 Score =  165 bits (418), Expect = 3e-38,   Method: Composition-based stats.
 Identities = 101/286 (35%), Positives = 145/286 (50%), Gaps = 50/286 (17%)

Query: 105 FRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIIL 164
             VK++  I A+V+++KC+VD  VVD++ NQ GGL TL FL EV+  I +N LFKRS+IL
Sbjct: 289 LEVKDIVVIHADVRLLKCVVDGIVVDVSANQFGGLATLAFLKEVNSKIGKNDLFKRSVIL 348

Query: 165 IKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG-------------SFAGPLEVLY 211
           +KAW +YESRILG  + L+S+YAL TL++     FN                A PL+VL 
Sbjct: 349 VKAWAFYESRILGAPYALLSTYALKTLIICALRRFNKKESKSDATKTKKREIATPLDVLR 408

Query: 212 RFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP-----------------------PRK 248
            F E+ S F W+   ++++G VP+  L  V+                            +
Sbjct: 409 IFFEYVSDFPWETHAVTIFGDVPVEKLDKVSVREFSSSSKSEKNKNKNNDDEREEKDDEE 468

Query: 249 DGGVLLLSKSFLDSCRYAYADFPGGQEN--------QGQPFV-----SKHFNVIDPLRVN 295
                LL  +F+D+   +Y        N        +  PF      +KH +++DPL   
Sbjct: 469 AEEDPLLDDTFVDTILKSYGPDSRPDANVLLNIGNGKKAPFRRRAIGAKHLHILDPLSET 528

Query: 296 NNLGRSVSKGNFFRIRTAFTFRAKGLARL-LDCPNEDLYNEVNQFF 340
           NNLGRSVS GNF R+R AF   A+ L RL ++   E++      FF
Sbjct: 529 NNLGRSVSLGNFARVRAAFRLGAERLKRLEMESEPENITRGFEYFF 574



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 33/87 (37%), Positives = 43/87 (49%), Gaps = 15/87 (17%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ--------------VFTFGSV 60
           E +T ELIA ++P   SE+RR  V   +  LI +CF  +              V  FGSV
Sbjct: 129 EALTEELIASLRPSKQSEKRRRMVFRKMESLIRECFEKEFEGEGVNEKKNTIVVSAFGSV 188

Query: 61  PLKTYLPDRDIDLGAFSDDQTL-KDTW 86
           P  TYLPD DID+    D + L   +W
Sbjct: 189 PFGTYLPDGDIDVCILGDHEVLDSQSW 215


>gi|224064218|ref|XP_002301405.1| predicted protein [Populus trichocarpa]
 gi|222843131|gb|EEE80678.1| predicted protein [Populus trichocarpa]
          Length = 141

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 94/116 (81%), Positives = 99/116 (85%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           SVIR LD  RW KAEE TAELIA IQP+  SEE RNAVA YV+RLI +CFPCQVFTFGSV
Sbjct: 26  SVIRVLDSERWSKAEERTAELIACIQPNQPSEELRNAVADYVQRLIAKCFPCQVFTFGSV 85

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAE 116
           PLKTYLPD DIDL AFS +  LKDTWAH VRDMLENEEKNE+AEFRVKEVQYIQAE
Sbjct: 86  PLKTYLPDGDIDLTAFSKNPNLKDTWAHQVRDMLENEEKNENAEFRVKEVQYIQAE 141


>gi|297600524|ref|NP_001049344.2| Os03g0210800 [Oryza sativa Japonica Group]
 gi|255674303|dbj|BAF11258.2| Os03g0210800 [Oryza sativa Japonica Group]
          Length = 871

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 82/108 (75%), Positives = 88/108 (81%), Gaps = 1/108 (0%)

Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQ-PFVSKHFNVIDPLRVNNNLG 299
           +TAEPPR D   LLLSKSFLD C YAYA  P  QE+QGQ PFVSKHFNVIDPLR NNNLG
Sbjct: 1   MTAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLG 60

Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           RSVSKGNFFRIR+AF+F AK LA+LL+CP EDL  EVNQFF NT  RH
Sbjct: 61  RSVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRH 108


>gi|224127915|ref|XP_002320195.1| predicted protein [Populus trichocarpa]
 gi|222860968|gb|EEE98510.1| predicted protein [Populus trichocarpa]
          Length = 145

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 93/117 (79%), Positives = 99/117 (84%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           SVIR LD  RW KAEE TAELI  IQP+  SEE RNAVA YV+RLI++CFPCQVFTFGSV
Sbjct: 26  SVIRVLDLDRWSKAEERTAELIDCIQPNQPSEELRNAVADYVQRLILKCFPCQVFTFGSV 85

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
           PLKTYLPD DIDL AFS +  LKDTWAH VRDMLENEEKNE+AEFRVKEVQYIQAE 
Sbjct: 86  PLKTYLPDGDIDLTAFSKNPNLKDTWAHQVRDMLENEEKNENAEFRVKEVQYIQAEA 142


>gi|301093296|ref|XP_002997496.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110638|gb|EEY68690.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 782

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 112/375 (29%), Positives = 169/375 (45%), Gaps = 71/375 (18%)

Query: 21  LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV----FTFGSVPLKTYLPDRDIDLGAF 76
           LI  + P   ++  R  V ++V+R+I   FP       F  GS P+KTYLP  D+D+   
Sbjct: 251 LIEWMGPSDVADRVRQQVLSFVQRVITAHFPLAAAPLFFATGSYPMKTYLPGSDLDICLL 310

Query: 77  SDDQTLKDTWAHLVRDMLENEEKNEHAEF------------------------------- 105
              Q L+ +W ++V   L     +  A                                 
Sbjct: 311 VP-QELESSWYYIVTQALCVAGGSGGAGTVLDLGNSASSDVSGSSSPSGPAAASGGGPLL 369

Query: 106 ---RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSI 162
               V+ V +I A+V+++KC VDN  VD   N++G L  +  LD +   +   HLFK+S+
Sbjct: 370 LTNTVRNVTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAARVGRQHLFKKSL 429

Query: 163 ILIKAWCYYESR---------------------ILGGHHGLISSYALVTLVLYIFHVFNG 201
           ILIKAWC +ESR                     ++G  HG +S+YA+ T+V+ +F+    
Sbjct: 430 ILIKAWCTHESRPFMQRASNEAGGSVPGSTPASVMGASHGALSTYAVNTIVMALFNQHGD 489

Query: 202 SFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL---PDVTAEPPRKDGGVLLLSKS 258
           +   PL+ LY FL+  ++F W    L+L GPVP+S L   P       R       L  S
Sbjct: 490 ALTHPLQALYLFLDRLAEFPWHECALTLHGPVPLSRLASTPLNGTTSYRSKLKTAKLDAS 549

Query: 259 FLDSCRYAYADFPGG-----QENQGQP---FVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
            +++ R   AD  G      + ++G P   F  +  N++DPL   NNL RSVS   F  +
Sbjct: 550 DVEAIRDTLADQFGAFDAALKSSKGTPTGLFPIRACNIVDPLDDKNNLARSVSAEGFPVM 609

Query: 311 RTAFTFRAKGLARLL 325
           + AF      LA +L
Sbjct: 610 KRAFRLARDQLAAML 624


>gi|403357215|gb|EJY78230.1| hypothetical protein OXYTRI_24618 [Oxytricha trifallax]
          Length = 831

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 101/336 (30%), Positives = 164/336 (48%), Gaps = 33/336 (9%)

Query: 22  IARIQPDPFSEERRNAVAAYVRRLIIQCF----PCQVFTFGSVPLKTYLPDRDIDLGAFS 77
           + +I P   SE +R  +   V+ LI +         V  +GS PLKTYLPD DID+    
Sbjct: 29  LNKIGPTQESERKRVKIFEQVKFLIEKALGGKSQVMVIRYGSDPLKTYLPDSDIDITVIR 88

Query: 78  DD--------QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV 128
            D        Q    T   L++  +E   + ++ +  VK +  I QA+V+IIK    N  
Sbjct: 89  RDYLQGNQTNQLTALTQLKLIKKEIEIFGETQNGKNFVKSMVLIDQADVEIIKLNFQNTF 148

Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
           VDI+  Q+GG+CTL F++ +   I +  L K+SIIL+KAW  Y++ ILG     +++YAL
Sbjct: 149 VDISIKQVGGICTLYFMNYMAKRIGKQQLLKKSIILLKAWFTYDASILGSQAACMATYAL 208

Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
             +VL+I + F      P++V+  F + +S FDW+N  ++++GP+  S   +   E    
Sbjct: 209 YVMVLFILNNFYDELNSPMDVIMMFFKVWSHFDWENNIVTIFGPIKSSGFYERLKECQFD 268

Query: 249 DGGVLLLSKSFLDSCRY--------------------AYADFPGGQENQGQPFVSKHFNV 288
              + +L +S     +Y                      +D         + F +K+FN+
Sbjct: 269 IDRLTMLDRSLHQEYQYRKLLVTPDELSFLNLQFSGVRLSDVSSYNLANKKSFNTKYFNI 328

Query: 289 IDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARL 324
           IDP    NNLG+S+SK N  RI+     +   + ++
Sbjct: 329 IDPTFSKNNLGKSISKLNSSRIKQVLRLQNMKMRQI 364


>gi|325189429|emb|CCA23919.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1193

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 103/334 (30%), Positives = 167/334 (50%), Gaps = 46/334 (13%)

Query: 13   KAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV----FTFGSVPLKTYLPD 68
            + E    +LI  + P   +++ R  + AY+R L+   FP       F  GS P KTYLPD
Sbjct: 721  RVETSVKKLIHALSPTHEADQARCNILAYLRHLLELQFPRSSSILFFPTGSFPCKTYLPD 780

Query: 69   RDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNE-HAEFR--------------------- 106
             D+D+      ++++ TW   V  ML     N+ HAE +                     
Sbjct: 781  ADLDVCLLVP-RSMEPTWFFSVVQMLCFAATNDVHAEPKHSLESVQAPSWMNSTSSTGNT 839

Query: 107  VKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIK 166
            V+ V +I A+V+++KC +DN  VDI  N++G L  L  LD  D  +  +HLFK+S++LIK
Sbjct: 840  VRNVTFINADVRVVKCTIDNVAVDITVNRVGALGALVLLDTFDLRVGRHHLFKQSLVLIK 899

Query: 167  AWCYYE-------SRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSK 219
            AWC  +         +LG  +G  S+YA+ T+V+ +F+ +      PLE L+ FL+  ++
Sbjct: 900  AWCALDCLEGGQGCGVLGSKNGAFSTYAVNTMVMTLFNRWGYRIQHPLEALHLFLDIMTQ 959

Query: 220  FDWDNFCLSLWGPVPIS-LLPDVTAE--PP--RKDGGVLLLSKSFLDSCRYAYADFPG-- 272
            F W     +++GPV  + L  ++++   PP         L+++  ++  R    ++ G  
Sbjct: 960  FPWQECAWTIFGPVLFTQLYQNLSSRIVPPGWETASANCLITREDIEQIRVCLNEYFGSF 1019

Query: 273  ----GQENQGQPFVSKHFNVIDPLRVNNNLGRSV 302
                G E     F  + FN+IDPL++ NNL RSV
Sbjct: 1020 DVSLGTETNA-VFPLRSFNMIDPLQLGNNLARSV 1052


>gi|348683529|gb|EGZ23344.1| hypothetical protein PHYSODRAFT_485178 [Phytophthora sojae]
          Length = 793

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 107/381 (28%), Positives = 165/381 (43%), Gaps = 77/381 (20%)

Query: 21  LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV----FTFGSVPLKTYLPDRDIDLGAF 76
           LI  + P   ++  R  V ++V+++I   FP       F  GS P+KTYLP  D+D+   
Sbjct: 255 LIEWMGPSDAADRVRQQVLSFVQQVITAHFPLAAAPLFFATGSYPMKTYLPGSDLDICLL 314

Query: 77  SDDQTLKDTWAHLVRDMLENEEKNEHAEF------------------------------- 105
              Q L+ +W  +V   L     +  A                                 
Sbjct: 315 VP-QELESSWYFIVTQALCIAGGSGGAGTVLDVGNPGGSVDGSGSSSPSGPAVGSGSSGA 373

Query: 106 -----RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKR 160
                 V+ V +I A+V+++KC VDN  VD   N++G L  +  LD +   +   HLFK+
Sbjct: 374 LLLTNTVRNVTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAVRVGRQHLFKK 433

Query: 161 SIILIKAWCYYES-------------------------RILGGHHGLISSYALVTLVLYI 195
           S+ILIKAWC +ES                          ++G  HG +S+YA+ T+V+ +
Sbjct: 434 SLILIKAWCTHESSPFMQAASVECGGLGPSVVPGSTPTSVMGASHGALSTYAVNTIVMAL 493

Query: 196 FHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL---PDVTAEPPRKDGGV 252
           F+    +   PL+ LY FL+  ++F W    L+L G VP+S L   P     P +     
Sbjct: 494 FNQHGDALTHPLQALYLFLDRLAEFPWHEAALTLHGAVPLSRLATTPLNGTTPSKSKLKA 553

Query: 253 LLLSKSFLDSCRYAYAD----FPGG-QENQGQP---FVSKHFNVIDPLRVNNNLGRSVSK 304
             L    +++ R   +D    F  G +  +  P   F  +  N++DPL   NNL RSVS 
Sbjct: 554 AKLDAGDVEAIRDTLSDQFGAFDAGLRSGKSAPTGLFPIRACNIVDPLDDKNNLARSVSA 613

Query: 305 GNFFRIRTAFTFRAKGLARLL 325
             F  ++ AF      LA +L
Sbjct: 614 EGFPVMKRAFRLARDQLAAML 634


>gi|253744327|gb|EET00549.1| Topoisomerase I-related protein [Giardia intestinalis ATCC 50581]
          Length = 511

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 70/224 (31%), Positives = 119/224 (53%), Gaps = 5/224 (2%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P   S   R  +  Y+R  +   FP  Q+  +GS   + +LPD D+DL     +    
Sbjct: 45  LAPSEDSISCRYQIIKYIRDELHSIFPELQLIPYGSFVTRIFLPDGDVDLSIIVAEDDAN 104

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
           D ++     + E     EHA F++  +  IQAE+ II+ +++   +DI+  +  GL T  
Sbjct: 105 DVFSQFYTHLKEIASSQEHATFKITNLSKIQAEMSIIRLVINGIFIDISAARPTGLVTSL 164

Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
           ++  ++  I  N+L KRS+IL++AW  YE+ ILG H  +++SYAL  +  +I    +   
Sbjct: 165 YIQLLNDSIGRNNLLKRSVILVQAWSLYEAHILGSHSQMLNSYALRVMTAFIL-TNSPEL 223

Query: 204 AGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
             PL+VL++F  F+S FD+ N  ++ +G +P S   +V    PR
Sbjct: 224 VHPLQVLFKFFAFYSTFDFTNNTITAFGVIPNS---EVDGSDPR 264


>gi|308159127|gb|EFO61675.1| Topoisomerase I-related protein [Giardia lamblia P15]
          Length = 512

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/213 (33%), Positives = 113/213 (53%), Gaps = 2/213 (0%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P   S   R  +  Y+R  +   FP  Q+  +GS   + +LPD DIDL     +    
Sbjct: 45  LAPTEDSITYRYQIIKYIRDKLHDLFPELQLIPYGSFVTRIFLPDGDIDLAIIVGEDDAA 104

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
           D        + +     E   FRV  +  IQAEV II+ +++   +DI+  +  GL T  
Sbjct: 105 DVLTQFYIHLKDIVASQEDTPFRVTNLSKIQAEVPIIRLVINGIFIDISSARPVGLVTSL 164

Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
           +L  ++  I  N+L KRS+ILI+AWC YE+ ILG H  +++SYAL  + ++I    +   
Sbjct: 165 YLQLLNDAIGRNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTIFIL-TNSPEL 223

Query: 204 AGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPIS 236
             PL+VL++F  F+S FD+ N  ++ +G +P S
Sbjct: 224 VHPLQVLFKFFAFYSAFDFTNNTITAFGVIPNS 256


>gi|159108047|ref|XP_001704297.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
 gi|157432356|gb|EDO76623.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
          Length = 512

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 111/203 (54%), Gaps = 2/203 (0%)

Query: 35  RNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDM 93
           R  +  Y+R  +   FP  Q+  +GS   + +LPD DIDL     +    D  A     +
Sbjct: 55  RYQIIKYIRDKLHSLFPELQLIPYGSFVTRIFLPDGDIDLAIIVGEDDAADVLAQFYIYL 114

Query: 94  LENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
            E    +E   F++  +  IQAEV II+ +++   +DI+  +  GL T  +L  ++  I 
Sbjct: 115 KEVAASHEDTPFKLTNLSKIQAEVPIIRLVINGVFIDISSARPVGLVTSLYLQLLNDAIG 174

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
            N+L KRS+ILI+AWC YE+ ILG H  +++SYAL  +  +I    +     PL+VL++F
Sbjct: 175 RNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTTFIL-TNSPELVHPLQVLFKF 233

Query: 214 LEFFSKFDWDNFCLSLWGPVPIS 236
             F+S FD+ N  ++ +G VP S
Sbjct: 234 FAFYSAFDFTNNTITAFGVVPNS 256


>gi|242051292|ref|XP_002463390.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
 gi|241926767|gb|EER99911.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
          Length = 208

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 59/126 (46%), Positives = 88/126 (69%), Gaps = 1/126 (0%)

Query: 24  RIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           R+ P   +E RR  V +Y+RRLI     C+VF FGSVPL+TYLPD D+D+    +   L 
Sbjct: 83  RVHPTQEAERRRQDVISYLRRLIGSSLGCEVFAFGSVPLRTYLPDGDVDITVLGNTW-LN 141

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
            T+   VR MLE+E++N  AEF++  + +I AEVK+IKC+++N +VD++FNQ+GG+ T C
Sbjct: 142 STFIDDVRSMLESEQENCDAEFKLTGLHFINAEVKLIKCIIENIIVDVSFNQIGGVSTFC 201

Query: 144 FLDEVD 149
           FL+ ++
Sbjct: 202 FLELIN 207


>gi|253742434|gb|EES99267.1| Hypothetical protein GL50581_3482 [Giardia intestinalis ATCC 50581]
          Length = 711

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 74/239 (30%), Positives = 121/239 (50%), Gaps = 23/239 (9%)

Query: 18  TAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAF 76
           T  +I+ + PD  SEE R  +  ++ ++I    P   +  +GS   K YLP  D+D+  +
Sbjct: 50  TDYIISLVSPDRASEEFRLKIFTFISKVIDVVLPNTLIVPYGSFISKIYLPSSDLDICCY 109

Query: 77  S-------------------DDQTLKDTWAHL--VRDMLENEEKNEHAEFRVKEVQYIQA 115
           +                    D  L+ T   +  V   L N   +      ++ +++I A
Sbjct: 110 NHSIDEIPLLQKILEALMVFSDPNLQSTGTRVSPVVSQLINSHISADERLELENIEFIMA 169

Query: 116 EVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRI 175
           +V +IKC V    VDI+  Q G L T   ++++   I  N+L KRS +LI++WC YE+RI
Sbjct: 170 KVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEARI 229

Query: 176 LGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVP 234
           +GGH  ++SSYAL  +++ I  +       P +VLY FL ++S FD+D   +   GP+P
Sbjct: 230 VGGHSQMLSSYALRVMIINIL-INCKDIYTPFQVLYVFLAYYSNFDYDRNIIHPSGPLP 287



 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 6/71 (8%)

Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
           EP   D  VL L K +  +  +          +  Q F+  + +++DPL+V NNLGRSVS
Sbjct: 546 EPVLNDHDVLFLLKRYFSTSTFTDV------LSDSQVFLPSYISIVDPLQVTNNLGRSVS 599

Query: 304 KGNFFRIRTAF 314
           + NF RI  +F
Sbjct: 600 EPNFMRITRSF 610


>gi|159115240|ref|XP_001707843.1| Hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
 gi|157435951|gb|EDO80169.1| hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
          Length = 731

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 74/241 (30%), Positives = 123/241 (51%), Gaps = 33/241 (13%)

Query: 21  LIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFS-- 77
           +++ + PD  SEE R  +  ++ ++I    P   +  +GS   K YLP  D+D+  F+  
Sbjct: 68  IVSLVSPDKASEEFRLKIFTFISKVIEAVLPNTLIVPYGSFISKIYLPSSDLDICCFNHG 127

Query: 78  -----------------DDQTLKDTW-------AHLVRDMLENEEKNEHAEFRVKEVQYI 113
                             D +L+ T        + L+   +  EE+ E     ++ +++I
Sbjct: 128 LDEIPLLQKILEALTVFSDPSLRPTGVRVPPAVSQLINSRIPTEERLE-----LENIEFI 182

Query: 114 QAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYES 173
            A+V +IKC V    VDI+  Q G L T   ++++   I  N+L KRS +LI++WC YE+
Sbjct: 183 MAKVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEA 242

Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPV 233
           RI+GGH  ++SSYAL  +V+ I       +  P + LY FL ++S FD+D   +   GP+
Sbjct: 243 RIVGGHSQMLSSYALRVMVINILLNCRDIYT-PFQALYVFLAYYSSFDYDRDIVHPSGPL 301

Query: 234 P 234
           P
Sbjct: 302 P 302



 Score = 45.1 bits (105), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 6/71 (8%)

Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
           EP   D  +LLL K +     ++    P    +  + F+  + +++DPL+V NNLGRSVS
Sbjct: 568 EPNLNDHDILLLFKRY-----FSMGTLPNVLSD-SRAFLPSYISIVDPLQVINNLGRSVS 621

Query: 304 KGNFFRIRTAF 314
           + NF RI  +F
Sbjct: 622 EPNFMRITRSF 632


>gi|308163112|gb|EFO65472.1| Hypothetical protein GLP15_5146 [Giardia lamblia P15]
          Length = 719

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 72/236 (30%), Positives = 118/236 (50%), Gaps = 23/236 (9%)

Query: 21  LIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFS-- 77
           +++ + PD  SEE R  +  ++ R+I    P   +  +GS   K YLP  D+D+  ++  
Sbjct: 53  IVSLVSPDKASEEFRLKIFTFISRVIEAVLPNTLIVPYGSFISKIYLPSSDLDICCYNHG 112

Query: 78  -----------------DDQTLKDTWAHL--VRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
                             D +L+ T   +      L N   +      ++ +++I A+V 
Sbjct: 113 LDEIPLLQKILEALTIFSDPSLRPTGVRVSPAVSQLINSRISAEERLELENIEFIMAKVS 172

Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
           +IKC V    VDI+  Q G L T   ++++   I  N+L KRS +LI++WC YE+RI+GG
Sbjct: 173 LIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEARIVGG 232

Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVP 234
           H  ++SSYAL  +V+ I       +  P + LY FL ++S FD+D   +   GP P
Sbjct: 233 HSQMLSSYALRVMVINILLNCKDIYT-PFQALYVFLAYYSTFDYDKNIVHPSGPFP 287



 Score = 46.2 bits (108), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 41/71 (57%), Gaps = 6/71 (8%)

Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
           EP   D  +LLL K +     ++   FP    +  + F+  + +++DPL+V NNLGRSVS
Sbjct: 556 EPVINDHDILLLLKRY-----FSMGTFPDVLSD-SRVFLPSYISIVDPLQVINNLGRSVS 609

Query: 304 KGNFFRIRTAF 314
           + NF RI  +F
Sbjct: 610 EPNFMRITRSF 620


>gi|2651305|gb|AAB87585.1| hypothetical protein [Arabidopsis thaliana]
          Length = 384

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 83/228 (36%), Positives = 116/228 (50%), Gaps = 56/228 (24%)

Query: 5   PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQ---------------- 48
           P++   WL AE    E++  IQP+  +E  RN + + ++ L+ +                
Sbjct: 27  PIEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVRTFLLLLDEL 86

Query: 49  CFPCQ------VFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEH 102
            F  Q      V+ FGS+PLKTYLPD DIDL   +   + +D  A  V  +LE E  N  
Sbjct: 87  SFSLQRIRNAKVYLFGSMPLKTYLPDGDIDLTVLTHHASEEDC-ARAVCCVLEAEMGN-- 143

Query: 103 AEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSI 162
           ++ +V  VQY+QA+            VD AF +                   +HLFK+SI
Sbjct: 144 SDLQVTGVQYVQAK------------VDKAFGR-------------------DHLFKKSI 172

Query: 163 ILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVL 210
           IL+KAWC+YESRILG + GLIS+YAL  LVL I ++   S +GPL  L
Sbjct: 173 ILVKAWCFYESRILGANSGLISTYALAILVLNIVNMSYSSLSGPLAKL 220


>gi|224135259|ref|XP_002322023.1| predicted protein [Populus trichocarpa]
 gi|222869019|gb|EEF06150.1| predicted protein [Populus trichocarpa]
          Length = 85

 Score =  113 bits (283), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 52/63 (82%), Positives = 54/63 (85%)

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +NHLFKRSIILIKAWCYYESRILG HHGLIS+YAL TLVLYIFHVFN  FAGPLEV   F
Sbjct: 1   QNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNKFAGPLEVSTAF 60

Query: 214 LEF 216
             F
Sbjct: 61  WNF 63


>gi|224135265|ref|XP_002322024.1| predicted protein [Populus trichocarpa]
 gi|222869020|gb|EEF06151.1| predicted protein [Populus trichocarpa]
          Length = 122

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 63/117 (53%), Positives = 73/117 (62%), Gaps = 30/117 (25%)

Query: 1   SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
           SV + L+P RW  AEE TAELIA IQP+  SEERRNAV  YV+RLI+ CFPCQ       
Sbjct: 25  SVTQALEPERWATAEERTAELIACIQPNQPSEERRNAVLCYVQRLIMNCFPCQ------- 77

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
                                  +TWA+ VRD+LE+EEKNE+AEF VKEVQYIQAEV
Sbjct: 78  -----------------------ETWANEVRDILEHEEKNENAEFHVKEVQYIQAEV 111


>gi|261333426|emb|CBH16421.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 1120

 Score =  107 bits (266), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 54/155 (34%), Positives = 87/155 (56%), Gaps = 26/155 (16%)

Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
           + AEV+++K ++D    DI   QLGG+  + FL E+D  I  NHL KR+++L+KAWC YE
Sbjct: 483 VVAEVRVLKLVMDGSSYDITVGQLGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYE 542

Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN------------------------GSFA--GP 206
           + +L G  G ISSYA   +++ + +                           G +    P
Sbjct: 543 AHVLSGQGGYISSYAATVMIISMINTVEFLEDVEREERGGEGDGKHLDERQRGEYQHISP 602

Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           L++  RFL++FS FD++++CL+L+GPVP   + +V
Sbjct: 603 LQLFARFLKYFSYFDFESYCLTLFGPVPCDKINNV 637


>gi|71748824|ref|XP_823467.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70833135|gb|EAN78639.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 1120

 Score =  107 bits (266), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 54/155 (34%), Positives = 87/155 (56%), Gaps = 26/155 (16%)

Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
           + AEV+++K ++D    DI   QLGG+  + FL E+D  I  NHL KR+++L+KAWC YE
Sbjct: 483 VVAEVRVLKLVMDGSSYDITVGQLGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYE 542

Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN------------------------GSFA--GP 206
           + +L G  G ISSYA   +++ + +                           G +    P
Sbjct: 543 AHVLSGQGGYISSYAATVMIISMINTVEFLEDVEREERGGEGDGKHLEERQRGEYQHISP 602

Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
           L++  RFL++FS FD++++CL+L+GPVP   + +V
Sbjct: 603 LQLFARFLKYFSYFDFESYCLTLFGPVPCDKINNV 637


>gi|298707565|emb|CBJ30149.1| nucleotidyltransferase family protein [Ectocarpus siliculosus]
          Length = 1301

 Score =  106 bits (265), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 56/140 (40%), Positives = 83/140 (59%), Gaps = 4/140 (2%)

Query: 97  EEKNEHAEFRVKEVQYIQ-AEVKIIKCLVDNFV-VDIAFNQLGGLCTLCFLDEVDHLINE 154
           +E+      R+  V +I    V+ IKC+VDN V VDI  NQ+G + T+  L+E D L+ +
Sbjct: 745 KEEGSSYRHRLSNVNFINMGRVQKIKCVVDNQVAVDIGANQVGDIATVALLEETDQLLGK 804

Query: 155 NHLFKRSIILIKAWCYYESRILGGHHGL--ISSYALVTLVLYIFHVFNGSFAGPLEVLYR 212
           +HLFKRS++LIK+W  YESR   G + L  I+  AL T+VL + +  +     PL+V+  
Sbjct: 805 DHLFKRSLLLIKSWWVYESRAYTGSNMLSRITESALATMVLAVVNQHHARLHTPLQVMAL 864

Query: 213 FLEFFSKFDWDNFCLSLWGP 232
           F +  S FDW  +C  + GP
Sbjct: 865 FFQMHSHFDWSRYCWCIEGP 884



 Score = 39.7 bits (91), Expect = 1.9,   Method: Composition-based stats.
 Identities = 17/57 (29%), Positives = 31/57 (54%)

Query: 20  ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAF 76
           +L+  ++P P +E  R +V  +V R + +    Q F  G   ++ YLPD ++ + AF
Sbjct: 621 DLLRLLRPAPRAEGYRRSVFRFVTRQVKRALGAQCFPVGGYAIQAYLPDEEVGISAF 677


>gi|298710234|emb|CBJ26309.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1317

 Score =  105 bits (263), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 60/143 (41%), Positives = 87/143 (60%), Gaps = 8/143 (5%)

Query: 101 EHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKR 160
           E A+  ++ V  I A   I+  +V N VVD+  NQ G +     L+E D+LI  NHLFKR
Sbjct: 111 ETAKPEIRNVSLINARTPIVTMVVGNVVVDLTENQGGSVAASALLEEADNLIQRNHLFKR 170

Query: 161 SIILIKAWCYYES------RILGGHHGLISSYALVTLVLYIFHVFNGSFA--GPLEVLYR 212
           S++L+KAW + E+      R+LG   G ++SY L  +VL++F     + A   PL+VL R
Sbjct: 171 SLLLLKAWAWCETPRLVGNRVLGARKGGLTSYGLSVMVLHLFAASASADALVHPLDVLIR 230

Query: 213 FLEFFSKFDWDNFCLSLWGPVPI 235
           F E +S+FDW  +CL+L GPVP+
Sbjct: 231 FFEVYSEFDWARYCLTLDGPVPL 253


>gi|224114896|ref|XP_002316887.1| predicted protein [Populus trichocarpa]
 gi|222859952|gb|EEE97499.1| predicted protein [Populus trichocarpa]
          Length = 199

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 60/78 (76%), Positives = 68/78 (87%)

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVV 129
           DIDL AFS++  LKDTWA  V DMLENEE NE+AEF VKEV+YIQAEVKIIKCLV+N VV
Sbjct: 19  DIDLTAFSENPNLKDTWAPQVCDMLENEENNENAEFGVKEVEYIQAEVKIIKCLVENIVV 78

Query: 130 DIAFNQLGGLCTLCFLDE 147
           DI+FNQLGGL TLCFL++
Sbjct: 79  DISFNQLGGLFTLCFLEK 96



 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 36/50 (72%), Positives = 42/50 (84%)

Query: 255 LSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
           LSK FL++C   YA  P GQ+NQGQPF+SKHFNVIDPLR+NNNLG SV+K
Sbjct: 106 LSKLFLEACSAIYAVLPAGQDNQGQPFLSKHFNVIDPLRINNNLGHSVNK 155


>gi|342184813|emb|CCC94295.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 1108

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 50/148 (33%), Positives = 82/148 (55%), Gaps = 26/148 (17%)

Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
           + AEV+++K ++D    D+   QLGG+  + FL E+D  +   HL KR+++L+KAWC YE
Sbjct: 472 VVAEVRVLKLVMDGSSYDVTVGQLGGVSCIRFLHEMDMRVGCEHLLKRTLLLMKAWCCYE 531

Query: 173 SRILGGHHGLISSYALVTLVLYIFHVF--------NGSFA------------------GP 206
           + +L G  G +SSYA   +++ + +           GS                     P
Sbjct: 532 AHVLSGQGGYMSSYAATVMLITMINTVEFLEDVEAEGSDGKTCSNCPEGHKSEGHVQISP 591

Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVP 234
           L++  RFL+++S FD+D +CL+L+GPVP
Sbjct: 592 LQLFARFLKYYSYFDFDRYCLTLFGPVP 619


>gi|452823525|gb|EME30535.1| nucleotidyltransferase [Galdieria sulphuraria]
          Length = 1412

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 69/263 (26%), Positives = 112/263 (42%), Gaps = 57/263 (21%)

Query: 27  PDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAF--SDDQTLKD 84
           P  FSE RR AV   V  +I +    Q F +GS   KTY  D  +++GAF    + T  +
Sbjct: 710 PTSFSELRREAVFRVVASIIKRSIGAQAFCYGSFATKTYHADSILEIGAFLVGKNDTAAE 769

Query: 85  TWAHLVRDMLEN-----EEKNEHAEFR--------------VKEVQYIQAEVKIIKC--- 122
             A L+  + E+     +  +   EF               V+ + Y + +     C   
Sbjct: 770 WSAKLMAALCEDATLASDHSSSSLEFSYLSLIQQKHPVPLPVRNISYFRPKPTPSGCQPP 829

Query: 123 -----------------------------LVDNFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
                                        +  N  V +  N + G+ T C L+E DH + 
Sbjct: 830 PAVTFTVNWPIEDPRSGLVALDTNSTERDIAPNVRVSVTLNHVAGIHTACVLEEFDHAMG 889

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
            NHLFKRS++L++ W  Y  ++      ++ S A+  LV+++ + F+ S   P ++LYRF
Sbjct: 890 RNHLFKRSLLLVRTWVDYGVKLT----DILPSRAVEVLVVFVANCFHSSIETPFDLLYRF 945

Query: 214 LEFFSKFDWDNFCLSLWGPVPIS 236
           L +F  FDW  F L   G + ++
Sbjct: 946 LTYFVHFDWRKFGLCETGIIDLA 968


>gi|340057832|emb|CCC52183.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 1145

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 48/148 (32%), Positives = 76/148 (51%), Gaps = 28/148 (18%)

Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
           K ++D    D+   QLGG+  + FL +VD  I   HL KR+++L+KAWC YE+ +L G  
Sbjct: 516 KLIMDGNSYDVTVGQLGGVSCIRFLHQVDTKIGCGHLLKRTLLLMKAWCCYEAHVLSGQG 575

Query: 181 GLISSYALVTLVLYIF-----------------------HVFNGSFAG-----PLEVLYR 212
           G +SSYA   +++ +                        H   G         PL++  R
Sbjct: 576 GYMSSYAATVMLIAMINTIEFLEDAESEACTELEEPARTHALEGRLGALNGVSPLQLFAR 635

Query: 213 FLEFFSKFDWDNFCLSLWGPVPISLLPD 240
           FL++FS FD++ +C++L+GPVP   + D
Sbjct: 636 FLKYFSCFDFERYCVTLFGPVPCEKIND 663


>gi|224064842|ref|XP_002301578.1| predicted protein [Populus trichocarpa]
 gi|222843304|gb|EEE80851.1| predicted protein [Populus trichocarpa]
          Length = 60

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 37/47 (78%), Positives = 42/47 (89%)

Query: 105 FRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHL 151
           FRVK+V+YIQAEVKIIKCLV N VVDI+FNQLGGL TLCFL++V  L
Sbjct: 13  FRVKKVEYIQAEVKIIKCLVKNIVVDISFNQLGGLFTLCFLEKVSAL 59


>gi|389601018|ref|XP_001564070.2| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
 gi|322504611|emb|CAM38122.2| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
          Length = 2016

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/85 (38%), Positives = 55/85 (64%)

Query: 113  IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
            + AEV+++K  ++    DI   Q GG+  + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 1055 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1114

Query: 173  SRILGGHHGLISSYALVTLVLYIFH 197
            + ILGG  G I SYA   +++ + +
Sbjct: 1115 AHILGGQAGYIGSYAATVMLISMLN 1139


>gi|398013931|ref|XP_003860157.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322498376|emb|CBZ33450.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 2047

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/85 (38%), Positives = 55/85 (64%)

Query: 113  IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
            + AEV+++K  ++    DI   Q GG+  + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 1058 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1117

Query: 173  SRILGGHHGLISSYALVTLVLYIFH 197
            + ILGG  G I SYA   +++ + +
Sbjct: 1118 AHILGGQAGYIGSYAATVMLISMLN 1142



 Score = 42.4 bits (98), Expect = 0.37,   Method: Composition-based stats.
 Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 3/40 (7%)

Query: 206  PLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
            PL +  RFL+F++ FD+D +C++ +GP+P   L  +T+ P
Sbjct: 1269 PLTLFARFLKFYAYFDFDRYCVTAFGPLP---LHKITSTP 1305


>gi|401419332|ref|XP_003874156.1| conserved hypothetical protein [Leishmania mexicana
            MHOM/GT/2001/U1103]
 gi|322490390|emb|CBZ25650.1| conserved hypothetical protein [Leishmania mexicana
            MHOM/GT/2001/U1103]
          Length = 2020

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/85 (38%), Positives = 55/85 (64%)

Query: 113  IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
            + AEV+++K  ++    DI   Q GG+  + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 1051 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1110

Query: 173  SRILGGHHGLISSYALVTLVLYIFH 197
            + ILGG  G I SYA   +++ + +
Sbjct: 1111 AHILGGQAGYIGSYAATVMLISMLN 1135



 Score = 42.4 bits (98), Expect = 0.37,   Method: Composition-based stats.
 Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 3/40 (7%)

Query: 206  PLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
            PL +  RFL+F++ FD+D +C++ +GP+P   L  +T+ P
Sbjct: 1265 PLTLFARFLKFYAYFDFDRYCVTAFGPLP---LHKITSTP 1301


>gi|339897903|ref|XP_001464956.2| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|321399300|emb|CAM67197.2| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 2047

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/85 (38%), Positives = 55/85 (64%)

Query: 113  IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
            + AEV+++K  ++    DI   Q GG+  + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 1058 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1117

Query: 173  SRILGGHHGLISSYALVTLVLYIFH 197
            + ILGG  G I SYA   +++ + +
Sbjct: 1118 AHILGGQAGYIGSYAATVMLISMLN 1142



 Score = 42.4 bits (98), Expect = 0.37,   Method: Composition-based stats.
 Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 3/40 (7%)

Query: 206  PLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
            PL +  RFL+F++ FD+D +C++ +GP+P   L  +T+ P
Sbjct: 1269 PLTLFARFLKFYAYFDFDRYCVTAFGPLP---LHKITSTP 1305


>gi|157868001|ref|XP_001682554.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68126008|emb|CAJ04245.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 1964

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/85 (38%), Positives = 55/85 (64%)

Query: 113  IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
            + AEV+++K  ++    DI   Q GG+  + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 971  VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1030

Query: 173  SRILGGHHGLISSYALVTLVLYIFH 197
            + ILGG  G I SYA   +++ + +
Sbjct: 1031 AHILGGQAGYIGSYAATVMLISMLN 1055



 Score = 42.4 bits (98), Expect = 0.37,   Method: Composition-based stats.
 Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 3/40 (7%)

Query: 206  PLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
            PL +  RFL+F++ FD+D +C++ +GP+P   L  +T+ P
Sbjct: 1184 PLTLFARFLKFYAYFDFDRYCVTAFGPLP---LHKITSTP 1220


>gi|71652853|ref|XP_815075.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70880102|gb|EAN93224.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1276

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 86/152 (56%), Gaps = 26/152 (17%)

Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
           + AEV+++K +++    DI   QLGG+  + FL E+D LI   HL KR+++L+KAWC YE
Sbjct: 565 VVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 624

Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN-----GSFA---------------------GP 206
           + IL G  G +SSYA   +++ + +        GS                        P
Sbjct: 625 AHILSGQGGYLSSYAATIMLISMMNTVEFLEDLGSVEEREEDGEAHLGCEPHESLKNISP 684

Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
           L++  RFL+FFS FD++++C++++GP+P + L
Sbjct: 685 LQLFARFLKFFSFFDFEHYCVTVFGPLPCACL 716


>gi|407407321|gb|EKF31173.1| hypothetical protein MOQ_004991 [Trypanosoma cruzi marinkellei]
          Length = 1349

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 85/150 (56%), Gaps = 26/150 (17%)

Query: 115 AEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESR 174
           AEV+++K +++    DI   QLGG+  + FL E+D LI   HL KR+++L+KAWC YE+ 
Sbjct: 596 AEVRVLKLVMEGSCFDITVGQLGGVECVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYEAH 655

Query: 175 ILGGHHGLISSYALVTLVLYIFHVF------------------------NGSFA--GPLE 208
           IL G  G +SSYA   +++ + +                            SF    PL+
Sbjct: 656 ILSGQGGYLSSYAATIMLIAMMNTVEFLEDVGSVEERDEDGEGRLGCEPQASFKNISPLQ 715

Query: 209 VLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
           +  RFL+FFS FD++++C++++GP+P + L
Sbjct: 716 LFARFLKFFSFFDFEHYCVTIFGPLPCACL 745


>gi|71408844|ref|XP_806800.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70870651|gb|EAN84949.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1239

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 85/152 (55%), Gaps = 26/152 (17%)

Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
           + AEV+++K +++    DI   QLGG+  + FL E+D LI   HL KR+++L+KAWC YE
Sbjct: 568 VVAEVRVLKLVMEGGCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 627

Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN-----GSFA---------------------GP 206
           + IL G  G +SSYA   +++ + +        GS                        P
Sbjct: 628 AHILSGQGGYLSSYAATIMLIAMMNTVEFVEDVGSVEEREEDGEGHLGCEPQEFFKNISP 687

Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
           L++  RFL+FFS FD++++C++++GP+P   L
Sbjct: 688 LQLFARFLKFFSFFDFEHYCVTIFGPLPCDCL 719


>gi|407846652|gb|EKG02680.1| hypothetical protein TCSYLVIO_006286 [Trypanosoma cruzi]
          Length = 893

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 86/152 (56%), Gaps = 26/152 (17%)

Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
           + AEV+++K +++    DI   QLGG+  + FL E+D LI   HL KR+++L+KAWC YE
Sbjct: 222 VVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 281

Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN-----GSFA---------------------GP 206
           + IL G  G +SSYA   +++ + +        GS                        P
Sbjct: 282 AHILSGQGGYLSSYAATIMLISMMNTVEFLEDLGSVEEREEDGEAHLGCEPNESLKNISP 341

Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
           L++  RFL+FFS FD++++C++++GP+P + L
Sbjct: 342 LQLFARFLKFFSFFDFEHYCVTVFGPLPCACL 373


>gi|449533401|ref|XP_004173664.1| PREDICTED: uncharacterized LOC101209112 [Cucumis sativus]
          Length = 831

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 29/43 (67%), Positives = 35/43 (81%)

Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GNFFRIR+AF F AK LARL +CP ED+  E+NQFF+NT +RH
Sbjct: 5   GNFFRIRSAFAFGAKRLARLFECPREDILAELNQFFLNTWERH 47


>gi|302691928|ref|XP_003035643.1| hypothetical protein SCHCODRAFT_104957 [Schizophyllum commune H4-8]
 gi|300109339|gb|EFJ00741.1| hypothetical protein SCHCODRAFT_104957, partial [Schizophyllum
           commune H4-8]
          Length = 671

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 60/220 (27%), Positives = 100/220 (45%), Gaps = 22/220 (10%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSD--DQT 81
           I P P  +E R+ +   + R+I   FP  +V  FGS   K YLP  DIDL   S+  +Q 
Sbjct: 170 ISPTPAEDEVRSMIVLLIARIIQDKFPDAEVRPFGSYGTKLYLPHGDIDLVVQSNTLEQN 229

Query: 82  LKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV----DNFVVDIAFNQLG 137
            K T    + D++ +      A     +VQ I A V IIK +       F +DI+ NQ  
Sbjct: 230 NKKTVLQRLADLIRS------ARLSSGKVQVIGARVPIIKFITAAEYGRFQIDISVNQFS 283

Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
           GL +   ++     + +  +  RS++LI      +  +   + G + SY++V LVL    
Sbjct: 284 GLVSSDIINGFQRGM-QCPIAIRSLVLILKLYLSQRGMNEVYTGGLGSYSIVCLVLSFLQ 342

Query: 198 VFNGSFAGPLE-------VLYRFLEFFSKF-DWDNFCLSL 229
           +      G ++       +L  F E + K+ +++   +SL
Sbjct: 343 MHPKIRNGEIDPERNLGVLLLEFFELYGKYHNYEEVGVSL 382


>gi|147787660|emb|CAN69576.1| hypothetical protein VITISV_028613 [Vitis vinifera]
          Length = 192

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/40 (62%), Positives = 28/40 (70%)

Query: 220 FDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSF 259
            DWD+FC+SLWGPVPIS LPD T EPPR+    LLL    
Sbjct: 147 IDWDSFCVSLWGPVPISSLPDATTEPPRQGSRELLLDSGI 186


>gi|403419742|emb|CCM06442.1| predicted protein [Fibroporia radiculosa]
          Length = 1487

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 52/172 (30%), Positives = 82/172 (47%), Gaps = 11/172 (6%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P   E R+ V A + R + Q FP  +V  FGS   K YLP  DIDL   S  Q++ 
Sbjct: 167 ISPTPEENEVRSLVVALITRAVTQAFPDAEVHPFGSYDTKLYLPVGDIDLVVHS--QSMA 224

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGLCT 141
            +    V   + N  K      RV+ +   +A+V I+K   L  N  VDI+ NQ  G+  
Sbjct: 225 YSKKEAVLHSIANTMKRAGITDRVRIIS--KAKVPIVKFVTLHGNIPVDISINQGNGVTA 282

Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL 193
                 + H + E    +  ++++K++    S +   + G + SY++V LV+
Sbjct: 283 GTM---IKHFLAELPALRSLVLIVKSFLSQRS-MNEVYTGGLGSYSIVCLVI 330


>gi|390597612|gb|EIN07011.1| Nucleotidyltransferase [Punctularia strigosozonata HHB-11173 SS5]
          Length = 464

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/183 (27%), Positives = 80/183 (43%), Gaps = 23/183 (12%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL------GAFS 77
           I P P  +E R  +   + R + Q FP  QV  FGS   K YLP  DIDL       A+S
Sbjct: 152 ISPTPAEDEIRGLIVQLISRAVTQAFPDAQVLPFGSYETKLYLPLGDIDLVIQSPSMAYS 211

Query: 78  DDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQ 135
           D  T+    A+ +R           A    +     +A+V IIK +  +  F VDI+ NQ
Sbjct: 212 DKVTVLHALANTMR----------RAGITDRVTIVAKAKVPIIKFITTHGRFAVDISLNQ 261

Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYI 195
             G+        ++  + E    +  +++ KA+    S +   + G + SY++V L +  
Sbjct: 262 TNGVAAGKM---INRYLRELPALRGLVMITKAFLSQRS-MNEVYTGGLGSYSIVCLAISF 317

Query: 196 FHV 198
             +
Sbjct: 318 LQM 320


>gi|392567029|gb|EIW60204.1| hypothetical protein TRAVEDRAFT_164816 [Trametes versicolor
           FP-101664 SS1]
          Length = 660

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 127/316 (40%), Gaps = 64/316 (20%)

Query: 19  AELIARIQ---------PDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPD 68
           AE+ ARI+         P P  +E R+ V A V R + + +   QV  FGS   K YLP 
Sbjct: 164 AEMYARIEVEAFVKYISPTPIEDEVRSLVVALVSRAVTRTYTDAQVLPFGSYETKLYLPL 223

Query: 69  RDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN-- 126
            DIDL  +S      D  + L    L N  K      RV  +   +A+V IIK +  +  
Sbjct: 224 GDIDLVIYSQSMARMDRVSVL--HSLANIVKRAGITDRVTII--AKAKVPIIKFVTTHGR 279

Query: 127 FVVDIAFNQLGGLCT----LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGL 182
           F VDI+ NQ  G+        FL+E+  L        RS++LI      +  +     G 
Sbjct: 280 FSVDISINQGNGVTAGKMVKQFLEELPAL--------RSLVLIIKSFLSQRSMNEVFTGG 331

Query: 183 ISSYALVTLVLYIFH----VFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
           + SY++V L +        V  G       +    +EFF  +     C   +G V ISL 
Sbjct: 332 LGSYSIVCLAISFLQMHPKVRRGEIDPSKNMGVLVMEFFELYG----CYFNYGEVGISL- 386

Query: 239 PDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
                    +DGG      S+ +  +  + D+  GQ+        +   + DP    N++
Sbjct: 387 ---------RDGG------SYFNKTQRGWMDY--GQQ--------RLLCIEDPGDPTNDI 421

Query: 299 GRSVSKGNFFRIRTAF 314
            R     N  ++RT  
Sbjct: 422 SRGSY--NIAKVRTTL 435


>gi|145533334|ref|XP_001452417.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420105|emb|CAK85020.1| unnamed protein product [Paramecium tetraurelia]
          Length = 361

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 99/213 (46%), Gaps = 32/213 (15%)

Query: 29  PFSEERRNAVAAYVR-RLIIQCFPCQV--FTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
           P SEE R    A +R    I+ F  +V    FGS   K YLP+ DID+       + K+ 
Sbjct: 77  PTSEEHRRREQAIMRVETFIKEFASEVDIQAFGSFKTKLYLPNADIDVVMIDKSMSAKEL 136

Query: 86  WAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKC--LVDNFVVDIAFNQLGGLCTL 142
           +  + + +++++        + + V  I  A+V IIK   +   +  DI+FNQ+ GL   
Sbjct: 137 YKKVAQSLMKSD--------KFENVNLIANAKVPIIKFFEVESQYQFDISFNQMDGLKQ- 187

Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVL-YIFHVFN 200
             +DE+         FK  I+++K  C  + R L   + G I S+ L  ++L ++  +  
Sbjct: 188 --IDEIRKAFTIYPEFKYLIMILK--CMLKQRELNETYSGGIGSFLLFQMILAFLREIRK 243

Query: 201 GSFAGPL----------EVLYRFLEFF-SKFDW 222
            +FA             E + RFLEF+  KFD+
Sbjct: 244 EAFANKKQEQLKNITLGEYILRFLEFYGQKFDY 276


>gi|147825319|emb|CAN73261.1| hypothetical protein VITISV_003724 [Vitis vinifera]
          Length = 106

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 21/29 (72%), Positives = 24/29 (82%)

Query: 221 DWDNFCLSLWGPVPISLLPDVTAEPPRKD 249
           DWD FC+SL GPVPIS LPD T EPPR++
Sbjct: 47  DWDGFCVSLGGPVPISSLPDATTEPPRQE 75


>gi|67989518|ref|NP_001018181.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe 972h-]
 gi|81175166|sp|Q9UTN3.2|CID14_SCHPO RecName: Full=Poly(A) RNA polymerase cid14; Short=PAP; AltName:
           Full=Caffeine-induced death protein 14; AltName:
           Full=Polynucleotide adenylyltransferase cid14
 gi|62554069|emb|CAI79317.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe]
          Length = 684

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 97/219 (44%), Gaps = 40/219 (18%)

Query: 22  IARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
           I  I P P     R  + + + + ++Q +P   ++ FGS   K YLP  D+DL   S + 
Sbjct: 249 IDYITPTPEEHAVRKTLVSRINQAVLQKWPDVSLYVFGSFETKLYLPTSDLDLVIISPEH 308

Query: 81  ----TLKDTW--AHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VD 130
               T KD +  AH ++ +               EVQ I  A V IIK  VD      VD
Sbjct: 309 HYRGTKKDMFVLAHHLKKLK-----------LASEVQVITTANVPIIK-FVDPLTKVHVD 356

Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESR---ILGGHHGLISSYA 187
           I+FNQ GGL T C +  V+  + +    +  +I+IK +    +     LGG    +SSYA
Sbjct: 357 ISFNQPGGLKT-CLV--VNGFMKKYPALRPLVIIIKHFLNMRALNEVFLGG----LSSYA 409

Query: 188 LVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK 219
           +V LV+    +      G +        +L  FLE + K
Sbjct: 410 IVCLVVSFLQLHPRLSTGSMREEDNFGVLLLEFLELYGK 448


>gi|147799779|emb|CAN72745.1| hypothetical protein VITISV_018734 [Vitis vinifera]
          Length = 258

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 21/29 (72%), Positives = 25/29 (86%)

Query: 220 FDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
            DWD+FC+SLWGPVPIS LPD T +PPR+
Sbjct: 114 IDWDSFCVSLWGPVPISSLPDATTKPPRQ 142


>gi|336367333|gb|EGN95678.1| hypothetical protein SERLA73DRAFT_60289 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 538

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 88/202 (43%), Gaps = 15/202 (7%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
           + P P  +E R  V + V + +   FP  QV  FGS   K YLPD DIDL   S+     
Sbjct: 196 MSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDIDLVIQSESMAYS 255

Query: 83  -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGL 139
            K T  H + + L      + A+   K     +A+V I+K + ++    VDI+ NQ  G+
Sbjct: 256 NKVTVLHALANTL------KRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDISINQGNGV 309

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
                ++     ++      RS+++I      +  +   + G + SY++V L +    + 
Sbjct: 310 IAGKIVNGFLKDMHGCGFALRSLVMITKAFLNQRGMNEVYTGGLGSYSIVCLAISFLQMH 369

Query: 200 NGSFAGPLEVLYRF----LEFF 217
               +G ++         +EFF
Sbjct: 370 PKIRSGEIDAEKNLGVLVMEFF 391


>gi|449547164|gb|EMD38132.1| hypothetical protein CERSUDRAFT_49354 [Ceriporiopsis subvermispora
           B]
          Length = 547

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 79/175 (45%), Gaps = 17/175 (9%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  +E R+ V   +RR I + FP  QV  FGS   K YLP  DIDL   S+     
Sbjct: 183 ISPTPQEDEVRSLVVELIRRAITRQFPDAQVLPFGSYETKLYLPLGDIDLVIHSNTMAYS 242

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGLCT 141
           D     V   L N  +       VK +   +A+V I+K   +   F VDI+ NQ  G+  
Sbjct: 243 DK--ENVLRALANTLRRAGITDNVKII--AKAKVPIVKFVTIHGRFSVDISINQGNGVAA 298

Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESR---ILGGHHGLISSYALVTLVL 193
                 ++H ++E    +  + ++K++    S      GG    + SY++V L +
Sbjct: 299 GKM---INHFLSELPALRALVFVVKSFLSQRSMNEVFTGG----LGSYSIVCLAI 346


>gi|213403316|ref|XP_002172430.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
 gi|212000477|gb|EEB06137.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
          Length = 667

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 58/185 (31%), Positives = 85/185 (45%), Gaps = 21/185 (11%)

Query: 22  IARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
           I  ++P P     R ++   + R I   +P   V+ FGS   + YLP  DID+   S D 
Sbjct: 240 INYLEPTPQEHAVRKSLITKLDRAIRAKWPEVTVYVFGSFETRLYLPTSDIDMVVMSSDT 299

Query: 81  TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VDIAFNQL 136
             + T  H+    L    KN        E+Q I  A V IIK  VD F    VD++FNQ 
Sbjct: 300 VHRGTKKHMYS--LARHLKNCKL---ATEIQVITTANVPIIK-FVDPFTRIHVDVSFNQP 353

Query: 137 GGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESR---ILGGHHGLISSYALVTLVL 193
           GGL T C +  V+  + +    +   +L+K +    +     LGG    +SSYA+V LV+
Sbjct: 354 GGLKT-CLV--VNGFLKKFPAVRPLTMLVKHFLNMRALNEVFLGG----LSSYAIVCLVV 406

Query: 194 YIFHV 198
               +
Sbjct: 407 SFLQM 411


>gi|336380050|gb|EGO21204.1| hypothetical protein SERLADRAFT_476100 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 592

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 88/202 (43%), Gaps = 15/202 (7%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
           + P P  +E R  V + V + +   FP  QV  FGS   K YLPD DIDL   S+     
Sbjct: 196 MSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDIDLVIQSESMAYS 255

Query: 83  -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGL 139
            K T  H + + L      + A+   K     +A+V I+K + ++    VDI+ NQ  G+
Sbjct: 256 NKVTVLHALANTL------KRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDISINQGNGV 309

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
                ++     ++      RS+++I      +  +   + G + SY++V L +    + 
Sbjct: 310 IAGKIVNGFLKDMHGCGFALRSLVMITKAFLNQRGMNEVYTGGLGSYSIVCLAISFLQMH 369

Query: 200 NGSFAGPLEVLYRF----LEFF 217
               +G ++         +EFF
Sbjct: 370 PKIRSGEIDAEKNLGVLVMEFF 391


>gi|406604992|emb|CCH43591.1| Poly(A) RNA polymerase protein 1 [Wickerhamomyces ciferrii]
          Length = 624

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 63/237 (26%), Positives = 99/237 (41%), Gaps = 40/237 (16%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + IA I P     E RN     +R  I++ +P C+V  FGS     YLP  
Sbjct: 211 WLTLE--IKDFIAYISPSKEEIELRNNTVRKLREAIMELWPDCEVHVFGSYATDLYLPGS 268

Query: 70  DIDL------GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKC 122
           DID+      G +    +L    + L R  L             K V+ I +A+V IIK 
Sbjct: 269 DIDMVIVSEHGGYESRNSLYSLSSFLKRKNL------------AKNVEVIAKAKVPIIKF 316

Query: 123 L--VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-H 179
                N  +D++F +  G+        +   I E    +  ++++K   +  SR L   H
Sbjct: 317 TESTSNIHIDVSFERTNGIDA---AKTIRSWITETPGLREIVLIVKQ--FLSSRKLNNVH 371

Query: 180 HGLISSYALVTLVLYIFHVFNGSFA----GPLE----VLYRFLEFFSK-FDWDNFCL 227
            G +  Y+++ LV Y F + +   +     P E    +L  F E + K F +DN  +
Sbjct: 372 VGGLGGYSIICLV-YSFLILHPRLSTGNISPYENLGVLLIEFFELYGKNFGYDNVAI 427


>gi|170109615|ref|XP_001886014.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164638944|gb|EDR03218.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 397

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/238 (25%), Positives = 99/238 (41%), Gaps = 28/238 (11%)

Query: 14  AEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           AE + AE+ A    I P P  +E R  +   +   +   FP  +V  FGS   K YLP  
Sbjct: 99  AEMLHAEVKAFVHWISPSPVEDEVRGLIVTQISNTVKASFPDARVLPFGSYETKLYLPLG 158

Query: 70  DIDLGAFSDDQTL--KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN- 126
           DIDL   SD      K    H + + L+      H     K      A+V I+K +  + 
Sbjct: 159 DIDLVILSDSMAYSNKVNVLHALANTLKRSGVTSHVTIIAK------AKVPIVKFVTTHG 212

Query: 127 -FVVDIAFNQLGGLCTL----CFLDEV--DHLINENHLFKRSIILIKAWCYYESRILGGH 179
            F VDI+ NQ  GL +      FL ++  +    +  +  RS++++      +  +   +
Sbjct: 213 RFHVDISLNQSNGLLSGKIINGFLKDMHGNGAEGKGSMALRSLVMVTKAFLTQRSMNEVY 272

Query: 180 HGLISSYALVTLVLYIFHVF----NGSFAGPLEVLYRFLEFFS----KFDWDNFCLSL 229
            G + SY++V L +    +     NG       +    +EFF      F++D   +SL
Sbjct: 273 TGGLGSYSIVCLAVSFLQMHPKIRNGEIDPEKNLGVLAMEFFELYGCYFNYDEVGISL 330


>gi|451844986|gb|EMD58301.1| hypothetical protein COCSADRAFT_165704 [Cochliobolus sativus
           ND90Pr]
          Length = 642

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 70/266 (26%), Positives = 105/266 (39%), Gaps = 37/266 (13%)

Query: 7   DPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQC-FPCQ---VFTFGSVPL 62
           +P +WL  E +  +    + P PF  E+RN +   V   + Q  FP Q   V  FGS P 
Sbjct: 318 EPEKWLHNEIL--DFYGFVAPKPFEHEQRNRLVNRVNNALGQRRFPQQNGRVLCFGSFPA 375

Query: 63  KTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNE---HAEFRVKEV-------QY 112
             YLP  D+DL   SD           V DM +          A  R+K +       Q 
Sbjct: 376 GLYLPTADMDLVYVSDQYY---NGGPPVVDMSQRGANKSLLYKASNRLKSMGMDADGCQV 432

Query: 113 IQAEVKIIKCL--VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIK 166
           I A+V IIK    +    VDI+F  L G+        +  E   +I    L K+ +++  
Sbjct: 433 IHAKVPIIKFQDRLTQLQVDISFENLSGVQAQATFAQWKQEYPDMIYMVALLKQFLVM-- 490

Query: 167 AWCYYESRILGGHHGLISSYALVTLVL-YIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNF 225
                   +   H G I  ++++ L++ YI H       G  E    FL ++  FD    
Sbjct: 491 ------HGLNEVHTGGIGGFSIICLIVSYIQHSDKHENLG--ECFLGFLRYYGDFDLSRK 542

Query: 226 CLSLWGPVPISLLP-DVTAEPPRKDG 250
            + ++ P  I      +   P R DG
Sbjct: 543 RIQMYPPAIIEKTAHGIDGRPERYDG 568


>gi|391346299|ref|XP_003747415.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Metaseiulus occidentalis]
          Length = 491

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 79/316 (25%), Positives = 127/316 (40%), Gaps = 61/316 (19%)

Query: 26  QPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD 84
           QP+   + RR  V   VR  I + +P C V  FGS     YLP  DID+        ++ 
Sbjct: 98  QPNAADQSRREQVIEKVRAAIREKWPDCVVEVFGSYKTGLYLPTGDIDM-------VIQG 150

Query: 85  TWA------HLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL-VDNFV-VDIAFNQL 136
            W        L R ++E ++  E   F+V +    +A V +IK    D  + VD++FNQ 
Sbjct: 151 NWEIIPPLFDLERQLIE-KKVGEKNTFKVLD----KASVPLIKFKDADTEIRVDLSFNQA 205

Query: 137 GGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIF 196
                  F+ +            + I ++K +      +    HG ISSY+L  ++L   
Sbjct: 206 NCTEAAAFVKQCCRTFPP---LAKLIFVLKQYLSLHG-LNEVFHGGISSYSLTLMILSFL 261

Query: 197 H------VFNGSFAGPLEVLYRFLEFFS-KFDWDNFCLSLWGPVPISLLPDVTAEPPRKD 249
                  +         ++L  FLEF+  +F++D   + +                  +D
Sbjct: 262 QLHPEQEMVRSDKPETGKLLVEFLEFYGDRFEYDKMGIRI------------------RD 303

Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
           GG  L+ K+ L  C  A     GG  + G    S    + DPL   N++ RS       R
Sbjct: 304 GGA-LVDKNQLRECLIAA----GGPPSSG----SNLLCIEDPLTPGNDVARSSYA--MSR 352

Query: 310 IRTAFTFRAKGLARLL 325
           +R AF      L++L+
Sbjct: 353 VRDAFKSAFTCLSKLV 368


>gi|403331574|gb|EJY64740.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
          Length = 316

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 55/226 (24%), Positives = 105/226 (46%), Gaps = 23/226 (10%)

Query: 14  AEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDID 72
            E  T + +  + P    +E RN VA  +  +I   FP C VF FGS      LP+ DID
Sbjct: 15  TETSTHDFVNFVTPSKEDKEIRNKVATSIEEVIKGVFPDCHVFVFGSCATGLNLPNSDID 74

Query: 73  LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQ-AEVKIIKCLVDNFV--V 129
           L  +  D + +      V D +  ++K        K +  ++  +V +IK     F   V
Sbjct: 75  LIVYQPDVS-ESRMITKVADAIVRQKK-------CKTIDVLKNTKVPLIKITDSEFGVNV 126

Query: 130 DIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYAL 188
           DI+FN+  G+  +  + ++  +  E    K  ++++K  C+ +SR L   + G + S+ L
Sbjct: 127 DISFNRTNGVYCVKLVKQLLQMFPE---LKPLMMVLK--CFLKSRQLNEPYSGGVGSFLL 181

Query: 189 VTLVL-YIFHVFNGSFAGPLEVLYRFLEFF----SKFDWDNFCLSL 229
             +V  ++   +       L++  + L+FF    ++F++ +  +S+
Sbjct: 182 TMMVTSFLQRQYKLGNTNNLDLGKQLLDFFKLYGTEFNYQHVGISI 227


>gi|145525609|ref|XP_001448621.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416176|emb|CAK81224.1| unnamed protein product [Paramecium tetraurelia]
          Length = 364

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 88/211 (41%), Gaps = 29/211 (13%)

Query: 29  PFSEERRNAVAAYVR--RLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
           P  +E +  V AY+R  + +    P  Q+ +FGS   + YLP+ DID+       T K  
Sbjct: 77  PSDQEHKRRVTAYLRVEKYLQDIAPEAQIESFGSFKTRMYLPNADIDIVMIETSCTQKQL 136

Query: 86  WAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKC--LVDNFVVDIAFNQLGGLCTLC 143
           +  +   M++   K E+            A+V IIK   +   +  D++FNQL GL  + 
Sbjct: 137 FKKVAARMMKQTNKFENVNL------IANAKVPIIKFVEVESQYHFDLSFNQLDGLKQIE 190

Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVLYIFHVFNGS 202
            L++   L  E        +L+   C    R L   + G + S+ L  ++L     F   
Sbjct: 191 ELEKAFELYPE-----LKFLLMTLKCVLRQRDLNETYSGGVGSFLLFQMILAFLREFRKD 245

Query: 203 F-----------AGPLEVLYRFLEFFS-KFD 221
           F               E + +FLEF+  KFD
Sbjct: 246 FFQHNKEDQIKNVTLGEYMIKFLEFYGIKFD 276


>gi|212645230|ref|NP_492446.3| Protein GLD-4 [Caenorhabditis elegans]
 gi|403399397|sp|G5EFL0.1|GLD4_CAEEL RecName: Full=Poly(A) RNA polymerase gld-4; AltName: Full=Defective
           in germ line development protein 4; AltName:
           Full=Germline development defective-4
 gi|194686198|emb|CAB02138.3| Protein GLD-4 [Caenorhabditis elegans]
 gi|226972859|gb|ACO95123.1| germline defective-4 [Caenorhabditis elegans]
          Length = 845

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 132/320 (41%), Gaps = 73/320 (22%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W+K  EI + L  ++    F + R + +  + ++ I      ++  FGS+    +LP  D
Sbjct: 90  WIKPNEIESRLRTKV----FEKVRDSVLRRWKQKTI------KISMFGSLRTNLFLPTSD 139

Query: 71  IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQ-YIQAEVKIIKCLVD---N 126
           ID+    DD      W     D L    +   A+   + V  Y  A V I+K +VD    
Sbjct: 140 IDVLVECDD------WVGTPGDWLAETARGLEADNIAESVMVYGGAFVPIVK-MVDRDTR 192

Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
             +DI+FN + G+    ++ +V     E  L +  ++L+K + +Y + +     G +SSY
Sbjct: 193 LSIDISFNTVQGVRAASYIAKVKE---EFPLIEPLVLLLKQFLHYRN-LNQTFTGGLSSY 248

Query: 187 ALVTLVLYIFHVF-----------NGSFAGPLEVLYRFLEFFS-KFDWDNFCLSLWGPVP 234
            LV L++  F ++            G   G L  L RFLE +S +F+++   +S      
Sbjct: 249 GLVLLLVNFFQLYALNMRSRTIYDRGVNLGHL--LLRFLELYSLEFNFEEMGIS------ 300

Query: 235 ISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRV 294
                          G    + KS     RY +         Q QP    +  + DPL  
Sbjct: 301 --------------PGQCCYIPKS-ASGARYGHK--------QAQP---GNLALEDPLLT 334

Query: 295 NNNLGRSVSKGNFFRIRTAF 314
            N++GRS    NF  I  AF
Sbjct: 335 ANDVGRSTY--NFSSIANAF 352


>gi|365758533|gb|EHN00370.1| Pap2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 514

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 54/240 (22%), Positives = 99/240 (41%), Gaps = 30/240 (12%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN   + +R  + Q +P   +  FGS     YLP  
Sbjct: 180 WLTFE--IKDFVAYISPSREEIEIRNQTISTIREALKQLWPDADLHVFGSYSTDLYLPGS 237

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID      LG       L    +HL ++ L  E +   A+ RV  +++++   +I    
Sbjct: 238 DIDCVVNSELGGKESRNNLYSLASHLKKNNLATEIE-VVAKARVPIIKFVEPHSRI---- 292

Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
                +D++F +  GL     + E    +N+    +  ++++K + +   R+   H G +
Sbjct: 293 ----HIDVSFERTNGLEAAKLIRE---WLNDTPGLRELVLIVKQFLHAR-RLNNVHTGGL 344

Query: 184 SSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPVPI 235
             ++++ LV    H+        ++       +L  F E + K F +D+  L      PI
Sbjct: 345 GGFSIICLVFSFLHMHPRIITKEIDSKDNLGVLLIEFFELYGKNFGYDDVALGSSDGYPI 404


>gi|426199822|gb|EKV49746.1| hypothetical protein AGABI2DRAFT_63272 [Agaricus bisporus var.
           bisporus H97]
          Length = 481

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 53/218 (24%), Positives = 86/218 (39%), Gaps = 19/218 (8%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
           + P P  +E R      + R I   F   +VF FGS   K YLP  DIDL   SD     
Sbjct: 154 MAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDIDLVIVSDSMAYS 213

Query: 83  -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV--DNFVVDIAFNQLGGL 139
            K +  H +  +L        A          +A+V I+K +     F VDI+ NQ  G+
Sbjct: 214 NKSSVLHSLASVL------RRAGIASNVTVIAKAKVPIVKFVTIHGRFNVDISINQTNGI 267

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
                +      +    L  RS++LI      +  +     G + SY++V L +    + 
Sbjct: 268 VGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSIVCLAISFLQMH 327

Query: 200 NGSFAGPLE-------VLYRFLEFFS-KFDWDNFCLSL 229
                G ++       ++  F E +   F++D   +S+
Sbjct: 328 PKIRRGEIDPEKNLGVLVMEFFELYGCHFNYDEVGISV 365


>gi|71005312|ref|XP_757322.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
 gi|46096726|gb|EAK81959.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
          Length = 730

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 65/240 (27%), Positives = 101/240 (42%), Gaps = 38/240 (15%)

Query: 14  AEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCF-PCQVFTFGSVPLKTYLPDR 69
           AE +  ELIA    + P     E R  V   + R I   F   +V+ FGS   K YLP  
Sbjct: 96  AEALHRELIAFDYWMTPTAAEHETRCMVIELISRAIKSQFRDAEVYPFGSQETKLYLPQG 155

Query: 70  DIDLGAFSDD-------QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIK 121
           D+DL   S+          L+   A L R  L              +VQ I +A+V IIK
Sbjct: 156 DLDLVVVSNSMANLRVQSALRTMAACLRRHNL------------ATDVQVIAKAKVPIIK 203

Query: 122 CLVD--NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
            +       VDI+ N   GL T  +++    L    H+  R +IL+  +   +  +    
Sbjct: 204 FVTTYARLKVDISLNHTNGLTTASYVNS--WLRKWPHI--RPLILVVKYLLMQRGMSEVF 259

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
            G + SY+++ +V+    +      G ++       +L  FLE + K F +DN  +S+ G
Sbjct: 260 SGGLGSYSVIIMVISFLQLHPKVQRGEIDADRSLGVLLLEFLELYGKNFGYDNCGISIRG 319


>gi|401623740|gb|EJS41828.1| trf4p [Saccharomyces arboricola H-6]
          Length = 573

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 97/242 (40%), Gaps = 34/242 (14%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN   + +R  + Q +P   +  FGS     YLP  
Sbjct: 170 WLTFE--IKDFVAYISPSREEIEVRNQTISMIREAVKQLWPDADLHVFGSYSTDLYLPGS 227

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID      LG       L    +HL       ++KN   E  V      +A V IIK +
Sbjct: 228 DIDCVITSELGGKESRNNLFSLASHL-------KKKNLATEIEV----VAKARVPIIKFV 276

Query: 124 VDN--FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
             N    +D++F +  GL     + E    +N+    +  ++++K + +   R+   H G
Sbjct: 277 EPNSGIHIDVSFERTNGLEAAKLIRE---WLNDTPGLRELVLIVKQFLH-SRRLNNVHTG 332

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPV 233
            +  ++++ LV    H+        +E       +L  F E + K F +D+  L      
Sbjct: 333 GLGGFSIICLVFSFLHMHPRIITKEIEAKDNLGVLLIEFFELYGKNFGYDDVALGSSDGY 392

Query: 234 PI 235
           P+
Sbjct: 393 PV 394


>gi|409081996|gb|EKM82354.1| hypothetical protein AGABI1DRAFT_52475, partial [Agaricus bisporus
           var. burnettii JB137-S8]
          Length = 559

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 53/218 (24%), Positives = 86/218 (39%), Gaps = 19/218 (8%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
           + P P  +E R      + R I   F   +VF FGS   K YLP  DIDL   SD     
Sbjct: 153 MAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDIDLVIVSDSMAYS 212

Query: 83  -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGL 139
            K +  H +  +L        A          +A+V I+K   +   F VDI+ NQ  G+
Sbjct: 213 NKSSVLHSLASVL------RRAGIASNVTVIAKAKVPIVKFVTIHGRFNVDISINQTNGI 266

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
                +      +    L  RS++LI      +  +     G + SY++V L +    + 
Sbjct: 267 VGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSIVCLAISFLQMH 326

Query: 200 NGSFAGPLE-------VLYRFLEFFS-KFDWDNFCLSL 229
                G ++       ++  F E +   F++D   +S+
Sbjct: 327 PKIRRGEIDPEKNLGVLVMEFFELYGCHFNYDEVGISV 364


>gi|401837753|gb|EJT41641.1| PAP2-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 592

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 54/240 (22%), Positives = 99/240 (41%), Gaps = 30/240 (12%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN   + +R  + Q +P   +  FGS     YLP  
Sbjct: 180 WLTFE--IKDFVAYISPSREEIEIRNQTISTIREALKQLWPDADLHVFGSYSTDLYLPGS 237

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID      LG       L    +HL ++ L  E +   A+ RV  +++++   +I    
Sbjct: 238 DIDCVVNSELGGKESRNNLYSLASHLKKNNLATEIEVV-AKARVPIIKFVEPHSRI---- 292

Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
                +D++F +  GL     + E    +N+    +  ++++K + +   R+   H G +
Sbjct: 293 ----HIDVSFERTNGLEAAKLIRE---WLNDTPGLRELVLIVKQFLHAR-RLNNVHTGGL 344

Query: 184 SSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPVPI 235
             ++++ LV    H+        ++       +L  F E + K F +D+  L      PI
Sbjct: 345 GGFSIICLVFSFLHMHPRIITKEIDSKDNLGVLLIEFFELYGKNFGYDDVALGSSDGYPI 404


>gi|403213331|emb|CCK67833.1| hypothetical protein KNAG_0A01440 [Kazachstania naganishii CBS
           8797]
          Length = 526

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 57/244 (23%), Positives = 103/244 (42%), Gaps = 31/244 (12%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + I+ I P+    ++RN     +R  + + +P   +  FGS     YLP  
Sbjct: 139 WLTLE--VKDFISYISPNRVEIKQRNTTIGKIRAAVSELWPDADLHVFGSYATDLYLPGS 196

Query: 70  DIDL------GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID       G   +  +L     HL ++ L  E +   A+ RV  +++++ E +I    
Sbjct: 197 DIDCVVNSKGGDKENQSSLYKLATHLKKNGLATEIEI-IAKARVPIIKFVEPESRI---- 251

Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
                +D++F ++ GL     + E      E+    R ++LI     +  R+   H G +
Sbjct: 252 ----HIDVSFERINGLEAAKLIRE----WLESTPGLRELVLIIKQFLHSRRLNNVHTGGL 303

Query: 184 SSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPVPI 235
             ++++ LV     +        ++       +L  F E + K F +D+  +SL   VP 
Sbjct: 304 GGFSIICLVYSFLSMHPRVITNEIDPIDNLGVLLIDFFELYGKNFGYDDVAISLSNGVP- 362

Query: 236 SLLP 239
           S LP
Sbjct: 363 SYLP 366


>gi|449017212|dbj|BAM80614.1| hypothetical protein CYME_CMK272C [Cyanidioschyzon merolae strain
            10D]
          Length = 1647

 Score = 50.4 bits (119), Expect = 0.001,   Method: Composition-based stats.
 Identities = 43/160 (26%), Positives = 66/160 (41%), Gaps = 12/160 (7%)

Query: 119  IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
            +++C  +        N    LC  CFL E D LI   HL  R +IL+K W +  S     
Sbjct: 1160 VVRCRTNGLTTQFLLNPAVALCRSCFLVECDELIGRRHLLIRCLILLKVW-WRHSLATAQ 1218

Query: 179  HHGLIS--SYALVTLVLYIFHVFNGSFAG-----PLEVLYRFLEFFS-KFDWDNFCLSLW 230
               L+S  S +LV+  L +  +   +  G     P  VL     F+    DW    +S++
Sbjct: 1219 ARALLSPLSGSLVSPFLALLLLSYLNCRGLPGDEPAHVLQGLFSFYGFDMDWSRCGMSIY 1278

Query: 231  GPVPI---SLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAY 267
            GP  I   +L+  +T   P     +L   +    +CR  Y
Sbjct: 1279 GPFDIQSGALMTHLTTRQPLIPDAMLRAHQLEYATCRLRY 1318


>gi|145546801|ref|XP_001459083.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426906|emb|CAK91686.1| unnamed protein product [Paramecium tetraurelia]
          Length = 364

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 52/213 (24%), Positives = 91/213 (42%), Gaps = 33/213 (15%)

Query: 29  PFSEERRNAVAAYVR--RLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
           P  +E +  V AY+R  + +    P  Q+ +FGS   + YLP+ DID+       T K  
Sbjct: 77  PSDQEHKRRVTAYMRVEKYLQDIAPEAQIESFGSFKTRMYLPNADIDMVMIETSCTQKQL 136

Query: 86  WAHLVRDMLENEEKNEH----AEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCT 141
           +  +   M++   K E+    A  +V  +++I+ E +        +  D++FNQL GL  
Sbjct: 137 FKKVAAKMMKQTNKFENVNLIANAKVPIIKFIEVESQ--------YHFDLSFNQLDGLKQ 188

Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVLYIFHVFN 200
           +  L++   +  E        +L+   C    R L   + G + S+ L  ++L     + 
Sbjct: 189 IEELEKAFEIYPE-----LKFLLMTLKCVLRQRDLNETYSGGVGSFLLFQMILAFLREYR 243

Query: 201 GSF-----------AGPLEVLYRFLEFFS-KFD 221
             F               E + +FLEF+  KFD
Sbjct: 244 KDFFQHNKQDQIKNVTLGEYMIKFLEFYGIKFD 276


>gi|50302781|ref|XP_451327.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49640458|emb|CAH02915.1| KLLA0A07359p [Kluyveromyces lactis]
          Length = 684

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 63/260 (24%), Positives = 109/260 (41%), Gaps = 42/260 (16%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFT-FGSVPLKTYLPDR 69
           WL  E    + ++ I P+    E+RN   A ++  +++ +P      FGS     YLP  
Sbjct: 190 WLTLE--IKDFVSYISPNRQEIEQRNQAIAKLKEAVVELWPDSSLNCFGSYATDLYLPGS 247

Query: 70  DIDL------GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID       G   +   L    + L R  L  + +   A+ RV  +++++ E KI    
Sbjct: 248 DIDCVVRSASGDKENRNALYSLASFLKRKQLATQVE-VIAKARVPIIKFVEPESKI---- 302

Query: 124 VDNFVVDIAFNQLGGL----CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
                +D++F +  GL        +L+E   L        R ++LI     +  R+   H
Sbjct: 303 ----HIDVSFERTNGLEAARVIRGWLEEQPGL--------RELVLIVKQFLHARRLNNVH 350

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCLSLWG 231
            G +  Y+++ LV     +      G   PLE    +L  F E + K F +D+  +S+  
Sbjct: 351 TGGLGGYSIICLVYTFLKLHPRVLTGDIDPLENLGVLLIDFFELYGKNFGYDDVGISVSE 410

Query: 232 P----VPISLLPDVTAEPPR 247
                +P +  PD++A  PR
Sbjct: 411 HEARYIPKNEHPDLSAGRPR 430


>gi|443895250|dbj|GAC72596.1| DNA polymerase sigma [Pseudozyma antarctica T-34]
          Length = 689

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 67/240 (27%), Positives = 102/240 (42%), Gaps = 28/240 (11%)

Query: 11  WLK----AEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCF-PCQVFTFGSVPL 62
           W K    AE +  EL+A    + P     E R  V   + R I   F   +V  FGS   
Sbjct: 88  WAKCQNGAEALHRELMAFDHWMAPTAAEHETRCMVIELISRAIKSQFRDAEVHPFGSQET 147

Query: 63  KTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIK 121
           K YLP  D+DL   S       T + L R M     ++  A     +VQ I +A+V IIK
Sbjct: 148 KLYLPQGDLDLVVVSRSMANLRTQSAL-RTMAACLRRHNLA----TDVQVIAKAKVPIIK 202

Query: 122 CLVD--NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
            +       VDI+ N   GL T  F++    L    H+  R +I++      +  +    
Sbjct: 203 FVTTYARLKVDISLNHTNGLTTASFVNS--WLRKWPHI--RPLIIVVKHLLMQRGMSEVF 258

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
            G + SY+++ +V+    +      G +E       +L  FLE + K F +DN  +S+ G
Sbjct: 259 SGGLGSYSIIIMVISFLQLHPKVQRGEIEPGRSLGVLLLEFLELYGKNFGYDNCGISIRG 318


>gi|268566431|ref|XP_002639720.1| Hypothetical protein CBG12446 [Caenorhabditis briggsae]
          Length = 897

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 83/321 (25%), Positives = 133/321 (41%), Gaps = 75/321 (23%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
           W+K  EI   L  ++      E+ R++V+        Q  P ++  FGS+    +LP  D
Sbjct: 92  WIKPNEIEVRLRTKVY-----EKVRDSVSQR-----WQHKPIKISMFGSLRTNLFLPTSD 141

Query: 71  IDLGAFSDD--QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD--- 125
           ID+    DD   T  D W       LEN+   E          +  A V I+K +VD   
Sbjct: 142 IDVLVECDDWVGTPGD-WLGETARGLENDNIAESVTV------FGGAFVPIVK-MVDRDT 193

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
              +DI+FN + G+    ++ +V     E  L +  ++L+K + +Y + +     G +SS
Sbjct: 194 RLSIDISFNTVQGVRAASYIAKVKE---EFPLIEPLVLLLKQFLHYRN-LNQTFTGGLSS 249

Query: 186 YALVTLVLYIFHVF-----------NGSFAGPLEVLYRFLEFFS-KFDWDNFCLSLWGPV 233
           Y LV L++  F ++           +G   G L  L RFLE +S +F+++   +S     
Sbjct: 250 YGLVLLLVNFFQLYALNMRHRTIYDSGVNLGHL--LLRFLEVYSMEFNYEEIGIS----- 302

Query: 234 PISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLR 293
                           G    +SKS     RY +         + QP    +  + DPL 
Sbjct: 303 ---------------PGQCCYISKSAA-GARYGH--------KRAQP---GNLALEDPLL 335

Query: 294 VNNNLGRSVSKGNFFRIRTAF 314
             N++GRS    NF  I  AF
Sbjct: 336 TANDVGRSTY--NFSSIANAF 354


>gi|388580693|gb|EIM21006.1| Nucleotidyltransferase, partial [Wallemia sebi CBS 633.66]
          Length = 360

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 58/222 (26%), Positives = 98/222 (44%), Gaps = 27/222 (12%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAF--SDDQT 81
           I P     + R      +RR I   +   +VF FGS   + YLPD DIDL     S +Q 
Sbjct: 86  ISPSLTEHKTREYTIECIRRCITSRWADAEVFAFGSFETRLYLPDGDIDLVVMRKSVNQY 145

Query: 82  LKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIK--CLVDNFVVDIAFNQLGG 138
            K +  H +  ML             + +Q I +A V IIK       + +DI+ NQ  G
Sbjct: 146 NKQSMLHTMASMLRQAN-------LAQSIQVISKARVPIIKFTSSFGGYPIDISLNQTNG 198

Query: 139 LCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYALVTLVLYIFH 197
           +     ++E+   ++     +   +L+K  C+   R +   + G +SSY+++ LV+    
Sbjct: 199 VDAGRMVNEI---LDRYPAARPLSMLLK--CFLSQRSMNEVYTGGVSSYSVICLVVSFLQ 253

Query: 198 VFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCLSLWG 231
           +      G   PL+    +L   LE + + F++D   +S+ G
Sbjct: 254 MHPKVRRGDINPLDNLGVLLVDLLELYGRNFNYDVTGISIEG 295


>gi|395333834|gb|EJF66211.1| hypothetical protein DICSQDRAFT_152192 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 647

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 60/220 (27%), Positives = 94/220 (42%), Gaps = 27/220 (12%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P  +E R+     + R I + +P  +V  FGS   K YLP  DIDL  +S      
Sbjct: 169 MSPTPIEDEVRSLSVQLIARAISKSYPDAKVLPFGSYETKLYLPSGDIDLVIYSHSMMRM 228

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGLCT 141
           D  + L    L N  K      RV  +   +A+V IIK +  +  F VDI+ NQ  G+ T
Sbjct: 229 DKVSVL--HSLANIMKRAGITDRVTII--AKAKVPIIKFVTAHGRFSVDISVNQGNGVDT 284

Query: 142 ----LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
                 FL E+  L        RS++LI      +  +     G + SY++V L +    
Sbjct: 285 GKMVKQFLRELPAL--------RSLVLIIKNFLSQRSMNEVFTGGLGSYSIVCLAISFLQ 336

Query: 198 VFNGSFAGPLE-------VLYRFLEFF-SKFDWDNFCLSL 229
           +      G ++       ++  F E + S F++    +SL
Sbjct: 337 MHPKIRRGEIDPSKNLGVLVMEFFELYGSYFNYQEVGISL 376


>gi|393216777|gb|EJD02267.1| hypothetical protein FOMMEDRAFT_141374 [Fomitiporia mediterranea
           MF3/22]
          Length = 732

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 63/217 (29%), Positives = 95/217 (43%), Gaps = 21/217 (9%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P   E R  V   +   I + +   +V  FGS   K YLP  DIDL   S  +TL 
Sbjct: 160 VSPTPVEHEVRWMVVQLISSSIKRVYSDSEVLPFGSFGTKLYLPQGDIDLVVQS--RTLA 217

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGLCT 141
                     L N  K      +V  +   QA V IIK   L   F VDI+ NQ  G+ T
Sbjct: 218 SFEKVTALKSLANIVKRTGLADKVTIIS--QARVPIIKFTTLYGRFAVDISMNQSNGVKT 275

Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYALVTLVLYIFHVFN 200
               D ++  +NE    +  ++++K+  + + R L   + G + SYA+V L +    +  
Sbjct: 276 ---GDMINRFLNEFPALRAIVLIVKS--FLKQRNLNEVYSGGLGSYAIVCLAVSHLQMHP 330

Query: 201 G------SFAGPLEVLY-RFLEFFSK-FDWDNFCLSL 229
                  + A  L VL   F E + K F+++N  +SL
Sbjct: 331 KVRRAEINSAKNLGVLTLEFFELYGKYFNYNNTGISL 367


>gi|388851758|emb|CCF54564.1| related to TRF4-topoisomerase I-related protein [Ustilago hordei]
          Length = 701

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 67/233 (28%), Positives = 100/233 (42%), Gaps = 24/233 (10%)

Query: 14  AEEITAELIARIQ---PDPFSEERRNAVAAYVRRLIIQCF-PCQVFTFGSVPLKTYLPDR 69
           AE +  ELIA  Q   P     E R  V   + R I   F   +V  FGS   K YLP  
Sbjct: 96  AEALHRELIAFDQWMAPTGAEHETRCMVIELIARAIKSQFRDAEVRPFGSQETKLYLPQG 155

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--N 126
           D+DL   S       T + L R M     ++  A     +VQ I +A+V IIK +     
Sbjct: 156 DLDLVVVSRSMANLRTQSAL-RTMAACLRRHNLA----TDVQVIAKAKVPIIKFVTTYAR 210

Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
             VDI+ N   GL T  +++    L    H+  R +IL+      +  +     G + SY
Sbjct: 211 LKVDISLNHTNGLTTASYVN--GWLRKWPHI--RPLILVIKHLLMQRGMSEVFSGGLGSY 266

Query: 187 ALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
           +++ +V+    +      G +E       +L  FLE + K F +DN  +S+ G
Sbjct: 267 SVIIMVISFLQLHPKLQRGEIEPGRSLGVLLLEFLELYGKNFGYDNCGISIRG 319


>gi|451992975|gb|EMD85451.1| hypothetical protein COCHEDRAFT_1148848 [Cochliobolus
           heterostrophus C5]
          Length = 624

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 69/266 (25%), Positives = 105/266 (39%), Gaps = 37/266 (13%)

Query: 7   DPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQC-FPCQ---VFTFGSVPL 62
           +P +WL  E +  +    + P PF  E+RN +   V   + Q  FP Q   V  FGS P 
Sbjct: 300 EPEKWLHNEIL--DFYDFVAPKPFEHEQRNRLVNRVNNALGQRRFPQQNGRVLCFGSFPA 357

Query: 63  KTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNE---HAEFRVKEV-------QY 112
             YLP  D+DL   SD           V DM +          A  R+K +       Q 
Sbjct: 358 GLYLPTADMDLVYVSDQYY---NGGPPVVDMSQRGANKSLLYKASNRLKSMGMDADGCQV 414

Query: 113 IQAEVKIIK--CLVDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIK 166
           I A+V IIK    +    VDI+F  L G+        +  +   +I    L K+ +++  
Sbjct: 415 IHAKVPIIKFQDRLTQLQVDISFENLSGVQAQATFAQWKQDYPDMIYMVALLKQFLVM-- 472

Query: 167 AWCYYESRILGGHHGLISSYALVTLVL-YIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNF 225
                   +   H G I  ++++ L++ YI H       G  E    FL+++  FD    
Sbjct: 473 ------HGLNEVHTGGIGGFSIICLIVSYIQHSDKHENLG--ECFLGFLKYYGDFDLSRK 524

Query: 226 CLSLWGPVPISLLP-DVTAEPPRKDG 250
            + +  P  I      +   P R DG
Sbjct: 525 RIQMHPPAIIEKTAHGIDGRPERYDG 550


>gi|357491469|ref|XP_003616022.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
 gi|355517357|gb|AES98980.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
          Length = 490

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 31/43 (72%)

Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
           GNF+RIR+AF + A+ L  +L  P + + +E+N+FF NT DRH
Sbjct: 11  GNFYRIRSAFKYGARKLGWILMLPEDRIADELNRFFANTLDRH 53


>gi|341895116|gb|EGT51051.1| hypothetical protein CAEBREN_16945 [Caenorhabditis brenneri]
          Length = 901

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 59/223 (26%), Positives = 101/223 (45%), Gaps = 32/223 (14%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCF---PCQVFTFGSVPLKTYLPDRDIDLGAFSDDQT 81
           I+P+      R  V   VR  +++ +   P +V  FGS+    +LP  DID+    D+  
Sbjct: 99  IKPNEIESRLRYKVYEKVRLSLLERWKHKPIKVSMFGSLRTTLFLPTSDIDVLVECDE-- 156

Query: 82  LKDTWAHLVRDMLENEEKNEHAEFRVKEVQ-YIQAEVKIIKCLVD---NFVVDIAFNQLG 137
               W     D L    +    +   + V  Y  A V I+K +VD      +DI+FN + 
Sbjct: 157 ----WIGTPGDWLTETARGLEIDNIAESVSVYGGAFVPIVK-MVDRDTRLSIDISFNTVQ 211

Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
           G+    ++D+V     E  L +  ++L+K + +Y + +     G +SSY LV L++  F 
Sbjct: 212 GVRAASYIDKVKE---EFPLIEPLVLLLKQFLHYRN-LNQTFTGGLSSYGLVLLLVNFFQ 267

Query: 198 VF-----------NGSFAGPLEVLYRFLEFFS-KFDWDNFCLS 228
           ++            G   G L  L RFLE +S +F+++   +S
Sbjct: 268 LYALNMRHRTIYDRGVNLGHL--LLRFLEVYSLEFNYEEIGIS 308


>gi|341883718|gb|EGT39653.1| hypothetical protein CAEBREN_22894 [Caenorhabditis brenneri]
          Length = 901

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 59/223 (26%), Positives = 101/223 (45%), Gaps = 32/223 (14%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCF---PCQVFTFGSVPLKTYLPDRDIDLGAFSDDQT 81
           I+P+      R  V   VR  +++ +   P +V  FGS+    +LP  DID+    D+  
Sbjct: 99  IKPNEIESRLRYKVYEKVRLSLLERWKHKPIKVSMFGSLRTTLFLPTSDIDVLVECDE-- 156

Query: 82  LKDTWAHLVRDMLENEEKNEHAEFRVKEVQ-YIQAEVKIIKCLVD---NFVVDIAFNQLG 137
               W     D L    +    +   + V  Y  A V I+K +VD      +DI+FN + 
Sbjct: 157 ----WIGTPGDWLTETARGLEIDNIAESVSVYGGAFVPIVK-MVDRDTRLSIDISFNTVQ 211

Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
           G+    ++D+V     E  L +  ++L+K + +Y + +     G +SSY LV L++  F 
Sbjct: 212 GVRAASYIDKVKE---EFPLIEPLVLLLKQFLHYRN-LNQTFTGGLSSYGLVLLLVNFFQ 267

Query: 198 VF-----------NGSFAGPLEVLYRFLEFFS-KFDWDNFCLS 228
           ++            G   G L  L RFLE +S +F+++   +S
Sbjct: 268 LYALNMRHRTIYDRGVNLGHL--LLRFLEVYSLEFNYEEIGIS 308


>gi|409045762|gb|EKM55242.1| hypothetical protein PHACADRAFT_93478 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 478

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 71/304 (23%), Positives = 122/304 (40%), Gaps = 59/304 (19%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P    +E R+ +   + R + + FP  +V  FGS   K YLP  DIDL   SD     
Sbjct: 164 ISPTQEEDEIRSLIVESISRAVTKAFPDARVLPFGSYETKLYLPLGDIDLVIESDSM--- 220

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGLCT 141
             + + V  +       + A    K     +A+V IIK +  +  F VDI+ NQ+ G+  
Sbjct: 221 -AYNNKVNVLQALATTMKRAGITDKVTIIAKAKVPIIKFVTRHGRFSVDISLNQMNGVKA 279

Query: 142 LC----FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
                 FLD +  L        ++++LI      +  +     G + SY++V L +    
Sbjct: 280 GTMIKRFLDHIPAL--------QALVLITKSFLSQRSMNEVFTGGLGSYSIVCLAISFLQ 331

Query: 198 ----VFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVL 253
               +  G       +    +EFF  +     C   +  V IS+          +DGG  
Sbjct: 332 MHPKIRRGEIDSSKNLGVLVMEFFELYG----CYFNYREVGISV----------RDGG-- 375

Query: 254 LLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF--FRIR 311
               S+ +  +  +AD+         PF+    ++ DP   +N+    +S+G+F   ++R
Sbjct: 376 ----SYYNKAQRGWADYK-------SPFL---LSIEDPGDPSND----ISRGSFGIVKVR 417

Query: 312 TAFT 315
           T   
Sbjct: 418 TTLA 421


>gi|389748468|gb|EIM89645.1| Nucleotidyltransferase [Stereum hirsutum FP-91666 SS1]
          Length = 479

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 39/120 (32%), Positives = 52/120 (43%), Gaps = 11/120 (9%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  +E R+ V   ++R I   FP  +V +FGS   K YLP  DIDL   S      
Sbjct: 114 ISPTPVEDEIRSLVVLQIQRCISSKFPDAKVRSFGSYETKLYLPLGDIDLVIISKSMAYS 173

Query: 84  D--TWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV--DNFVVDIAFNQLGGL 139
           D  T  H V + L      +      K      A+V I+K +     F VDI+ N   G+
Sbjct: 174 DRVTVLHAVANTLRTAGITDRVSVIAK------AKVPIVKFVTTFGRFAVDISINMSNGV 227


>gi|256271045|gb|EEU06149.1| Pap2p [Saccharomyces cerevisiae JAY291]
          Length = 584

 Score = 47.0 bits (110), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 55/244 (22%), Positives = 95/244 (38%), Gaps = 38/244 (15%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN   + +R  + Q +P   +  FGS     YLP  
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNKTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID      LG       L    +HL +  L  E +   A+ RV  +++++    I    
Sbjct: 236 DIDCVVTSKLGGKESRNNLYSLASHLKKKKLATEVEVV-AKARVPIIKFVEPHSGI---- 290

Query: 124 VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
                +D++F +  G+        +LD+   L        R ++LI     +  R+   H
Sbjct: 291 ----HIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLNNVH 338

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
            G +  ++++ LV    H+        ++       +L  F E + K F +D+  L    
Sbjct: 339 TGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALGSSD 398

Query: 232 PVPI 235
             P+
Sbjct: 399 GYPV 402


>gi|396490001|ref|XP_003843230.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
 gi|312219809|emb|CBX99751.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
          Length = 717

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 61/237 (25%), Positives = 104/237 (43%), Gaps = 36/237 (15%)

Query: 7   DPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLI-IQCFP---CQVFTFGSVPL 62
           DP +WL  E +  +    + P P+  E+RN +   V+ ++    FP    ++  FGS P 
Sbjct: 348 DPEKWLHNEIL--DFYDFVAPKPYEHEQRNLLVQRVQSVLGYHRFPQDNGRILCFGSFPA 405

Query: 63  KTYLPDRDIDLGAFSD----------DQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
             YLP  D+DL   SD          D T ++  A L++  + N  +  +  F      Y
Sbjct: 406 GLYLPTADMDLVYTSDRHFNGGPPVMDVTARNATAPLLKG-VRNVLQRRNMAFGAISCIY 464

Query: 113 IQAEVKIIKCL--VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIK 166
             A+V ++K    V    VDI+F  L G+        + D+   +I    L K+ +++  
Sbjct: 465 -GAKVPLVKFTDSVTRLQVDISFENLSGMQAQATFAQWKDKYPDMIYMVALLKQFLVM-- 521

Query: 167 AWCYYESRILGG-HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFF-SKFD 221
                  R L   H G I  +A++ L+++  H   G      E+   FL+++ +KFD
Sbjct: 522 -------RGLNEVHTGGIGGFAIICLIVHYIHQA-GKAENLAELFKGFLDYYGNKFD 570


>gi|124481633|gb|AAI33102.1| LOC568678 protein [Danio rerio]
          Length = 535

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 60/119 (50%), Gaps = 12/119 (10%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  E+ R+ V A ++R+I   +P  +V  FGS     YLP  DIDL  F + +TL 
Sbjct: 61  ISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLVVFGNWETLP 120

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W   + + L   +  +    +V +    +A V IIK L+D+     VDI+FN   G+
Sbjct: 121 -LWT--LEEALRKRKVADENSIKVLD----KATVPIIK-LMDSHTEVKVDISFNVQSGV 171


>gi|164656242|ref|XP_001729249.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
 gi|159103139|gb|EDP42035.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
          Length = 527

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 53/212 (25%), Positives = 93/212 (43%), Gaps = 33/212 (15%)

Query: 38  VAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLEN 96
           V + ++R +   +P  +V++FGS   + YLP  DIDL   S+          ++ DM   
Sbjct: 2   VISLLQRALCSKWPDARVYSFGSQDTQLYLPQGDIDLVVLSN----------VMNDMPRE 51

Query: 97  EEKNEHA------EFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGLCTLCFLDEV 148
              +E A      +  +      +A+V IIK  C    F VDI+ NQ  GL    F   V
Sbjct: 52  ITLSEMAACLRSYQLAIHVQVLARAKVPIIKFVCPYGQFNVDISINQANGLQASKF---V 108

Query: 149 DHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYALVTLVLYIFHVFNGSFAGPL 207
           +  + +    +  +++IK   + + R L   + G + SY++  +VL    +      G +
Sbjct: 109 NGWLKKQPAIRPLVMVIKQ--FLQQRALSEVYTGGLGSYSVTLMVLSFLQLHPKLQRGEM 166

Query: 208 E-------VLYRFLEFFSK-FDWDNFCLSLWG 231
                   +L  FLE + K + +D   +S+ G
Sbjct: 167 SADKNLGTLLMEFLELYGKNYGYDECAISVRG 198


>gi|151945519|gb|EDN63760.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
          Length = 584

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 55/244 (22%), Positives = 95/244 (38%), Gaps = 38/244 (15%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN   + +R  + Q +P   +  FGS     YLP  
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNKTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID      LG       L    +HL +  L  E +   A+ RV  +++++    I    
Sbjct: 236 DIDCVVTSKLGGKESRNNLYSLASHLKKKNLATEVEVV-AKARVPIIKFVEPHSGI---- 290

Query: 124 VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
                +D++F +  G+        +LD+   L        R ++LI     +  R+   H
Sbjct: 291 ----HIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLNNVH 338

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
            G +  ++++ LV    H+        ++       +L  F E + K F +D+  L    
Sbjct: 339 TGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALGSSD 398

Query: 232 PVPI 235
             P+
Sbjct: 399 GYPV 402


>gi|395335008|gb|EJF67384.1| hypothetical protein DICSQDRAFT_77074 [Dichomitus squalens LYAD-421
           SS1]
          Length = 592

 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 97/240 (40%), Gaps = 34/240 (14%)

Query: 13  KAEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPD 68
           K + +  E++A    I P P     R  V A V  L+ + FP   V TFGSV    YLPD
Sbjct: 101 KEQRLHDEIVAFFQYISPTPEEAHARAMVIAKVSSLVTRRFPQGAVDTFGSVAQNLYLPD 160

Query: 69  RDIDL-----GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
            D D+       + D +T K T   L   M     +N      V+ +   +  V   + +
Sbjct: 161 GDTDMVVTMPPQYDDPETKKRTLFQLAALM-----RNNRVTPHVQVIHRARVPVISFQTV 215

Query: 124 VD--NFVVDIAFNQLGGLCTLCFLDEV-DHLINENHLFKRSIILIKAWCYYESRILGGHH 180
            D  +  +D++ N   GL  +  L    D +    HL    ++ +KA       +     
Sbjct: 216 PDLGSLKIDVSLNATDGLKAVPILRSYFDRMPALRHL----VLCLKALLSRHG-LNSASF 270

Query: 181 GLISSYALVTLVLYIFHVFNGS-----FAGPLE------VLYRFLEFFS-KFDWDNFCLS 228
           G +SSYAL+ L +    +            P+E      +L  FLE++  K+ ++   +S
Sbjct: 271 GGLSSYALICLAISFLQLNPMGRPKELIDAPVENESLGVLLMDFLEYYGHKYKYETGVVS 330


>gi|68363844|ref|XP_697115.1| PREDICTED: PAP-associated domain-containing protein 5 [Danio rerio]
          Length = 653

 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 60/119 (50%), Gaps = 12/119 (10%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  E+ R+ V A ++R+I   +P  +V  FGS     YLP  DIDL  F + +TL 
Sbjct: 179 ISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLVVFGNWETLP 238

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W   + + L   +  +    +V +    +A V IIK L+D+     VDI+FN   G+
Sbjct: 239 -LWT--LEEALRKRKVADENSIKVLD----KATVPIIK-LMDSHTEVKVDISFNVQSGV 289


>gi|190407236|gb|EDV10503.1| DNA polymerase sigma [Saccharomyces cerevisiae RM11-1a]
 gi|259149371|emb|CAY86175.1| Pap2p [Saccharomyces cerevisiae EC1118]
          Length = 584

 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 55/244 (22%), Positives = 95/244 (38%), Gaps = 38/244 (15%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN   + +R  + Q +P   +  FGS     YLP  
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNKTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID      LG       L    +HL +  L  E +   A+ RV  +++++    I    
Sbjct: 236 DIDCVVTSKLGGKESRNNLYSLASHLKKKNLATEVEVV-AKARVPIIKFVEPHSGI---- 290

Query: 124 VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
                +D++F +  G+        +LD+   L        R ++LI     +  R+   H
Sbjct: 291 ----HIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLNNVH 338

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
            G +  ++++ LV    H+        ++       +L  F E + K F +D+  L    
Sbjct: 339 TGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALGSSD 398

Query: 232 PVPI 235
             P+
Sbjct: 399 GYPV 402


>gi|6324457|ref|NP_014526.1| non-canonical poly(A) polymerase PAP2 [Saccharomyces cerevisiae
           S288c]
 gi|1717744|sp|P53632.1|PAP2_YEAST RecName: Full=Poly(A) RNA polymerase protein 2; AltName: Full=DNA
           polymerase kappa; AltName: Full=DNA polymerase sigma;
           AltName: Full=Topoisomerase 1-related protein TRF4
 gi|663237|emb|CAA88145.1| ORF [Saccharomyces cerevisiae]
 gi|950226|gb|AAC49091.1| Trf4p [Saccharomyces cerevisiae]
 gi|1419987|emb|CAA99134.1| TRF4 [Saccharomyces cerevisiae]
 gi|51830518|gb|AAU09782.1| YOL115W [Saccharomyces cerevisiae]
 gi|285814775|tpg|DAA10668.1| TPA: non-canonical poly(A) polymerase PAP2 [Saccharomyces
           cerevisiae S288c]
 gi|392296670|gb|EIW07772.1| Pap2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 584

 Score = 46.6 bits (109), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 57/247 (23%), Positives = 94/247 (38%), Gaps = 44/247 (17%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN   + +R  + Q +P   +  FGS     YLP  
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKC 122
           DID      LG       L    +HL +  L              EV+ + +A V IIK 
Sbjct: 236 DIDCVVTSELGGKESRNNLYSLASHLKKKNL------------ATEVEVVAKARVPIIKF 283

Query: 123 LV--DNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRIL 176
           +       +D++F +  G+        +LD+   L        R ++LI     +  R+ 
Sbjct: 284 VEPHSGIHIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLN 335

Query: 177 GGHHGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLS 228
             H G +  ++++ LV    H+        ++       +L  F E + K F +D+  L 
Sbjct: 336 NVHTGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALG 395

Query: 229 LWGPVPI 235
                P+
Sbjct: 396 SSDGYPV 402


>gi|349581056|dbj|GAA26214.1| K7_Pap2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 584

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 57/247 (23%), Positives = 94/247 (38%), Gaps = 44/247 (17%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN   + +R  + Q +P   +  FGS     YLP  
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKC 122
           DID      LG       L    +HL +  L              EV+ + +A V IIK 
Sbjct: 236 DIDCVVTSELGGKESRNNLYSLASHLKKKNL------------ATEVEVVAKARVPIIKF 283

Query: 123 LV--DNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRIL 176
           +       +D++F +  G+        +LD+   L        R ++LI     +  R+ 
Sbjct: 284 VEPHSGIHIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLN 335

Query: 177 GGHHGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLS 228
             H G +  ++++ LV    H+        ++       +L  F E + K F +D+  L 
Sbjct: 336 NVHTGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALG 395

Query: 229 LWGPVPI 235
                P+
Sbjct: 396 SSDGYPV 402


>gi|366992111|ref|XP_003675821.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
 gi|342301686|emb|CCC69457.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
          Length = 586

 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 57/246 (23%), Positives = 103/246 (41%), Gaps = 35/246 (14%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E   A+ ++ I P     E RN   + VR  + Q +P   +  FGS     YLP  
Sbjct: 176 WLTLE--IADFVSYISPSREEIESRNQTISKVRNAVKQLWPDADLHVFGSYATDLYLPGS 233

Query: 70  DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNE---HAEFRVKEVQYIQAEVKIIKCLV 124
           DID  + + + D+  +++   L   + +     +    A+ RV  +++++ E        
Sbjct: 234 DIDCVINSKAGDKENRNSLYSLASFLKQQGLATQIEVIAKTRVPIIKFVEPE-------- 285

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINE---NHLFKRSIILIKAWCYYESRILGGHHG 181
            N  +D++F +  GL       E   LI E   +    R ++LI     +  R+   H G
Sbjct: 286 SNIHIDVSFERTNGL-------EAAKLIREWLQDTPGLRELVLIIKQFLHSRRLNNVHTG 338

Query: 182 LISSYALVTLVLYIFHVFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCLSLWGPV 233
            +  ++++ +V     +          P+E    +L  F E + K F +D+  +S+    
Sbjct: 339 GLGGFSIICIVFSFLQMHPRIITNEIDPMENLGVLLIEFFELYGKNFGYDDVAISVTDGY 398

Query: 234 PISLLP 239
           P S LP
Sbjct: 399 P-SYLP 403


>gi|50286703|ref|XP_445781.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49525087|emb|CAG58700.1| unnamed protein product [Candida glabrata]
          Length = 485

 Score = 45.8 bits (107), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 55/241 (22%), Positives = 100/241 (41%), Gaps = 28/241 (11%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E +  + +A I P     E RN     +R  + + +P   +  FGS     YLP  
Sbjct: 102 WLNYEIL--DFVAYISPSKEEIETRNRTIGSIRSAVKELWPDADLHVFGSYATDLYLPGS 159

Query: 70  DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNE---HAEFRVKEVQYIQAEVKIIKCLV 124
           DID  + +   D+  ++    L   + + E   E    A+ RV  +++++ E +      
Sbjct: 160 DIDCVVNSKQGDKQSRNNLYKLANFLKKKEIATEIEVVAKARVPIIKFVEVESRT----- 214

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
               +DI+F +L GL     +   D L +   L  R ++L+     +  R+   H G + 
Sbjct: 215 ---HMDISFERLNGLEAAKLI--RDWLASTPGL--RELVLVVKQFLHSRRLNNVHSGGLG 267

Query: 185 SYALVTLVLYIFHVFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCLSLWGPVPIS 236
            ++++ LV     +          PLE    +L  F E + K F +D+  + +    PI 
Sbjct: 268 GFSIICLVYSFLRMHPRIITAEIDPLENLGVLLIEFFELYGKNFGYDDVAIGVQDGSPIY 327

Query: 237 L 237
           +
Sbjct: 328 M 328


>gi|146184040|ref|XP_001027646.2| Chitinase class I family protein [Tetrahymena thermophila]
 gi|146143378|gb|EAS07404.2| Chitinase class I family protein [Tetrahymena thermophila SB210]
          Length = 463

 Score = 45.8 bits (107), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 53/222 (23%), Positives = 94/222 (42%), Gaps = 30/222 (13%)

Query: 20  ELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSD 78
           EL   + P     E R      ++++++   P C+V TFGS   + YLP+ DID+    D
Sbjct: 166 ELTDYLSPTKQEHEIRLKSMERLKKILLDAVPGCEVKTFGSFSTELYLPNSDIDMVIVKD 225

Query: 79  DQTLKDTWAHLVRDMLENEEKNEHAEF----RVKEVQYIQAEVKIIKCLVDNFVVDIAFN 134
           D   K  +  +   ++  ++  E+       +V  +++++ E +I      NF  DI+FN
Sbjct: 226 DIQNKSLYKKVADKIMNCDDIYENINLVTNAKVPIIKFVEKETQI------NF--DISFN 277

Query: 135 QLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRIL-GGHHGLISSYALVTLVL 193
           +  G+  L  + +   L  E    K  I+++K  C    R L   + G I S+ L  ++L
Sbjct: 278 KEDGVKQLSEVKKGLELYPE---MKYLIMVMK--CILRQRDLHETYSGGIGSFLLFCMIL 332

Query: 194 YIFHVFNGSFAGPL-----------EVLYRFLEFFSKFDWDN 224
                    +               E L +  +F+  FD DN
Sbjct: 333 AFLRDLRRQYEKENRVQEIQNITLGEYLLKMFKFYGFFDVDN 374


>gi|328860813|gb|EGG09918.1| hypothetical protein MELLADRAFT_115680 [Melampsora larici-populina
           98AG31]
          Length = 987

 Score = 45.4 bits (106), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 69/139 (49%), Gaps = 12/139 (8%)

Query: 17  ITAEL---IARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDID 72
           +TAE+   +A I+P    +E R  +   +R+ +   +P   V  FGS   K YLP  DID
Sbjct: 234 LTAEIGSFVAYIRPTREEDELRLMIIEMIRKAVTMQWPDADVVPFGSFGTKLYLPGGDID 293

Query: 73  LGAFSDDQTLKDTWAHLVRDMLE-NEEKNEHAEFRVKEVQYIQAEVKII--KCLVDNFVV 129
           L   S  + +KD  + ++  +     E+N   +     V   +A+V II  K +  NF V
Sbjct: 294 LVILS-TRMMKDAKSKILYRLAPLLREQNIGQDV----VVIAKAKVPIIKFKTIFGNFQV 348

Query: 130 DIAFNQLGGLCTLCFLDEV 148
           DI+ NQ  GL  L  ++E+
Sbjct: 349 DISINQSNGLVALEKVNEL 367


>gi|242212981|ref|XP_002472321.1| predicted protein [Postia placenta Mad-698-R]
 gi|220728598|gb|EED82489.1| predicted protein [Postia placenta Mad-698-R]
          Length = 1512

 Score = 45.4 bits (106), Expect = 0.038,   Method: Composition-based stats.
 Identities = 70/287 (24%), Positives = 114/287 (39%), Gaps = 59/287 (20%)

Query: 21  LIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDD 79
            +  I P P  +E R+ V   + R + + FP  QV  FGS   K YLP  +         
Sbjct: 167 FVKYISPTPEEDEVRSLVVTLISRAVTRAFPDAQVLPFGSYETKLYLPIGN--------- 217

Query: 80  QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD--NFVVDIAFNQLG 137
              K++  H     L N  K      RVK +   +A+V I+K +    +F VDI+ NQ  
Sbjct: 218 ---KESVLH----ALANTVKRAGITDRVKIIA--KAKVPIVKFVTTHGHFSVDISVNQGN 268

Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
           G+        + H + E    +  I++IK++    S +   + G + SY++V L +    
Sbjct: 269 GVTA---GKMIKHYLAELPALRSLILVIKSFLSQRS-MNEVYTGGLGSYSIVCLAISFLQ 324

Query: 198 ----VFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVL 253
               +  G       +    +EFF  +     C   +  V ISLL          DGG  
Sbjct: 325 MHPKIRRGEIDPSRNLGVLVMEFFELYG----CYFNYHEVGISLL----------DGG-- 368

Query: 254 LLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
               ++ +     + D+       GQP   K  ++ DP    N++ R
Sbjct: 369 ----TYFNKAERGWLDY-------GQP---KLLSIEDPGDPTNDISR 401


>gi|313241181|emb|CBY33472.1| unnamed protein product [Oikopleura dioica]
          Length = 422

 Score = 45.4 bits (106), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 85/193 (44%), Gaps = 37/193 (19%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI  + I  +QP    +  R+ V   +R+++ + +P  ++ TFGS     YLPD DID+
Sbjct: 90  EEI-EDFIKFMQPTESEQAMRDDVVWRIRQVVKELWPSAKLETFGSYNTGLYLPDGDIDM 148

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI----QAEVKIIKCLVDNFV- 128
                   ++  W  L    L    +N+  E R+   + I    +A V IIK +  N + 
Sbjct: 149 -------VIQGQWEQLPMWQL----RNKLVERRIAREENITVIEKAVVPIIKLIESNTLV 197

Query: 129 -VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGL----- 182
            VDI+FN   G         V   + E    K+ ++L+K         +  H GL     
Sbjct: 198 HVDISFNTSNGREAAAL---VKKYMAEYPNLKQLVVLLK--------YILNHRGLNEVWK 246

Query: 183 --ISSYALVTLVL 193
             + SYAL  LV+
Sbjct: 247 GGLGSYALTLLVV 259


>gi|145475559|ref|XP_001423802.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124390863|emb|CAK56404.1| unnamed protein product [Paramecium tetraurelia]
          Length = 354

 Score = 45.1 bits (105), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 109/245 (44%), Gaps = 38/245 (15%)

Query: 29  PFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAH 88
           P  EE R    A++R   ++ F   V  F  +     LP+ DID+     + + K+ +  
Sbjct: 77  PTIEEHRKREQAFMR---VETFIKGV-CFRILRQNFNLPNADIDVVMIDKNMSAKELYKK 132

Query: 89  LVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKC--LVDNFVVDIAFNQLGGLCTLCFLD 146
           + ++++++ +K E+     K      A+V IIK   +  ++  DI+FNQ+ G+     +D
Sbjct: 133 VAQNLMKS-DKFENVNLIAK------AKVPIIKFFEIESSYQFDISFNQMDGIRQ---ID 182

Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVL-YIFHVFNGSFA 204
           E+         FK  I+++K  C  + R L   + G I S+ L  ++L ++  V   +FA
Sbjct: 183 EIQKAFTIYPEFKYLIMILK--CILKQRDLNETYSGGIGSFLLFQMILAFLREVRKEAFA 240

Query: 205 GPL----------EVLYRFLEFF-SKFDWDNFCLSLWG-------PVPISLLPDVTAEPP 246
                        E + RFLEF+ SKFD+    + +         P P      ++ + P
Sbjct: 241 NKKQEQLKNITLGEYILRFLEFYGSKFDYQKKRILMVNGGSIVNKPTPDDKFSLISPQDP 300

Query: 247 RKDGG 251
             D G
Sbjct: 301 DHDIG 305


>gi|50294195|ref|XP_449509.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528823|emb|CAG62485.1| unnamed protein product [Candida glabrata]
          Length = 626

 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 55/230 (23%), Positives = 99/230 (43%), Gaps = 26/230 (11%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL AE    + +A I P     E RN   A +RR + + +    +  FGS     YLP  
Sbjct: 186 WLTAE--IRDFVAYISPSREEIETRNKTIAKIRRSVKRLWTDADLQVFGSYATDMYLPGS 243

Query: 70  DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VD 125
           DID  + + S D+  +     L R +     KN+    RV+ +   ++ V IIK +    
Sbjct: 244 DIDCVVNSKSGDKENRQYLYELARHL-----KNDGLATRVEVI--AKSRVPIIKFVEPES 296

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           +  +D++F +  GL     + E    I +    +   +++K + +   R+   H G +  
Sbjct: 297 DIHIDVSFERSNGLEAAKLIRE---WIGDTPGLRELTLVVKQFLHAR-RLNDVHTGGLGG 352

Query: 186 YALVTLVLYIFHVFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCL 227
           ++++ LV     +      G   PL+    +L  F E + K F +D+  +
Sbjct: 353 FSIICLVFSFLRLHPRIITGDIDPLDNLGVLLIEFFELYGKNFAYDDVAI 402


>gi|440296452|gb|ELP89279.1| PAP-associated domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 344

 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 54/218 (24%), Positives = 92/218 (42%), Gaps = 12/218 (5%)

Query: 3   IRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVP 61
           +R + P   L   E        I   P  +E R      V +L+   +P C+V  +GS  
Sbjct: 3   LRSVCPTDKLTLTEEIKLFTRYISLTPNEQELRQISYQKVSQLLTNRYPGCEVTIYGSYV 62

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
               LP  DIDL     ++  K+    L+  +      ++    RV++V    A+V IIK
Sbjct: 63  SGFSLPSSDIDLVLSFSEEVSKNQVKKLLFKISTICRSSKF--LRVEDV-ITNAKVPIIK 119

Query: 122 CL-VDNFV-VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
            L +D  + +D++ N  GG+ +      + H +  +  F + I L   +  +++ +   +
Sbjct: 120 LLDLDTTISIDLSINCEGGIDS----SALTHSLLTSSQFTQEIALFVKYLVFQNNLNEPY 175

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFF 217
           HG I SYA+V L       +     G    L  FL F+
Sbjct: 176 HGGIGSYAIVLLTATFLKFYPQHSLG--RALVEFLNFY 211


>gi|330805693|ref|XP_003290813.1| hypothetical protein DICPUDRAFT_81531 [Dictyostelium purpureum]
 gi|325079023|gb|EGC32644.1| hypothetical protein DICPUDRAFT_81531 [Dictyostelium purpureum]
          Length = 892

 Score = 44.3 bits (103), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 53/224 (23%), Positives = 100/224 (44%), Gaps = 25/224 (11%)

Query: 5   PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPC-QVFTFGSVPLK 63
           P     ++   E+ AE    ++ +  S +R+N     +   +   FP  +++ +GS   +
Sbjct: 619 PESKSEFINYLELKAE---TLKENSNSLQRKNNSFNTLENFLKNEFPTGKLYKYGSFVTR 675

Query: 64  TYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK-C 122
              PD DID+      Q       ++V   L+N+ + ++ E R        A+V II+ C
Sbjct: 676 LSSPDSDIDVTLIDSSQPY-----NMVLQKLKNKPRYDNFETRP------DAKVPIIRFC 724

Query: 123 LVDNFV-VDIAFNQLG--GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
              N V  D++FN +G     +  F+ E    + +    K  I+L+K +   ++ I    
Sbjct: 725 DKINLVKFDLSFN-IGEPNQNSNFFISE----LKDKKYLKELILLVKHYT-EKANIKDAS 778

Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWD 223
            G  SS+AL  + +Y +     S     ++L+ F  F+ KFD++
Sbjct: 779 QGYFSSHALTIMAIYFYKTLVRSNLNIHKLLHSFFLFYIKFDYN 822


>gi|452845518|gb|EME47451.1| hypothetical protein DOTSEDRAFT_69399 [Dothistroma septosporum
           NZE10]
          Length = 610

 Score = 44.3 bits (103), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 49/215 (22%), Positives = 81/215 (37%), Gaps = 32/215 (14%)

Query: 46  IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEF 105
           I+Q    ++FT+GS  LK + P  DID    +     ++ +   + DM+      E    
Sbjct: 77  ILQQAGGKIFTYGSYRLKVFGPGSDIDALMIAPRHVTREDFFKYMPDMIRQSTPTEQL-- 134

Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFN-----------QLGGLCTLCFLDEVD----- 149
             + V    A V IIK  +D   VD+ F+           QL     L  L E D     
Sbjct: 135 -TELVPVEAANVPIIKTEIDGVAVDLIFSTLHMASVPKDLQLKDSNLLRGLSETDLRCVN 193

Query: 150 ---------HLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
                     L+ E   F+ ++  IK W      + G  +G         +V+ +  ++ 
Sbjct: 194 GTRVTDRLLQLVPETKTFRLALRAIKLWASRRG-VYGNVYGFPGGVGYAMMVVRMCQLYP 252

Query: 201 GSFAGPLEVLYRFLEFFSKFDW-DNFCLSLWGPVP 234
            + A P+ ++ +F     K+ W D   L    P P
Sbjct: 253 RA-AAPV-IVNKFFMVMGKWRWPDPVTLCKREPAP 285


>gi|384485719|gb|EIE77899.1| hypothetical protein RO3G_02603 [Rhizopus delemar RA 99-880]
          Length = 494

 Score = 43.9 bits (102), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 92/211 (43%), Gaps = 44/211 (20%)

Query: 43  RRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEH 102
           +R  I+CF   +  FGS  L  Y+ D DIDL      Q L+  +      +L+ +     
Sbjct: 57  KRGDIECF---LSPFGSYALGGYIRDADIDLVLVCPIQVLRKYFFKFFPQLLKQQT---- 109

Query: 103 AEFRVKEVQYIQ-AEVKIIKCLVDNFVVDIAFNQLG---GLCTLCFLD-----EVDHL-- 151
               V  V+ IQ A V IIKC +DN  +DI+F +L        + FLD     ++D    
Sbjct: 110 ---LVSNVESIQKANVPIIKCTIDNISIDISFVRLKVERVAQNINFLDDSLLKDIDETCL 166

Query: 152 -------INE---NHLFKRSIIL-------IKAWCYYE---SRILGGHHGLISSYALVTL 191
                  +N+   N ++++ + L       IK W       S+ +G  +G  SS+ L+ +
Sbjct: 167 ASMDGPRVNQFCKNQIYRQHVRLFQVCLQCIKHWATQRGIYSKPIGYLNG--SSWTLLLV 224

Query: 192 VLYIFHVFNGSFAGPLEVLYRFLEFFSKFDW 222
             Y+  + N        +L RF   +S++ W
Sbjct: 225 KAYM-SIKNKELLSVTMILSRFFSMWSQWPW 254


>gi|392595411|gb|EIW84734.1| hypothetical protein CONPUDRAFT_47123 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 663

 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 47/218 (21%), Positives = 91/218 (41%), Gaps = 19/218 (8%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
           + P    +E R  V   V + +   FP  +V  FGS   K YLP  DIDL   SD     
Sbjct: 162 MSPTSIEDEIRGLVVKLVGKAVTSAFPDAKVLPFGSYGTKLYLPSGDIDLVIESDSMQYV 221

Query: 83  -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV--DNFVVDIAFNQLGGL 139
            K++  H + ++L      + A    K     +A+V I+K +       VDI+ NQ  GL
Sbjct: 222 PKNSVLHSLANVL------KRAGIADKVTIIAKAKVPIVKFITRHGRLNVDISINQSNGL 275

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
                ++     +       R+++++      +  +   + G + SY++V + +    + 
Sbjct: 276 VAGQIVNGFLADMRGCGRALRALVMVAKAFLGQRGMNEVYTGGLGSYSIVCMAISFLQMH 335

Query: 200 NGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSL 229
                G ++       ++  F E + + F+++   +S+
Sbjct: 336 PKIRRGEIDAERNLGVLVMEFFELYGRYFNYEQVGISI 373


>gi|348500306|ref|XP_003437714.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Oreochromis niloticus]
          Length = 672

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 58/129 (44%), Gaps = 16/129 (12%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  E+ R  V   ++ +I   +P  +V  FGS     YLP  DIDL  F       
Sbjct: 191 ISPRPEEEKMRLEVVDRIKEVIHDLWPSAEVEVFGSFSTGLYLPTSDIDLVVFG------ 244

Query: 84  DTWAHLVRDMLEN--EEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGG 138
             W  L    LE    +KN   E  +K +   +A V IIK L D++    VDI+FN + G
Sbjct: 245 -KWESLPLWTLEEALRKKNVADENSIKVLD--KATVPIIK-LTDSYTEVKVDISFNVMSG 300

Query: 139 LCTLCFLDE 147
           +     + E
Sbjct: 301 VKAARLIKE 309


>gi|302148910|pdb|3NYB|A Chain A, Structure And Function Of The Polymerase Core Of Tramp, A
           Rna Surveillance Complex
          Length = 323

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 55/243 (22%), Positives = 94/243 (38%), Gaps = 36/243 (14%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN   + +R  + Q +P   +  FGS     YLP  
Sbjct: 20  WLTFE--IKDFVAYISPSREEIEIRNQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 77

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID      LG       L    +HL +  L  E +   A+ RV  +++++    I    
Sbjct: 78  DIDCVVTSELGGKESRNNLYSLASHLKKKNLATEVEVV-AKARVPIIKFVEPHSGI---- 132

Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINE---NHLFKRSIILIKAWCYYESRILGGHH 180
                + ++F +  G+       E   LI E   +    R ++LI     +  R+   H 
Sbjct: 133 ----HIAVSFERTNGI-------EAAKLIREWLDDTPGLRELVLIVKQFLHARRLNNVHT 181

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGP 232
           G +  ++++ LV    H+        ++       +L  F E + K F +D+  L     
Sbjct: 182 GGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALGSSDG 241

Query: 233 VPI 235
            P+
Sbjct: 242 YPV 244


>gi|299752783|ref|XP_002911796.1| Trf5 [Coprinopsis cinerea okayama7#130]
 gi|298409998|gb|EFI28302.1| Trf5 [Coprinopsis cinerea okayama7#130]
          Length = 816

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 49/183 (26%), Positives = 76/183 (41%), Gaps = 22/183 (12%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  +E R  +   +   +   FP   V  FGS   K YLP  DIDL   S+     
Sbjct: 289 ISPTPVEDEIRGLIVKQIAVTVQSKFPDASVLPFGSYETKLYLPMGDIDLVILSESMAYS 348

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGLCT 141
           +  +  V   L N  K      RV  +   +A V I+K +  +  F VDI+ NQ  GL +
Sbjct: 349 NKVS--VLHTLANTLKRAGITSRVTVI--AKARVPIVKFVTTHGRFNVDISINQENGLVS 404

Query: 142 LCFLDE-VDHLIN--------------ENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
              ++  + HL N              +  L  RS++LI      +  +   + G + SY
Sbjct: 405 GNIINGFLRHLHNPTSNTPEFDANGNPKTSLALRSLVLITKAFLAQRSMNEVYTGGLGSY 464

Query: 187 ALV 189
           +++
Sbjct: 465 SIM 467


>gi|196004468|ref|XP_002112101.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
 gi|190586000|gb|EDV26068.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
          Length = 289

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 55/214 (25%), Positives = 92/214 (42%), Gaps = 20/214 (9%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  +  R  V   V+ +I+  +P  QV  FGS     YLP  DIDL  F  D   K
Sbjct: 21  ISPRPEEKNMRETVVEGVKEVILTLWPHVQVEVFGSFRTGLYLPTSDIDLVIFGIDG--K 78

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD--NFVVDIAFNQLGGLCT 141
             +  L + ++++E  +      +K +    A V IIK      N+ +DI FN    + +
Sbjct: 79  GAFEDLEKALMQHEVCDRD---NIKCIH--NAMVPIIKLTEKTCNYKMDIEFNIENSVKS 133

Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG 201
               D +   I +    K  ++++K +  ++  +     G +SSY LV +V+    +   
Sbjct: 134 ---ADIIQTYIRKYEPLKYLVLVLKQFL-FQRELNEVFSGGVSSYTLVMMVVNFLQLHPR 189

Query: 202 SFAGPLEVLYRFL--EFFS----KFDWDNFCLSL 229
            +    E  Y  L  EFF      F++   C+ +
Sbjct: 190 RYTDHPEANYGVLLIEFFELYGRHFNYHTTCIRV 223


>gi|156837261|ref|XP_001642660.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156113216|gb|EDO14802.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 524

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 59/238 (24%), Positives = 98/238 (41%), Gaps = 38/238 (15%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + ++ I P+    E RN     +R  +   +P   +  FGS     YLP  
Sbjct: 114 WLTLE--IRDFVSYISPNRKEIELRNQTIGKLRDAVQHHWPDANLHVFGSYATDLYLPGS 171

Query: 70  DIDL---GAFSDDQT---LKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID        D Q+   L    +HL ++ L  E+    A+ RV  +++++   KI    
Sbjct: 172 DIDCVVNSKAGDKQSRNCLYSLASHLKKEGLA-EDIEIIAKARVPIIKFVEPLSKI---- 226

Query: 124 VDNFVVDIAFNQLGGLCTL----CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
                VD++F +  GL        +LD  + L        R ++LI        R+   H
Sbjct: 227 ----HVDVSFERTNGLEAAKLIRGWLDSTNGL--------RELVLIVKQFLQARRLNKVH 274

Query: 180 HGLISSYALVTLVLYIFHVFNGSFA---GPLE----VLYRFLEFFSK-FDWDNFCLSL 229
            G +  ++++ LV    H+     A    P+E    +L  F E + K F +D+  LS+
Sbjct: 275 TGGLGGFSIICLVYSFLHLHPRILANEINPIENLGVLLIDFFELYGKNFGYDHVALSV 332


>gi|313232447|emb|CBY24115.1| unnamed protein product [Oikopleura dioica]
          Length = 887

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 77/335 (22%), Positives = 130/335 (38%), Gaps = 51/335 (15%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSV 60
           +  PL  G     EEI  +    I+  P     R+ V   V   I Q FP  QV  FGS 
Sbjct: 133 ITSPLSKGMEGLHEEII-DFHNWIRSTPEEYTMRHDVVLRVEEAIKQEFPGAQVEVFGSF 191

Query: 61  PLKTYLPDRDIDLGAFSDD-----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQA 115
               YLP  DID+    +         ++   + ++D L  +   E    +V +     A
Sbjct: 192 QTGLYLPTSDIDMVVLGEKIEPRYGNPQNGPHYRLQDRLLKQGIAERYSIKVID----SA 247

Query: 116 EVKIIKC--LVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYES 173
            V IIK   ++ +  VDI+FN   G+  +     V   I +    +  ++++K +   + 
Sbjct: 248 AVPIIKMRDMITDIKVDISFNMKTGVTAIGL---VKGYIRQFPALRYLVLVLKQFL-LQR 303

Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGP---LEVLY-RFLEFFS-KFDWDNFCLS 228
            +     G ISSY L+ +V+           G    L VL  +FL F+  +F++   C+ 
Sbjct: 304 DMNEVWTGGISSYGLILMVVSFLQHQGADNTGDDVNLGVLLIKFLRFYGMEFEYSKCCIR 363

Query: 229 LWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNV 288
           +                  K+GG  +  +      + A           G  +V    ++
Sbjct: 364 V------------------KNGGQFIKKEEMATQMKEAPT---------GPKYVPNFLSI 396

Query: 289 IDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLAR 323
            DPL  +N++GR+        ++ AF F  + L R
Sbjct: 397 EDPLTPSNDVGRASHGAE--NVKDAFLFAYRVLDR 429


>gi|320164013|gb|EFW40912.1| PAP associated domain containing 5 [Capsaspora owczarzaki ATCC
           30864]
          Length = 558

 Score = 42.7 bits (99), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 82/185 (44%), Gaps = 22/185 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           E+   + +  I+P P   + R  +   +R +I   +   +V  FGS     YLP  DID+
Sbjct: 213 EQEMYDFVEFIKPTPLEHQMREEIVQRIREVITGAWKHARVEVFGSFATGLYLPMSDIDI 272

Query: 74  GAFSD-DQTLKDTWAHLVRDMLENEEKNEHAEFRV-KEVQYI-QAEVKIIKC--LVDNFV 128
             F + DQ    T   L+             E R+ K V+ I +  V IIK    +    
Sbjct: 273 VVFGNWDQIPLFTLGKLLE------------ESRIAKNVKVIDKTSVPIIKLADALSGVF 320

Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
           VDI+FN   GL T+ F   +   ++E  +      +IK +   + ++   + G + SY++
Sbjct: 321 VDISFNLESGLRTVEF---IRACVDEYRMLYHLTFVIKQFL-AQRQLNEPYSGGLGSYSV 376

Query: 189 VTLVL 193
           V LV+
Sbjct: 377 VLLVV 381


>gi|449707156|gb|EMD46861.1| PAPassociated domain containing protein [Entamoeba histolytica
           KU27]
          Length = 400

 Score = 42.4 bits (98), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 53/224 (23%), Positives = 102/224 (45%), Gaps = 36/224 (16%)

Query: 10  RWLKAEEITAELIARIQ-----PDPFSEE---RRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           +WLK+ E   +L   +Q      +P   E   R   +  Y +  I++     V  FGS  
Sbjct: 5   QWLKSFEGELDLNQEVQLFIKFIEPNKNEYKIREELLTKYSK--ILEKEGYNVMAFGSTQ 62

Query: 62  LKTYLPDRDIDLGAFSDD----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
            K +LP  DID    +++    + L    + L   +LE++++N             +A +
Sbjct: 63  SKLFLPTSDIDFSVLTNEYNTRKVLNSVSSILSSYVLEDQKRN------------FKASI 110

Query: 118 KIIKCLVDN---FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKA-WCYYES 173
            ++K L D     V+DI+ N   G  T+ F++EV   I ++   ++ ++LIK+  C Y+ 
Sbjct: 111 PVLK-LTDKKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSILCCYDF 166

Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFF 217
                 +G + +Y++  +V    +  N +     E+L  FL+++
Sbjct: 167 H--QPANGGLGTYSVFVMVYCYINNNNITTHDYGELLKGFLKYY 208


>gi|241955483|ref|XP_002420462.1| poly(A) polymerase, putative; polynucleotide adenylyltransferase,
           putative [Candida dubliniensis CD36]
 gi|223643804|emb|CAX41541.1| poly(A) polymerase, putative [Candida dubliniensis CD36]
          Length = 558

 Score = 42.4 bits (98), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 41/189 (21%), Positives = 75/189 (39%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEH---------- 102
           +VFTFGS  L  Y P  DID          +D +  +  D++    + E           
Sbjct: 82  KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDAYVP 141

Query: 103 ---AEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A++ I +  +D      N + ++    L  L      DE+  L+ 
Sbjct: 142 IIKIEFDGISIDLIMAKLNIPRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVP 201

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W    + + G   G     A   LV  I  ++  + +    ++ +F
Sbjct: 202 KPTVFKHALRCIKMWAQQRA-VYGNIFGFPGGVAWAMLVARICQLYPNAVSSV--IVEKF 258

Query: 214 LEFFSKFDW 222
              ++K++W
Sbjct: 259 FNIYTKWNW 267


>gi|68482706|ref|XP_714750.1| hypothetical protein CaO19.10713 [Candida albicans SC5314]
 gi|3334283|sp|O42617.1|PAP_CANAL RecName: Full=Poly(A) polymerase PAPalpha; AltName:
           Full=Polynucleotide adenylyltransferase alpha
 gi|2696030|dbj|BAA23802.1| poly A polymerase [Candida albicans]
 gi|5771514|gb|AAD51412.1| unknown [Candida albicans]
 gi|46436342|gb|EAK95706.1| hypothetical protein CaO19.10713 [Candida albicans SC5314]
          Length = 558

 Score = 42.4 bits (98), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 41/189 (21%), Positives = 74/189 (39%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHA--------- 103
           +VFTFGS  L  Y P  DID          +D +  +  D++    + E           
Sbjct: 82  KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDAYVP 141

Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A + I +  +D      N + ++    L  L      DE+  L+ 
Sbjct: 142 IIKLEFDGISIDLIMARLNIPRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVP 201

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W    + + G   G     A   LV  I  ++  + +    ++ +F
Sbjct: 202 KPTVFKHALRCIKLWAQQRA-VYGNIFGFPGGVAWAMLVARICQLYPNAVSSA--IVEKF 258

Query: 214 LEFFSKFDW 222
              ++K++W
Sbjct: 259 FNIYTKWNW 267


>gi|403373923|gb|EJY86891.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
          Length = 403

 Score = 42.4 bits (98), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 44/205 (21%), Positives = 94/205 (45%), Gaps = 17/205 (8%)

Query: 32  EERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLV 90
           ++ R  V + + +++ +CF   +V  FGS      LP+ D+DL  +  DQ  ++    L 
Sbjct: 125 QQARRKVVSRIHKIVKECFSQAKVMIFGSCATGLDLPNSDVDLLVYYPDQREQNMINRLA 184

Query: 91  RDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDH 150
             ++++         +  +V  I+ + K   C      +DI+FN+  G+  +     V  
Sbjct: 185 GSLMKSGICKSIEAIKHAKVPIIKLQDKETSC-----NIDISFNRTNGIYCVKL---VKT 236

Query: 151 LINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVL-YIFHVFNGSFAGPLE 208
           L+ +    +  +I++KA  + + R L   + G ISS+ L  L   Y+   +       ++
Sbjct: 237 LMIKYPELRPLMIVLKA--FLKCRGLNETYSGGISSFLLTMLATSYLQMAYKSGKTDKMD 294

Query: 209 VLYRFLEFF----SKFDWDNFCLSL 229
           +    ++FF    +KF+++   +S+
Sbjct: 295 LGKHLIDFFELYGTKFNYEQIGISI 319


>gi|238882575|gb|EEQ46213.1| Poly(A) polymerase [Candida albicans WO-1]
          Length = 558

 Score = 42.4 bits (98), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 41/189 (21%), Positives = 74/189 (39%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHA--------- 103
           +VFTFGS  L  Y P  DID          +D +  +  D++    + E           
Sbjct: 82  KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDAYVP 141

Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A + I +  +D      N + ++    L  L      DE+  L+ 
Sbjct: 142 IIKLEFDGISIDLIMARLNIPRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVP 201

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W    + + G   G     A   LV  I  ++  + +    ++ +F
Sbjct: 202 KPTVFKHALRCIKLWAQQRA-VYGNIFGFPGGVAWAMLVARICQLYPNAVSSA--IVEKF 258

Query: 214 LEFFSKFDW 222
              ++K++W
Sbjct: 259 FNIYTKWNW 267


>gi|407039791|gb|EKE39813.1| topoisomerase, putative [Entamoeba nuttalli P19]
          Length = 400

 Score = 42.4 bits (98), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 49/197 (24%), Positives = 88/197 (44%), Gaps = 35/197 (17%)

Query: 10  RWLKAEEITAELIARIQ-----PDPFSEE---RRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           +WLK+ E   +L   +Q      +P   E   R   +  Y +  I++     V  FGS  
Sbjct: 5   QWLKSFEGELDLNQEVQLFIKFIEPNKNEYKIREELLTKYSK--ILEKEGYNVMAFGSTQ 62

Query: 62  LKTYLPDRDIDLGAFSDD----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
            K +LP  DID    +++    + L    + L   +LE++++N             +A +
Sbjct: 63  SKLFLPTSDIDFSVITNEYNTRKVLNSVSSILSSYVLEDQKRN------------FKASI 110

Query: 118 KIIKCLVDN---FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAW--CYYE 172
            ++K L D     V+DI+ N   G  T+ F++EV   I ++   ++ ++LIK+   CY  
Sbjct: 111 PVLK-LTDKKTLIVLDISHNNTNGTKTVNFIEEV---IKKDDRIRKLVLLIKSLLCCYDF 166

Query: 173 SRILGGHHGLISSYALV 189
            +   G  G  S + +V
Sbjct: 167 HQPANGGLGTYSVFVMV 183


>gi|260948920|ref|XP_002618757.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
 gi|238848629|gb|EEQ38093.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
          Length = 567

 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 91/236 (38%), Gaps = 34/236 (14%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +  I P       RN V   ++R I + +P  Q   FGS     YLP  
Sbjct: 157 WLTME--IKDFVNYISPSKEEIVVRNTVIRRLKRRIAEFWPQTQAHVFGSCATDLYLPGS 214

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--- 125
           DID+   S       T  +  R  L             K ++ I  A+V IIK  VD   
Sbjct: 215 DIDMVVIS------TTGDYEQRGKLYQLSSFLRTNKLAKNIEVIATAKVPIIK-FVDPQY 267

Query: 126 NFVVDIAFNQLGGLCTL----CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
           N  VDI+F +  GL        +LD +  L        R ++LI        R+   H G
Sbjct: 268 NIHVDISFERTNGLDAARRIRKWLDSMPGL--------RELVLIVKQFLRSRRLNNVHVG 319

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEV-------LYRFLEFFSK-FDWDNFCLSL 229
            +  YA + L+ +   +      G + V       L  F E + + F +D+  ++L
Sbjct: 320 GLGGYATIILMYHFLRLHPRVSTGNISVMENLGTLLIEFFELYGRNFSYDHLIVAL 375


>gi|387196341|gb|AFJ68755.1| DNA polymerase sigma subunit, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 419

 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 85/211 (40%), Gaps = 20/211 (9%)

Query: 33  ERRNAVAAYVRRLIIQCFPC-QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVR 91
           E R  V       + + +P   V  FGS   K +LPD DID+          D   H +R
Sbjct: 86  EARQKVTRISADTVKKLWPSFDVHVFGSEATKVFLPDSDIDMVVLPP----TDLPLHQIR 141

Query: 92  DMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDH 150
             L    +    E  V  ++ I QA V I+K    N  VDI+F+   GL +  ++ E   
Sbjct: 142 KNLFTLAEAFKQEESVSGMEIISQARVPIVKLRFQNLQVDISFSSDSGLKSARYMLEK-- 199

Query: 151 LINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-YIFHVFNGSFAGPLEV 209
              E     R +IL+  +   +  +   + G   S+ L  +V+ Y+ H    +       
Sbjct: 200 --MEAMPPLRPLILVLKYFLAQRELNQTYMGGCGSFLLQLMVIAYLQHAQKEADKASRSE 257

Query: 210 LYR--------FLEFFS-KFDWDNFCLSLWG 231
             R        FL F+  +F+++   +S+ G
Sbjct: 258 RTRNLGSLFLGFLRFYGHQFNYEEVGISVLG 288


>gi|67465021|ref|XP_648697.1| topoisomerase [Entamoeba histolytica HM-1:IMSS]
 gi|56464936|gb|EAL43308.1| topoisomerase, putative [Entamoeba histolytica HM-1:IMSS]
          Length = 400

 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 49/197 (24%), Positives = 88/197 (44%), Gaps = 35/197 (17%)

Query: 10  RWLKAEEITAELIARIQ-----PDPFSEE---RRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           +WLK+ E   +L   +Q      +P   E   R   +  Y +  I++     V  FGS  
Sbjct: 5   QWLKSFEGELDLNQEVQLFIKFIEPNKNEYKIREELLTKYSK--ILEKEGYNVMAFGSTQ 62

Query: 62  LKTYLPDRDIDLGAFSDD----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
            K +LP  DID    +++    + L    + L   +LE++++N             +A +
Sbjct: 63  SKLFLPTSDIDFSVLTNEYNTRKVLNSVSSILSSYVLEDQKRN------------FKASI 110

Query: 118 KIIKCLVDN---FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKA--WCYYE 172
            ++K L D     V+DI+ N   G  T+ F++EV   I ++   ++ ++LIK+   CY  
Sbjct: 111 PVLK-LTDKKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSILCCYDF 166

Query: 173 SRILGGHHGLISSYALV 189
            +   G  G  S + +V
Sbjct: 167 HQPANGGLGTYSVFVMV 183


>gi|256078812|ref|XP_002575688.1| hypothetical protein [Schistosoma mansoni]
 gi|360044186|emb|CCD81733.1| hypothetical protein Smp_145600 [Schistosoma mansoni]
          Length = 672

 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 53/121 (43%), Gaps = 18/121 (14%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  +  R  V A V+ ++   +P CQV  FGS     YLP  DID+  F       
Sbjct: 67  ISPSPAEQFAREVVVAKVKDIVYSLWPNCQVDVFGSFKTGLYLPTSDIDMVIFGK----- 121

Query: 84  DTWAHLVRDMLENE--EKNEHAEFRVKEVQYIQAEVKIIKCLVDN---FVVDIAFNQLGG 138
             W  L    LE    +    +E +V +    +A V I+K + D      VDI+FN +  
Sbjct: 122 --WDALPLHTLEQALFKSGISSEIKVLD----KATVPIVK-MTDKETELRVDISFNMINS 174

Query: 139 L 139
           +
Sbjct: 175 V 175


>gi|334311788|ref|XP_003339660.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Monodelphis domestica]
          Length = 809

 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 46/137 (33%), Positives = 63/137 (45%), Gaps = 16/137 (11%)

Query: 8   PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYL 66
           PG +L  EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YL
Sbjct: 353 PGTYLH-EEIS-DFYEYMSPRPEEEKMRMEVVNRIENVIKELWPSADVQIFGSFKTGLYL 410

Query: 67  PDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD 125
           P  DIDL  F         W +L    LE E   +H       V+ + +A V IIK L D
Sbjct: 411 PTSDIDLVVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTD 461

Query: 126 NFV---VDIAFNQLGGL 139
           +F    VDI+FN   G+
Sbjct: 462 SFTEVKVDISFNVQNGV 478


>gi|367014043|ref|XP_003681521.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
 gi|359749182|emb|CCE92310.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
          Length = 663

 Score = 42.0 bits (97), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 51/232 (21%), Positives = 100/232 (43%), Gaps = 26/232 (11%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P+    E RN   + +R  + + +P   +  FGS     YLP  
Sbjct: 206 WLTME--IKDFVAYISPNRQEIEIRNKTISKIRAAVRELWPDADLQVFGSYATDLYLPGS 263

Query: 70  DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VD 125
           DID  + +   D+  +++   L    L+++E     E   K      A V IIK +    
Sbjct: 264 DIDCVVNSKGRDKENRNSLYSLAS-FLKSKELATRVEVIAK------ARVPIIKFVEPQS 316

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
              +D++F ++ GL     + E    + E    +  ++++K + +   R+   H G +  
Sbjct: 317 QIHIDVSFERINGLEAARLIRE---WLEETPGLRELVLIVKQFLHSR-RLNNVHTGGLGG 372

Query: 186 YALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSL 229
           ++++ LV    H+     +  ++       +L  F E + K F +D+  L++
Sbjct: 373 FSIICLVYSFLHLHPRVVSDEIDPLDNLGVLLIDFFELYGKNFGYDDVGLTV 424


>gi|313242854|emb|CBY39607.1| unnamed protein product [Oikopleura dioica]
          Length = 833

 Score = 42.0 bits (97), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 75/335 (22%), Positives = 134/335 (40%), Gaps = 51/335 (15%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSV 60
           +  PL  G     EEI  +    I+  P     R+ V   V   I Q FP  QV  FGS 
Sbjct: 79  ITSPLSKGMEGLHEEII-DFHNWIRSTPEEYTMRHDVVLRVEEAIKQEFPGAQVEVFGSF 137

Query: 61  PLKTYLPDRDIDLGAFSDD-----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQA 115
               YLP  DID+    +         ++   + ++D L  +   E    +V +     A
Sbjct: 138 QTGLYLPTSDIDMVVLGEKIEPRYGNPQNGPHYRLQDRLLKQGIAERYSIKVID----SA 193

Query: 116 EVKIIKC--LVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYES 173
            V IIK   ++ +  VDI+FN   G+  +     V   I +    +  ++++K +   + 
Sbjct: 194 AVPIIKMRDMITDIKVDISFNMKTGVTAIGL---VKGYIRQFPALRYLVLVLKQFL-LQR 249

Query: 174 RILGGHHGLISSYALVTLVL-YIFHVFNGSFAGPLE---VLYRFLEFFS-KFDWDNFCLS 228
            +     G ISSY L+ +V+ ++ H    + A  +    +L +FL F+  +F++   C+ 
Sbjct: 250 DMNEVWTGGISSYGLILMVVSFLQHQGADNTADDVNLGVLLIKFLRFYGMEFEYSKCCIR 309

Query: 229 LWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNV 288
           +                  K+GG  +  +      + +           G  +V    ++
Sbjct: 310 V------------------KNGGQFIKKEEMATQMKESPT---------GPKYVPNFLSI 342

Query: 289 IDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLAR 323
            DPL  +N++GR+        ++ AF F  + L R
Sbjct: 343 EDPLTPSNDVGRASHGAE--NVKDAFLFAYRVLDR 375


>gi|149237693|ref|XP_001524723.1| Poly(A) polymerase PAPa [Lodderomyces elongisporus NRRL YB-4239]
 gi|146451320|gb|EDK45576.1| Poly(A) polymerase PAPa [Lodderomyces elongisporus NRRL YB-4239]
          Length = 587

 Score = 42.0 bits (97), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 80/189 (42%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNE------- 101
           ++FTFGS  L  Y P  DID L  F    T +D    +  L+R+  E EE N        
Sbjct: 84  KLFTFGSYRLGVYGPSSDIDALVVFPRYITREDFFTEFEKLLRERPELEEINSVREAFVP 143

Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A++ I +   D      N + +I    L  L      DE+ +L+ 
Sbjct: 144 IIKLEFDGISIDLIFAKLDIPRIPKDLTLTDKNLLKNIDEKDLRALNGTRVTDEILNLVP 203

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W   +  I    +G     A   LV  I  ++  + +    ++ +F
Sbjct: 204 KPTVFKHALRFIKMWA-QQRAIYANVYGFPGGVAWAMLVARICQLYPNAVSS--YIVEKF 260

Query: 214 LEFFSKFDW 222
            + +S++ W
Sbjct: 261 FQIYSQWSW 269


>gi|255732153|ref|XP_002551000.1| Poly(A) polymerase PAPalpha [Candida tropicalis MYA-3404]
 gi|240131286|gb|EER30846.1| Poly(A) polymerase PAPalpha [Candida tropicalis MYA-3404]
          Length = 558

 Score = 41.6 bits (96), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 39/189 (20%), Positives = 75/189 (39%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHA--------- 103
           +VFTFGS  L  Y P  DID          +D +  +  D++    + E           
Sbjct: 83  KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTRDDFFSVFPDIIRKRPELEEIACVPDAFVP 142

Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A + + +  ++      N + ++    L  L      DE+  L+ 
Sbjct: 143 IIKLEFDGISIDLIMARLNVPRVPLEMTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVP 202

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W    + + G  +G     A   LV  I  ++  + +    ++ +F
Sbjct: 203 KPTVFKHALRCIKLWAQQRA-VYGNVYGFPGGVAWAMLVARICQLYPNAVSA--VIVEKF 259

Query: 214 LEFFSKFDW 222
              ++K++W
Sbjct: 260 FSIYTKWNW 268


>gi|198427134|ref|XP_002121817.1| PREDICTED: similar to PAP-associated domain-containing protein 5
           (Topoisomerase-related function protein 4-2) (TRF4-2)
           [Ciona intestinalis]
          Length = 391

 Score = 41.6 bits (96), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 70/309 (22%), Positives = 128/309 (41%), Gaps = 65/309 (21%)

Query: 29  PFSEER--RNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
           P  EER  R  V   V  ++++ +P C++  FGS     YLP  DID+  F +       
Sbjct: 92  PTEEERQMREYVIKSVEEVVLELWPTCKLDVFGSFRTDLYLPTSDIDIVLFGE------- 144

Query: 86  WAHLVRDMLENE--EKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV--VDIAFNQLGGLCT 141
           W HL    L+     K+  AE  VK +   +A V +IK      +  VDI+FN   G+ +
Sbjct: 145 WEHLPLWSLQKALVSKDIVAEGSVKVLD--RAAVPLIKFQHKETLVKVDISFNIQSGVQS 202

Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG 201
           +  + +    + +     + I ++K +      +     G +SSY+L+ + +        
Sbjct: 203 VELIKD---FMKKYPALPKLIFVLKQFLLVRE-LNEVWTGGLSSYSLILMAISFLQTHPR 258

Query: 202 SFAGPLE-----VLYRFLEFFSK-FDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLL 255
           S +  +      +L  FLE + + F++ + C+ +                  K+ G +  
Sbjct: 259 SDSRDITNNLGVMLLEFLELYGRHFNYQSLCICV------------------KNKGYI-- 298

Query: 256 SKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF--FRIRTA 313
                        +F    +N  QP +    ++ DPL + N+LGR    G++   +++ A
Sbjct: 299 ----------TKEEFRKQMDNGCQPSL---LSIEDPLTLGNDLGR----GSYAVMQVKQA 341

Query: 314 FTFRAKGLA 322
           F F  + L 
Sbjct: 342 FEFSFRTLT 350


>gi|195115910|ref|XP_002002499.1| GI12386 [Drosophila mojavensis]
 gi|193913074|gb|EDW11941.1| GI12386 [Drosophila mojavensis]
          Length = 348

 Score = 41.6 bits (96), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 105/259 (40%), Gaps = 42/259 (16%)

Query: 27  PDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
           P P     R  + + V R+I   +P   V  FGS  L   LP+ DIDL        +   
Sbjct: 39  PTPTEHAARIELLSRVERVIQGLWPEALVEIFGSFRLGINLPNSDIDL-------VVLGC 91

Query: 86  WAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VDNFVVDIAFNQLGGLCTLC 143
           W HL    LE+E ++              A V II+      +  VDI+FN   G+ +  
Sbjct: 92  WEHLPLRSLESELRSSGIVLPGTLQVVDTAAVPIIRFTDCETHLKVDISFNMPNGIDSSE 151

Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH-HGLISSYALVTLVLYIFHVF--- 199
            + +  H   E+ +  + ++++K   + E R L    +G ISSY L+ + +    +    
Sbjct: 152 LIKKFLH---EHPVLGKLVLVLKQ--FLEQRNLNSTLNGGISSYNLIIMCINFLQMHPRQ 206

Query: 200 NGSFAGPLEVLYRFLEFFS----KFDWDNFCLSLW-----------GPVPISLLPDVTAE 244
               +  L VL   LEFF      F++    +S+W           G    SL  D    
Sbjct: 207 RSPESTNLGVL--LLEFFELYGLSFNYAQIGISIWNGYVRKENILVGSRTPSLYIDDPLL 264

Query: 245 PPRKDGGVLLLSKSFLDSC 263
           P R++      S+SF+ SC
Sbjct: 265 PGRQN------SRSFIASC 277


>gi|147905450|ref|NP_001089116.1| poly(A) polymerase gamma [Xenopus laevis]
 gi|73671771|gb|AAZ80291.1| SRP 3'-adenylating protein [Xenopus laevis]
          Length = 751

 Score = 41.6 bits (96), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 45/201 (22%), Positives = 80/201 (39%), Gaps = 22/201 (10%)

Query: 46  IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEE--KN--- 100
           II     ++FTFGS  L  +    DID    +     +  +     + L+ ++  KN   
Sbjct: 87  IISAAGGKIFTFGSYRLGVHTKGADIDSLCVAPRHVERSDFFQTFSEKLKQQDGIKNLRA 146

Query: 101 -EHAEFRVKEVQYIQAEVKII------KCLVDNFVV--DIAFNQLGGLCTLCF-----LD 146
            E A   V + +++  E+ ++      +C+ DN  +  D     L   C          D
Sbjct: 147 VEDAFVPVIKFEFMNTEIDLVFARLPLQCIPDNLDLRDDSRLRNLDIRCIRSLNGCRVTD 206

Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGP 206
           E+ HL+     F+ ++  IK W      I     G +   +   LV     ++  + A  
Sbjct: 207 EILHLVPNKENFRLTLRAIKLWAKRRG-IYSNMLGFLGGVSWAMLVARTCQLYPNAIAST 265

Query: 207 LEVLYRFLEFFSKFDWDNFCL 227
           L  +++F   FSK++W N  L
Sbjct: 266 L--VHKFFLVFSKWEWPNPVL 284


>gi|213625101|gb|AAI69821.1| LOC733387 protein [Xenopus laevis]
          Length = 750

 Score = 41.6 bits (96), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 45/201 (22%), Positives = 80/201 (39%), Gaps = 22/201 (10%)

Query: 46  IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEE--KN--- 100
           II     ++FTFGS  L  +    DID    +     +  +     + L+ ++  KN   
Sbjct: 86  IISAAGGKIFTFGSYRLGVHTKGADIDSLCVAPRHVERSDFFQTFSEKLKQQDGIKNLRA 145

Query: 101 -EHAEFRVKEVQYIQAEVKII------KCLVDNFVV--DIAFNQLGGLCTLCF-----LD 146
            E A   V + +++  E+ ++      +C+ DN  +  D     L   C          D
Sbjct: 146 VEDAFVPVIKFEFMNTEIDLVFARLPLQCIPDNLDLRDDSRLRNLDIRCIRSLNGCRVTD 205

Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGP 206
           E+ HL+     F+ ++  IK W      I     G +   +   LV     ++  + A  
Sbjct: 206 EILHLVPNKENFRLTLRAIKLWAKRRG-IYSNMLGFLGGVSWAMLVARTCQLYPNAIAST 264

Query: 207 LEVLYRFLEFFSKFDWDNFCL 227
           L  +++F   FSK++W N  L
Sbjct: 265 L--VHKFFLVFSKWEWPNPVL 283


>gi|54020874|ref|NP_001005684.1| poly(A) polymerase gamma [Xenopus (Silurana) tropicalis]
 gi|49522894|gb|AAH75107.1| poly(A) polymerase beta (testis specific) [Xenopus (Silurana)
           tropicalis]
          Length = 752

 Score = 41.2 bits (95), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 45/201 (22%), Positives = 80/201 (39%), Gaps = 22/201 (10%)

Query: 46  IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEE--KN--- 100
           II     ++FTFGS  L  +    DID    +     +  +     + L+ ++  KN   
Sbjct: 87  IISAVGGKIFTFGSYRLGVHTKGADIDSLCVAPRHVERSDFFQSFSEKLKQQDGIKNLRA 146

Query: 101 -EHAEFRVKEVQYIQAEVKII------KCLVDNFVV--DIAFNQLGGLCTLCF-----LD 146
            E A   V + +++  E+ ++      +C+ DN  +  D     L   C          D
Sbjct: 147 VEDAFVPVIKFEFMNTEIDLVFARLPLQCIPDNLDLRDDSRLRNLDIRCIRSLNGCRVTD 206

Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGP 206
           E+ HL+     F+ ++  IK W      I     G +   +   LV     ++  + A  
Sbjct: 207 EILHLVPNKENFRLTLRAIKLWAKRRG-IYSNMLGFLGGVSWAMLVARTCQLYPNAIAST 265

Query: 207 LEVLYRFLEFFSKFDWDNFCL 227
           L  +++F   FSK++W N  L
Sbjct: 266 L--VHKFFLVFSKWEWPNPVL 284


>gi|365982357|ref|XP_003668012.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
 gi|343766778|emb|CCD22769.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
          Length = 684

 Score = 41.2 bits (95), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 66/265 (24%), Positives = 108/265 (40%), Gaps = 42/265 (15%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + ++ I P     E RN   + +R+ + + +   Q+  FGS     YLP  
Sbjct: 204 WLTLE--IKDFVSYISPSREEIELRNKTISKLRKAVKELWSDSQLHIFGSYATDLYLPGS 261

Query: 70  DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
           DID      +G     Q L D   HL +  L ++ +   A+ RV  +++++   +I    
Sbjct: 262 DIDCVVNSKMGDKEQRQYLYDLARHLKQKGLTSQVE-VIAKARVPIIKFVEKSSQI---- 316

Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINE---NHLFKRSIILIKAWCYYESRILGGHH 180
                +D++F +  G+       E   LI E        R +ILI        R+   H 
Sbjct: 317 ----HIDVSFERTNGV-------EAAKLIREWLSATPGLRELILIVKQFLSARRLNDVHT 365

Query: 181 GLISSYALVTLV---LYIFHVFNGSFAGPLE----VLYRFLEFFSKFDWDNFCLSLWGPV 233
           G +  + ++ LV   L +      +   PLE    +L  F E + K    NF   L   V
Sbjct: 366 GGLGGFTIICLVYSFLSMHPRIKTNDIDPLENLGVLLIEFFELYGK----NFAYDL---V 418

Query: 234 PISLLPDVTAEPPRKDGGVLLLSKS 258
            ISLL    +  P+ +   LL ++S
Sbjct: 419 AISLLDGYPSYIPKSEWRSLLPTRS 443


>gi|351712688|gb|EHB15607.1| PAP-associated domain-containing protein 5, partial [Heterocephalus
           glaber]
          Length = 599

 Score = 41.2 bits (95), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI ++    + P P  E+ R  V + +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 102 EEI-SDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 160

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 161 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 211

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 212 DISFNVQNGV 221


>gi|218186296|gb|EEC68723.1| hypothetical protein OsI_37216 [Oryza sativa Indica Group]
          Length = 112

 Score = 41.2 bits (95), Expect = 0.81,   Method: Composition-based stats.
 Identities = 20/44 (45%), Positives = 27/44 (61%)

Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV 54
          W   E     ++ARIQP+P SE+RR AV AYV+ L+     CQ+
Sbjct: 30 WDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQGLLRFNVGCQM 73


>gi|238609344|ref|XP_002397464.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
 gi|215471952|gb|EEB98394.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
          Length = 174

 Score = 41.2 bits (95), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 51/116 (43%), Gaps = 17/116 (14%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  +E R+ +   +   I   +P  +V  FGS   K YLP  DID+   S   T+ 
Sbjct: 27  ISPSPVEDEIRSLLVQLISSAIKTRYPDAEVHPFGSYATKLYLPTGDIDIVVLSRTHTI- 85

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGL 139
             +   V   L        A+ RV  V+++       +  +    VDI+FNQ GG+
Sbjct: 86  -AFRCFVTAKL--------AKARVPIVKFVT------RVELGGIPVDISFNQPGGV 126


>gi|431914108|gb|ELK15367.1| PAP-associated domain-containing protein 5 [Pteropus alecto]
          Length = 530

 Score = 40.8 bits (94), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 65/143 (45%), Gaps = 17/143 (11%)

Query: 2   VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSV 60
           ++R    GR    EEI+ +    + P P  E+ R  V + +  +I + +P   V  FGS 
Sbjct: 22  LVRSAQTGRL--HEEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSF 78

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKI 119
               YLP  DIDL  F         W +L    LE E   +H       V+ + +A V I
Sbjct: 79  KTGLYLPTSDIDLVVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPI 130

Query: 120 IKCLVDNFV---VDIAFNQLGGL 139
           IK L D+F    VDI+FN   G+
Sbjct: 131 IK-LTDSFTEVKVDISFNVQNGV 152


>gi|254573058|ref|XP_002493638.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p)
           [Komagataella pastoris GS115]
 gi|238033437|emb|CAY71459.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p)
           [Komagataella pastoris GS115]
 gi|328354535|emb|CCA40932.1| DNA polymerase sigma subunit [Komagataella pastoris CBS 7435]
          Length = 601

 Score = 40.8 bits (94), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 56/238 (23%), Positives = 99/238 (41%), Gaps = 25/238 (10%)

Query: 4   RPLDPGRWLKAEEITAELIARIQPDPFS-EERRNAVAAYVRRLIIQCFP-CQVFTFGSVP 61
           + L+   WL  E    + I  I P     E R NAV    + +    +P C V  FGS  
Sbjct: 127 KQLELSDWLTLE--IKDFINYISPSIAEIEARNNAVKRLRKEITTNLWPDCYVNVFGSFA 184

Query: 62  LKTYLPDRDIDLGAFSDD-QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
              YLP  DID+   SD  +    ++ + +   L ++    + E         +A+V II
Sbjct: 185 TDLYLPGSDIDMVITSDSGKYCAKSYLYQLSSFLRSKNLGVNIE------TIARAKVPII 238

Query: 121 KCLV--DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
           K +       +D++F +  GL      + +   + E    +  ++++K +     R+   
Sbjct: 239 KFIEPRSKIHIDVSFEKTNGLRA---AERIQGWLRETPGLRELVLIVKQFLAVR-RMNNV 294

Query: 179 HHGLISSYALVTLV---LYIFHVFNGSFAGPLE----VLYRFLEFFS-KFDWDNFCLS 228
           HHG +  ++++ LV   L +      +   PL+    +L  F E +   F +DN  LS
Sbjct: 295 HHGGLGGFSIICLVHSFLSLHPRLITNSIDPLDNLGVLLIEFFELYGYNFGYDNVILS 352


>gi|403159818|ref|XP_003320384.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375168256|gb|EFP75965.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 876

 Score = 40.8 bits (94), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 65/138 (47%), Gaps = 10/138 (7%)

Query: 17  ITAEL---IARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDID 72
           +TAE+   +A IQP     + R  +   +R+ +   +P   V  FGS   K YLP  DID
Sbjct: 72  LTAEIGSFVAYIQPTHEEHQLRQMIIQMIRKTVHSRWPDADVEPFGSFGTKLYLPAGDID 131

Query: 73  LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII--KCLVDNFVVD 130
           L   S  Q + +  + ++  +     +N   +     V   +A+V II  K +  N  VD
Sbjct: 132 LVIIS-TQMMNEQKSRILYKLAPLIRENNIGQ---DVVVIAKAKVPIIKFKTIFGNINVD 187

Query: 131 IAFNQLGGLCTLCFLDEV 148
           I+ NQ  G+  +  ++E+
Sbjct: 188 ISINQTNGIVAMKKVNEL 205


>gi|427795543|gb|JAA63223.1| Putative pap-associated domain-containing protein 5, partial
           [Rhipicephalus pulchellus]
          Length = 627

 Score = 40.8 bits (94), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 54/201 (26%), Positives = 84/201 (41%), Gaps = 23/201 (11%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           +QP P   + R  V   ++ +I+  +P  +V  FGS     YLP  DID+      +TL 
Sbjct: 153 MQPTPAEHQMRLGVIQRIKDVILGLWPQAEVEIFGSFRTGLYLPTSDIDVVVLGKWETLP 212

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD---NFVVDIAFNQLGGL 139
             W  L + +L       H     + ++ + +A V I+K L D      VDI+FN   G+
Sbjct: 213 -MWT-LEKALL------SHGIAEPQSIKVLDKASVPIVK-LTDAKTTVKVDISFNMNNGV 263

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALV--TLVLYIFH 197
            + C +        E       ++L+      +  +     G ISSY+L+  T+     H
Sbjct: 264 KSACLIQS----FKEKFPALPKLVLVLKQFLLQRDLNEVFTGGISSYSLILMTVSFLQLH 319

Query: 198 VFNGSFAGP-LEVLYRFLEFF 217
              G    P L  L   LEFF
Sbjct: 320 PRGGDAPSPNLGTL--LLEFF 338


>gi|256818784|ref|NP_001157969.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
 gi|256818786|ref|NP_001157970.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
          Length = 680

 Score = 40.8 bits (94), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V + +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 183 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 241

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 242 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 292

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 293 DISFNVQNGV 302


>gi|395839409|ref|XP_003792582.1| PREDICTED: PAP-associated domain-containing protein 5 [Otolemur
           garnettii]
          Length = 629

 Score = 40.8 bits (94), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V + +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 132 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 190

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 191 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 241

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 242 DISFNVQNGV 251


>gi|354474676|ref|XP_003499556.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Cricetulus griseus]
          Length = 464

 Score = 40.8 bits (94), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI ++    + P P  E+ R  V + +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 14  EEI-SDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 72

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 73  VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 123

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 124 DISFNVQNGV 133


>gi|60392891|sp|Q68ED3.2|PAPD5_MOUSE RecName: Full=PAP-associated domain-containing protein 5; AltName:
           Full=Topoisomerase-related function protein 4-2;
           Short=TRF4-2
 gi|148878177|gb|AAI45738.1| Papd5 protein [Mus musculus]
 gi|219519562|gb|AAI44797.1| Papd5 protein [Mus musculus]
          Length = 633

 Score = 40.8 bits (94), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V + +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 136 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 194

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 195 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 245

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 246 DISFNVQNGV 255


>gi|239835858|gb|ACS29269.1| pap1 [Meyerozyma guilliermondii]
          Length = 564

 Score = 40.8 bits (94), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 31/206 (15%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEH---------- 102
           ++FTFGS  L  Y P  DID         +++ +  +   M+    + E           
Sbjct: 82  KIFTFGSYRLGVYGPGSDIDTLIVVPKHVVREDFFTIFDQMIRQRPELEEITAVPDAFVP 141

Query: 103 ---AEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A + + +  +D      N + +I  N +  L      D++  L+ 
Sbjct: 142 IIMIEFSGISIDLIFARLNVSRVPLDMTLEDNNLLKNIDENDMRALNGTRVTDQILQLVP 201

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W    + + G   G     A   LV  I  ++  + +    ++ +F
Sbjct: 202 KVTVFKHALRCIKLWAQQRA-VYGNMFGFPGGVAWAMLVARICQLYPNAVSA--VIVEKF 258

Query: 214 LEFFSKFDWDNFCLSLWGPVPISLLP 239
              ++K++W         P P+ L P
Sbjct: 259 FNIYTKWNW---------PQPVLLKP 275


>gi|51328369|gb|AAH80314.1| Papd5 protein [Mus musculus]
          Length = 583

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V + +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 86  EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 144

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 145 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 195

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 196 DISFNVQNGV 205


>gi|340506956|gb|EGR32991.1| hypothetical protein IMG5_064460 [Ichthyophthirius multifiliis]
          Length = 347

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 90/219 (41%), Gaps = 30/219 (13%)

Query: 20  ELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSD 78
           EL   + P     E R      + ++I    P C+V TFGS   K YLP+ DID+    +
Sbjct: 54  ELTEYLAPTKEEHELRIKSFENLTQIIKSVIPDCEVKTFGSFSSKLYLPNSDIDIVIVKE 113

Query: 79  DQTLKDTWAHLVRDMLENEEKNEHAEF----RVKEVQYIQAEVKIIKCLVDNFVVDIAFN 134
            ++ K  +  +   +L  E+  E+  F    +V  +++++      K    NF  DI+FN
Sbjct: 114 GESNKYLYKKVADVVLTCEDIYENISFITNAKVPLIKFVE------KSTQTNF--DISFN 165

Query: 135 QLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVL 193
           +  G+     L EV   +      K  I ++K  C    R L   + G I S+ L  ++L
Sbjct: 166 KEDGVKQ---LPEVQKCLQIYPEIKYLIFIMK--CILRQRDLNETYTGGIGSFLLFCMIL 220

Query: 194 YIFHVFNGSFAGPLEV-----------LYRFLEFFSKFD 221
                    +    +V           L +  +F+S FD
Sbjct: 221 AFLRELRKEYKDNNKVSEIKNITLGEYLLKMFKFYSNFD 259


>gi|47209824|emb|CAF91228.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 964

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  E  R  V   + ++I   +P  QV  FGS     YLP  DIDL  F       
Sbjct: 461 ISPRPEEEAMRRDVVNRIEKVIKDLWPTAQVEIFGSFSTGLYLPTSDIDLVVFGK----- 515

Query: 84  DTWAHLVRDMLEN--EEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFN 134
             W H     LE   +++N    + +K +   +A V IIK L D+     VDI+FN
Sbjct: 516 --WDHPPLQELEQALKKRNVAGPYPIKVLD--KATVPIIK-LTDHETEVKVDISFN 566


>gi|440291374|gb|ELP84643.1| PAP-associated domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 475

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 106/244 (43%), Gaps = 42/244 (17%)

Query: 10  RWLKAE--EITAE-----LIARIQPDPFSEERRNAVAAYVRRLII--QCFPCQVFTFGSV 60
           +WL+ E  +IT +     L   ++P+P   E R  V     R+I   +    +V  FGS 
Sbjct: 5   KWLEYEGGDITLDDEFDILYHYVEPNPIEYEIRRYVLEKYTRVIENDKKSEIKVVPFGST 64

Query: 61  PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDM----LENEEKNEHAEFRVKEVQYIQAE 116
             K +LP  DID    +           + R +    +E+E++             ++A 
Sbjct: 65  QSKLFLPSSDIDFTVVTKGGKTNMVLNSVARILSLYTMEDEKR------------ALRAT 112

Query: 117 VKIIKCLVD---NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKA-WCYYE 172
           V +IK L D     V+DI+ N   G+ T+ ++++    +  N L +  + +IK     YE
Sbjct: 113 VPVIK-LTDRETGIVLDISHNNESGVDTVRWMEKE---MKSNALIRPLLFIIKTVLSSYE 168

Query: 173 SRI--LGGHHGLISSYALVTLVLYIFHVFNGSFAGPL--EVLYRFLEFF-SKFDWDNFCL 227
             +  LGG    + +Y+L  +V   F              +L RFL+++ ++FD   F L
Sbjct: 169 LNLPALGG----LGTYSLFMMVFCFFREKGSDLKDKRGGAILLRFLKYYATEFDSRKFGL 224

Query: 228 SLWG 231
           S+ G
Sbjct: 225 SVTG 228


>gi|256818788|ref|NP_001157971.1| PAP-associated domain-containing protein 5 isoform b [Mus musculus]
          Length = 637

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V + +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 183 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 241

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 242 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 292

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 293 DISFNVQNGV 302


>gi|219518398|gb|AAI44798.1| Papd5 protein [Mus musculus]
          Length = 590

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V + +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 136 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 194

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 195 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 245

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 246 DISFNVQNGV 255


>gi|426382139|ref|XP_004057678.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Gorilla gorilla gorilla]
          Length = 664

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 214 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 272

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 273 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 323

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 324 DISFNVQNGV 333


>gi|410905163|ref|XP_003966061.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
          Length = 778

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  E  R  V   + R+I   +P  +V  FGS     YLP  DIDL  F       
Sbjct: 248 ISPRPEEEAMRRDVVNRIERVIKDLWPTARVEIFGSFSTGLYLPTSDIDLVVFGK----- 302

Query: 84  DTWAHLVRDMLEN--EEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFN 134
             W H     LE   +++N    + +K +   +A V IIK L D+     VDI+FN
Sbjct: 303 --WDHPPLQELEQALKKRNVAGPYPIKVLD--KATVPIIK-LTDHETEVKVDISFN 353


>gi|441597299|ref|XP_003263084.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Nomascus leucogenys]
          Length = 666

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 216 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 274

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 275 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 325

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 326 DISFNVQNGV 335


>gi|296231051|ref|XP_002760982.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Callithrix jacchus]
          Length = 664

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 214 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 272

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 273 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 323

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 324 DISFNVQNGV 333


>gi|402550493|pdb|4FHX|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
           The Mechanism For Utp Selectivity - H336n Mutant Bound
           To Mgatp
          Length = 349

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 50/184 (27%), Positives = 77/184 (41%), Gaps = 23/184 (12%)

Query: 24  RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL 82
           +I    F E+R  A    +R  + +  P  ++  FGS+     L + D+DL    D +  
Sbjct: 28  KISDKEFKEKR--AALDTLRLCLKRISPDAELVAFGSLESGLALKNSDMDLCVLMDSRVQ 85

Query: 83  KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVVDIAFNQ 135
            DT A      L+  E+     F  K +Q  +A + IIK   D       +F  DI FN 
Sbjct: 86  SDTIA------LQFYEELIAEGFEGKFLQ--RARIPIIKLTSDTKNGFGASFQCDIGFNN 137

Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-Y 194
              +     L     L   +   K  ++L+K W     +I   + G +SSY  V +VL Y
Sbjct: 138 RLAIHNTLLLSSYTKL---DARLKPMVLLVKHWA-KRKQINSPYFGTLSSYGYVLMVLYY 193

Query: 195 IFHV 198
           + HV
Sbjct: 194 LIHV 197


>gi|63101121|gb|AAY33178.1| PAPa [Candida parapsilosis]
 gi|354544642|emb|CCE41367.1| hypothetical protein CPAR2_303560 [Candida parapsilosis]
          Length = 552

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 39/189 (20%), Positives = 78/189 (41%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD----TWAHLVRDMLENEEKN--EHAEFR 106
           ++FTFGS  L  Y P  DID          +D     +  ++R   E EE N  + A   
Sbjct: 82  KIFTFGSYKLGVYGPSSDIDALVVVPRHVTRDDFFTVFEKILRGRQELEEINCVKEAFVP 141

Query: 107 VKEVQYIQAEVKIIKCLVD-------------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
           + ++++    + ++   +D             N + +I    +  L      DE+  L+ 
Sbjct: 142 IIKLEFAGISIDLLFAKLDIPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVP 201

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           ++ +FK ++  +K W    + I    +G     A   LV  I  ++  + +    +L +F
Sbjct: 202 KSTVFKNALRFVKMWAQQRA-IYANVYGFPGGVAWAMLVARICQLYPNAVSAV--ILEKF 258

Query: 214 LEFFSKFDW 222
            + +S++ W
Sbjct: 259 FQIYSQWSW 267


>gi|389738915|gb|EIM80110.1| hypothetical protein STEHIDRAFT_126102 [Stereum hirsutum FP-91666
           SS1]
          Length = 1326

 Score = 40.0 bits (92), Expect = 1.6,   Method: Composition-based stats.
 Identities = 51/196 (26%), Positives = 86/196 (43%), Gaps = 29/196 (14%)

Query: 20  ELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAF-- 76
           + + ++ P P     +  V   + RLI    P  ++ +FGS      L + D+DL     
Sbjct: 47  DFVIQLLPTPEELSVKEDVRKLLERLIRTIEPDSRLLSFGSTANGFSLRNSDMDLCCLID 106

Query: 77  SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVV 129
           SD++        ++ D+LE E K     F VK + +  A + I+K  +D           
Sbjct: 107 SDERLSAADLVTMLGDLLERETK-----FHVKPLPH--ARIPIVKLSLDPSPGLPLGIAC 159

Query: 130 DIAF-NQLGGLCT---LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           DI F N+L    T    C+      +I+   + +  ++ +K WC    +I   + G +SS
Sbjct: 160 DIGFENRLALENTRLLYCYA-----MIDPTRV-RTLVLFLKVWCK-RRKINSPYQGTLSS 212

Query: 186 YALVTLVLY-IFHVFN 200
           Y  V LV+Y + HV N
Sbjct: 213 YGYVLLVIYFLVHVKN 228


>gi|402550488|pdb|4FH3|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
           The Mechanism For Utp Selectivity
 gi|402550489|pdb|4FH5|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
           The Mechanism For Utp Selectivity - Mgutp Bound
 gi|402550490|pdb|4FHP|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
           The Mechanism For Utp Selectivity - Cautp Bound
 gi|402550491|pdb|4FHV|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
           The Mechanism For Utp Selectivity - Mgctp Bound
 gi|402550492|pdb|4FHW|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
           The Mechanism For Utp Selectivity - Mggtp Bound
 gi|402550494|pdb|4FHY|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
           The Mechanism For Utp Selectivity - Mg 3'-Datp Bound
          Length = 349

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 50/184 (27%), Positives = 77/184 (41%), Gaps = 23/184 (12%)

Query: 24  RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL 82
           +I    F E+R  A    +R  + +  P  ++  FGS+     L + D+DL    D +  
Sbjct: 28  KISDKEFKEKR--AALDTLRLCLKRISPDAELVAFGSLESGLALKNSDMDLCVLMDSRVQ 85

Query: 83  KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVVDIAFNQ 135
            DT A      L+  E+     F  K +Q  +A + IIK   D       +F  DI FN 
Sbjct: 86  SDTIA------LQFYEELIAEGFEGKFLQ--RARIPIIKLTSDTKNGFGASFQCDIGFNN 137

Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-Y 194
              +     L     L   +   K  ++L+K W     +I   + G +SSY  V +VL Y
Sbjct: 138 RLAIHNTLLLSSYTKL---DARLKPMVLLVKHWA-KRKQINSPYFGTLSSYGYVLMVLYY 193

Query: 195 IFHV 198
           + HV
Sbjct: 194 LIHV 197


>gi|320039014|gb|EFW20949.1| hypothetical protein CPSG_02791 [Coccidioides posadasii str.
           Silveira]
          Length = 1241

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 22/101 (21%), Positives = 48/101 (47%), Gaps = 2/101 (1%)

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
           D+ + D+++  L  L     LD +   I +   F+++   I AW  +    L    G + 
Sbjct: 770 DDPIFDLSYQALTKLQAFRDLDYIRRSIPDLAAFRKAHRFITAWAKHRGVYLS-RFGYLG 828

Query: 185 SYALVTLVLYIFHVFNGSF-AGPLEVLYRFLEFFSKFDWDN 224
              +  ++  +F +F G       +++YRF ++++ FDW++
Sbjct: 829 GIHITMMLSRVFKLFCGEVRVTSTDMIYRFFQYYADFDWEH 869


>gi|390136629|pdb|4EP7|A Chain A, Functional Implications From The Cid1 Poly(U) Polymerase
           Crystal Structure
 gi|390136630|pdb|4EP7|B Chain B, Functional Implications From The Cid1 Poly(U) Polymerase
           Crystal Structure
          Length = 340

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 50/184 (27%), Positives = 77/184 (41%), Gaps = 23/184 (12%)

Query: 24  RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL 82
           +I    F E+R  A    +R  + +  P  ++  FGS+     L + D+DL    D +  
Sbjct: 19  KISDKEFKEKR--AALDTLRLCLKRISPDAELVAFGSLESGLALKNSDMDLCVLMDSRVQ 76

Query: 83  KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVVDIAFNQ 135
            DT A      L+  E+     F  K +Q  +A + IIK   D       +F  DI FN 
Sbjct: 77  SDTIA------LQFYEELIAEGFEGKFLQ--RARIPIIKLTSDTKNGFGASFQCDIGFNN 128

Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-Y 194
              +     L     L   +   K  ++L+K W     +I   + G +SSY  V +VL Y
Sbjct: 129 RLAIHNTLLLSSYTKL---DARLKPMVLLVKHWA-KRKQINSPYFGTLSSYGYVLMVLYY 184

Query: 195 IFHV 198
           + HV
Sbjct: 185 LIHV 188


>gi|335308290|ref|XP_003361170.1| PREDICTED: PAP-associated domain-containing protein 5-like [Sus
           scrofa]
          Length = 511

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 60/139 (43%), Gaps = 17/139 (12%)

Query: 9   GRW---LKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKT 64
           G W   L   E  ++    + P P  E+ R  V   +  +I + +P   V  FGS     
Sbjct: 4   GGWTGSLGLHEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGL 63

Query: 65  YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCL 123
           YLP  DIDL  F         W +L    LE E   +H       V+ + +A V IIK L
Sbjct: 64  YLPTSDIDLVVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-L 114

Query: 124 VDNFV---VDIAFNQLGGL 139
            D+F    VDI+FN   G+
Sbjct: 115 TDSFTEVKVDISFNVQNGV 133


>gi|440900205|gb|ELR51393.1| PAP-associated domain-containing protein 5, partial [Bos grunniens
           mutus]
          Length = 563

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI ++    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 66  EEI-SDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 124

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 125 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 175

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 176 DISFNVQNGV 185


>gi|119910013|ref|XP_001256516.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
 gi|297485254|ref|XP_002694925.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
 gi|296478153|tpg|DAA20268.1| TPA: DNA polymerase sigma-like [Bos taurus]
          Length = 467

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI ++    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 17  EEI-SDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 75

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 76  VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 126

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 127 DISFNVQNGV 136


>gi|397498213|ref|XP_003819879.1| PREDICTED: PAP-associated domain-containing protein 5, partial [Pan
           paniscus]
          Length = 593

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 96  EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 154

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 155 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 205

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 206 DISFNVQNGV 215


>gi|303317898|ref|XP_003068951.1| Endonuclease/Exonuclease/phosphatase family protein [Coccidioides
           posadasii C735 delta SOWgp]
 gi|240108632|gb|EER26806.1| Endonuclease/Exonuclease/phosphatase family protein [Coccidioides
           posadasii C735 delta SOWgp]
          Length = 1241

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/101 (21%), Positives = 48/101 (47%), Gaps = 2/101 (1%)

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
           D+ + D+++  L  L     LD +   I +   F+++   I AW  +    L    G + 
Sbjct: 770 DDPIFDLSYQALTKLQAFRDLDYIRRSIPDLAAFRKAHRFITAWAKHRGVYLS-RFGYLG 828

Query: 185 SYALVTLVLYIFHVFNGSF-AGPLEVLYRFLEFFSKFDWDN 224
              +  ++  +F +F G       +++YRF ++++ FDW++
Sbjct: 829 GIHITMMLSRVFKLFCGEVRVTSTDMIYRFFQYYADFDWEH 869


>gi|291410211|ref|XP_002721395.1| PREDICTED: DNA polymerase sigma-like [Oryctolagus cuniculus]
          Length = 522

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 25  EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 83

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 84  VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 134

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 135 DISFNVQNGV 144


>gi|59800139|sp|Q8NDF8.2|PAPD5_HUMAN RecName: Full=PAP-associated domain-containing protein 5; AltName:
           Full=Terminal uridylyltransferase 3; Short=TUTase 3;
           AltName: Full=Topoisomerase-related function protein
           4-2; Short=TRF4-2
          Length = 572

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 122 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 180

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 181 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 231

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 232 DISFNVQNGV 241


>gi|345325980|ref|XP_001507597.2| PREDICTED: PAP-associated domain-containing protein 5-like
           [Ornithorhynchus anatinus]
          Length = 578

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 58/119 (48%), Gaps = 12/119 (10%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL  F   + L 
Sbjct: 90  MSPRPEEEKMRMEVVNRIENVIKELWPTADVQIFGSFKTGLYLPTSDIDLVVFGKWENLP 149

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W  L   + +++  +EH+   VK +   +A V IIK L D+F    VDI+FN   G+
Sbjct: 150 -LWT-LEEALRKHKVADEHS---VKVLD--KATVPIIK-LTDSFTEVKVDISFNVQNGV 200


>gi|355710188|gb|EHH31652.1| hypothetical protein EGK_12764, partial [Macaca mulatta]
          Length = 564

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI ++    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 67  EEI-SDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 125

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 126 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 176

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 177 DISFNVQNGV 186


>gi|426382137|ref|XP_004057677.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
           [Gorilla gorilla gorilla]
          Length = 631

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 134 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 192

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 193 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 243

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 244 DISFNVQNGV 253


>gi|403292555|ref|XP_003937307.1| PREDICTED: PAP-associated domain-containing protein 5 [Saimiri
           boliviensis boliviensis]
          Length = 631

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 134 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 192

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 193 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 243

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 244 DISFNVQNGV 253


>gi|390477686|ref|XP_002760981.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
           [Callithrix jacchus]
          Length = 631

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 134 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 192

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 193 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 243

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 244 DISFNVQNGV 253


>gi|256818782|ref|NP_001035375.2| PAP-associated domain-containing protein 5 isoform b [Homo sapiens]
          Length = 651

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 201 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 259

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 260 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 310

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 311 DISFNVQNGV 320


>gi|359319041|ref|XP_535307.4| PREDICTED: PAP-associated domain-containing protein 5 [Canis lupus
           familiaris]
          Length = 641

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 144 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 202

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 203 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 253

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 254 DISFNVQNGV 263


>gi|441597295|ref|XP_003263083.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
           [Nomascus leucogenys]
 gi|348031139|emb|CCB84642.1| PAP associated domain containing 5 [Homo sapiens]
          Length = 631

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 134 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 192

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 193 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 243

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 244 DISFNVQNGV 253


>gi|119603156|gb|EAW82750.1| PAP associated domain containing 5, isoform CRA_c [Homo sapiens]
          Length = 371

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 30  EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 88

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 89  VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 139

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 140 DISFNVQNGV 149


>gi|19115813|ref|NP_594901.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe 972h-]
 gi|15213942|sp|O13833.2|CID1_SCHPO RecName: Full=Poly(A) RNA polymerase protein cid1; AltName:
           Full=Caffeine-induced death protein 1
 gi|393715400|pdb|4E7X|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715401|pdb|4E7X|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715402|pdb|4E7X|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715403|pdb|4E7X|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715405|pdb|4E80|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715406|pdb|4E80|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715407|pdb|4E80|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715408|pdb|4E80|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715409|pdb|4E8F|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715410|pdb|4E8F|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|4324457|gb|AAD16889.1| caffeine-induced death protein 1 [Schizosaccharomyces pombe]
 gi|5524947|emb|CAB50789.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe]
          Length = 405

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 50/184 (27%), Positives = 76/184 (41%), Gaps = 23/184 (12%)

Query: 24  RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL 82
           +I    F E+R  A    +R  + +  P  ++  FGS+     L + D+DL    D +  
Sbjct: 56  KISDKEFKEKR--AALDTLRLCLKRISPDAELVAFGSLESGLALKNSDMDLCVLMDSRVQ 113

Query: 83  KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVVDIAFNQ 135
            DT A    + L  E       F  K +Q  +A + IIK   D       +F  DI FN 
Sbjct: 114 SDTIALQFYEELIAE------GFEGKFLQ--RARIPIIKLTSDTKNGFGASFQCDIGFNN 165

Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-Y 194
              +     L     L   +   K  ++L+K W     +I   + G +SSY  V +VL Y
Sbjct: 166 RLAIHNTLLLSSYTKL---DARLKPMVLLVKHWA-KRKQINSPYFGTLSSYGYVLMVLYY 221

Query: 195 IFHV 198
           + HV
Sbjct: 222 LIHV 225


>gi|380798533|gb|AFE71142.1| PAP-associated domain-containing protein 5 isoform a, partial
           [Macaca mulatta]
          Length = 618

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 121 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 179

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 180 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 230

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 231 DISFNVQNGV 240


>gi|301756837|ref|XP_002914273.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
           [Ailuropoda melanoleuca]
          Length = 593

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 143 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 201

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 202 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 252

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 253 DISFNVQNGV 262


>gi|256818780|ref|NP_001035374.2| PAP-associated domain-containing protein 5 isoform a [Homo sapiens]
 gi|194374871|dbj|BAG62550.1| unnamed protein product [Homo sapiens]
          Length = 698

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 201 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 259

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 260 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 310

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 311 DISFNVQNGV 320


>gi|402908342|ref|XP_003916909.1| PREDICTED: PAP-associated domain-containing protein 5 [Papio
           anubis]
          Length = 605

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 108 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 166

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 167 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 217

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 218 DISFNVQNGV 227


>gi|426243516|ref|XP_004015600.1| PREDICTED: PAP-associated domain-containing protein 5 [Ovis aries]
          Length = 588

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI ++    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 138 EEI-SDFYEYMSPRPEEEKMRMEVVNRIEGVIKELWPSADVQIFGSFKTGLYLPTSDIDL 196

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 197 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 247

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 248 DISFNVQNGV 257


>gi|432853107|ref|XP_004067543.1| PREDICTED: PAP-associated domain-containing protein 5-like [Oryzias
           latipes]
          Length = 679

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 55/126 (43%), Gaps = 10/126 (7%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           I P P  E+ R  V   ++ +I   +P  +V  FGS     YLP  DIDL  F   +TL 
Sbjct: 197 ISPRPEEEKMRLEVVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLVVFGKWETLP 256

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VDNFVVDIAFNQLGGLCT 141
             W   + + L      + +  +V +    +A V IIK    V    VDI+FN   G+  
Sbjct: 257 -LWT--LEEALRKRNVADKSAIKVLD----KATVPIIKLTDSVTEVKVDISFNVESGVKA 309

Query: 142 LCFLDE 147
              + E
Sbjct: 310 ARLIKE 315


>gi|344301689|gb|EGW31994.1| Poly(A) polymerase PAPalpha [Spathaspora passalidarum NRRL Y-27907]
          Length = 556

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 45/206 (21%), Positives = 79/206 (38%), Gaps = 31/206 (15%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD----TWAHLVRDMLENEEKNE------- 101
           +VFTFGS  L  Y P  DID          ++     +  ++R   E +E          
Sbjct: 82  KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTREDFFTVFEQIIRKRPELQEIASVPDAFVP 141

Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A + + +  +D      N + +I    L  L      DE+  L+ 
Sbjct: 142 IIKIEFDGISIDLILARLNVPRVPLDMTLDDKNLLKNIDERDLRSLNGTRVTDEILQLVP 201

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W    + + G   G     A   LV  I  ++  + +    ++ +F
Sbjct: 202 KPTVFKHALRCIKLWAQQRA-VYGNVFGFPGGVAWAMLVARICQLYPNAVSA--VIVEKF 258

Query: 214 LEFFSKFDWDNFCLSLWGPVPISLLP 239
              ++K++W         P P+ L P
Sbjct: 259 FNIYTKWNW---------PQPVLLKP 275


>gi|332845909|ref|XP_003315148.1| PREDICTED: PAP-associated domain-containing protein 5 [Pan
           troglodytes]
          Length = 586

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 89  EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 147

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 148 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 198

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 199 DISFNVQNGV 208


>gi|281338901|gb|EFB14485.1| hypothetical protein PANDA_002140 [Ailuropoda melanoleuca]
          Length = 632

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 135 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 193

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 194 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 244

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 245 DISFNVQNGV 254


>gi|119603155|gb|EAW82749.1| PAP associated domain containing 5, isoform CRA_b [Homo sapiens]
          Length = 374

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 30  EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 88

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 89  VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 139

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 140 DISFNVQNGV 149


>gi|395505923|ref|XP_003757286.1| PREDICTED: PAP-associated domain-containing protein 5, partial
           [Sarcophilus harrisii]
          Length = 615

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 118 EEIS-DFYEYMSPRPEEEKMRMEVVNRIENVIKELWPSADVQIFGSFKTGLYLPTSDIDL 176

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 177 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 227

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 228 DISFNVQNGV 237


>gi|297283970|ref|XP_002802516.1| PREDICTED: PAP-associated domain-containing protein 5 [Macaca
           mulatta]
          Length = 653

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 203 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 261

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 262 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 312

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 313 DISFNVQNGV 322


>gi|344289184|ref|XP_003416325.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Loxodonta africana]
          Length = 595

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI ++    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 98  EEI-SDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 156

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 157 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 207

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 208 DISFNVQNGV 217


>gi|119603153|gb|EAW82747.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
 gi|119603154|gb|EAW82748.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
          Length = 527

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 30  EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 88

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 89  VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 139

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 140 DISFNVQNGV 149


>gi|410983511|ref|XP_003998082.1| PREDICTED: PAP-associated domain-containing protein 5 [Felis catus]
          Length = 514

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 64  EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 122

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 123 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 173

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 174 DISFNVQNGV 183


>gi|297283968|ref|XP_001083145.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Macaca mulatta]
          Length = 700

 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 203 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 261

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 262 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 312

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 313 DISFNVQNGV 322


>gi|162317662|gb|AAI56330.1| PAP associated domain containing 5 [synthetic construct]
 gi|162318878|gb|AAI57080.1| PAP associated domain containing 5 [synthetic construct]
          Length = 442

 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 54/120 (45%), Gaps = 14/120 (11%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL  F       
Sbjct: 1   MSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFGK----- 55

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W +L    LE E   +H       V+ + +A V IIK L D+F    VDI+FN   G+
Sbjct: 56  --WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKVDISFNVQNGV 111


>gi|297698707|ref|XP_002826459.1| PREDICTED: PAP-associated domain-containing protein 5 [Pongo
           abelii]
          Length = 588

 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)

Query: 15  EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
           EEI+ +    + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL
Sbjct: 91  EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 149

Query: 74  GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
             F         W +L    LE E   +H       V+ + +A V IIK L D+F    V
Sbjct: 150 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 200

Query: 130 DIAFNQLGGL 139
           DI+FN   G+
Sbjct: 201 DISFNVQNGV 210


>gi|324975490|gb|ADY62673.1| PAPa [Candida orthopsilosis]
          Length = 372

 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 41/189 (21%), Positives = 76/189 (40%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD----TWAHLVRDMLENEEKNE------- 101
           ++FTFGS  L  Y P  DID          ++     +  ++R   E EE N        
Sbjct: 82  KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVP 141

Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  + A++ I +   D      N + +I    +  L      DE+  L+ 
Sbjct: 142 IIKLEFAGISIDLLFAKLDIPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVP 201

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W    + +    +G     A   LV  I  ++  + +    +L +F
Sbjct: 202 KPTVFKNALRFIKMWAQQRA-VYANVYGFPGGVAWAMLVARICQLYPNAVSA--VILEKF 258

Query: 214 LEFFSKFDW 222
            + +S+++W
Sbjct: 259 FQIYSQWNW 267


>gi|241855549|ref|XP_002416033.1| PAP-associated domain-containing protein, putative [Ixodes
           scapularis]
 gi|215510247|gb|EEC19700.1| PAP-associated domain-containing protein, putative [Ixodes
           scapularis]
          Length = 347

 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 82/199 (41%), Gaps = 19/199 (9%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           +QP P   E R  V   ++ +I+  +P  +V  FGS     YLP  DID+      +TL 
Sbjct: 12  MQPSPAEHEMRLGVIQRIKEVILSLWPQAEVEIFGSFRTGLYLPTSDIDVVVLGKWETLP 71

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD---NFVVDIAFNQLGGL 139
             W  L + +L       H     + ++ + +A V I+K L D      VDI+FN   G+
Sbjct: 72  -MWT-LEKALL------THGIAEPRSIKVLDKASVPIVK-LTDARTTVKVDISFNMNNGV 122

Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
            +   +        E       ++L+      +  +     G ISSY+L+ + +    + 
Sbjct: 123 KSARLIKS----FKEKFPALAKLVLVLKQFLLQRDLNEVFTGGISSYSLILMTVSFLQLH 178

Query: 200 NGSFAGPLEVLYR-FLEFF 217
                GP   L    LEFF
Sbjct: 179 PRGGDGPNPNLGTLLLEFF 197


>gi|324975502|gb|ADY62684.1| PAPa [Candida orthopsilosis]
          Length = 547

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNEHA----- 103
           ++FTFGS  L  Y P  DID L       T +D   T+  ++R   E +E N  +     
Sbjct: 81  KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTTFDKIIRQRSELQEINGVSDAFVP 140

Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A + + +  +D      N + ++    L  L      DE+  L+ 
Sbjct: 141 IIKLEFDGISLDLIMARLNVPRVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVP 200

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W   E  + G   G     A   L   I  ++  + +    ++ +F
Sbjct: 201 KPGVFKHALRCIKLWA-QERAVYGNVFGFPGGVAWAMLTARICQLYPNAVSAV--IVEKF 257

Query: 214 LEFFSKFDW 222
              ++K++W
Sbjct: 258 FNIYTKWNW 266


>gi|328772133|gb|EGF82172.1| hypothetical protein BATDEDRAFT_23561 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 752

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 55/120 (45%), Gaps = 10/120 (8%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           ++P       R    A VR+++ Q +   +V  FGS   K YLP  D+D+    D   L 
Sbjct: 189 VRPTEAEHSLRKLTIARVRKIVKQIWADAEVHVFGSFQTKLYLPSSDVDIVVVGDSCVLP 248

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VDNFVVDIAFNQLGGLCT 141
                L +     E+ +  +   V E    + +V IIK +  + +F +DI+FN + G+ +
Sbjct: 249 KCLRQLAKAF---EKADTLSRMEVIE----KTKVPIIKGVDKLTHFSLDISFNMVNGIKS 301


>gi|402890991|ref|XP_003908748.1| PREDICTED: poly(A) polymerase gamma [Papio anubis]
          Length = 700

 Score = 39.3 bits (90), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 38/183 (20%), Positives = 72/183 (39%), Gaps = 22/183 (12%)

Query: 46  IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEF 105
           ++     ++FTFGS  L  +    DID    +     +  +     + L++         
Sbjct: 88  VVATVGGKIFTFGSYRLGVHTKGADIDALCVAPRHVERSDFFQSFFEKLKH--------- 138

Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC-FLDEVDHLINENHLFKRSIIL 164
                   Q  ++ ++ + D FV  I F +  G+   C   DE+ HL+     F+ ++  
Sbjct: 139 --------QDGIRNLRAVEDAFVPVIKF-EFDGIEVRCRVTDEILHLVPNKETFRLTLRA 189

Query: 165 IKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDN 224
           +K W      I     G +   +   LV     ++  + A  L  +++F   FSK++W N
Sbjct: 190 VKLWAKRRG-IYSNMLGFLGGVSWAMLVARTCQLYPNAAASTL--VHKFFLVFSKWEWPN 246

Query: 225 FCL 227
             L
Sbjct: 247 PVL 249


>gi|324975520|gb|ADY62700.1| PAPa [Candida orthopsilosis]
          Length = 547

 Score = 39.3 bits (90), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNEHA----- 103
           ++FTFGS  L  Y P  DID L       T +D   T+  ++R   E +E N  +     
Sbjct: 81  KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTTFDKIIRQRPELQEINGVSDAFVP 140

Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A + + +  +D      N + ++    L  L      DE+  L+ 
Sbjct: 141 IIKLEFDGISLDLIMARLNVPRVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVP 200

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W   E  + G   G     A   L   I  ++  + +    ++ +F
Sbjct: 201 KPGVFKHALRCIKLWA-QERAVYGNVFGFPGGVAWAMLTARICQLYPNAVSAV--IVEKF 257

Query: 214 LEFFSKFDW 222
              ++K++W
Sbjct: 258 FNIYTKWNW 266


>gi|324975487|gb|ADY62671.1| PAPa [Candida metapsilosis]
          Length = 547

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNEHA----- 103
           ++FTFGS  L  Y P  DID L       T +D   T+  ++R   E +E N  +     
Sbjct: 81  KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTTFDQIIRKRPELQEINGVSDAFVP 140

Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  I A + + +  +D      N + ++    L  L      DE+  L+ 
Sbjct: 141 IIKLEFDGISLDLIMARLNVPRVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVP 200

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W   E  + G   G     A   L   I  ++  + +    ++ +F
Sbjct: 201 KPGVFKHALRCIKLWA-QERAVYGNVFGFPGGVAWAMLTARICQLYPNAVSSV--IVEKF 257

Query: 214 LEFFSKFDW 222
              ++K++W
Sbjct: 258 FSIYTKWNW 266


>gi|224064673|ref|XP_002197521.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
           [Taeniopygia guttata]
          Length = 443

 Score = 39.3 bits (90), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 12/119 (10%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P  E  R  V   +  +I + +P   V  FGS     YLP  DIDL  F   +TL 
Sbjct: 1   MSPRPEEETMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP 60

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W   + + L      +    +V +    +A V IIK L D+F    VDI+FN   G+
Sbjct: 61  -LWT--LEEALRKHNVADENSVKVLD----KATVPIIK-LTDSFTEVKVDISFNVQNGV 111


>gi|344305107|gb|EGW35339.1| hypothetical protein SPAPADRAFT_48344 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 615

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 55/216 (25%), Positives = 89/216 (41%), Gaps = 26/216 (12%)

Query: 29  PFSEE--RRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
           P S+E   RN V   ++  I + +P  +   FGS     YLP  DID+   S      +T
Sbjct: 197 PSSDEIVTRNTVVNRLKTQIAKFWPGTEAHVFGSCATDLYLPGSDIDMVVIS------ET 250

Query: 86  WAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD---NFVVDIAFNQLGGLCT 141
             +  R  L        ++   K V+ I  A+V IIK  VD      +D++F +  G+  
Sbjct: 251 GDYENRSRLYQLSSFLRSKKLAKNVEVIANAKVPIIK-FVDPESEIHIDVSFERTNGIDA 309

Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG 201
              + +   LI    L  R ++LI        R+   H G +  YA + +  +   +   
Sbjct: 310 AKRIRK--WLITTPGL--RELVLIVKQFLRSRRLNNVHVGGLGGYATIIMCYHFLRLHPK 365

Query: 202 SFAGPLEVLYR----FLEFFS----KFDWDNFCLSL 229
              G +++L       +EFF      F +DN  +SL
Sbjct: 366 VSTGSIDILDNLGVLLIEFFELYGRNFSYDNLIISL 401


>gi|49899785|gb|AAH76872.1| LOC445836 protein, partial [Xenopus laevis]
          Length = 563

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 54/120 (45%), Gaps = 14/120 (11%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL  F       
Sbjct: 80  MSPRPEEEKMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGK----- 134

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W +L    LE E   +H       V+ + +A V IIK L D+F    VDI+FN   G+
Sbjct: 135 --WENLPLWTLE-EALRKHNVADENSVKVLDKATVPIIK-LTDSFTEVKVDISFNVQNGV 190


>gi|410077415|ref|XP_003956289.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
 gi|372462873|emb|CCF57154.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
          Length = 537

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 54/245 (22%), Positives = 100/245 (40%), Gaps = 33/245 (13%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + ++ I P     E RN   + +R  + + +P   +  FGS     YLP  
Sbjct: 144 WLTLE--MKDFVSYISPSSTEIEDRNITISRIRDAVKELWPDADLHVFGSYSTDLYLPGS 201

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLV--DN 126
           DID    + ++  KD+     ++ L    K    +    +V+ + +A V IIK +     
Sbjct: 202 DIDC-VVNSERGNKDS-----KNCLYQLAKFLTTKKLATDVEVVSKARVPIIKFVEPHTG 255

Query: 127 FVVDIAFNQLGGLCTL----CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGL 182
             +D++F +  GL        +LD    L        R ++L+     +  R+   H G 
Sbjct: 256 IHIDVSFERTNGLEAAKLIRSWLDSTAGL--------RELVLVIKQFLHARRLNNVHTGG 307

Query: 183 ISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPVP 234
           +  ++++ LV    H+        ++       +L  F E + K F +D+  +S+    P
Sbjct: 308 LGGFSIICLVFTFLHMHPRIITNEIDPIDNLGVLLIDFFELYGKNFGYDDVAISVLNGHP 367

Query: 235 ISLLP 239
            S +P
Sbjct: 368 -SYIP 371


>gi|449282422|gb|EMC89255.1| PAP-associated domain-containing protein 5, partial [Columba livia]
          Length = 501

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 12/119 (10%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P  E  R  V   +  +I + +P   V  FGS     YLP  DIDL  F   +TL 
Sbjct: 12  MSPRPEEERMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP 71

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W   + + L      +    +V +    +A V IIK L D+F    VDI+FN   G+
Sbjct: 72  -LWT--LEEALRKHNVADENSVKVLD----KATVPIIK-LTDSFTEVKVDISFNVQNGV 122


>gi|190345571|gb|EDK37480.2| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 588

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 95/232 (40%), Gaps = 26/232 (11%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +  I P       RN V   ++  I + +P  +V  FGS     YLP  
Sbjct: 167 WLTLE--IKDFVNYISPSKLEITTRNNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGS 224

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--- 125
           DID+   S D   +       R  L     +  ++   K ++ I +A+V I+K  VD   
Sbjct: 225 DIDMVVISRDGDREQ------RSRLYQLSTHLRSKKLAKNIEVIAKAKVPIVK-FVDPDS 277

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N  +D++F +  G+     + E   L +   L  R ++L+        R+   H G +  
Sbjct: 278 NIHIDVSFERSNGIDAAIKIREW--LASTPGL--RELVLVVKQFLRSRRLNNVHVGGLGG 333

Query: 186 YALVTLVLYIFHVF------NGSFAGPL-EVLYRFLEFFSK-FDWDNFCLSL 229
           Y+ + L  +   +       N S    L  +L  F E + + F +DN  L++
Sbjct: 334 YSTIILCYHFLKLHPRVATENMSILDNLGSLLIEFFELYGRNFSYDNLILAI 385


>gi|254579541|ref|XP_002495756.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
 gi|238938647|emb|CAR26823.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
          Length = 531

 Score = 38.9 bits (89), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 54/241 (22%), Positives = 93/241 (38%), Gaps = 34/241 (14%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +A I P     E RN     +R  + + +P   +  FGS     YLP  
Sbjct: 100 WLTLE--IRDFVAYISPSRQEIELRNKTIRTLRHAVRKLWPGADLQVFGSYATDLYLPGS 157

Query: 70  DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VD 125
           DID  + + + D+  + +   L    L+N +     E   K      A V IIK +    
Sbjct: 158 DIDCVINSKTGDKENRSSLYELAH-FLKNRKLATQVEVIAK------ARVPIIKFVEPTS 210

Query: 126 NFVVDIAFNQLGGLCTL----CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
              VD++F +  GL        +L +   L        R ++LI     +  R+   H G
Sbjct: 211 QIHVDVSFERTNGLEAAKLIRSWLQQTPGL--------RELVLIVKQFLHARRLNNVHTG 262

Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYR-------FLEFFSK-FDWDNFCLSLWGPV 233
            +  ++++ LV    ++      G ++  Y        F E + K F +D+  + +    
Sbjct: 263 GLGGFSIICLVYAFLNLHPRIVTGEIDARYNLGVLLIDFFELYGKNFGYDDVAVVVADQR 322

Query: 234 P 234
           P
Sbjct: 323 P 323


>gi|146419896|ref|XP_001485907.1| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 588

 Score = 38.9 bits (89), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 95/232 (40%), Gaps = 26/232 (11%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + +  I P       RN V   ++  I + +P  +V  FGS     YLP  
Sbjct: 167 WLTLE--IKDFVNYISPSKLEITTRNNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGS 224

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--- 125
           DID+   S D   +       R  L     +  ++   K ++ I +A+V I+K  VD   
Sbjct: 225 DIDMVVISRDGDREQ------RSRLYQLSTHLRSKKLAKNIEVIAKAKVPIVK-FVDPDS 277

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N  +D++F +  G+     + E   L +   L  R ++L+        R+   H G +  
Sbjct: 278 NIHIDVSFERSNGIDAAIKIREW--LASTPGL--RELVLVVKQFLRSRRLNNVHVGGLGG 333

Query: 186 YALVTLVLYIFHVF------NGSFAGPL-EVLYRFLEFFSK-FDWDNFCLSL 229
           Y+ + L  +   +       N S    L  +L  F E + + F +DN  L++
Sbjct: 334 YSTIILCYHFLKLHPRVATENMSILDNLGSLLIEFFELYGRNFSYDNLILAI 385


>gi|444720754|gb|ELW61529.1| HEAT repeat-containing protein 3 [Tupaia chinensis]
          Length = 1047

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 54/120 (45%), Gaps = 14/120 (11%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P  E+ R  V   +  +I + +P   V  FGS     YLP  DIDL  F       
Sbjct: 657 MSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFG------ 710

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W +L    LE E   +H       V+ + +A V IIK L D+F    VDI+FN   G+
Sbjct: 711 -KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKVDISFNVQNGV 767


>gi|327278603|ref|XP_003224050.1| PREDICTED: PAP-associated domain-containing protein 5-like [Anolis
           carolinensis]
          Length = 665

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 53/119 (44%), Gaps = 12/119 (10%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P  +  R  V   +  +I + +P   V  FGS     YLP  DIDL  F   +TL 
Sbjct: 180 MSPRPEEQRMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP 239

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W   + + L      +    +V +    +A V IIK L D+F    VDI+FN   G+
Sbjct: 240 -LWT--LEEALRKHNVADKGSVKVLD----KATVPIIK-LTDSFTEVKVDISFNVQNGV 290


>gi|449472874|ref|XP_004176276.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Taeniopygia guttata]
          Length = 490

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 12/119 (10%)

Query: 25  IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
           + P P  E  R  V   +  +I + +P   V  FGS     YLP  DIDL  F   +TL 
Sbjct: 1   MSPRPEEETMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP 60

Query: 84  DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
             W   + + L      +    +V +    +A V IIK L D+F    VDI+FN   G+
Sbjct: 61  -LWT--LEEALRKHNVADENSVKVLD----KATVPIIK-LTDSFTEVKVDISFNVQNGV 111


>gi|389744511|gb|EIM85694.1| poly-A polymerase [Stereum hirsutum FP-91666 SS1]
          Length = 628

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 45/195 (23%), Positives = 72/195 (36%), Gaps = 24/195 (12%)

Query: 53  QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
           ++FTFGS  L  + P  DID         L++ +  +   ML   E        V E   
Sbjct: 84  KIFTFGSYRLGVHGPGSDIDTLCVVPKHVLREDFFDVFEQMLRETEGVTECSG-VPEAYV 142

Query: 113 IQAEVKIIKCLVDNFVVDIAF------------NQLGGLCTLCF--------LDEVDHLI 152
              +VKI    +D  +  +A             N L  L   C          DE+  L+
Sbjct: 143 PIVKVKISGIPIDFLMARLALSTIPDDLSLQDDNLLRNLDDRCIRSLGGSRVTDEILRLV 202

Query: 153 NENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYR 212
              ++F+ S+  IK W    + I    +G +   A   LV  I  ++  + AG   ++ R
Sbjct: 203 PNVNVFRDSLRCIKLWAQRRA-IYSNVNGFLGGVAWAMLVARICQLYPNAIAG--AIVSR 259

Query: 213 FLEFFSKFDWDNFCL 227
           F     ++ W    L
Sbjct: 260 FFIIMYQWSWPQPVL 274


>gi|367040851|ref|XP_003650806.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
 gi|346998067|gb|AEO64470.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
          Length = 759

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 52/237 (21%), Positives = 98/237 (41%), Gaps = 22/237 (9%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E +  +    I+P  F E  R  +  +++    + F   +V+ FGS P   YLP  
Sbjct: 392 WLHKEVV--DFYEYIKPRDFEERLRGELVEHLKTFCRKTFKDAEVYPFGSFPSGLYLPTA 449

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDML---ENEEKNEHAEFRVKEVQYIQAEVKIIKCLV-- 124
           D+DL   SD         +  +  L    ++ KN    +  +    + A+V ++K +   
Sbjct: 450 DMDLAFISDSYAKGGVPRYGTKSFLYRFRSQLKNHRIAWEDEIELIVGAKVPLVKFIEHR 509

Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSII-LIKAWCYYESRILGGHHGLI 183
               VDI+F    GL  +    E      E +    +++ LIK +      +    +G I
Sbjct: 510 TGLKVDISFENRTGLTAI----ETFKAWREQYPGMPALVTLIKHFLLMRG-LNEPVNGGI 564

Query: 184 SSYALVTLVLYIFHVFNGSFAGPLEVLYR----FLEFF----SKFDWDNFCLSLWGP 232
             ++++ LV+ +  +     +G L+  +      L FF    +KF++    +S+  P
Sbjct: 565 GGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQLLLHFFDLYGNKFNYQTVAISMNPP 621


>gi|238879008|gb|EEQ42646.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 603

 Score = 38.9 bits (89), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 56/215 (26%), Positives = 91/215 (42%), Gaps = 24/215 (11%)

Query: 29  PFSEE--RRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
           P SEE   RN V + +++ I + +P  +   FGS     YLP  DID+   S      +T
Sbjct: 182 PSSEEIVTRNNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVVVS------ET 235

Query: 86  WAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCL--VDNFVVDIAFNQLGGLCTL 142
             +  R  L         +   K V+ I  A+V IIK +  V    +D++F +  GL   
Sbjct: 236 GDYENRSRLYQLSTFLRTKKLAKNVEVIASAKVPIIKFVDPVSELHIDVSFERTNGLDAA 295

Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHV---F 199
             +     LI+   L  R ++L+        R+   H G +  YA + +  +   +    
Sbjct: 296 KRIRR--WLISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLRLHPKL 351

Query: 200 NGSFAGPLE----VLYRFLEFFSK-FDWDNFCLSL 229
           + S    L+    +L  F E + + F +DN  LSL
Sbjct: 352 STSSMDALDNLGVLLIEFFELYGRNFSYDNLILSL 386


>gi|167384281|ref|XP_001736885.1| PAP-associated domain-containing protein [Entamoeba dispar SAW760]
 gi|165900593|gb|EDR26889.1| PAP-associated domain-containing protein, putative [Entamoeba
           dispar SAW760]
          Length = 400

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 47/197 (23%), Positives = 86/197 (43%), Gaps = 35/197 (17%)

Query: 10  RWLKAEEITAELIARIQ-----PDPFSEE---RRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
           +WLK+ E   +L   +Q      +P   E   R   +  Y +  I++     +  FGS  
Sbjct: 5   QWLKSFEGELDLNQEVQLFIKFIEPNKNEYKIREELLTKYSK--ILEKEGYNIMPFGSTQ 62

Query: 62  LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRD----MLENEEKNEHAEFRVKEVQYIQAEV 117
            K +LP  DID    +++   +     +       +LE++++N             +A V
Sbjct: 63  SKLFLPTSDIDFSVITNEYNTRKVLNSISSILSSYVLEDQKRN------------FKASV 110

Query: 118 KIIKCLVDN---FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAW--CYYE 172
            ++K L D     V+DI+ N   G  T+ F++E+   I ++   +R ++LIK+   CY  
Sbjct: 111 PVLK-LTDKQTLIVLDISHNNTSGTKTVDFIEEI---IKKDDRIRRLVLLIKSILCCYDF 166

Query: 173 SRILGGHHGLISSYALV 189
            +   G  G  S + +V
Sbjct: 167 HQPANGGLGTYSVFVMV 183


>gi|391342828|ref|XP_003745717.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
           [Metaseiulus occidentalis]
          Length = 512

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 69/289 (23%), Positives = 108/289 (37%), Gaps = 46/289 (15%)

Query: 26  QPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD 84
           +P     + R  V   V+ ++ Q +P  Q   FGS     YLP  DIDL    D +TL  
Sbjct: 106 KPTRTEHQVRQEVVNRVKEVVRQLWPQAQCEVFGSFCTGLYLPTSDIDLVILGDWETLPM 165

Query: 85  TWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VDNFVVDIAFNQLGGLCTL 142
              H     L  E+    +  +V +    +A V I+K      N  VDI+FNQ  G+ + 
Sbjct: 166 FTLH---KALIQEKIASASTIKVLD----RASVPIVKFTEQSTNVKVDISFNQKNGVKSA 218

Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYALVTLVLYIF--HVF 199
             + +            + + ++K   Y   R L     G ISSY+L+ LV+     H+ 
Sbjct: 219 KLIKDFCKTFPP---LPKLVFVLKQ--YLLQRDLNEVFTGGISSYSLILLVVSFLQRHLR 273

Query: 200 NGSFAGPLE------VLYRFLEFFSK-FDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGV 252
                 PL       +L  F E + + F++    + +                  KDGG 
Sbjct: 274 IKELQSPLSNVNLGVLLLEFFELYGRYFNYAEVGIRI------------------KDGGS 315

Query: 253 LLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
            +  ++       A     G          S    + DPL   N++GRS
Sbjct: 316 YMSKEALQREMATAQGQTSGAGVIHD---TSSILCIEDPLTPGNDIGRS 361


>gi|324975506|gb|ADY62687.1| PAPa [Candida orthopsilosis]
          Length = 552

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNE------- 101
           ++FTFGS  L  Y P  DID L       T +D    +  ++R   E EE N        
Sbjct: 82  KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVP 141

Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  + A++ I +   D      N + +I    +  L      DE+  L+ 
Sbjct: 142 IIKLEFAGISIDLLFAKLDIPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVP 201

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W    + +    +G     A   LV  I  ++  + +    +L +F
Sbjct: 202 KPTVFKNALRFIKMWAQQRA-VYANVYGFPGGVAWAMLVARICQLYPNAVSAV--ILEKF 258

Query: 214 LEFFSKFDW 222
            + +S+++W
Sbjct: 259 FQIYSQWNW 267


>gi|94490330|gb|ABF29402.1| nonribosomal peptide synthetase [Xylaria sp. BCC 1067]
          Length = 6744

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 33/74 (44%), Gaps = 11/74 (14%)

Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGP------ 232
           HH L   +AL  L+  +  V+ GS + PL     F+++ +K D+  F    WG       
Sbjct: 834 HHALSDGWALPLLLQQVSAVYEGSISLPLRPFNHFIDYMTKMDYKTF----WGRYFDDLQ 889

Query: 233 -VPISLLPDVTAEP 245
                LLP VT  P
Sbjct: 890 VAAFPLLPSVTYTP 903


>gi|68480208|ref|XP_715914.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
 gi|68480321|ref|XP_715864.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
 gi|46437507|gb|EAK96852.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
 gi|46437559|gb|EAK96903.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
          Length = 603

 Score = 38.5 bits (88), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 56/215 (26%), Positives = 91/215 (42%), Gaps = 24/215 (11%)

Query: 29  PFSEE--RRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
           P SEE   RN V + +++ I + +P  +   FGS     YLP  DID+   S      +T
Sbjct: 182 PSSEEIVTRNNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVVVS------ET 235

Query: 86  WAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCL--VDNFVVDIAFNQLGGLCTL 142
             +  R  L         +   K V+ I  A+V IIK +  V    +D++F +  GL   
Sbjct: 236 GDYENRSRLYQLSTFLRTKKLAKNVEVIASAKVPIIKFVDPVSELHIDVSFERTNGLDAA 295

Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHV---F 199
             +     LI+   L  R ++L+        R+   H G +  YA + +  +   +    
Sbjct: 296 KRIRR--WLISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLRLHPKL 351

Query: 200 NGSFAGPLE----VLYRFLEFFSK-FDWDNFCLSL 229
           + S    L+    +L  F E + + F +DN  LSL
Sbjct: 352 STSSMDALDNLGVLLIEFFELYGRNFSYDNLILSL 386


>gi|448519050|ref|XP_003868035.1| non-canonical poly(A) polymerase [Candida orthopsilosis Co 90-125]
 gi|380352374|emb|CCG22600.1| non-canonical poly(A) polymerase [Candida orthopsilosis]
          Length = 604

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 90/232 (38%), Gaps = 26/232 (11%)

Query: 11  WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
           WL  E    + ++ I P       RN V   ++R +   +P  +   FGS     YLP  
Sbjct: 164 WLTME--MKDFVSYISPSRAEIVTRNNVINTLKREVSSFWPGTEAHVFGSCATDLYLPGS 221

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--- 125
           DID+   S       T  +  R  L        A+   K V+ I  A+V IIK  VD   
Sbjct: 222 DIDMVVIS------STGDYENRSRLYQLSSFLRAKNLAKNVEVIASAKVPIIK-FVDPES 274

Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
           N  +DI+F +  GL     +     L+    L  R ++L+        ++   H G +  
Sbjct: 275 NLPIDISFERTNGLDAARRIRRW--LLATPGL--RELVLVVKQFLRSRKLNNVHVGGLGG 330

Query: 186 YALVTLVLYIFH----VFNGSFAGPLEVLYRFLEFFS----KFDWDNFCLSL 229
           YA + +  +       +   +   P  +    +EFF      F +DN  +S+
Sbjct: 331 YATIIMCYHFMQLHPKISTNTMNAPDNLGVLLIEFFELYGRNFSYDNLIISI 382


>gi|448531596|ref|XP_003870285.1| Pap1 poly(A) polymerase [Candida orthopsilosis Co 90-125]
 gi|380354639|emb|CCG24155.1| Pap1 poly(A) polymerase [Candida orthopsilosis]
          Length = 557

 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)

Query: 53  QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNE------- 101
           ++FTFGS  L  Y P  DID L       T +D    +  ++R   E EE N        
Sbjct: 87  KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVP 146

Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
               EF    +  + A++ I +   D      N + +I    +  L      DE+  L+ 
Sbjct: 147 IIKLEFAGISIDLLFAKLDIPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVP 206

Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
           +  +FK ++  IK W    + +    +G     A   LV  I  ++  + +    +L +F
Sbjct: 207 KPTVFKNALRFIKMWAQQRA-VYANVYGFPGGVAWAMLVARICQLYPNAVSAV--ILEKF 263

Query: 214 LEFFSKFDW 222
            + +S+++W
Sbjct: 264 FQIYSQWNW 272


>gi|336276454|ref|XP_003352980.1| hypothetical protein SMAC_03298 [Sordaria macrospora k-hell]
 gi|380092465|emb|CCC09742.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 781

 Score = 38.1 bits (87), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 63/143 (44%), Gaps = 14/143 (9%)

Query: 9   GRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLP 67
           G WL  E I  +    I+P  F +  R  V   + R +   FP   V+ FGS P   YLP
Sbjct: 446 GHWLHKEII--DFYEYIKPRAFEKRIRQEVLDEINRFVRSTFPDAGVYPFGSFPSGLYLP 503

Query: 68  DRDIDLGAFSDD-----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQ-AEVKIIK 121
             D+D+   SD      +   DT   + R  L +  K +   F+  EV+ I  A+V ++K
Sbjct: 504 TGDMDMVLCSDQYKRNYRAKYDTRRTMYR--LSDALKQQKLAFQ-NEVEIIAFAKVPLVK 560

Query: 122 CLVD--NFVVDIAFNQLGGLCTL 142
            +       +D++F    GL  +
Sbjct: 561 WVDSRTGLKIDVSFENDTGLQAI 583


>gi|452823485|gb|EME30495.1| DNA polymerase sigma subunit [Galdieria sulphuraria]
          Length = 417

 Score = 38.1 bits (87), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 42/159 (26%), Positives = 66/159 (41%), Gaps = 16/159 (10%)

Query: 33  ERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVR 91
           ++R  +   V  +I Q +P   V  FGS     YLP  DIDL   S  +       HL+ 
Sbjct: 116 KQRKQLIERVTEIIRQIWPNSSVHVFGSFATNLYLPTSDIDLCILSSPENGSKRELHLLA 175

Query: 92  DMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCL--VDNFVVDIAFNQLGGLCTLCFLDEV 148
           D+L  +        +++ V  I +A V IIK          DI+F +  G+     +  +
Sbjct: 176 DVLRRKTN------KMRRVMAIDKARVPIIKVTDRETGIQCDISFGRTNGIEN---VRHI 226

Query: 149 DHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSY 186
              +      +  +++IK  C+   R L   H G I SY
Sbjct: 227 QKYLKRYPSLRPLMMVIK--CFLHQRALNEVHEGGIGSY 263


>gi|401837953|gb|EJT41787.1| TRF5-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 642

 Score = 37.7 bits (86), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 54/247 (21%), Positives = 102/247 (41%), Gaps = 33/247 (13%)

Query: 9   GRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLP 67
             WL +E    + +  I P     + RN     +R+ + Q +    +  FGS     YLP
Sbjct: 173 AEWLTSE--IKDFVHYISPSKSEIKCRNRTIDKLRQAVKQLWSDADLHVFGSFATDLYLP 230

Query: 68  DRDID--LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL-- 123
             DID  + +   D+  ++    L R +     KNE    R++ +  ++  V IIK +  
Sbjct: 231 GSDIDCVINSRHHDKEDRNYIYELARYL-----KNEGLAIRMEVI--VRTRVPIIKFIEP 283

Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINE---NHLFKRSIILIKAWCYYESRILGGHH 180
           +    +D++F +  GL       E   LI E   +    R ++L+     +  R+   H 
Sbjct: 284 LSQLHIDVSFERTNGL-------EAARLIREWLRDSPGLRELVLVIKQFLHSRRLNNVHT 336

Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGP 232
           G +  + ++ LV    ++     +  ++       +L  F E + K F +D+  +S+   
Sbjct: 337 GGLGGFTVICLVYSFLNMHPRIKSNDIDTPDNLGVLLIDFFELYGKNFGYDDVAISISDD 396

Query: 233 VPISLLP 239
            P S +P
Sbjct: 397 HP-SYIP 402


>gi|343427054|emb|CBQ70582.1| related to TRF4-topoisomerase I-related protein [Sporisorium
           reilianum SRZ2]
          Length = 697

 Score = 37.7 bits (86), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 61/140 (43%), Gaps = 12/140 (8%)

Query: 14  AEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCF-PCQVFTFGSVPLKTYLPDR 69
           AE +  ELIA    + P     E R  V   + R I   F   +V  FGS   K YLP  
Sbjct: 98  AEALHRELIAFDDWMAPTVAEHETRCMVVELISRAIKSQFRDAEVHPFGSQETKLYLPQG 157

Query: 70  DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--N 126
           D+DL   S       T + L R M     ++  A     +VQ I +A+V IIK +     
Sbjct: 158 DLDLVVVSQSMANLRTQSAL-RTMAACLRRHNLAT----DVQVIAKAKVPIIKFVTTYAR 212

Query: 127 FVVDIAFNQLGGLCTLCFLD 146
             VDI+ N   GL T  +++
Sbjct: 213 LKVDISLNHTNGLTTASYVN 232


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.326    0.143    0.449 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,666,672,430
Number of Sequences: 23463169
Number of extensions: 241218603
Number of successful extensions: 726104
Number of sequences better than 100.0: 348
Number of HSP's better than 100.0 without gapping: 158
Number of HSP's successfully gapped in prelim test: 190
Number of HSP's that attempted gapping in prelim test: 725528
Number of HSP's gapped (non-prelim): 495
length of query: 347
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 204
effective length of database: 9,003,962,200
effective search space: 1836808288800
effective search space used: 1836808288800
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)