BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038458
(347 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449443945|ref|XP_004139736.1| PREDICTED: uncharacterized protein LOC101209112 [Cucumis sativus]
Length = 1341
Score = 608 bits (1567), Expect = e-171, Method: Compositional matrix adjust.
Identities = 289/347 (83%), Positives = 315/347 (90%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
+V+R LD RW KAEE TAELIA IQP+P SEERRNAVA YV+RLI++CFPCQVFTFGSV
Sbjct: 26 TVMRMLDSERWSKAEERTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVFTFGSV 85
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL AFS +Q LK+TWAH VRDMLE+EEKNE+AEFRVKEVQYI+AEVKII
Sbjct: 86 PLKTYLPDGDIDLTAFSKNQNLKETWAHQVRDMLESEEKNENAEFRVKEVQYIKAEVKII 145
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KCLV+N VVDI+F+QLGGLCTLCFL+EVDHLIN+NHLFKRSIILIKAWCYYESRILG HH
Sbjct: 146 KCLVENIVVDISFDQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHH 205
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDWDNFC+SLWGPVPIS LPD
Sbjct: 206 GLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPISSLPD 265
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
VTAEPPRKDGG LLLSK FL++C YA FPGGQENQGQPFVSKHFNVIDPLRVNNNLGR
Sbjct: 266 VTAEPPRKDGGELLLSKLFLEACSAVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 325
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVSKGNFFRIR+AF F AK LARL +CP ED+ E+NQFF+NT +RH
Sbjct: 326 SVSKGNFFRIRSAFAFGAKRLARLFECPREDILAELNQFFLNTWERH 372
>gi|356520288|ref|XP_003528795.1| PREDICTED: uncharacterized protein LOC100809742 [Glycine max]
Length = 1331
Score = 605 bits (1561), Expect = e-171, Method: Composition-based stats.
Identities = 294/349 (84%), Positives = 317/349 (90%), Gaps = 2/349 (0%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV--FTFG 58
SVI+ LD RWLKAE+ TAELIA IQP+P SEERRNAVA YV+RLI++CFPCQV FTFG
Sbjct: 26 SVIQVLDSERWLKAEQRTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVGVFTFG 85
Query: 59 SVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
SVPLKTYLPD DIDL AFS +Q LKD+WAH VRDMLENEEKNE+AEF VKEVQYIQAEVK
Sbjct: 86 SVPLKTYLPDGDIDLTAFSKNQNLKDSWAHQVRDMLENEEKNENAEFHVKEVQYIQAEVK 145
Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
IIKCLV+N VVDI+FNQLGGLCTLCFL+EVD+LIN+NHLFKRSIILIKAWCYYESRILG
Sbjct: 146 IIKCLVENIVVDISFNQLGGLCTLCFLEEVDNLINQNHLFKRSIILIKAWCYYESRILGA 205
Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
HHGLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDW+NFC+SLWGPVPIS L
Sbjct: 206 HHGLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWENFCVSLWGPVPISSL 265
Query: 239 PDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
PDVTAEPPRKDGG LLLSK FLD+C YA FPGGQENQGQPFVSKHFNVIDPLRVNNNL
Sbjct: 266 PDVTAEPPRKDGGDLLLSKLFLDACSSVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNL 325
Query: 299 GRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GRSVSKGNFFRIR+AF F AK LARLLDCP E+L++EVNQFF NT +RH
Sbjct: 326 GRSVSKGNFFRIRSAFAFGAKKLARLLDCPEEELFSEVNQFFFNTWERH 374
>gi|356560284|ref|XP_003548423.1| PREDICTED: uncharacterized protein LOC100800527 [Glycine max]
Length = 1337
Score = 600 bits (1546), Expect = e-169, Method: Composition-based stats.
Identities = 292/349 (83%), Positives = 316/349 (90%), Gaps = 2/349 (0%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV--FTFG 58
SVI+ LD RWLKAE+ TAELIA IQP+P SEERRNAVA YV+RLI++CFPCQV FTFG
Sbjct: 26 SVIQVLDSERWLKAEQRTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVRVFTFG 85
Query: 59 SVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
SVPLKTYLPD DIDL AFS +Q LKD+WAH VRDMLENEEKNE+AEF VKEVQYIQAEVK
Sbjct: 86 SVPLKTYLPDGDIDLTAFSKNQNLKDSWAHQVRDMLENEEKNENAEFHVKEVQYIQAEVK 145
Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
IIKCLV+N VVDI+FNQLGGLCTLCFL+EVD+LIN+NHLFKRSIILIKAWCYYESRILG
Sbjct: 146 IIKCLVENIVVDISFNQLGGLCTLCFLEEVDNLINQNHLFKRSIILIKAWCYYESRILGA 205
Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
HHGLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDW+NFC+SLWGPVPIS L
Sbjct: 206 HHGLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWENFCVSLWGPVPISSL 265
Query: 239 PDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
PDVTAEPPRKDGG LLLSK FLD+C YA FPGGQENQGQPFVSKHFNVIDPLRVNNNL
Sbjct: 266 PDVTAEPPRKDGGDLLLSKLFLDACSSVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNL 325
Query: 299 GRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GRSVSKGNFFRIR+AF F AK LARLLDC ++L++EVNQFF NT +RH
Sbjct: 326 GRSVSKGNFFRIRSAFAFGAKRLARLLDCSEDELFSEVNQFFFNTWERH 374
>gi|225454502|ref|XP_002277075.1| PREDICTED: uncharacterized protein LOC100241322 [Vitis vinifera]
Length = 1295
Score = 581 bits (1498), Expect = e-163, Method: Compositional matrix adjust.
Identities = 282/347 (81%), Positives = 306/347 (88%), Gaps = 1/347 (0%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
S IR LD RWL AEE TAELIA IQP+ SEE RNAVA YV+R+++QCFPCQVFTFGSV
Sbjct: 25 SAIRVLDTERWLIAEERTAELIACIQPNQPSEELRNAVADYVQRIVVQCFPCQVFTFGSV 84
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL AFS++Q LKDTWA+ VRDML++EEKNE+AEFRVKEVQYIQAEVKII
Sbjct: 85 PLKTYLPDGDIDLTAFSNNQNLKDTWANQVRDMLQSEEKNENAEFRVKEVQYIQAEVKII 144
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KCLV+N VVDI+FNQLGGLCTLCFL+EVDHLIN+NHLFKRSIILIKAWCYYESRILG HH
Sbjct: 145 KCLVENIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHH 204
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GLIS+YAL TLVLYIFHVFN SF GPLEVLYRFLEFFS FDWDNFC+SLWGPVPIS LPD
Sbjct: 205 GLISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVPISSLPD 264
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
VTAEPPR+D G LLLSK FLD+C YA FP GQE QGQ F+SKHFNVIDPLRVNNNLGR
Sbjct: 265 VTAEPPRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRVNNNLGR 324
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVSKGNFFRIR+AF F AK LARLLD P E++ EVNQ FMNT +RH
Sbjct: 325 SVSKGNFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERH 370
>gi|297745424|emb|CBI40504.3| unnamed protein product [Vitis vinifera]
Length = 1229
Score = 580 bits (1494), Expect = e-163, Method: Compositional matrix adjust.
Identities = 282/347 (81%), Positives = 306/347 (88%), Gaps = 1/347 (0%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
S IR LD RWL AEE TAELIA IQP+ SEE RNAVA YV+R+++QCFPCQVFTFGSV
Sbjct: 25 SAIRVLDTERWLIAEERTAELIACIQPNQPSEELRNAVADYVQRIVVQCFPCQVFTFGSV 84
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL AFS++Q LKDTWA+ VRDML++EEKNE+AEFRVKEVQYIQAEVKII
Sbjct: 85 PLKTYLPDGDIDLTAFSNNQNLKDTWANQVRDMLQSEEKNENAEFRVKEVQYIQAEVKII 144
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KCLV+N VVDI+FNQLGGLCTLCFL+EVDHLIN+NHLFKRSIILIKAWCYYESRILG HH
Sbjct: 145 KCLVENIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHH 204
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GLIS+YAL TLVLYIFHVFN SF GPLEVLYRFLEFFS FDWDNFC+SLWGPVPIS LPD
Sbjct: 205 GLISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVPISSLPD 264
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
VTAEPPR+D G LLLSK FLD+C YA FP GQE QGQ F+SKHFNVIDPLRVNNNLGR
Sbjct: 265 VTAEPPRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRVNNNLGR 324
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVSKGNFFRIR+AF F AK LARLLD P E++ EVNQ FMNT +RH
Sbjct: 325 SVSKGNFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERH 370
>gi|255564741|ref|XP_002523365.1| hypothetical protein RCOM_0719270 [Ricinus communis]
gi|223537453|gb|EEF39081.1| hypothetical protein RCOM_0719270 [Ricinus communis]
Length = 1334
Score = 569 bits (1467), Expect = e-160, Method: Compositional matrix adjust.
Identities = 284/347 (81%), Positives = 308/347 (88%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
SVIR LD RW KAEE TAELI I+P+ SE RRNAVA YV RLI +CFPC+VFTFGSV
Sbjct: 19 SVIRVLDSERWAKAEERTAELIDCIKPNEPSERRRNAVADYVERLITKCFPCRVFTFGSV 78
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL AFS+ Q++K+TWAH VRD+LENEEKNE+AEFRVKEVQYIQAEVKII
Sbjct: 79 PLKTYLPDGDIDLTAFSEGQSMKETWAHQVRDVLENEEKNENAEFRVKEVQYIQAEVKII 138
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KCLV+N VVDI+F+QLGGLCTLCFL+EVDHLIN++HLFK+SIILIKAWCYYESRILG HH
Sbjct: 139 KCLVENIVVDISFDQLGGLCTLCFLEEVDHLINQDHLFKKSIILIKAWCYYESRILGAHH 198
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDWDNFC+SLWGPVPIS LPD
Sbjct: 199 GLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPISSLPD 258
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
VTAEPPRKDGG LLLSK FL +C YA PGG E+QGQ F SKHFNVIDPLRVNNNLGR
Sbjct: 259 VTAEPPRKDGGELLLSKLFLKACGAVYAVSPGGPESQGQTFTSKHFNVIDPLRVNNNLGR 318
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVSKGNFFRIR+AF F AK LARLLDCP ED++ EVNQFFMNT DRH
Sbjct: 319 SVSKGNFFRIRSAFAFGAKRLARLLDCPKEDIHFEVNQFFMNTWDRH 365
>gi|302143676|emb|CBI22537.3| unnamed protein product [Vitis vinifera]
Length = 1359
Score = 566 bits (1458), Expect = e-159, Method: Composition-based stats.
Identities = 266/347 (76%), Positives = 295/347 (85%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
SV R LD R AEE T +LIA IQP+ SEERR AVA+YV+ LI++CF C+VF FGSV
Sbjct: 25 SVTRALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSV 84
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL AFS LKDTWA+ VRD+LE EEK+ AEFRVKEVQYIQAEVKII
Sbjct: 85 PLKTYLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEVKII 144
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KCLV+N VVDI+FNQLGGLCTLCFL+EVDHLI++ HLFKRSIILIKAWCYYESRILG HH
Sbjct: 145 KCLVENIVVDISFNQLGGLCTLCFLEEVDHLISQKHLFKRSIILIKAWCYYESRILGAHH 204
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GLIS+YAL TLVLYIF VFN SFAGPLEVLYRFLEFFSKFDW+N+C+SLWGPVPIS LPD
Sbjct: 205 GLISTYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPD 264
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
VTA+PPRKD G LLLSK FLD+C YA P GQEN QPF+SK+FNVIDPLR NNNLGR
Sbjct: 265 VTADPPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGR 324
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVSKGNFFRIR+AF F A+ LARLLDCP +++ EVNQFFMNT +RH
Sbjct: 325 SVSKGNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERH 371
>gi|225462743|ref|XP_002268106.1| PREDICTED: uncharacterized protein LOC100248390 [Vitis vinifera]
Length = 1353
Score = 565 bits (1456), Expect = e-158, Method: Composition-based stats.
Identities = 266/347 (76%), Positives = 295/347 (85%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
SV R LD R AEE T +LIA IQP+ SEERR AVA+YV+ LI++CF C+VF FGSV
Sbjct: 25 SVTRALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSV 84
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL AFS LKDTWA+ VRD+LE EEK+ AEFRVKEVQYIQAEVKII
Sbjct: 85 PLKTYLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEVKII 144
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KCLV+N VVDI+FNQLGGLCTLCFL+EVDHLI++ HLFKRSIILIKAWCYYESRILG HH
Sbjct: 145 KCLVENIVVDISFNQLGGLCTLCFLEEVDHLISQKHLFKRSIILIKAWCYYESRILGAHH 204
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GLIS+YAL TLVLYIF VFN SFAGPLEVLYRFLEFFSKFDW+N+C+SLWGPVPIS LPD
Sbjct: 205 GLISTYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPD 264
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
VTA+PPRKD G LLLSK FLD+C YA P GQEN QPF+SK+FNVIDPLR NNNLGR
Sbjct: 265 VTADPPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGR 324
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVSKGNFFRIR+AF F A+ LARLLDCP +++ EVNQFFMNT +RH
Sbjct: 325 SVSKGNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERH 371
>gi|42566126|ref|NP_191728.2| nucleotidyltransferase [Arabidopsis thaliana]
gi|332646720|gb|AEE80241.1| nucleotidyltransferase [Arabidopsis thaliana]
Length = 1303
Score = 561 bits (1445), Expect = e-157, Method: Composition-based stats.
Identities = 271/348 (77%), Positives = 303/348 (87%), Gaps = 1/348 (0%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGS 59
SV RPLD RW KAE+ TA+LIA IQP+P SE+RRNAVA+YVRRLI++CFP Q+F FGS
Sbjct: 29 SVTRPLDAERWAKAEDRTAKLIACIQPNPPSEDRRNAVASYVRRLIMECFPQVQIFMFGS 88
Query: 60 VPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKI 119
VPLKTYLPD DIDL AFS +Q LKD+WA+LVRDMLE EEKNE+AEF VKEVQYIQAEVKI
Sbjct: 89 VPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDMLEKEEKNENAEFHVKEVQYIQAEVKI 148
Query: 120 IKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
IKCLV+N VVDI+FNQ+GGLCTLCFL+EVDH IN+NHLFKRSIILIKAWCYYESRILG H
Sbjct: 149 IKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQNHLFKRSIILIKAWCYYESRILGAH 208
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLP 239
HGLIS+YAL TLVLYIF++FN SF+GPLEVLYRFLEFFSKFDW NFCLSLWGPVP+S LP
Sbjct: 209 HGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFLEFFSKFDWQNFCLSLWGPVPVSSLP 268
Query: 240 DVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLG 299
DVTAEPPR+D G L +S++F +C YA QE QGQPFVSKHFNVIDPLR NNNLG
Sbjct: 269 DVTAEPPRRDVGELRVSEAFYRACSRVYAVNIAPQEIQGQPFVSKHFNVIDPLRENNNLG 328
Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
RSVSKGNFFRIR+AFT AK L RLL+CP E+L +EVNQFFMNT +RH
Sbjct: 329 RSVSKGNFFRIRSAFTLGAKKLTRLLECPKENLIHEVNQFFMNTWERH 376
>gi|242036527|ref|XP_002465658.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
gi|241919512|gb|EER92656.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
Length = 1333
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 264/346 (76%), Positives = 294/346 (84%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
V R LDP RW AE+ TAELIARIQP+ +SE RR AV YV+RLI+ C CQVFTFGSVP
Sbjct: 16 VTRRLDPERWAVAEDRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVP 75
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
LKTYLPD DID+ AFS+ + LK+ WA+LVRD LE EEKNE+AEF VKEVQYIQAEVKIIK
Sbjct: 76 LKTYLPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIK 135
Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
CLV+N VVDI+FNQ+GGLCTLCFL+E+D+LI+ NHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 136 CLVENIVVDISFNQVGGLCTLCFLEEIDNLISRNHLFKRSIILIKAWCFYESRILGAHHG 195
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
LIS+YAL TLVLYIFH+FN SF GPLEVLYRFLEFFS FDW+ FCLSLWGPVPIS LPD+
Sbjct: 196 LISTYALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDM 255
Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
TAEPPR D G LLL+KSFLD+C AY P QENQGQPFVSKHFNVIDPLR NNNLGRS
Sbjct: 256 TAEPPRMDSGELLLNKSFLDTCSSAYGVVPRTQENQGQPFVSKHFNVIDPLRANNNLGRS 315
Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
VSKGNFFRIR+AF + AK L +LL+CP EDL E+NQFF NT RH
Sbjct: 316 VSKGNFFRIRSAFAYGAKRLGKLLECPKEDLIAELNQFFTNTWIRH 361
>gi|218192316|gb|EEC74743.1| hypothetical protein OsI_10487 [Oryza sativa Indica Group]
Length = 1316
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 272/347 (78%), Positives = 293/347 (84%), Gaps = 1/347 (0%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
V R LD RW AE TAELIARIQP+ SE RR AV YVRRLI C CQVFTFGSVP
Sbjct: 14 VTRRLDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVP 73
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
LKTYLPD DID+ AFSD + LKDTWA+LVRD LE+EEK+E+AEFRVKEVQYIQAEVKIIK
Sbjct: 74 LKTYLPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIK 133
Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
CLVDN VVDI+FNQ+GGLCTLCFL+EVD LI++NHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 134 CLVDNIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHG 193
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
LIS+YAL TLVLYIFHVFN F GPLEVLYRFLEFFS FDW+ FCLSL GPVPIS LPD+
Sbjct: 194 LISTYALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDM 253
Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQG-QPFVSKHFNVIDPLRVNNNLGR 300
TAEPPR D LLLSKSFLD C YAYA P QE+QG QPFVSKHFNVIDPLR NNNLGR
Sbjct: 254 TAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGR 313
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVSKGNFFRIR+AF+F AK LA+LL+CP EDL EVNQFF NT RH
Sbjct: 314 SVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRH 360
>gi|357113459|ref|XP_003558520.1| PREDICTED: uncharacterized protein LOC100841269 [Brachypodium
distachyon]
Length = 1305
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 264/346 (76%), Positives = 290/346 (83%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
V R LDP RW AE TAELIARIQP+ SE RR AV YVRRLI+ C C+VFTFGSVP
Sbjct: 14 VTRRLDPERWAVAESRTAELIARIQPNAHSEGRRLAVYNYVRRLIMNCLSCEVFTFGSVP 73
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
LKTYLPD DID+ AFS+ + LKDTWA+LVRD LE+EEK+E+AEF VKEVQYIQAEVKIIK
Sbjct: 74 LKTYLPDGDIDVTAFSNSEELKDTWANLVRDALEHEEKSENAEFCVKEVQYIQAEVKIIK 133
Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
CLVDN VVDI+FNQ+GGLCTLCFL+EVD+LIN +HLFKRSIIL+KAWC+YESRILG HHG
Sbjct: 134 CLVDNIVVDISFNQVGGLCTLCFLEEVDNLINHSHLFKRSIILVKAWCFYESRILGAHHG 193
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
LIS+YAL TLVLYIFHVFN SF GPLEVLYRFLEFF FDW+ FCLSLWGPVPIS LPD+
Sbjct: 194 LISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFGNFDWEKFCLSLWGPVPISSLPDM 253
Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
TAEPPR D G LLL K FLD+C AY P QE QGQPFVSKHFNVIDPLR NNNLGRS
Sbjct: 254 TAEPPRMDTGELLLGKPFLDNCNQAYGVMPRTQETQGQPFVSKHFNVIDPLRTNNNLGRS 313
Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
V KGN+FRIR+AF F AK LA+LL+CP ED+ EVNQFF NT RH
Sbjct: 314 VGKGNYFRIRSAFCFGAKKLAKLLECPKEDIITEVNQFFTNTLTRH 359
>gi|108706800|gb|ABF94595.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
sativa Japonica Group]
Length = 1316
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 272/347 (78%), Positives = 293/347 (84%), Gaps = 1/347 (0%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
V R LD RW AE TAELIARIQP+ SE RR AV YVRRLI C CQVFTFGSVP
Sbjct: 14 VTRRLDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVP 73
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
LKTYLPD DID+ AFSD + LKDTWA+LVRD LE+EEK+E+AEFRVKEVQYIQAEVKIIK
Sbjct: 74 LKTYLPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIK 133
Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
CLVDN VVDI+FNQ+GGLCTLCFL+EVD LI++NHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 134 CLVDNIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHG 193
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
LIS+YAL TLVLYIFHVFN F GPLEVLYRFLEFFS FDW+ FCLSL GPVPIS LPD+
Sbjct: 194 LISTYALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDM 253
Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQG-QPFVSKHFNVIDPLRVNNNLGR 300
TAEPPR D LLLSKSFLD C YAYA P QE+QG QPFVSKHFNVIDPLR NNNLGR
Sbjct: 254 TAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGR 313
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVSKGNFFRIR+AF+F AK LA+LL+CP EDL EVNQFF NT RH
Sbjct: 314 SVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRH 360
>gi|414865287|tpg|DAA43844.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
gi|414865288|tpg|DAA43845.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
Length = 1332
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 262/346 (75%), Positives = 292/346 (84%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
V R LDP RW AE TAELIARIQP+ +SE RR AV YV+RLI+ C CQVFTFGSVP
Sbjct: 16 VTRRLDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVP 75
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
LKTYLPD DID+ AFS+ + LK+ WA+LVRD LE EEKNE+AEF VKEVQYIQAEVKIIK
Sbjct: 76 LKTYLPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIK 135
Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
CLV+N VVDI+FNQ+GGLCTLCFL+E+D+LI+ENHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 136 CLVENIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHG 195
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
LIS+YAL TLVLYIFH+FN SF GPLEVLYRFLEFFS FDW+ FCLSLWGPVPIS LPD+
Sbjct: 196 LISTYALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDM 255
Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
TAEPPR D G LLL+KSFLD+C AY P QEN QPF+SKHFNVIDPLR NNNLGRS
Sbjct: 256 TAEPPRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRS 315
Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
VSKGNFFRIR+AF + AK L +LL+CP EDL E+NQFF NT RH
Sbjct: 316 VSKGNFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRH 361
>gi|414865289|tpg|DAA43846.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
Length = 1348
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 262/346 (75%), Positives = 292/346 (84%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
V R LDP RW AE TAELIARIQP+ +SE RR AV YV+RLI+ C CQVFTFGSVP
Sbjct: 16 VTRRLDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVP 75
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
LKTYLPD DID+ AFS+ + LK+ WA+LVRD LE EEKNE+AEF VKEVQYIQAEVKIIK
Sbjct: 76 LKTYLPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIK 135
Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
CLV+N VVDI+FNQ+GGLCTLCFL+E+D+LI+ENHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 136 CLVENIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHG 195
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
LIS+YAL TLVLYIFH+FN SF GPLEVLYRFLEFFS FDW+ FCLSLWGPVPIS LPD+
Sbjct: 196 LISTYALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDM 255
Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
TAEPPR D G LLL+KSFLD+C AY P QEN QPF+SKHFNVIDPLR NNNLGRS
Sbjct: 256 TAEPPRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRS 315
Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
VSKGNFFRIR+AF + AK L +LL+CP EDL E+NQFF NT RH
Sbjct: 316 VSKGNFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRH 361
>gi|6850860|emb|CAB71099.1| putative protein [Arabidopsis thaliana]
Length = 1388
Score = 542 bits (1397), Expect = e-152, Method: Compositional matrix adjust.
Identities = 271/348 (77%), Positives = 303/348 (87%), Gaps = 1/348 (0%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGS 59
SV RPLD RW KAE+ TA+LIA IQP+P SE+RRNAVA+YVRRLI++CFP Q+F FGS
Sbjct: 29 SVTRPLDAERWAKAEDRTAKLIACIQPNPPSEDRRNAVASYVRRLIMECFPQVQIFMFGS 88
Query: 60 VPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKI 119
VPLKTYLPD DIDL AFS +Q LKD+WA+LVRDMLE EEKNE+AEF VKEVQYIQAEVKI
Sbjct: 89 VPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDMLEKEEKNENAEFHVKEVQYIQAEVKI 148
Query: 120 IKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
IKCLV+N VVDI+FNQ+GGLCTLCFL+EVDH IN+NHLFKRSIILIKAWCYYESRILG H
Sbjct: 149 IKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQNHLFKRSIILIKAWCYYESRILGAH 208
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLP 239
HGLIS+YAL TLVLYIF++FN SF+GPLEVLYRFLEFFSKFDW NFCLSLWGPVP+S LP
Sbjct: 209 HGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFLEFFSKFDWQNFCLSLWGPVPVSSLP 268
Query: 240 DVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLG 299
DVTAEPPR+D G L +S++F +C YA QE QGQPFVSKHFNVIDPLR NNNLG
Sbjct: 269 DVTAEPPRRDVGELRVSEAFYRACSRVYAVNIAPQEIQGQPFVSKHFNVIDPLRENNNLG 328
Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
RSVSKGNFFRIR+AFT AK L RLL+CP E+L +EVNQFFMNT +RH
Sbjct: 329 RSVSKGNFFRIRSAFTLGAKKLTRLLECPKENLIHEVNQFFMNTWERH 376
>gi|297817502|ref|XP_002876634.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
gi|297322472|gb|EFH52893.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
Length = 1302
Score = 540 bits (1392), Expect = e-151, Method: Compositional matrix adjust.
Identities = 271/348 (77%), Positives = 302/348 (86%), Gaps = 1/348 (0%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGS 59
SV R LD RW KAE+ TA+LIA IQP+P SE+RRNAVA+YVRRLI++CFP Q+F FGS
Sbjct: 29 SVTRQLDAERWAKAEDRTAKLIACIQPNPPSEDRRNAVASYVRRLIMECFPQVQIFMFGS 88
Query: 60 VPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKI 119
VPLKTYLPD DIDL AFS +Q LKD+WA+LVRDMLE EEKNE+AEF VKEVQYIQAEVKI
Sbjct: 89 VPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDMLEKEEKNENAEFHVKEVQYIQAEVKI 148
Query: 120 IKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
IKCLV+N VVDI+FNQ+GGLCTLCFL+EVDH IN+NHLFKRSIILIKAWCYYESRILG H
Sbjct: 149 IKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQNHLFKRSIILIKAWCYYESRILGAH 208
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLP 239
HGLIS+YAL TLVLYIF++FN SF+GPLEVLYRFLEFFSKFDW NFCLSLWGPVP+S LP
Sbjct: 209 HGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFLEFFSKFDWQNFCLSLWGPVPVSSLP 268
Query: 240 DVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLG 299
DVTA PPRKD G L +S++F +C YA QE QGQPFVSKHFNVIDPLR NNNLG
Sbjct: 269 DVTAAPPRKDVGELRVSEAFYRACSKVYAVNIAPQEIQGQPFVSKHFNVIDPLRENNNLG 328
Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
RSVSKGNFFRIR+AFT AK LARLL+CP E+L +EVNQFFMNT +RH
Sbjct: 329 RSVSKGNFFRIRSAFTLGAKKLARLLECPKENLIHEVNQFFMNTWERH 376
>gi|413956606|gb|AFW89255.1| hypothetical protein ZEAMMB73_893455 [Zea mays]
Length = 1316
Score = 538 bits (1387), Expect = e-150, Method: Compositional matrix adjust.
Identities = 259/346 (74%), Positives = 292/346 (84%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
+ R LDP RW AE+ TAELIA IQP+ +SE RR AV YV+RLI+ C CQVFTFGSVP
Sbjct: 16 MTRRLDPERWAVAEDRTAELIACIQPNVYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVP 75
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
LKTYLPD DID+ AFS+ + LK+ WA+LVRD LE EEK+E+AEF VKEVQYIQAEVKIIK
Sbjct: 76 LKTYLPDGDIDVTAFSNSEELKEIWANLVRDALEREEKDENAEFHVKEVQYIQAEVKIIK 135
Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
CLV+N VVDI+FNQ+GGLCTLCFL+E+D+LI++NHLFKRSIILIKAWC+YESRILG HHG
Sbjct: 136 CLVENIVVDISFNQVGGLCTLCFLEEIDNLISQNHLFKRSIILIKAWCFYESRILGAHHG 195
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
LIS+YAL TLVLYIFH+FN SF GPLEVLYRFLEFFS FDW+ FCLSLWGPVPIS LPD+
Sbjct: 196 LISTYALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDM 255
Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
TA PPR D G LLL+KSFLD+C AY P QENQGQPFVSKHFNVIDPLR NNNLGRS
Sbjct: 256 TAIPPRMDSGELLLNKSFLDTCSSAYGVVPHTQENQGQPFVSKHFNVIDPLRTNNNLGRS 315
Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
VSKGNFFRIR+AF + AK L +LL+CP E L E+NQFF NT RH
Sbjct: 316 VSKGNFFRIRSAFAYGAKRLGKLLECPKEALIPELNQFFTNTWIRH 361
>gi|224118186|ref|XP_002317752.1| predicted protein [Populus trichocarpa]
gi|222858425|gb|EEE95972.1| predicted protein [Populus trichocarpa]
Length = 353
Score = 536 bits (1380), Expect = e-150, Method: Compositional matrix adjust.
Identities = 260/342 (76%), Positives = 293/342 (85%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
L+ RW AEE TAELIA IQP+ SEERR AV YV+RLI++CFPCQVFTFGSVPLKTY
Sbjct: 1 LELERWAIAEERTAELIACIQPNQPSEERRTAVLGYVQRLIMKCFPCQVFTFGSVPLKTY 60
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DID+ F++ Q LK TWA V+D+L++EEK+E+AEF VKEVQYIQAEVKIIKCLV+
Sbjct: 61 LPDGDIDITVFTESQDLKKTWADEVKDILQHEEKSENAEFHVKEVQYIQAEVKIIKCLVE 120
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N VVDI+FNQLGGLCTLCFL+EVD LI++NHLFKRSIILIKAWCYYESRILG HHGLIS+
Sbjct: 121 NIVVDISFNQLGGLCTLCFLEEVDQLISQNHLFKRSIILIKAWCYYESRILGAHHGLIST 180
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YAL TLVLYIFHVFN FAGPLEVLYRFLEFFSKFDW++FC+SLWGPVPIS LP+VTA
Sbjct: 181 YALETLVLYIFHVFNNRFAGPLEVLYRFLEFFSKFDWEHFCISLWGPVPISSLPNVTALS 240
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
PR+DGG +LLS+ FL+ C YA FP QENQ Q FVSK+FNVIDPLR NNNLGRSVSKG
Sbjct: 241 PREDGGQILLSQLFLEVCSSVYAVFPSQQENQEQSFVSKYFNVIDPLRTNNNLGRSVSKG 300
Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
NF+RIR+AF F A+ LARLLDCP E+L E NQFFMNT DRH
Sbjct: 301 NFYRIRSAFAFGAQRLARLLDCPKENLLAEFNQFFMNTWDRH 342
>gi|147867191|emb|CAN79954.1| hypothetical protein VITISV_027426 [Vitis vinifera]
Length = 1388
Score = 491 bits (1263), Expect = e-136, Method: Composition-based stats.
Identities = 239/347 (68%), Positives = 265/347 (76%), Gaps = 31/347 (8%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
SV R LD R AEE T +LIA IQP+ SEERR AVA+YV+ LI++CF C+VF FGSV
Sbjct: 25 SVTRALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSV 84
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL AFS LKDTWA+ VRD+LE EEK+ AEFRVKEVQYIQAEV
Sbjct: 85 PLKTYLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEV--- 141
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
DHLI++ HLFKRSIILIKAWCYYESRILG HH
Sbjct: 142 ----------------------------DHLISQKHLFKRSIILIKAWCYYESRILGAHH 173
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GLIS+YAL TLVLYIF VFN SFAGPLEVLYRFLEFFSKFDW+N+C+SLWGPVPIS LPD
Sbjct: 174 GLISTYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPD 233
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
VTA+PPRKD G LLLSK FLD+C YA P GQEN QPF+SK+FNVIDPLR NNNLGR
Sbjct: 234 VTADPPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGR 293
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVSKGNFFRIR+AF F A+ LARLLDCP +++ EVNQFFMNT +RH
Sbjct: 294 SVSKGNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERH 340
>gi|302802985|ref|XP_002983246.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
gi|300148931|gb|EFJ15588.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
Length = 351
Score = 479 bits (1233), Expect = e-133, Method: Compositional matrix adjust.
Identities = 239/347 (68%), Positives = 279/347 (80%), Gaps = 11/347 (3%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
LD RWL+AE T ELI RIQP FSE+RR AVA YV RLI +CF C+VFTFGSVPL+TY
Sbjct: 1 LDDERWLQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTY 60
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV-KIIKCLV 124
LPD DIDL AFS Q L+++WA+ VR +LE EE+++ AEFRVKEVQYIQAEV KIIKCLV
Sbjct: 61 LPDGDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLV 120
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
+N VVDI+FNQLGGLCTLCFL+EVD LI +HLFKRSIIL+KAWCYYESRILG HHGLIS
Sbjct: 121 ENIVVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLIS 180
Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
+YAL TLVLYIFHVF+ S GPL VLYRFLEFFS FDWD +CLSLWGP+P+S LPD+
Sbjct: 181 TYALETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM--- 237
Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQEN----QGQPFVSKHFNVIDPLRVNNNLGR 300
+DGG LLL+K FLDSC AYA P G N Q + F SK+ NV+DPL+ NNLGR
Sbjct: 238 ---QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGR 294
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SV+KGNF+RIR AF F A+ LAR+L+CP ED+ +EV++FF+NT DRH
Sbjct: 295 SVNKGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRH 341
>gi|302755776|ref|XP_002961312.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
gi|300172251|gb|EFJ38851.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
Length = 351
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 238/347 (68%), Positives = 279/347 (80%), Gaps = 11/347 (3%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
LD RW++AE T ELI RIQP FSE+RR AVA YV RLI +CF C+VFTFGSVPL+TY
Sbjct: 1 LDDERWVQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTY 60
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV-KIIKCLV 124
LPD DIDL AFS Q L+++WA+ VR +LE EE+++ AEFRVKEVQYIQAEV KIIKCLV
Sbjct: 61 LPDGDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLV 120
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
+N VVDI+FNQLGGLCTLCFL+EVD LI +HLFKRSIIL+KAWCYYESRILG HHGLIS
Sbjct: 121 ENIVVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLIS 180
Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
+YAL TLVLYIFHVF+ S GPL VLYRFLEFFS FDWD +CLSLWGP+P+S LPD+
Sbjct: 181 TYALETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM--- 237
Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQEN----QGQPFVSKHFNVIDPLRVNNNLGR 300
+DGG LLL+K FLDSC AYA P G N Q + F SK+ NV+DPL+ NNLGR
Sbjct: 238 ---QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGR 294
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SV+KGNF+RIR AF F A+ LAR+L+CP ED+ +EV++FF+NT DRH
Sbjct: 295 SVNKGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRH 341
>gi|147820621|emb|CAN67650.1| hypothetical protein VITISV_005081 [Vitis vinifera]
Length = 1572
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 222/301 (73%), Positives = 242/301 (80%), Gaps = 29/301 (9%)
Query: 47 IQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFR 106
++C +VFTFGSVPLKTYLPD DIDL AFS++Q LKDTWA+
Sbjct: 222 VKC-ATRVFTFGSVPLKTYLPDGDIDLTAFSNNQNLKDTWAN------------------ 262
Query: 107 VKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIK 166
+VKIIKCLV+N VVDI+FNQLGGLCTLCFL+EVDHLIN+NHLFKRSIILIK
Sbjct: 263 ---------QVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIK 313
Query: 167 AWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFC 226
AWCYYESRILG HHGLIS+YAL TLVLYIFHVFN SF GPLEVLYRFLEFFS FDWDNFC
Sbjct: 314 AWCYYESRILGAHHGLISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFC 373
Query: 227 LSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHF 286
+SLWGPVPIS LPDVTAEPPR+D G LLLSK FLD+C YA FP GQE QGQ F+SKHF
Sbjct: 374 VSLWGPVPISSLPDVTAEPPRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHF 433
Query: 287 NVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDR 346
NVIDPLRVNNNLGRSVSKGNFFRIR+AF F AK LARLLD P E++ EVNQ FMNT +R
Sbjct: 434 NVIDPLRVNNNLGRSVSKGNFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWER 492
Query: 347 H 347
H
Sbjct: 493 H 493
>gi|358347363|ref|XP_003637727.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
gi|355503662|gb|AES84865.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
Length = 827
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 197/231 (85%), Positives = 210/231 (90%)
Query: 117 VKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRIL 176
VK++KCLV+N VVDI+FNQLGGLCTLCFL+EVD LIN NHLFKRSIILIKAWCYYESRIL
Sbjct: 109 VKLVKCLVENIVVDISFNQLGGLCTLCFLEEVDGLINHNHLFKRSIILIKAWCYYESRIL 168
Query: 177 GGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPIS 236
G HHGLIS+YAL TLVLYIFHVFN SFAGPLEVLYRFLEFFSKFDWDNFC+SLWGPVPIS
Sbjct: 169 GAHHGLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPIS 228
Query: 237 LLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNN 296
LPDVTAEPPRKD G LLL KSFLD+C YA FPGG ENQGQPFVSKHFNVIDPLRVNN
Sbjct: 229 SLPDVTAEPPRKDAGELLLHKSFLDACSTVYAVFPGGPENQGQPFVSKHFNVIDPLRVNN 288
Query: 297 NLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
NLGRSVSKGNFFRIR+AF F AK LARLLDCP ++L+ EVNQFF+NT DRH
Sbjct: 289 NLGRSVSKGNFFRIRSAFAFGAKKLARLLDCPKDELFLEVNQFFLNTWDRH 339
>gi|449449962|ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207419 [Cucumis sativus]
Length = 898
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 200/343 (58%), Positives = 254/343 (74%), Gaps = 1/343 (0%)
Query: 5 PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
P+ W +AEE T +I+++QP SE RR AV YV+RLI C+VF FGSVPLKT
Sbjct: 38 PIGVDYWRRAEEATQAIISQVQPTVVSERRRKAVIDYVQRLIRGRLRCEVFPFGSVPLKT 97
Query: 65 YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
YLPD DIDL A +++ A V +L +E++N AEF VK+VQ I+AEVK++KCLV
Sbjct: 98 YLPDGDIDLTALGG-SNVEEALASDVCSVLNSEDQNGAAEFVVKDVQLIRAEVKLVKCLV 156
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
N VVDI+FNQLGGLCTLCFL+++D I ++HLFKRSIILIKAWCYYESRILG HHGLIS
Sbjct: 157 QNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 216
Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
+YAL TLVLYIFH+F+ + GPL+VLY+FL++FSKFDWDN+C+SL GPV IS LP++ AE
Sbjct: 217 TYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGPVRISSLPELVAE 276
Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
P GG LLLS FL SC ++ G E + F KH N++DPL+ NNNLGRSVSK
Sbjct: 277 TPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPLKENNNLGRSVSK 336
Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GNF+RIR+AF++ A+ L +L P +++ +EV +FF NT DRH
Sbjct: 337 GNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRH 379
>gi|357463851|ref|XP_003602207.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
gi|355491255|gb|AES72458.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
Length = 768
Score = 396 bits (1018), Expect = e-108, Method: Compositional matrix adjust.
Identities = 191/346 (55%), Positives = 261/346 (75%), Gaps = 2/346 (0%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
V + L+ +W + E+ T EL+ ++P+P SE RN + +Y++ LII P +VF FGSVP
Sbjct: 18 VPKVLERSKWSQVEDRTIELLQFLEPNPKSETLRNNIVSYIKGLIISHVPVKVFEFGSVP 77
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
LKTYL D DIDL F +++ + + ++ +LE+E NE ++FRVKEVQ + AEVKIIK
Sbjct: 78 LKTYLRDGDIDLTIFGNNELFPEIFIPHIQQILESEMNNEFSKFRVKEVQLVNAEVKIIK 137
Query: 122 CLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
CLV+ FV+DI+FNQL GLC+LCFLDEVD+LI+ NH+FKRS+ILIKAWCY+ESR+LG G
Sbjct: 138 CLVEKFVIDISFNQLSGLCSLCFLDEVDYLISRNHIFKRSVILIKAWCYHESRLLGSKSG 197
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
L S+YAL LVLY+F+++N F GPLEVL+RFLEFFSKFDW N+C+SL GPVP+ LP++
Sbjct: 198 LFSTYALEILVLYLFNLYNNEFVGPLEVLFRFLEFFSKFDWGNYCISLSGPVPLDSLPNM 257
Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
TA+ PRKD LLL++SFL + ++ Y Q+N+ + FVSKH N+IDPL+ NNNLG S
Sbjct: 258 TADCPRKDRQDLLLTESFLIASKFCYG--WRNQKNREKHFVSKHINIIDPLQENNNLGHS 315
Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+S+GNFFRI++A + A+ + R+LDC +E L +E + FF NT +RH
Sbjct: 316 ISRGNFFRIKSAIAYGAEQMMRILDCTDEYLISEFDHFFENTWNRH 361
>gi|359481238|ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258499 [Vitis vinifera]
Length = 884
Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust.
Identities = 198/339 (58%), Positives = 246/339 (72%), Gaps = 1/339 (0%)
Query: 9 GRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPD 68
+W +AE E+I +QP SEERR V YV+ LI C+VF FGSVPLKTYLPD
Sbjct: 37 AQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFPFGSVPLKTYLPD 96
Query: 69 RDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV 128
DIDL AF ++DT A+ V +LE E++N AEF VK+VQ I AEVK++KCLV N V
Sbjct: 97 GDIDLTAFGG-PAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEVKLVKCLVQNIV 155
Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
VDI+FNQLGGLCTLCFL+++D LI ++HLFKRSIILIKAWCYYESRILG HHGLIS+YAL
Sbjct: 156 VDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215
Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
TLVLYIF +F+ GPL VLY+FL++FSKFDWDN+C+SL GPV IS LP++ AE P
Sbjct: 216 ETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPEN 275
Query: 249 DGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFF 308
G LL+ L C ++ G E + FV KHFN++DPL+ NNNLGRSVSKGNF+
Sbjct: 276 VGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFY 335
Query: 309 RIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
RIR+AFT+ A+ L R+L P + + E+ +FF NT +RH
Sbjct: 336 RIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERH 374
>gi|297735556|emb|CBI18050.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 198/339 (58%), Positives = 246/339 (72%), Gaps = 1/339 (0%)
Query: 9 GRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPD 68
+W +AE E+I +QP SEERR V YV+ LI C+VF FGSVPLKTYLPD
Sbjct: 37 AQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFPFGSVPLKTYLPD 96
Query: 69 RDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV 128
DIDL AF ++DT A+ V +LE E++N AEF VK+VQ I AEVK++KCLV N V
Sbjct: 97 GDIDLTAFGG-PAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEVKLVKCLVQNIV 155
Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
VDI+FNQLGGLCTLCFL+++D LI ++HLFKRSIILIKAWCYYESRILG HHGLIS+YAL
Sbjct: 156 VDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215
Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
TLVLYIF +F+ GPL VLY+FL++FSKFDWDN+C+SL GPV IS LP++ AE P
Sbjct: 216 ETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPEN 275
Query: 249 DGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFF 308
G LL+ L C ++ G E + FV KHFN++DPL+ NNNLGRSVSKGNF+
Sbjct: 276 VGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFY 335
Query: 309 RIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
RIR+AFT+ A+ L R+L P + + E+ +FF NT +RH
Sbjct: 336 RIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERH 374
>gi|222616508|gb|EEE52640.1| hypothetical protein OsJ_34991 [Oryza sativa Japonica Group]
Length = 801
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 192/339 (56%), Positives = 240/339 (70%), Gaps = 6/339 (1%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W E ++ARIQP+P SE+RR AV AYV+ L+ CQVF FGSVPLKTYLPD D
Sbjct: 30 WDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQGLLRFNVGCQVFPFGSVPLKTYLPDGD 89
Query: 71 IDLGAF--SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV 128
IDL AF S D+ L A V+ +LE+EE + AEF VK+VQYI AEVK++KC+V N +
Sbjct: 90 IDLTAFGHSSDEIL----AKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCIVQNII 145
Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
VDI+FNQ GGLCTLCFL++VD +NHLFKRSI+LIKAWCYYESRILG HHGLIS+YAL
Sbjct: 146 VDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKAWCYYESRILGAHHGLISTYAL 205
Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
LVLYIFH+F+G+ GPL VLYRFL+++SKFDWDN +SL+GP+ +S LP++ + P
Sbjct: 206 EILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVTDSPDT 265
Query: 249 DGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFF 308
+ + FL C + P E Q F K FN++DPL+ +NNLGRSVSKGNF
Sbjct: 266 VNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVSKGNFL 325
Query: 309 RIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
RIR+AF F A+ L ++L P+ +EVNQFF NT RH
Sbjct: 326 RIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRNTLKRH 364
>gi|77548394|gb|ABA91191.1| nucleotidyltransferase family protein, putative, expressed [Oryza
sativa Japonica Group]
Length = 783
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 191/344 (55%), Positives = 241/344 (70%), Gaps = 6/344 (1%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+ P W E ++ARIQP+P SE+RR AV AYV+ L+ CQVF FGSVPLKTY
Sbjct: 25 ISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLKTY 84
Query: 66 LPDRDIDLGAF--SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
LPD DIDL AF S D+ L A V+ +LE+EE + AEF VK+VQYI AEVK++KC+
Sbjct: 85 LPDGDIDLTAFGHSSDEIL----AKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCI 140
Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
V N +VDI+FNQ GGLCTLCFL++VD + HLFKRSI+LIKAWCYYESRILG HHGLI
Sbjct: 141 VQNIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLI 200
Query: 184 SSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTA 243
S+YAL LVLYIFH+F+G+ GPL VLYRFL+++SKFDWDN +SL+GP+ +S LP++
Sbjct: 201 STYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVT 260
Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
+ P + + FL C + P E Q F K FN++DPL+ +NNLGRSVS
Sbjct: 261 DSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVS 320
Query: 304 KGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
KGNF RIR+AF F A+ L +++ P+ +EVNQFF NT RH
Sbjct: 321 KGNFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRH 364
>gi|115483835|ref|NP_001065579.1| Os11g0114700 [Oryza sativa Japonica Group]
gi|77548393|gb|ABA91190.1| nucleotidyltransferase family protein, putative, expressed [Oryza
sativa Japonica Group]
gi|113644283|dbj|BAF27424.1| Os11g0114700 [Oryza sativa Japonica Group]
gi|215694848|dbj|BAG90039.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218185112|gb|EEC67539.1| hypothetical protein OsI_34858 [Oryza sativa Indica Group]
gi|222615390|gb|EEE51522.1| hypothetical protein OsJ_32709 [Oryza sativa Japonica Group]
Length = 801
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 191/344 (55%), Positives = 241/344 (70%), Gaps = 6/344 (1%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+ P W E ++ARIQP+P SE+RR AV AYV+ L+ CQVF FGSVPLKTY
Sbjct: 25 ISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLKTY 84
Query: 66 LPDRDIDLGAF--SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
LPD DIDL AF S D+ L A V+ +LE+EE + AEF VK+VQYI AEVK++KC+
Sbjct: 85 LPDGDIDLTAFGHSSDEIL----AKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCI 140
Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
V N +VDI+FNQ GGLCTLCFL++VD + HLFKRSI+LIKAWCYYESRILG HHGLI
Sbjct: 141 VQNIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLI 200
Query: 184 SSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTA 243
S+YAL LVLYIFH+F+G+ GPL VLYRFL+++SKFDWDN +SL+GP+ +S LP++
Sbjct: 201 STYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVT 260
Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
+ P + + FL C + P E Q F K FN++DPL+ +NNLGRSVS
Sbjct: 261 DSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVS 320
Query: 304 KGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
KGNF RIR+AF F A+ L +++ P+ +EVNQFF NT RH
Sbjct: 321 KGNFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRH 364
>gi|224124740|ref|XP_002319410.1| predicted protein [Populus trichocarpa]
gi|222857786|gb|EEE95333.1| predicted protein [Populus trichocarpa]
Length = 681
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 191/337 (56%), Positives = 238/337 (70%), Gaps = 1/337 (0%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W +AEE+ E++ RI P S +R V YV+RLI +VF +GSVPLKTYLPD D
Sbjct: 58 WERAEEVATEIVYRIHPTVESSFKRKQVIDYVQRLIRYSLGFEVFPYGSVPLKTYLPDGD 117
Query: 71 IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
IDL A S +++ V +L EE NE A + VK+V I AEVK+IKC+V N VVD
Sbjct: 118 IDLTAISS-PAIEEALVSDVYTVLRGEELNEDALYEVKDVHCIDAEVKLIKCIVQNTVVD 176
Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
I+FNQLGGLCTLCFL+EVD L+ +NHLFKRSIILIKAWCYYESRILG HHGLIS+YAL T
Sbjct: 177 ISFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 236
Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
L+LYIFH+F+ S GPL VLY+FL++FSKFDW+N+C+SL GPV S LP++ A+PP
Sbjct: 237 LILYIFHLFHSSLNGPLAVLYKFLDYFSKFDWENYCISLNGPVCKSSLPNIVAKPPENVS 296
Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
G LLLS FL C + E +PF KH N++DPL+ NNNLGRSV++GNFFRI
Sbjct: 297 GELLLSDEFLKDCVDRFYVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRI 356
Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
R+AF + + L R+L P E + +E+ FF NT DRH
Sbjct: 357 RSAFKYGGRKLGRILLLPREKIADELKTFFANTLDRH 393
>gi|359478494|ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253523 [Vitis vinifera]
Length = 854
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/337 (56%), Positives = 243/337 (72%), Gaps = 1/337 (0%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W AE T E++A++QP S R V YV+RLI C C+VF +GSVPLKTYL D D
Sbjct: 40 WAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTYLLDGD 99
Query: 71 IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
IDL A +++ A V +L+ EE+NE+AEF VK++Q+I AEVK++KCLV + V+D
Sbjct: 100 IDLTALCS-SNVEEALASDVHAVLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKDIVID 158
Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
I+FNQLGGL TLCFL++VD LI ++HLFKRSIILIK+WCYYESRILG HHGLIS+YAL
Sbjct: 159 ISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKSWCYYESRILGAHHGLISTYALEI 218
Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
LVLYIFH+F+ S GPL VLYRFL++FSKFDWDN+C+SL GPV S LPD+ AE P
Sbjct: 219 LVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPDIVAELPENGQ 278
Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
LLLS+ FL +C ++ G E + F KH N+IDPLR NNNLGRSV+KGNF+RI
Sbjct: 279 DDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPLRENNNLGRSVNKGNFYRI 338
Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
R+AF + + L ++L P E + +E+ FF +T +RH
Sbjct: 339 RSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERH 375
>gi|224145449|ref|XP_002325647.1| predicted protein [Populus trichocarpa]
gi|222862522|gb|EEF00029.1| predicted protein [Populus trichocarpa]
Length = 533
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 191/337 (56%), Positives = 239/337 (70%), Gaps = 1/337 (0%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W +AEE T E++ RI P S +R + YV+RLI +VF +GSVPLKTYLPD D
Sbjct: 58 WERAEEFTREIVYRIHPTVESNFKRKQIIGYVQRLIKSSLGFEVFPYGSVPLKTYLPDGD 117
Query: 71 IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
IDL + S +++ + +L EE NE + F VK+V I AEVK+IKC+V N VVD
Sbjct: 118 IDLTSISS-PAIEEALVSDIHAVLRREELNEDSTFEVKDVHCIDAEVKLIKCIVQNTVVD 176
Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
I+FNQLGGLCTLCFL+EVD L+ +NHLFKRSIILIKAWCYYESRILG HHGLIS+YAL T
Sbjct: 177 ISFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 236
Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
L+LYIFH+F+ S GPL VLYRFLE+FSKFDW+N+C+SL GPV S LP++ AEP
Sbjct: 237 LILYIFHLFHCSLNGPLAVLYRFLEYFSKFDWENYCISLNGPVCKSSLPNIVAEPLENGQ 296
Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
G LLLS FL C ++ E +PF KH N++DPL+ NNNLGRSV++GNFFRI
Sbjct: 297 GELLLSDEFLKDCADRFSVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRI 356
Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
R+AF + A+ L ++L P E + +E+ FF NT DRH
Sbjct: 357 RSAFKYGARKLGQILLLPKERIADELKIFFANTLDRH 393
>gi|79597803|ref|NP_850678.2| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
gi|332645293|gb|AEE78814.1| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
Length = 829
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 191/340 (56%), Positives = 245/340 (72%), Gaps = 1/340 (0%)
Query: 8 PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
P W++ EE T E+I ++ P SE+RR V YV++LI C+V +FGSVPLKTYLP
Sbjct: 31 PELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLP 90
Query: 68 DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
D DIDL AF ++ A V +LE EE N ++F VK+VQ I+AEVK++KCLV N
Sbjct: 91 DGDIDLTAFGG-LYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNI 149
Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
VVDI+FNQ+GG+CTLCFL+++DHLI ++HLFKRSIILIKAWCYYESRILG HGLIS+YA
Sbjct: 150 VVDISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYA 209
Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
L TLVLYIFH+F+ S GPL VLY+FL++FSKFDWD++C+SL GPV +S LPD+ E P
Sbjct: 210 LETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPE 269
Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF 307
G LLL+ FL C Y+ G E + F SKH N++DPL+ NNLGRSVSKGNF
Sbjct: 270 NGGEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNF 329
Query: 308 FRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+RIR+AFT+ A+ L +L +E + +E+ +FF N RH
Sbjct: 330 YRIRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRH 369
>gi|297816424|ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
lyrata]
gi|297321933|gb|EFH52354.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
lyrata]
Length = 829
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 189/340 (55%), Positives = 243/340 (71%), Gaps = 1/340 (0%)
Query: 8 PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
P W++ EE T E+I ++ P SE+RR V YV++LI C+V +FGSVPLKTYLP
Sbjct: 31 PEFWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRITLGCEVHSFGSVPLKTYLP 90
Query: 68 DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
D DIDL AF ++ A V +LE EE N + F VK+VQ I+AEVK++KCLV N
Sbjct: 91 DGDIDLTAFGG-LYHEEELAAKVFSVLEREEHNVSSHFVVKDVQLIRAEVKLVKCLVQNI 149
Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
VVDI+FNQ+GG+CTLCFL+++DHLI ++HLFKRSIILIKAWCYYESRILG HGLIS+YA
Sbjct: 150 VVDISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYA 209
Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
L TLVLYIFH+F+ S GPL VLY+FL++FSKFDWDN+C+SL GPV +S LP++ E P
Sbjct: 210 LETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVETPE 269
Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF 307
G LL+ FL C Y+ G E + F SKH N++DPL+ NNLGRSVSKGNF
Sbjct: 270 NGGEDFLLTSEFLKECMEMYSVPSRGFETNQRGFQSKHLNIVDPLKETNNLGRSVSKGNF 329
Query: 308 FRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+RIR+AFT+ A+ L ++ +E + +E+ +FF N RH
Sbjct: 330 YRIRSAFTYGARKLGQIFLQSDEAIKSELRKFFSNMLLRH 369
>gi|255554485|ref|XP_002518281.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223542501|gb|EEF44041.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 821
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 185/337 (54%), Positives = 237/337 (70%), Gaps = 1/337 (0%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W +AE+ T +++ RI P ++ R V YV+ LI QVF +GSVPLKTYLPD D
Sbjct: 51 WERAEQATLQIVYRIHPTVEADCNRKHVVEYVQSLIQSSLGFQVFPYGSVPLKTYLPDGD 110
Query: 71 IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
IDL A + + D V +L EE+N A ++VK+V +I AEVK+IKC+V + VVD
Sbjct: 111 IDLTAIINPAGV-DASVSDVHAVLRREEQNRDAPYKVKDVHFIDAEVKLIKCIVHDIVVD 169
Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
I+FNQLGGL TLCFL++VD LI ++HLFKRSIILIKAWCYYESRILG HHGLIS+YAL T
Sbjct: 170 ISFNQLGGLSTLCFLEQVDQLIGKSHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 229
Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
L+LYIFH+F+ S GPL VLYRFL++FSKFDWDN+C+SL GPV S LP + AEPP
Sbjct: 230 LILYIFHLFHSSLNGPLMVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPKIVAEPPETGR 289
Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
G LLL FL + + E +PF KH N++DPLR NNNLGRSV++GNF+RI
Sbjct: 290 GNLLLDDEFLRNSVKMLSVPSRSPEMNSRPFTQKHLNIVDPLRENNNLGRSVNRGNFYRI 349
Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
R+AF + A+ L +L ++ + NE+++FF NT DRH
Sbjct: 350 RSAFKYGARKLGHILSLQSDRMINELDKFFANTLDRH 386
>gi|326531888|dbj|BAK01320.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 702
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 186/342 (54%), Positives = 236/342 (69%), Gaps = 1/342 (0%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+ P W ++ RIQP SE RR AV YV+RL+ C VF FGSVPLKTY
Sbjct: 27 ISPDAWAPFGAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVFPFGSVPLKTY 86
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL AF + ++ A+ VR +LE+EE+ + AEF +K+VQYI AEVK++KC V
Sbjct: 87 LPDGDIDLAAFGSTCS-DESIANEVRAILESEERRKDAEFEIKDVQYINAEVKLVKCFVQ 145
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N VVDI+FNQ+GGL TLCFL++VD +NHLFKRSI+LIKAWCYYESRILG HHGLIS+
Sbjct: 146 NIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRILGAHHGLIST 205
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YAL TLVLYIFH+F+ S GPL VLYRFL+++SKFDWDN +SL GP+ +S LPD+ +P
Sbjct: 206 YALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISLSSLPDLVTDP 265
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
P L + FL C + P E +PF K N++DPL+ +NNLGRSVSKG
Sbjct: 266 PGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPSNNLGRSVSKG 325
Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
NF+RIR+AF A+ L ++L P + +EVNQFF +T R+
Sbjct: 326 NFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRN 367
>gi|356553166|ref|XP_003544929.1| PREDICTED: uncharacterized protein LOC100816328 [Glycine max]
Length = 779
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 187/333 (56%), Positives = 247/333 (74%), Gaps = 2/333 (0%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLG 74
E+ TAE+++RI+P ++ RR V YV+RLI C+VF +GSVPLKTYLPD DIDL
Sbjct: 45 EKTTAEILSRIRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLT 104
Query: 75 AFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFN 134
A S Q ++D VR +L EE NE +E+ VK+V++I AEVK++KC+V + VVDI+FN
Sbjct: 105 ALSC-QNIEDGLVSDVRAVLHGEEINEASEYEVKDVRFIDAEVKLVKCIVQDIVVDISFN 163
Query: 135 QLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLY 194
QLGGL TLCFL++VD L+ ++HLFKRSIILIKAWCYYESR+LG HHGLIS+YAL TLVLY
Sbjct: 164 QLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLY 223
Query: 195 IFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLL 254
IFH F+ S GPL VLYRFL++FSKFDWDN+C+SL GPV S P++ AE P ++GG L
Sbjct: 224 IFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVGKSSPPNIVAEVP-ENGGNTL 282
Query: 255 LSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAF 314
L++ F+ SC +++ G + + F KH N+IDPL+ NNNLGRSV+KGNF+RIR+AF
Sbjct: 283 LTEEFIRSCVESFSLPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAF 342
Query: 315 TFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+ A+ L +L P + + E+ +FF NT +RH
Sbjct: 343 KYGARKLGWILMLPEDRITEELIRFFTNTLERH 375
>gi|356500940|ref|XP_003519288.1| PREDICTED: uncharacterized protein LOC100814626 [Glycine max]
Length = 780
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 187/333 (56%), Positives = 246/333 (73%), Gaps = 2/333 (0%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLG 74
E TAE++ RI+P ++ RR V YV+RLI C+VF +GSVPLKTYLPD DIDL
Sbjct: 45 ERNTAEILRRIRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLT 104
Query: 75 AFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFN 134
A S + ++D VR +L EE NE AE+ VK+V++I AEVK++KC+V + VVDI+FN
Sbjct: 105 ALSC-ENIEDGLVSDVRAVLHGEEINEAAEYEVKDVRFIDAEVKLVKCIVQDIVVDISFN 163
Query: 135 QLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLY 194
QLGGL TLCFL++VD L+ ++HLFKRSIILIKAWCYYESR+LG HHGLIS+YAL TLVLY
Sbjct: 164 QLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLY 223
Query: 195 IFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLL 254
IFH F+ S GPL VLYRFL++FSKFDWDN+C+SL GPV + LP++ AE P ++GG L
Sbjct: 224 IFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVSKTSLPNIVAEVP-ENGGNTL 282
Query: 255 LSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAF 314
L++ F+ SC +++ G + + F KH N+IDPL+ NNNLGRSV+KGNF+RIR+AF
Sbjct: 283 LTEEFIRSCVESFSVPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAF 342
Query: 315 TFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+ A+ L +L P + + E+ +FF NT +RH
Sbjct: 343 KYGARKLGWILRLPEDRIAEELIRFFANTLERH 375
>gi|357153090|ref|XP_003576335.1| PREDICTED: uncharacterized protein LOC100826374, partial
[Brachypodium distachyon]
Length = 769
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 191/341 (56%), Positives = 236/341 (69%), Gaps = 7/341 (2%)
Query: 10 RWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDR 69
RW EE ++ RIQP SE RR AV YV+RL+ C+VF FGSVPLKTYLPD
Sbjct: 10 RWRAFEEAALGVVGRIQPSAPSEGRRAAVVHYVQRLVRHAVGCEVFPFGSVPLKTYLPDG 69
Query: 70 DIDLGAF---SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN 126
DIDL AF S D+ L A+ VR +LE+EE + AEF VK+VQYI AEVK++KCLV N
Sbjct: 70 DIDLTAFGSISSDENL----ANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCLVQN 125
Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
VVDI+FNQ+GGLCTLCFL++VD + HLFK+SI+LIKAWCYYESRILG HHGLIS+Y
Sbjct: 126 IVVDISFNQIGGLCTLCFLEQVDQRFGKEHLFKKSIMLIKAWCYYESRILGAHHGLISTY 185
Query: 187 ALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPP 246
AL LVL IFH+F+ S GPL VLYRFL+++SKFDWDN +SL+GPV +S LP++ ++ P
Sbjct: 186 ALEILVLCIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVLLSSLPELVSDAP 245
Query: 247 RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGN 306
G L + FL C + P E + F K N++DPL+ NNNLGRSVSKGN
Sbjct: 246 VTHDGDFLKREEFLRECAQTFTVPPRNSEKNTRLFSRKFLNIVDPLKQNNNLGRSVSKGN 305
Query: 307 FFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
FFRIR+AF A+ L ++L + EVNQFF NT R+
Sbjct: 306 FFRIRSAFDLGARKLGKILKEASSSAVPEVNQFFRNTLKRN 346
>gi|326492351|dbj|BAK01959.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 724
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 187/349 (53%), Positives = 238/349 (68%), Gaps = 8/349 (2%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLI-------IQCFPCQVFTFG 58
+ P W E ++ RIQP SE RR AV YV+RL+ + P VF FG
Sbjct: 27 ISPDAWAPFEAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVPVTPFPVFPFG 86
Query: 59 SVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
SVPLKTYLPD DIDL AF + ++ A+ VR +LE+EE+ + AEF +K+VQYI AEVK
Sbjct: 87 SVPLKTYLPDGDIDLAAFGSTCS-DESIANEVRAILESEERRKDAEFEIKDVQYINAEVK 145
Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
++KC V N VVDI+FNQ+GGL TLCFL++VD +NHLFKRSI+LIKAWCYYESRILG
Sbjct: 146 LVKCFVQNIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRILGA 205
Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
HHGLIS+YAL TLVLYIFH+F+ S GPL VLYRFL+++SKFDWDN +SL GP+ +S L
Sbjct: 206 HHGLISTYALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISLSSL 265
Query: 239 PDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
PD+ +PP L + FL C + P E +PF K N++DPL+ +NNL
Sbjct: 266 PDLVTDPPGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPSNNL 325
Query: 299 GRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GRSVSKGNF+RIR+AF A+ L ++L P + +EVNQFF +T R+
Sbjct: 326 GRSVSKGNFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRN 374
>gi|357155485|ref|XP_003577136.1| PREDICTED: uncharacterized protein LOC100840351 [Brachypodium
distachyon]
Length = 739
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 191/342 (55%), Positives = 238/342 (69%), Gaps = 1/342 (0%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+ P W E +I RIQP SE R +V Y++RL+ CQVF FGSVPLKTY
Sbjct: 25 VSPEVWEPLEAAALAVIGRIQPTIPSEGLRASVVDYIQRLVRCSVGCQVFPFGSVPLKTY 84
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL AF + ++ A+ VR +LE EE+ E AEF VK+VQYI AEVK++KC V
Sbjct: 85 LPDGDIDLTAFGSTYS-DESLANEVRAILEAEERREDAEFEVKDVQYIHAEVKLVKCFVQ 143
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N VVDI+FNQ+GGLCTLCFL++VD +NHLFKRSIILIKAWCYYESRILG HHGLIS+
Sbjct: 144 NIVVDISFNQMGGLCTLCFLEQVDQRFEKNHLFKRSIILIKAWCYYESRILGAHHGLIST 203
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YAL TLVLYIFH+F+ S GPL VLYRFL+++SKFDWDN +SL+GPV +S LP++ EP
Sbjct: 204 YALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVSLSSLPELVTEP 263
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
L + FL C + P E +PF K+FN++DPL+ +NNLGRSVSKG
Sbjct: 264 TGTHDDSFLQREEFLKECAKMFTVPPRLNEKNTRPFYQKYFNIVDPLKQSNNLGRSVSKG 323
Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
NF+RIR+AF A+ L ++L P +EVNQFF +T R+
Sbjct: 324 NFYRIRSAFDLGARKLGKILQMPANSTVDEVNQFFKSTLKRN 365
>gi|449526634|ref|XP_004170318.1| PREDICTED: uncharacterized LOC101207419 [Cucumis sativus]
Length = 816
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 180/295 (61%), Positives = 226/295 (76%), Gaps = 1/295 (0%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
QVF FGSVPLKTYLPD DIDL A +++ A V +L +E++N AEF VK+VQ
Sbjct: 4 QVFPFGSVPLKTYLPDGDIDLTALGG-SNVEEALASDVCSVLNSEDQNGAAEFVVKDVQL 62
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
I+AEVK++KCLV N VVDI+FNQLGGLCTLCFL+++D I ++HLFKRSIILIKAWCYYE
Sbjct: 63 IRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYE 122
Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGP 232
SRILG HHGLIS+YAL TLVLYIFH+F+ + GPL+VLY+FL++FSKFDWDN+C+SL GP
Sbjct: 123 SRILGAHHGLISTYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGP 182
Query: 233 VPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPL 292
V IS LP++ AE P GG LLLS FL SC ++ G E + F KH N++DPL
Sbjct: 183 VRISSLPELVAETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPL 242
Query: 293 RVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+ NNNLGRSVSKGNF+RIR+AF++ A+ L +L P +++ +EV +FF NT DRH
Sbjct: 243 KENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRH 297
>gi|168037604|ref|XP_001771293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677382|gb|EDQ63853.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 2035
Score = 367 bits (942), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 193/373 (51%), Positives = 240/373 (64%), Gaps = 41/373 (10%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV---------------- 54
W +AE TAELI ++P SEERR AV +V RLI F C+V
Sbjct: 587 WTRAEGQTAELIDSLKPTRLSEERRTAVTGFVERLIRDRFECEVSALPHELNGFIVRSSA 646
Query: 55 --------FTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFR 106
FGSVPLKTYLPD DIDL F+ + LK+TWA V L+ E + AEFR
Sbjct: 647 GAVRYSAVIRFGSVPLKTYLPDGDIDLYIFARND-LKETWAQDVLKALKQAEDDADAEFR 705
Query: 107 VKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIK 166
VKEVQYIQAEVK+IKCLV+N VVDI+FNQ+GGL TLCFL+ VD + NHLFKRS+IL+K
Sbjct: 706 VKEVQYIQAEVKLIKCLVENIVVDISFNQIGGLSTLCFLERVDEEVGLNHLFKRSVILVK 765
Query: 167 AWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG--SFAGPLEVLYRFLEFFSKFDWDN 224
AWCYYESRILG HHGLIS++AL TLVLYIFHVF+ S GPLEVLY FL +F FDWD
Sbjct: 766 AWCYYESRILGAHHGLISTFALETLVLYIFHVFHSMRSLHGPLEVLYLFLTYFCNFDWDQ 825
Query: 225 FCLSLWGPVPISLLPDVTAEPPRKD-------------GGVLLLSKSFLDSCRYAYADFP 271
+CLS+WGPVP+ +P ++E +KD GG L S+ F++ C Y+D
Sbjct: 826 YCLSIWGPVPLDHIPKNSSELSQKDGGWRTVARSPWEVGGKLYFSEEFIEECINRYSDVR 885
Query: 272 GGQE-NQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNE 330
G E +QG+ F K+ NV+DP+R NNLGRSV+ G+F RIR+AF A+ L + +CP +
Sbjct: 886 AGSESSQGRIFNPKYLNVLDPIRHTNNLGRSVNVGSFKRIRSAFGLGARTLGEVFECPKD 945
Query: 331 DLYNEVNQFFMNT 343
+ + FF T
Sbjct: 946 QITEKFKSFFSCT 958
>gi|255564100|ref|XP_002523048.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223537731|gb|EEF39352.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 644
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 178/342 (52%), Positives = 238/342 (69%), Gaps = 3/342 (0%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+D WL AE+ T E++ +QP SE++R V Y++RLI + +VF FGSVPLKTY
Sbjct: 28 IDSELWLMAEKRTQEILWVLQPSSSSEQKRKEVIDYIQRLIKHHYATEVFPFGSVPLKTY 87
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL A S Q +++ A V D+L E+N +E VK+V+YIQA+VK++KC V
Sbjct: 88 LPDGDIDLTALSH-QNMEEDLAREVCDILTYAEQNLESE--VKDVRYIQAQVKVVKCSVK 144
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N VDI+FNQ+ GLC LCFL++VD LI ++HL K SIILIKAWC+YESRILG HHGL+S+
Sbjct: 145 NISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKHSIILIKAWCFYESRILGAHHGLLST 204
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YAL LVLYI +VF+ S GPL VLYRFLE++S FDWDN+C+++ GPV IS LP++ E
Sbjct: 205 YALEILVLYIVNVFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAISSLPEIMTEA 264
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
P + LLL+ FL C+ ++ EN G F KH N++DPL+ +NNLGRSVSKG
Sbjct: 265 PYSNRNELLLTPEFLKRCKERFSVPIKAVENGGHEFSIKHLNILDPLKDSNNLGRSVSKG 324
Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
NF RI+ A ++ A+ L +L P E++ + FF+NT DR+
Sbjct: 325 NFHRIKCALSYGAQRLGEILMLPGENMGAGLENFFINTLDRN 366
>gi|255559667|ref|XP_002520853.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223539984|gb|EEF41562.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 655
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 176/342 (51%), Positives = 235/342 (68%), Gaps = 3/342 (0%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+D WL AE+ E++ +QP SE++R V Y++RLI F +V FGSVPLKTY
Sbjct: 28 IDSELWLMAEKRAQEILWILQPSLASEQKRKVVIDYIQRLIKHHFATEVLPFGSVPLKTY 87
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL A S Q +++ + ++L EE+N +E VK+V+YIQA+VKI+KC V
Sbjct: 88 LPDGDIDLTALSH-QNMEEDLVREICNILTYEEQNSESE--VKDVRYIQAQVKIVKCSVK 144
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N VDI+FNQ+ GLC LCFL++VD LI ++HL K SIILIKAWC+YESRILG HHGL+S+
Sbjct: 145 NISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKCSIILIKAWCFYESRILGAHHGLLST 204
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YAL LVLYI + F+ S GPL VLYRFLE++S FDWDN+C+++ GPV +S LP++ E
Sbjct: 205 YALEILVLYIINAFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAVSSLPEIMTES 264
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
P +G LLL FL C+ ++ EN G F KH N++DPL+ NNNLGRSVSKG
Sbjct: 265 PYNNGNELLLCPEFLKRCKEKFSVPIKAVENGGHEFSIKHLNILDPLKDNNNLGRSVSKG 324
Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
NF RI+ A ++ A+ L +L P E++ + FF+NT DR+
Sbjct: 325 NFHRIKCALSYGAQRLGEILALPGENMGAGLEIFFINTLDRN 366
>gi|293332253|ref|NP_001168029.1| uncharacterized protein LOC100381756 [Zea mays]
gi|223945595|gb|ACN26881.1| unknown [Zea mays]
gi|413924674|gb|AFW64606.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
Length = 833
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 180/342 (52%), Positives = 234/342 (68%), Gaps = 1/342 (0%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+ P W + E ++ +IQP SE R AV YV+RL QVF FGSVPLKTY
Sbjct: 23 VSPDAWRRFETAALAVVNKIQPTAASEHLRAAVVDYVQRLFWFQARYQVFPFGSVPLKTY 82
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL F + A+ V +L++EE+ + +EF VK+VQY+ AEVK++KCLV
Sbjct: 83 LPDGDIDLTLFGP-AISDENLANEVCTILKSEERRKDSEFEVKDVQYVPAEVKLVKCLVQ 141
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N VVDI+ NQ+GGLCTLCFL++VD ++HLFK+SIILIK WCYYESRILG HHGLIS+
Sbjct: 142 NIVVDISVNQIGGLCTLCFLEKVDQHFGKDHLFKKSIILIKDWCYYESRILGAHHGLIST 201
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YAL TLVLYIFH+F+ S GPL VLYRFL+++SKFDWDN +SL+GPV +S LP++ +P
Sbjct: 202 YALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLSSLPELVTDP 261
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
P L + FL C +++ P E + F + N++DPL+ +NNLGRSVSKG
Sbjct: 262 PDIQDDDFLQREEFLKECIESFSVLPRNSETNPRLFSRRFLNIVDPLKQSNNLGRSVSKG 321
Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
NF+RIR+AF F A+ L ++L P+ EVNQFF NT R+
Sbjct: 322 NFYRIRSAFDFGARKLGKILQVPSCLTVGEVNQFFRNTLKRN 363
>gi|413924673|gb|AFW64605.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
Length = 815
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 180/342 (52%), Positives = 234/342 (68%), Gaps = 1/342 (0%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+ P W + E ++ +IQP SE R AV YV+RL QVF FGSVPLKTY
Sbjct: 23 VSPDAWRRFETAALAVVNKIQPTAASEHLRAAVVDYVQRLFWFQARYQVFPFGSVPLKTY 82
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL F + A+ V +L++EE+ + +EF VK+VQY+ AEVK++KCLV
Sbjct: 83 LPDGDIDLTLFGP-AISDENLANEVCTILKSEERRKDSEFEVKDVQYVPAEVKLVKCLVQ 141
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N VVDI+ NQ+GGLCTLCFL++VD ++HLFK+SIILIK WCYYESRILG HHGLIS+
Sbjct: 142 NIVVDISVNQIGGLCTLCFLEKVDQHFGKDHLFKKSIILIKDWCYYESRILGAHHGLIST 201
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YAL TLVLYIFH+F+ S GPL VLYRFL+++SKFDWDN +SL+GPV +S LP++ +P
Sbjct: 202 YALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLSSLPELVTDP 261
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
P L + FL C +++ P E + F + N++DPL+ +NNLGRSVSKG
Sbjct: 262 PDIQDDDFLQREEFLKECIESFSVLPRNSETNPRLFSRRFLNIVDPLKQSNNLGRSVSKG 321
Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
NF+RIR+AF F A+ L ++L P+ EVNQFF NT R+
Sbjct: 322 NFYRIRSAFDFGARKLGKILQVPSCLTVGEVNQFFRNTLKRN 363
>gi|108708029|gb|ABF95824.1| expressed protein [Oryza sativa Japonica Group]
Length = 1004
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 179/337 (53%), Positives = 231/337 (68%), Gaps = 12/337 (3%)
Query: 13 KAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDID 72
+AEE E++ R++P SE RR AV Y RRL+ C+VF +GSVPLKTYLPD D+D
Sbjct: 35 RAEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVD 94
Query: 73 L---GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVV 129
L G S TL D H+ L++EE+N AEF VK++Q I AEV++IKC ++N VV
Sbjct: 95 LTVLGNTSYGSTLIDDIYHI----LQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVV 150
Query: 130 DIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALV 189
DI+FNQ GG+C LCFL+ VD + +NHL K SIILIKAWCYYESR+LG HHGLIS+YAL
Sbjct: 151 DISFNQTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALE 210
Query: 190 TLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKD 249
TL+LYIF++F+ S GPLEVLYRFLE+FSKFDWDN+C+SL GPV +S LP+ E
Sbjct: 211 TLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTP 270
Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
G LL K FL++ E F SK+ N+IDPL+ +NNLGRSV+K +F R
Sbjct: 271 GSDLLFDKEFLNNSVQKTDSNACNTE-----FRSKYLNIIDPLKEHNNLGRSVNKASFNR 325
Query: 310 IRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDR 346
IRTAF++ A+ L ++L E + +E+ FF NT +R
Sbjct: 326 IRTAFSYGAQKLGQVLLLQPELIPDEIYGFFKNTLNR 362
>gi|242069725|ref|XP_002450139.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
gi|241935982|gb|EES09127.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
Length = 835
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 181/345 (52%), Positives = 237/345 (68%), Gaps = 7/345 (2%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+ P W + E ++ +IQP SE+ R AV YV+RL QVF FGSVPLKTY
Sbjct: 23 VSPDAWRRFETAALAVVNKIQPTAASEQLRAAVIEYVQRLFWFQARYQVFPFGSVPLKTY 82
Query: 66 LPDRDIDLGAFS---DDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKC 122
LPD DIDL F D+ L A+ V +L++EE+ + +EF VK+V Y+ AEVK++KC
Sbjct: 83 LPDGDIDLTLFGPAISDENL----ANEVCAILKSEERRKDSEFEVKDVHYVPAEVKLVKC 138
Query: 123 LVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGL 182
LV N VVDI+ NQ+GGLCTLCFL++VD +NHLFKRSI+L+K WCYYESRILG HHGL
Sbjct: 139 LVQNIVVDISVNQIGGLCTLCFLEKVDQNFGKNHLFKRSIMLVKDWCYYESRILGAHHGL 198
Query: 183 ISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVT 242
IS+YAL TLVLYIFH+F+ S GPL VLYRFL+++SKFDWDN +SL+GPV +S LP++
Sbjct: 199 ISTYALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLSSLPELV 258
Query: 243 AEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSV 302
+PP L + FL C +++ P E + F + N++DPL+ +NNLGRSV
Sbjct: 259 TDPPDTQDDDFLQREEFLKECTESFSVLPRNSETNPRVFSRRFLNIVDPLKQSNNLGRSV 318
Query: 303 SKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SKGNF+RIR+AF F A+ L ++L P+ +EVNQFF NT R+
Sbjct: 319 SKGNFYRIRSAFDFGARKLGKILQVPSCLTVSEVNQFFRNTLKRN 363
>gi|115452887|ref|NP_001050044.1| Os03g0336700 [Oryza sativa Japonica Group]
gi|108708028|gb|ABF95823.1| expressed protein [Oryza sativa Japonica Group]
gi|113548515|dbj|BAF11958.1| Os03g0336700 [Oryza sativa Japonica Group]
Length = 1035
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 179/337 (53%), Positives = 231/337 (68%), Gaps = 12/337 (3%)
Query: 13 KAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDID 72
+AEE E++ R++P SE RR AV Y RRL+ C+VF +GSVPLKTYLPD D+D
Sbjct: 35 RAEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVD 94
Query: 73 L---GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVV 129
L G S TL D H+ L++EE+N AEF VK++Q I AEV++IKC ++N VV
Sbjct: 95 LTVLGNTSYGSTLIDDIYHI----LQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVV 150
Query: 130 DIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALV 189
DI+FNQ GG+C LCFL+ VD + +NHL K SIILIKAWCYYESR+LG HHGLIS+YAL
Sbjct: 151 DISFNQTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALE 210
Query: 190 TLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKD 249
TL+LYIF++F+ S GPLEVLYRFLE+FSKFDWDN+C+SL GPV +S LP+ E
Sbjct: 211 TLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTP 270
Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
G LL K FL++ E F SK+ N+IDPL+ +NNLGRSV+K +F R
Sbjct: 271 GSDLLFDKEFLNNSVQKTDSNACNTE-----FRSKYLNIIDPLKEHNNLGRSVNKASFNR 325
Query: 310 IRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDR 346
IRTAF++ A+ L ++L E + +E+ FF NT +R
Sbjct: 326 IRTAFSYGAQKLGQVLLLQPELIPDEIYGFFKNTLNR 362
>gi|414882101|tpg|DAA59232.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
Length = 906
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 177/341 (51%), Positives = 231/341 (67%), Gaps = 9/341 (2%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W + E ++ IQP SE R A+ YV+RL+ QVF FGSVPLKTYLPD D
Sbjct: 29 WRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFPFGSVPLKTYLPDGD 88
Query: 71 IDLGAF----SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN 126
IDL F SD++ + A +L++EE + +EF VK+VQYI AEVK++KC+V N
Sbjct: 89 IDLTTFGPAISDEKLANEVCA-----ILKSEEHRKDSEFDVKDVQYIHAEVKLVKCVVQN 143
Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
+VDI+ NQ+GGLCTLCFL++VD + HLFKRS++LIK WCYYE+RILG HHGLIS+Y
Sbjct: 144 IIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILGAHHGLISTY 203
Query: 187 ALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPP 246
AL LVLYIFH+F+ S GPL VLYRFL+++S+FDWD +SL+GPV +S LPD+ +PP
Sbjct: 204 ALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSSLPDLVTDPP 263
Query: 247 RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGN 306
LL + FL C A++ P E Q F K N++DPL+ +NNLGRSVS+GN
Sbjct: 264 VIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNNLGRSVSRGN 323
Query: 307 FFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
F+RIR+AF F A+ L ++L P +EVNQFF NT R+
Sbjct: 324 FYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRN 364
>gi|414882102|tpg|DAA59233.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
Length = 875
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 177/341 (51%), Positives = 231/341 (67%), Gaps = 9/341 (2%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W + E ++ IQP SE R A+ YV+RL+ QVF FGSVPLKTYLPD D
Sbjct: 29 WRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFPFGSVPLKTYLPDGD 88
Query: 71 IDLGAF----SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN 126
IDL F SD++ + A +L++EE + +EF VK+VQYI AEVK++KC+V N
Sbjct: 89 IDLTTFGPAISDEKLANEVCA-----ILKSEEHRKDSEFDVKDVQYIHAEVKLVKCVVQN 143
Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
+VDI+ NQ+GGLCTLCFL++VD + HLFKRS++LIK WCYYE+RILG HHGLIS+Y
Sbjct: 144 IIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILGAHHGLISTY 203
Query: 187 ALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPP 246
AL LVLYIFH+F+ S GPL VLYRFL+++S+FDWD +SL+GPV +S LPD+ +PP
Sbjct: 204 ALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSSLPDLVTDPP 263
Query: 247 RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGN 306
LL + FL C A++ P E Q F K N++DPL+ +NNLGRSVS+GN
Sbjct: 264 VIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNNLGRSVSRGN 323
Query: 307 FFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
F+RIR+AF F A+ L ++L P +EVNQFF NT R+
Sbjct: 324 FYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRN 364
>gi|357112328|ref|XP_003557961.1| PREDICTED: uncharacterized protein LOC100823912 [Brachypodium
distachyon]
Length = 1051
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 180/330 (54%), Positives = 227/330 (68%), Gaps = 7/330 (2%)
Query: 21 LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDL---GAFS 77
++ R+QP SE RR V Y RR++ C+VF FGSVPLKTYLPD DIDL G S
Sbjct: 38 VVRRVQPTEASERRRAEVIDYARRIVGTALGCEVFAFGSVPLKTYLPDGDIDLTVLGNAS 97
Query: 78 DDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLG 137
D TL D V +L + E+N AEF VK++++I AEVK+IKC ++N +VDI+FNQ G
Sbjct: 98 CDSTLIDD----VYCILGSGEQNSDAEFEVKDLEHIDAEVKLIKCTIENIIVDISFNQTG 153
Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
G+C LCFL+ VD I +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL TL+LYIF+
Sbjct: 154 GICALCFLELVDRKIGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALETLILYIFN 213
Query: 198 VFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSK 257
+F+ S GPLEVLYRFLE+FSKFDWDN+C+SL GPV +S LP++ E LL K
Sbjct: 214 LFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEGTNIPVDDLLFDK 273
Query: 258 SFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFR 317
FL S + P + + F KH N+IDPL+ NNLGRSV+K NF RIRTAF++
Sbjct: 274 EFLHSSVEKASVPPRDSDARCTKFRVKHLNIIDPLKECNNLGRSVNKANFSRIRTAFSYG 333
Query: 318 AKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
A+ L + L P+E + E+ FF NT R+
Sbjct: 334 ARKLGQYLMLPSERISGEIFGFFKNTLKRN 363
>gi|242041009|ref|XP_002467899.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
gi|241921753|gb|EER94897.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
Length = 1046
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 171/328 (52%), Positives = 225/328 (68%), Gaps = 1/328 (0%)
Query: 20 ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDD 79
E++ R++P SE RR V Y RRL+ C+VF FGSVPLKTYLPD DIDL +
Sbjct: 35 EVVRRVRPTEASERRRADVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGN- 93
Query: 80 QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGL 139
+ T + V +LE+EE+N AEF VK ++ I AEV++IKC + N ++DI+FNQ GG+
Sbjct: 94 TSYDSTLVNDVYCILESEEQNSDAEFIVKNLERIDAEVRLIKCTIGNIIIDISFNQTGGI 153
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
C LCFL+ VD + +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL L+LYIF++F
Sbjct: 154 CALCFLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYIFNLF 213
Query: 200 NGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSF 259
+ S PLEVLYRFLE+FSKFDWDN+C+SL GPV +S LP++T E LL K F
Sbjct: 214 HKSLHSPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLTVEATITHTSDLLFDKEF 273
Query: 260 LDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAK 319
L S P ++ F KH N++DPL+ +NNLGRSV++ +F RIRTAF + A+
Sbjct: 274 LKSSMDKATVPPKNSDSCYTRFRPKHLNIVDPLKEHNNLGRSVNRASFNRIRTAFLYGAR 333
Query: 320 GLARLLDCPNEDLYNEVNQFFMNTRDRH 347
L +L P+E + +E+ FF NT +R+
Sbjct: 334 KLGHILMLPSEVIPDEIYGFFKNTLERN 361
>gi|414888115|tpg|DAA64129.1| TPA: hypothetical protein ZEAMMB73_121752 [Zea mays]
Length = 942
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 176/332 (53%), Positives = 231/332 (69%), Gaps = 11/332 (3%)
Query: 19 AELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSD 78
E++ R+ P +E RR V AY+RRLI C C+VF FGSVPL+TYLPD D+D+ +
Sbjct: 68 GEVVLRVHPTREAERRRQDVIAYLRRLIGSCLGCEVFAFGSVPLRTYLPDGDVDITVLGN 127
Query: 79 DQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGG 138
L T+ VR ML++E++N AEF++ +Q+I AEVK+IKC+++N +VD++FNQ+GG
Sbjct: 128 TW-LNSTFIDDVRSMLQSEQENCDAEFKLTGLQFINAEVKLIKCVIENIIVDVSFNQIGG 186
Query: 139 LCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHV 198
+ T CFL+ VD I +NHLFKRSI+LIKAWCY+ESRILG HHGLIS+YAL TLVLYIF++
Sbjct: 187 VSTFCFLELVDRQIGQNHLFKRSIMLIKAWCYHESRILGAHHGLISTYALETLVLYIFNM 246
Query: 199 FNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLL---L 255
F+ S GPLE LYRFLE+FSKFDWD + +SL G V +S L T EP G LL L
Sbjct: 247 FHKSLHGPLEALYRFLEYFSKFDWDRYGISLNGQVDLSSL---TVEPTDVQGESLLGKEL 303
Query: 256 SKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFT 315
+ +LD +F G G F K N+IDPL+ NNNLGRSVSK NF+RIR+AF+
Sbjct: 304 QQGYLDRLVVIPNEFDGC----GTQFRQKFLNIIDPLKANNNLGRSVSKANFYRIRSAFS 359
Query: 316 FRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
F A+ L ++L P+E + +E+ FF NT RH
Sbjct: 360 FGAQKLGQILLLPSEYIRDEIYGFFANTLKRH 391
>gi|414591190|tpg|DAA41761.1| TPA: hypothetical protein ZEAMMB73_453733 [Zea mays]
Length = 918
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 171/334 (51%), Positives = 227/334 (67%), Gaps = 2/334 (0%)
Query: 14 AEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDL 73
AE E++ R+ P +E RR V AY+ RLI C+VF FGSVPL+TYLPD D+D+
Sbjct: 75 AEAAAGEVLLRVHPTREAERRRQDVIAYLTRLIGSSLGCEVFAFGSVPLRTYLPDGDVDI 134
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAF 133
+ L T VR ML++E++N AE ++ + +I AEVK+IKC+++N +VD++F
Sbjct: 135 TVLGNTW-LNSTLIDDVRSMLQSEQENCDAELKLTGLHFIDAEVKLIKCVIENIIVDVSF 193
Query: 134 NQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL 193
NQ+GG+ T CFL+ VD + +NHLFKRSI+L KAWCY+ESRILG HHGLIS+YAL TLVL
Sbjct: 194 NQIGGVSTFCFLELVDRQVGKNHLFKRSIMLTKAWCYHESRILGAHHGLISTYALETLVL 253
Query: 194 YIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVL 253
YIF++F+ S GPLEVLY+FLE+FSKFDWD + +SL GPV +S LP +T EP G L
Sbjct: 254 YIFNMFHKSLHGPLEVLYKFLEYFSKFDWDRYGISLNGPVDLSSLPSLTVEPTEVQ-GEL 312
Query: 254 LLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTA 313
LL K F P + F K N++DPL+ NNNLGRSVSK NF+RIR+A
Sbjct: 313 LLGKDFHQGSLDRLVVIPNEFDGCDTQFRQKFLNIVDPLKANNNLGRSVSKANFYRIRSA 372
Query: 314 FTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
F+F A+ L ++L P+E + +E+ FF NT RH
Sbjct: 373 FSFGAQKLGQILLLPSEYICDEIYGFFSNTLKRH 406
>gi|224146203|ref|XP_002325920.1| predicted protein [Populus trichocarpa]
gi|222862795|gb|EEF00302.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 175/343 (51%), Positives = 233/343 (67%), Gaps = 7/343 (2%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+DP WL AE+ T E++ IQP SE +R V Y++ LI F +VF FGSVPLKTY
Sbjct: 28 IDPELWLMAEKRTQEILYTIQPTFASEHKRMEVINYIQSLIKYYFTVEVFAFGSVPLKTY 87
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL S Q +++ A V +L+ EE + EF+V +VQYI A+VK++KC V
Sbjct: 88 LPDGDIDLMVLSH-QNMEEELARGVCTLLQREELD--PEFQVNDVQYIHAQVKLVKCSVK 144
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N VDI+FNQ+ G LCFL++VD LI ++HLFKRSIILIKAWC+YESRILG HHGLIS+
Sbjct: 145 NISVDISFNQMAGPSALCFLEQVDQLIGQDHLFKRSIILIKAWCFYESRILGAHHGLIST 204
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YAL LVL I +VF+ S PL VLY+FL+++S FDWDN+C+S+ GP+PIS P +
Sbjct: 205 YALQILVLNIINVFHSSLPDPLAVLYKFLDYYSAFDWDNYCVSINGPIPISSFPQTDST- 263
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQ-ENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
+G L+S+ FL + R +A FP + EN F KH N++DPL+ +NNLGRSV+K
Sbjct: 264 -HNNGNESLISQEFLRNFREKFA-FPMKELENGAHEFPIKHLNIVDPLKSSNNLGRSVNK 321
Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GNF RIR A ++ A+ L ++ P E + + +FFMNT DR+
Sbjct: 322 GNFHRIRGALSYGAQRLGEIIALPGEAMGGRLEKFFMNTLDRN 364
>gi|42565972|ref|NP_191191.2| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
thaliana]
gi|30725328|gb|AAP37686.1| At3g56320 [Arabidopsis thaliana]
gi|110736147|dbj|BAF00045.1| hypothetical protein [Arabidopsis thaliana]
gi|332645988|gb|AEE79509.1| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
thaliana]
Length = 603
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 168/343 (48%), Positives = 231/343 (67%), Gaps = 4/343 (1%)
Query: 5 PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
P+D W+ AEE E++ IQP S+ RN + YVR LI+ +VF+FGSVPLKT
Sbjct: 34 PIDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKT 93
Query: 65 YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
YLPD DIDL + D + L L+NEE+ +EF +VQ+I A+VK+IKC +
Sbjct: 94 YLPDGDIDLTVLTKQNMDDDFYGQLC-SRLQNEER--ESEFHATDVQFIPAQVKVIKCNI 150
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
N VDI+FNQ GLC LCFL++VD L +HLFKRSIIL+KAWCYYESRILG + GLIS
Sbjct: 151 RNIAVDISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGANTGLIS 210
Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
+YAL LVLYI ++F+ S +GPL VLY+FL+++ FDW+N+C+S+ GPVPIS LP++TA
Sbjct: 211 TYALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSLPELTAA 270
Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
P ++G LLL + FL +C Y+ ++ G F KH N++DPL+ +NNLG+SV++
Sbjct: 271 SP-ENGHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLKYSNNLGKSVTQ 329
Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GN RIR AFT A+ L +L P + + + +FF N+ +R+
Sbjct: 330 GNVQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERN 372
>gi|326517667|dbj|BAK03752.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 334
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 170/302 (56%), Positives = 213/302 (70%), Gaps = 1/302 (0%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+ G W E+ A ++ RIQP SE+RR AV YV+RLI C+VF FGSVPLKTY
Sbjct: 30 ISAGAWRPFEDAAAAVVGRIQPSVSSEDRRAAVVHYVQRLIRCSVGCEVFPFGSVPLKTY 89
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL AF + + A+ VR +LE+EE + AEF VK+VQYI AEVK++KCLV
Sbjct: 90 LPDGDIDLTAFGSASS-DENLANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCLVQ 148
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N VVDI+FNQ+GGLCTLCFL++VD + HLFK+SI+LIKAWCYYESRILG HHGLIS+
Sbjct: 149 NIVVDISFNQIGGLCTLCFLEQVDERFGKKHLFKKSIMLIKAWCYYESRILGAHHGLIST 208
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YAL LVLYIFH+F+ S GPL VLYRFL+++SKFDWDN +SL+GPVP+S LP++ ++
Sbjct: 209 YALEILVLYIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVPLSSLPELVSDT 268
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
P L + FL + P E + F+ K N++DPL+ NNNLGRSVSKG
Sbjct: 269 PDTHDVDFLKREEFLKEFAQMFTVPPRSFERNNRLFLRKFLNIVDPLKQNNNLGRSVSKG 328
Query: 306 NF 307
F
Sbjct: 329 FF 330
>gi|414866687|tpg|DAA45244.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
Length = 1050
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 167/324 (51%), Positives = 221/324 (68%), Gaps = 1/324 (0%)
Query: 24 RIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
R++P SE RR V Y RRL+ C+VF FGSVPLKTYLPD DIDL + +
Sbjct: 37 RVRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGN-TSYD 95
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
T + V +LE+EE+N AEF VK+++ I AEV++IKC + N +VDI+FNQ GG+C LC
Sbjct: 96 STLVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALC 155
Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
FL+ VD + +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL L+LY+F++F+ S
Sbjct: 156 FLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSL 215
Query: 204 AGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSC 263
P+EVLYRFLE+FSKFDWDN+C+SL GPV +S LP++ E LL K FL S
Sbjct: 216 HSPVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSS 275
Query: 264 RYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLAR 323
P ++ F KH N++DPL+ NNLGRSV++ +F RIRTAF + A+ L
Sbjct: 276 MDKATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGH 335
Query: 324 LLDCPNEDLYNEVNQFFMNTRDRH 347
++ P+E + +E+ +FF NT R+
Sbjct: 336 IVTLPSEVIPDEIYEFFKNTLGRN 359
>gi|414866686|tpg|DAA45243.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
Length = 1056
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 167/324 (51%), Positives = 221/324 (68%), Gaps = 1/324 (0%)
Query: 24 RIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
R++P SE RR V Y RRL+ C+VF FGSVPLKTYLPD DIDL + +
Sbjct: 37 RVRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGN-TSYD 95
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
T + V +LE+EE+N AEF VK+++ I AEV++IKC + N +VDI+FNQ GG+C LC
Sbjct: 96 STLVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALC 155
Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
FL+ VD + +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL L+LY+F++F+ S
Sbjct: 156 FLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSL 215
Query: 204 AGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSC 263
P+EVLYRFLE+FSKFDWDN+C+SL GPV +S LP++ E LL K FL S
Sbjct: 216 HSPVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSS 275
Query: 264 RYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLAR 323
P ++ F KH N++DPL+ NNLGRSV++ +F RIRTAF + A+ L
Sbjct: 276 MDKATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGH 335
Query: 324 LLDCPNEDLYNEVNQFFMNTRDRH 347
++ P+E + +E+ +FF NT R+
Sbjct: 336 IVTLPSEVIPDEIYEFFKNTLGRN 359
>gi|168035287|ref|XP_001770142.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678668|gb|EDQ65124.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1504
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 167/296 (56%), Positives = 204/296 (68%), Gaps = 9/296 (3%)
Query: 56 TFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQA 115
TFGSVPLKTYLPD DIDL AF+ +K TW + L+ + N ++EFRVKEVQ I A
Sbjct: 143 TFGSVPLKTYLPDGDIDLSAFTPSPDVKRTWIQDTYNALQKAKDNPNSEFRVKEVQLIHA 202
Query: 116 EVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRI 175
EVKI+KC V+N +VD++F+QLGGL TLCFL EVD LI E+HLFKRSIIL+KAWCYYESRI
Sbjct: 203 EVKIVKCFVENILVDVSFDQLGGLGTLCFLVEVDKLIGEDHLFKRSIILVKAWCYYESRI 262
Query: 176 LGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPI 235
LG H GL+S+YA+ LVLYIF F+ S GPL+VLY FLEFFS FDWDN+C+SL P+P+
Sbjct: 263 LGAHCGLMSTYAVEALVLYIFDKFHASLRGPLQVLYLFLEFFSSFDWDNYCVSLSSPIPL 322
Query: 236 SLLPDVT---------AEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHF 286
L + A R+DGG L +K FL +C Y P Q + F K
Sbjct: 323 KSLSKDSEKLEDLQKLALSTRRDGGELFFTKEFLVACETEYGVVPVSQITKSNKFTVKCL 382
Query: 287 NVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMN 342
N+ DPLR +NNLGRSV++GNF RIR AF F A+ L R+L C ED+ E+ QFF N
Sbjct: 383 NISDPLRSSNNLGRSVNQGNFARIRRAFDFGARTLRRVLSCTEEDVPAELEQFFKN 438
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 30/55 (54%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
W KAE AELI +QP+ SE+RR V YVR L+ C Q ++ LK +
Sbjct: 36 WAKAELRAAELITSLQPNEASEQRRQDVIDYVRGLVKGCIYGQCLHSEALCLKHF 90
>gi|7572930|emb|CAB87431.1| putative protein [Arabidopsis thaliana]
Length = 614
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 169/354 (47%), Positives = 231/354 (65%), Gaps = 15/354 (4%)
Query: 5 PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
P+D W+ AEE E++ IQP S+ RN + YVR LI+ +VF+FGSVPLKT
Sbjct: 34 PIDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKT 93
Query: 65 YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
YLPD DIDL + D + L L+NEE+ +EF +VQ+I A+VK+IKC +
Sbjct: 94 YLPDGDIDLTVLTKQNMDDDFYGQLC-SRLQNEER--ESEFHATDVQFIPAQVKVIKCNI 150
Query: 125 DNFVVDIAFNQLGGLCTLCFLD-----------EVDHLINENHLFKRSIILIKAWCYYES 173
N VDI+FNQ GLC LCFL+ EVD L +HLFKRSIIL+KAWCYYES
Sbjct: 151 RNIAVDISFNQTAGLCALCFLEQVLSAIQNQAPEVDQLFGRDHLFKRSIILVKAWCYYES 210
Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPV 233
RILG + GLIS+YAL LVLYI ++F+ S +GPL VLY+FL+++ FDW+N+C+S+ GPV
Sbjct: 211 RILGANTGLISTYALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPV 270
Query: 234 PISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLR 293
PIS LP++TA P ++G LLL + FL +C Y+ ++ G F KH N++DPL+
Sbjct: 271 PISSLPELTAASP-ENGHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLK 329
Query: 294 VNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+NNLG+SV++GN RIR AFT A+ L +L P + + + +FF N+ +R+
Sbjct: 330 YSNNLGKSVTQGNVQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERN 383
>gi|297820390|ref|XP_002878078.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323916|gb|EFH54337.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 602
Score = 330 bits (845), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 165/337 (48%), Positives = 228/337 (67%), Gaps = 4/337 (1%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W+ AEE E++ IQP S++ RN + YVR LI +VF+FGSVPLKTYLPD D
Sbjct: 40 WMIAEERAHEILCTIQPALVSDKSRNEIIDYVRTLIKSHDGIEVFSFGSVPLKTYLPDGD 99
Query: 71 IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
IDL + D + L L+NEE+ +EF +VQ+I A+VK+IKC + N VD
Sbjct: 100 IDLTVLTKQNMDDDFYGQLC-SRLQNEER--ESEFHATDVQFIPAQVKVIKCNIRNIAVD 156
Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
I+FNQ GLC LCFL++VD L +HLFKRSIIL+KAWCYYESRILG + GLIS+YAL
Sbjct: 157 ISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGANTGLISTYALAV 216
Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
LVLYI ++F+ S +GPL VLY+FL+++ FDW+N+C+S+ GPVPIS LP++TA P ++G
Sbjct: 217 LVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSLPELTAASP-ENG 275
Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
LLL + FL +C ++ ++ G F KH N++DPL+ +NNLG+SV++GN RI
Sbjct: 276 HELLLDEKFLRNCVELFSAPTKAVDSNGLDFPIKHLNIVDPLKYSNNLGKSVTQGNVQRI 335
Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
R AFT A+ L +L P + + + +FF N+ +R+
Sbjct: 336 RHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERN 372
>gi|326490774|dbj|BAJ90054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1030
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 168/325 (51%), Positives = 222/325 (68%), Gaps = 7/325 (2%)
Query: 26 QPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDL---GAFSDDQTL 82
QP S+ RR V + RR++ C+VF FGSVPLKTYLPD DIDL G S TL
Sbjct: 43 QPTQASDRRRAEVVDHARRIVGTALGCEVFVFGSVPLKTYLPDGDIDLTVIGNTSCGSTL 102
Query: 83 KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTL 142
D H+ LE+ E+N AEF VK++++I AEV++IKC + N +VDI+FNQ GG+C +
Sbjct: 103 IDDVYHI----LESGEENGDAEFEVKDLEHIDAEVRLIKCTIGNIIVDISFNQTGGICAV 158
Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS 202
FL+ VD + +NHLFKRSIILIK WCYYESR+LG HHGLIS+YAL TL+LY+F++F+ S
Sbjct: 159 SFLELVDRKVGKNHLFKRSIILIKGWCYYESRLLGAHHGLISTYALETLILYVFNLFHKS 218
Query: 203 FAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDS 262
GPLEVLYRFLE+FSKFDWD +C+SL GPV +S LP++ E G LL + FLD+
Sbjct: 219 LHGPLEVLYRFLEYFSKFDWDKYCISLNGPVALSSLPNLIVEGLNVPGDDLLFDREFLDN 278
Query: 263 CRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLA 322
+ P + + F K N+IDPL+ NNLGRSV++ NF RIRTAF+F A+ L
Sbjct: 279 SVEKASAPPRNSDARCSKFRVKCLNIIDPLKECNNLGRSVNRANFHRIRTAFSFGARKLG 338
Query: 323 RLLDCPNEDLYNEVNQFFMNTRDRH 347
++L P E + +++ FF NT +R+
Sbjct: 339 QILMLPPELIPDDIFAFFKNTLERN 363
>gi|218200261|gb|EEC82688.1| hypothetical protein OsI_27344 [Oryza sativa Indica Group]
Length = 1001
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 163/328 (49%), Positives = 220/328 (67%), Gaps = 5/328 (1%)
Query: 20 ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDD 79
E++ R+QP +E R + Y++ L C+VF FGSVPLKTYLPD DID+ +
Sbjct: 54 EVVLRVQPTEEAERTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNT 113
Query: 80 QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGL 139
T+ VR +LE EE+ + A+ + +Q+I AEVK+IKC++DN VVDI+FNQ+GG+
Sbjct: 114 AP-DSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGV 172
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
TLC L+ VDH + +HLFKRSI+LIKAWCY+ES ILG H GLIS+YAL LVLYIF++F
Sbjct: 173 TTLCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIF 232
Query: 200 NGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSF 259
+ S PLEVLY+FLE+FSKFDWD +C+SL GPVP+S LP++T EP +L
Sbjct: 233 HKSLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNLTVEPSGIHDELLFGPNGS 292
Query: 260 LDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAK 319
D D G N F K+ N+IDP++ +NNLGRSVSKG+F+RIR AF+F A+
Sbjct: 293 CDRLIVLKKDSDGSNMN----FRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGAQ 348
Query: 320 GLARLLDCPNEDLYNEVNQFFMNTRDRH 347
L+++L P + + E+ FF+NT H
Sbjct: 349 NLSQILMLPTDLIPTEIFGFFVNTLKSH 376
>gi|222637691|gb|EEE67823.1| hypothetical protein OsJ_25591 [Oryza sativa Japonica Group]
Length = 1001
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 162/328 (49%), Positives = 220/328 (67%), Gaps = 5/328 (1%)
Query: 20 ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDD 79
E++ R+QP ++ R + Y++ L C+VF FGSVPLKTYLPD DID+ +
Sbjct: 54 EVVLRVQPTEEADRTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNT 113
Query: 80 QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGL 139
T+ VR +LE EE+ + A+ + +Q+I AEVK+IKC++DN VVDI+FNQ+GG+
Sbjct: 114 AP-DSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGV 172
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
TLC L+ VDH + +HLFKRSI+LIKAWCY+ES ILG H GLIS+YAL LVLYIF++F
Sbjct: 173 TTLCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIF 232
Query: 200 NGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSF 259
+ S PLEVLY+FLE+FSKFDWD +C+SL GPVP+S LP++T EP +L
Sbjct: 233 HKSLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNLTVEPSGIHDELLFGPNGS 292
Query: 260 LDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAK 319
D D G N F K+ N+IDP++ +NNLGRSVSKG+F+RIR AF+F A+
Sbjct: 293 CDRLIVLKKDSDGSNMN----FRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGAQ 348
Query: 320 GLARLLDCPNEDLYNEVNQFFMNTRDRH 347
L+++L P + + E+ FF+NT H
Sbjct: 349 NLSQILMLPTDLIPTEIFGFFVNTLKSH 376
>gi|115488182|ref|NP_001066578.1| Os12g0283100 [Oryza sativa Japonica Group]
gi|77554657|gb|ABA97453.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649085|dbj|BAF29597.1| Os12g0283100 [Oryza sativa Japonica Group]
gi|222616913|gb|EEE53045.1| hypothetical protein OsJ_35772 [Oryza sativa Japonica Group]
Length = 989
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 159/327 (48%), Positives = 215/327 (65%), Gaps = 12/327 (3%)
Query: 21 LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
++ R+ P +E RR V Y+RRL+ C+V FGSVPLK+YLPD D+D+ +
Sbjct: 56 VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGN-T 114
Query: 81 TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLC 140
L V +LE+EE++ AE +K + +I AEVK+IKC+++N VVDI+FNQ+GG+
Sbjct: 115 ALDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVS 174
Query: 141 TLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
TLCFL+ D + +NHLFKRSI+LIKAWCY+ESRILG HHGL+S+YAL TLVLYIF++F+
Sbjct: 175 TLCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFH 234
Query: 201 GSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFL 260
S GPLE LY+FLE+FSKFDWD +C+SL GPV +S LP EP S
Sbjct: 235 KSLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVEP-----------SSIQ 283
Query: 261 DSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKG 320
D + P + F KH N+IDPL+ +NNLGRSVS+G+F+RIR A +F A+
Sbjct: 284 DELLFGKKTLPEVSDGSNINFCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQK 343
Query: 321 LARLLDCPNEDLYNEVNQFFMNTRDRH 347
L ++L ++ + E+ FF NT RH
Sbjct: 344 LGQILMLHSDLIPTEIFGFFANTLKRH 370
>gi|218186672|gb|EEC69099.1| hypothetical protein OsI_37998 [Oryza sativa Indica Group]
Length = 989
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 159/327 (48%), Positives = 215/327 (65%), Gaps = 12/327 (3%)
Query: 21 LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
++ R+ P +E RR V Y+RRL+ C+V FGSVPLK+YLPD D+D+ +
Sbjct: 56 VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGN-T 114
Query: 81 TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLC 140
L V +LE+EE++ AE +K + +I AEVK+IKC+++N VVDI+FNQ+GG+
Sbjct: 115 ALDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVS 174
Query: 141 TLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
TLCFL+ D + +NHLFKRSI+LIKAWCY+ESRILG HHGL+S+YAL TLVLYIF++F+
Sbjct: 175 TLCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFH 234
Query: 201 GSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFL 260
S GPLE LY+FLE+FSKFDWD +C+SL GPV +S LP EP S
Sbjct: 235 KSLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVEP-----------SSIQ 283
Query: 261 DSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKG 320
D + P + F KH N+IDPL+ +NNLGRSVS+G+F+RIR A +F A+
Sbjct: 284 DELLFGKKTLPEVSDGSNINFCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQK 343
Query: 321 LARLLDCPNEDLYNEVNQFFMNTRDRH 347
L ++L ++ + E+ FF NT RH
Sbjct: 344 LGQILMLHSDLIPTEIFGFFANTLKRH 370
>gi|356561857|ref|XP_003549193.1| PREDICTED: uncharacterized protein LOC100787145 [Glycine max]
Length = 684
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 166/348 (47%), Positives = 230/348 (66%), Gaps = 7/348 (2%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
S + +D W AEE E++ I+P SE R V YV+RLI + +V FGSV
Sbjct: 14 SQLLSIDEELWRMAEERAQEILWTIEPIVLSEVNRKDVIDYVQRLIRGYYGAEVLPFGSV 73
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL A S + +D A V ++L++ + E++VK++QYI+A+V+++
Sbjct: 74 PLKTYLPDGDIDLTALSHEDAEED-LAQAVCNILQS---GDDPEYQVKDIQYIRAQVRLV 129
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KC V N VDI+FNQ+ G+CTL FL++VD L+ +NH+FK SIILIKAWCYYESR+LGGHH
Sbjct: 130 KCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFKHSIILIKAWCYYESRLLGGHH 189
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GL+S+YA+ LVLYI + F+ S GPLEVLY FL+++ FDWD+ +S+WGP P+S LP+
Sbjct: 190 GLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYVSIWGPKPLSSLPE 249
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPG-GQENQGQPFVSKHFNVIDPLRVNNNLG 299
+ AE P D G LL K FL + R FP E F K N++DPLR +NNLG
Sbjct: 250 I-AETPECDQGEFLLQKEFLRNYR-NMCSFPSRASETMTHEFPVKFMNILDPLRNDNNLG 307
Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
RSV+ N R+R A ++ A+ L ++L P E++ + +FF +T DR+
Sbjct: 308 RSVNIANLHRVRFALSYGARRLKQILTLPGENMGAALEKFFFSTLDRN 355
>gi|356570171|ref|XP_003553264.1| PREDICTED: uncharacterized protein LOC100797780 [Glycine max]
Length = 644
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 161/348 (46%), Positives = 225/348 (64%), Gaps = 7/348 (2%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
S + +D W AEE E++ IQP+ SE R V YV+RLI + +V FGSV
Sbjct: 14 SQLLSIDKELWQMAEERAQEILWTIQPNVLSEVNRKDVIDYVQRLIRGYYGAEVLPFGSV 73
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL A S + +D L + + + + E++VK+++YI+A+V+++
Sbjct: 74 PLKTYLPDGDIDLTALSHEDAEED----LAQAVCYVLQSGDDPEYQVKDIKYIRAQVRLV 129
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KC V N VDI+FNQ+ G+CTL FL++VD L+ +NH+FKRSIILIKAWCYYESR+LGGHH
Sbjct: 130 KCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFKRSIILIKAWCYYESRLLGGHH 189
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GL+S+YA+ LVLYI + F+ S GPLEVLY FL+++ FDWD+ +S+WGP P+S P+
Sbjct: 190 GLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYVSIWGPKPLSSFPE 249
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPG-GQENQGQPFVSKHFNVIDPLRVNNNLG 299
+ AE D G LL K FL + R FP + F K N++DPLR +NNLG
Sbjct: 250 I-AETLECDHGEFLLQKEFLRNYR-NMCSFPSRATKTMTHEFPVKFMNILDPLRNDNNLG 307
Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
RSV+ + R R A ++ A+ L ++L P E + + +FF +T DR+
Sbjct: 308 RSVNIASLHRFRFALSYGARRLKQILTLPGETMGAALEKFFFSTLDRN 355
>gi|413924678|gb|AFW64610.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
Length = 474
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 167/348 (47%), Positives = 224/348 (64%), Gaps = 14/348 (4%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLI-IQCFPCQVFTFGSVPLKT 64
+ P W + E T ++ +I P S+ R V YV+RL + QV +FGSVPLKT
Sbjct: 67 ISPDDWRRLEGATFSVMCKIHPTVSSQHLRARVIDYVQRLFRLHHDGYQVISFGSVPLKT 126
Query: 65 YLPDRDIDL----GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
YLPD DIDL A SD+ + A +L++EE+ + +EF VK+V+Y+ AEVK++
Sbjct: 127 YLPDGDIDLTLLCAAISDENLENEVCA-----ILKSEEQRKDSEFEVKDVKYVPAEVKLV 181
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KC V N VDI+ NQ+GG + FL++VD + +N+L +RSI+LIK WCYYES ILG
Sbjct: 182 KCKVQNIAVDISVNQIGGPNKVYFLEKVDQNLGKNNLLRRSIMLIKHWCYYESCILGAQR 241
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GL+S+YAL TLVLYIFHVF+ S GPL VLYRFL+++SKFDWDN +SL+GP+ +S LP+
Sbjct: 242 GLVSTYALETLVLYIFHVFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPISLSSLPE 301
Query: 241 VTAEPP--RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
+ EPP R DG L ++FL C A++ P E Q F K N++DPL+ +NNL
Sbjct: 302 LVTEPPYTRDDG--FLSREAFLKDCAKAFSVPPINSEENPQVFSKKFVNIVDPLKQSNNL 359
Query: 299 GRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDR 346
GRS+SKGN RIR F F A L ++L P NE+N+FF NT R
Sbjct: 360 GRSISKGNLGRIRKEFYFGACKLGKILQAPACFSANEINRFFRNTLSR 407
>gi|242082774|ref|XP_002441812.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
gi|241942505|gb|EES15650.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
Length = 546
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 174/389 (44%), Positives = 230/389 (59%), Gaps = 48/389 (12%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLI------------------- 46
+ P W + E ++ +IQP SE R+AV Y++RL+
Sbjct: 29 IPPDAWRRFESAALGVVNKIQPTVASENFRSAVIDYLKRLLGSRAGVQSWLLPFLPFHFY 88
Query: 47 -------------------IQCFPCQ-------VFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
I C VF FGSVPLKTYLPD DIDL AFS
Sbjct: 89 VFFGAKPVRDYEYKCVTVWIYFVGCALESLCDLVFPFGSVPLKTYLPDGDIDLTAFSPAI 148
Query: 81 TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLC 140
+ + A+ V +L +E+ + +EF VK+VQYI AEVK++KCLV N VVDI+ NQ+GGL
Sbjct: 149 S-DENLANQVYAILSSEQHRKDSEFDVKDVQYIHAEVKLVKCLVQNIVVDISVNQIGGLS 207
Query: 141 TLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
TLCFL++VD + HL KRSI+LIK WCYYESRILG +GL+S+YAL LVLY+F +F+
Sbjct: 208 TLCFLEKVDENFGKKHLLKRSIVLIKDWCYYESRILGAQNGLLSTYALEVLVLYVFLIFH 267
Query: 201 GSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP--PRKDGGVLLLSKS 258
S GPL VLYRFL+F+SKFDWD+ +SL+GPV +S LP++ +P P D + +
Sbjct: 268 RSLGGPLAVLYRFLDFYSKFDWDSKGISLFGPVSLSSLPNLVTDPHLPAIDDDFFVPREK 327
Query: 259 FLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRA 318
L ++ P E Q F K N++DPL+ +NNLGRSV+KGNF+RIR+AF F A
Sbjct: 328 ILRKYAEDFSAPPRNSERDAQVFSRKFLNIVDPLKQSNNLGRSVNKGNFYRIRSAFDFGA 387
Query: 319 KGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+ L ++L P NEVNQFF NT R+
Sbjct: 388 RKLGKILQMPVCYTVNEVNQFFSNTLKRN 416
>gi|356507300|ref|XP_003522406.1| PREDICTED: uncharacterized protein LOC100813790 [Glycine max]
Length = 692
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 155/347 (44%), Positives = 227/347 (65%), Gaps = 5/347 (1%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
S + +D W AE+ E++ IQP+ SE R V YV+RLI + +V FGSV
Sbjct: 15 SQLLSIDEELWQMAEDRVQEILWTIQPNVLSEVNRKDVIDYVQRLIRDYYGAEVLPFGSV 74
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD D+DL + +D A + ++L++ + +E++VK++QYI+A+V+++
Sbjct: 75 PLKTYLPDGDVDLTTLIHEDA-EDDLAQAICNVLKS---GDDSEYQVKDIQYIRAQVRLV 130
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KC V N VDI+FNQ+ G+ TL FL++VD L+ +NH+FKRSIILIK WCYY+SR+LGGHH
Sbjct: 131 KCTVKNIAVDISFNQMAGIYTLRFLEQVDQLVGKNHIFKRSIILIKGWCYYDSRLLGGHH 190
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GL+S+YA+ LVLYI + F+ S GPLEVLY FL+++ FDWD+ +S+WGP +S LP+
Sbjct: 191 GLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYISIWGPKSLSSLPE 250
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
+ AE P D G LL K FL + + + G E F K N++DPLR +NNLGR
Sbjct: 251 I-AEAPECDQGEFLLQKEFLGNYKNMCSYPAGASETLTHEFPVKFMNILDPLRNDNNLGR 309
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVS + R+R AF++ + L ++ P E++ + +FF +T +R+
Sbjct: 310 SVSIASLHRLRFAFSYGVQKLKQIFTLPGENMGAALEKFFSSTLNRN 356
>gi|356570173|ref|XP_003553265.1| PREDICTED: uncharacterized protein LOC100798838 [Glycine max]
Length = 626
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 158/343 (46%), Positives = 221/343 (64%), Gaps = 5/343 (1%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
S + +D W EE E++ IQP+ SE R + YV+RLI + QVF FGS
Sbjct: 15 SQLLSIDEELWRMIEERAQEILWTIQPNVLSEVNRKNIIDYVQRLIGEYCGAQVFPFGSF 74
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD DIDL A S + +D LVR + + + +E++VK++++I+A+V+++
Sbjct: 75 PLKTYLPDGDIDLTALSHEDEEED----LVRAVCNILKSEDDSEYQVKDIEHIRAQVQVV 130
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KC V N VDI+FNQ+ GL TL FL++VD L+ +NH+FKRS+ILIK+WCYYESRILG H
Sbjct: 131 KCTVKNIPVDISFNQMAGLYTLFFLEQVDQLVGKNHIFKRSVILIKSWCYYESRILGAHC 190
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GL+S+YA LVLYI + F+ S GPL VLY FL+++S FDW++ +S+WGP +S LP+
Sbjct: 191 GLLSTYATEILVLYIINRFHSSVRGPLAVLYVFLDYYSSFDWEHNYISIWGPKVLSSLPE 250
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
+ + P D G LL K FL + R + E F KH N++DPLR NNNLGR
Sbjct: 251 I-VDTPEYDQGEFLLQKEFLKNYRDMCSSKAKASETMTNAFPVKHMNILDPLRNNNNLGR 309
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNT 343
SV+ GN RIR AF+ ++ L ++L E++ + +FF NT
Sbjct: 310 SVNIGNLSRIRLAFSLGSQRLKQILTLAGENMGAALEKFFFNT 352
>gi|326521958|dbj|BAK04107.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1031
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 157/327 (48%), Positives = 216/327 (66%), Gaps = 2/327 (0%)
Query: 21 LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
++ R+ P +E RR+ + Y + LI F C+VF FGSVPLKTYLPD D+D+ ++
Sbjct: 157 VLLRLHPTEEAERRRHKIIDYAKNLIGTTFGCEVFAFGSVPLKTYLPDGDVDITILTN-V 215
Query: 81 TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLC 140
L + + V +L E+ NE AEF +KE+Q I A+VKIIKC++DN V+DI+FNQ+GG+
Sbjct: 216 NLDNNFVQDVCCLLAAEQSNEAAEFALKEIQVINAKVKIIKCVIDNLVMDISFNQVGGVS 275
Query: 141 TLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
TLCFL+ + I ++HLFKRSIILIKAWCY+E I G +H L+S+YAL L+LYIF++F+
Sbjct: 276 TLCFLEMANKEIGKDHLFKRSIILIKAWCYHEGSIHGSNHWLMSTYALEVLILYIFNLFH 335
Query: 201 GSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFL 260
GPL+ LY+FLE++SKFDWDN CL+L GPVP+S L + TA P LLLSK L
Sbjct: 336 TVLHGPLQALYKFLEYYSKFDWDNQCLTLNGPVPLSSLRNYTAG-PTGSNEELLLSKEPL 394
Query: 261 DSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKG 320
+ D P G + +G F K+ N+IDPL+ NNLG S+S+ N IR AF A+
Sbjct: 395 EPSLRRLFDLPAGSDGRGPEFRLKYLNIIDPLKGGNNLGTSISEANSRVIRDAFAAGAEK 454
Query: 321 LARLLDCPNEDLYNEVNQFFMNTRDRH 347
L ++L P E + +V FF +T +H
Sbjct: 455 LGQILKLPCELIAEQVYVFFTHTLGKH 481
>gi|359486339|ref|XP_002274554.2| PREDICTED: uncharacterized protein LOC100253615 [Vitis vinifera]
Length = 755
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 167/338 (49%), Positives = 223/338 (65%), Gaps = 6/338 (1%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W + E++ IQP SE+RR + YV+RLI F +V FGS+PLKTYLPD D
Sbjct: 32 WSITKLTIQEILCAIQPTIVSEQRRKEIIDYVQRLIRDSFGNEVLPFGSMPLKTYLPDGD 91
Query: 71 IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
IDL A + +D +A V +LE E + +EFRV+++ YI+A+VKI+KC+V + VD
Sbjct: 92 IDLTALCPENDEED-FARDVCTLLEGE-RQMGSEFRVEDISYIRAKVKIVKCMVQDISVD 149
Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
I+FNQ GGL TLCFL+++D LI ++HLFKRS+ILIKAWCYYE RILG H GL+S+YAL
Sbjct: 150 ISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEGRILGSHCGLLSTYALEI 209
Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
LVLY+ ++F S PL VLYRFL+++S FDW+ F +S+ GPV IS L +T P D
Sbjct: 210 LVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPVSISSL--LTGAPETAD- 266
Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
LL+++ FL SC+ A+A E QPF+ KH N+ DPLR NNLGRS+S GN +R
Sbjct: 267 KPLLINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRF 326
Query: 311 RTAFTFRAKGLARLLDCPNEDLYNE-VNQFFMNTRDRH 347
R A + A+ L +L E NE + +FF NT DR+
Sbjct: 327 RYAISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRN 364
>gi|302835555|ref|XP_002949339.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
nagariensis]
gi|300265641|gb|EFJ49832.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
nagariensis]
Length = 3433
Score = 298 bits (764), Expect = 2e-78, Method: Composition-based stats.
Identities = 156/322 (48%), Positives = 204/322 (63%), Gaps = 18/322 (5%)
Query: 18 TAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFT---FGSVPLKTYLPDRDIDLG 74
T LI+RI+P S +RR + +V ++ +CF T FGSVPLKTYLPD DIDL
Sbjct: 34 TDTLISRIRPTGLSLQRRWVITEHVTSIVKRCFAPHDVTAIPFGSVPLKTYLPDGDIDLS 93
Query: 75 AFSDD-------QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
+S+ + L+DTWA ++ LE E N A FRV VQ I AEVK++KCLVDN
Sbjct: 94 IYSESPRAQALKEALRDTWATQLQVCLEEEANNPTAVFRVANVQVIHAEVKLLKCLVDNI 153
Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINE-----NHLFKRSIILIKAWCYYESRILGGHHGL 182
VVDI+F Q+GGL T FL++VD +++ HLFK SIIL+K WCYYESR+LG HHGL
Sbjct: 154 VVDISFFQVGGLNTYNFLEDVDRFVDQCIPVRKHLFKDSIILVKGWCYYESRVLGAHHGL 213
Query: 183 ISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVT 242
IS+YAL TLVLY+ ++++ PL+VLY+FL S FDW+N+CLSL GP+P+S P
Sbjct: 214 ISTYALETLVLYVINLYHRELTNPLQVLYKFLVECSCFDWENYCLSLEGPIPLSSFPKPV 273
Query: 243 AEPPRKDGGVLLLSKSFLDSCRYAYADFPG--GQENQGQPFVSKHFNVIDPLRVNNNLGR 300
E P LL+K F+ + Y + P Q + +PF K NV+DP+ NNLGR
Sbjct: 274 VETPEALQRDALLTKDFMARAYFKYTE-PQLRAQGGEPKPFAIKQLNVMDPILPGNNLGR 332
Query: 301 SVSKGNFFRIRTAFTFRAKGLA 322
SVSK ++ RIR AF A+ LA
Sbjct: 333 SVSKASYLRIRRAFEHGARMLA 354
>gi|356518940|ref|XP_003528133.1| PREDICTED: uncharacterized protein LOC100815787 [Glycine max]
Length = 680
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 153/347 (44%), Positives = 226/347 (65%), Gaps = 7/347 (2%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
S + +D W AE+ E++ I+P+ SE R V YV+RLI + +V FGSV
Sbjct: 15 SQLVSIDEELWRMAEDRVQEILWTIEPNVLSEVNRKDVIDYVQRLIKGYYGAKVLPFGSV 74
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
PLKTYLPD D+DL + +D A + ++L++ + +E++VK++QYI+A+V+++
Sbjct: 75 PLKTYLPDGDVDLTTLIHEDAEED-LAQAICNILKS---GDDSEYQVKDIQYIRAQVRLV 130
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
KC V N VDI+FNQ+ G+ TL FL++VD L+ +NH+FKRSIILIKAWCYY+SR+LGGH+
Sbjct: 131 KCTVKNIAVDISFNQMAGIYTLRFLEQVDQLVGKNHIFKRSIILIKAWCYYDSRLLGGHY 190
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPD 240
GL+S+YA+ LVLYI + F+ GPLEVLY FL+++S FDWD+ +S+WGP +S LP+
Sbjct: 191 GLLSTYAVEILVLYIINRFHSVVRGPLEVLYIFLDYYSSFDWDHNYVSIWGPKSLSSLPE 250
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
+T P D G LL K FL + + + E F K N++DPLR +NNLGR
Sbjct: 251 IT---PECDQGEFLLQKEFLTNYKNMCSYPTRASETLTHEFPVKFMNILDPLRNDNNLGR 307
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
SVS + R+R AF + A+ L ++ P E++ + +FF +T +R+
Sbjct: 308 SVSIASLHRLRFAFAYSAQKLKQIFTLPGENMGAALEKFFFSTLERN 354
>gi|357491471|ref|XP_003616023.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
gi|355517358|gb|AES98981.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
Length = 387
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 168/342 (49%), Positives = 210/342 (61%), Gaps = 61/342 (17%)
Query: 14 AEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ-------------------- 53
AE+ TAE++ RIQP ++ RR V YV+RLI C+
Sbjct: 44 AEQTTAEILRRIQPTLAADRRRREVVDYVQRLIRYGARCEKLLPNVWRKLDFEVRIFRIG 103
Query: 54 -VFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
VF +GSVPLKTYLPD DIDL A S Q ++D V +L EE N+ AE+ VK+V++
Sbjct: 104 KVFPYGSVPLKTYLPDGDIDLTALSP-QNIEDGLVSDVHAVLRGEENNDAAEYEVKDVRF 162
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
I AE N VVDI+FNQLGGL TLCFL++VD L+ ++H+FKRSIILIKAWCYYE
Sbjct: 163 IDAE---------NIVVDISFNQLGGLSTLCFLEKVDRLVAKDHIFKRSIILIKAWCYYE 213
Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPL------------------------- 207
SRILG HHGLIS+YAL TLVLYIFH F+ S GPL
Sbjct: 214 SRILGAHHGLISTYALETLVLYIFHRFHVSLDGPLAEKERKRNLNHIMLVMHPFNKHFMH 273
Query: 208 ----EVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSC 263
+VLYRFL++FSKFDWDN+C+SL GPV S PDV AE ++GG LL+ F+ SC
Sbjct: 274 PALFQVLYRFLDYFSKFDWDNYCVSLKGPVAKSSPPDVVAE-ALENGGNTLLTDEFIRSC 332
Query: 264 RYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
+++ P G + + F KH N+IDPL+ NNNLGRSV+KG
Sbjct: 333 VESFSVPPRGLDLNLRAFPHKHLNIIDPLKENNNLGRSVNKG 374
>gi|297823987|ref|XP_002879876.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
lyrata]
gi|297325715|gb|EFH56135.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
lyrata]
Length = 516
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 152/337 (45%), Positives = 218/337 (64%), Gaps = 7/337 (2%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
WL AEE E++ IQP SE RN + +++ L+ + +VF FGSVPLKTYLPD D
Sbjct: 33 WLIAEERAQEILFAIQPMYLSERSRNEIINHLQTLMRERLGIEVFLFGSVPLKTYLPDGD 92
Query: 71 IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
IDL + +++ A +R++LE E ++F+V +VQYI A+VK+IKC + N +D
Sbjct: 93 IDLTVLTP-YGMEENCAKALRNILEAERG--ESDFQVTDVQYIHAQVKVIKCTIRNVALD 149
Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
I+FNQ+ GL LCFL++VD +HLFKRSIILIKAWC+YESRILG ++GLIS+YAL
Sbjct: 150 ISFNQMAGLSALCFLEQVDRAFGRDHLFKRSIILIKAWCFYESRILGANNGLISTYALAI 209
Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDG 250
LVL I ++ S +GPL VLY+F++F+ FDW+N+C+++ G VPIS PD+T +
Sbjct: 210 LVLNIVNMSYSSVSGPLAVLYKFMDFYGSFDWENYCITVTGLVPISSFPDITETRNHE-- 267
Query: 251 GVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
+ L + F C +Y+ E + F KH+N++DPL+ +NNLGRSVS+GN R+
Sbjct: 268 --VFLDEKFFRECIESYSGPANVVEANRKYFPVKHYNILDPLKHSNNLGRSVSEGNAIRL 325
Query: 311 RTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
R F A+ L +L P E + ++ FF N+ DR+
Sbjct: 326 RHCFRRGAQKLRDVLTFPGETVGWKLEDFFGNSLDRN 362
>gi|159471748|ref|XP_001694018.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277185|gb|EDP02954.1| predicted protein [Chlamydomonas reinhardtii]
Length = 633
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 156/330 (47%), Positives = 209/330 (63%), Gaps = 18/330 (5%)
Query: 18 TAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFT---FGSVPLKTYLPDRDIDLG 74
T LI+RI+P S +RR + +V +L+ +CF T FGSVPLKTYLPD DIDL
Sbjct: 31 TDTLISRIRPTTLSLQRRFVITEHVTQLVKRCFAPHDVTAVPFGSVPLKTYLPDGDIDLS 90
Query: 75 AFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN 126
+S DQ L+DTWA ++ LE+E N HA F+V VQ I AEVK++KCLVDN
Sbjct: 91 IYSYSSRAQSLKDQ-LRDTWATTLQLCLEDEANNPHAAFKVANVQVIHAEVKLLKCLVDN 149
Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINE-----NHLFKRSIILIKAWCYYESRILGGHHG 181
VVDI+F Q+GGL T FL++VD +++ HLFK SIIL+K WCYYESR+LG HHG
Sbjct: 150 IVVDISFFQIGGLNTYNFLEDVDAFVDKAITARKHLFKDSIILVKGWCYYESRVLGAHHG 209
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
LIS+YAL TLVLY+ ++++ + PL+VLY+FL S FDW+ +CL+L GP+P++ P+
Sbjct: 210 LISTYALETLVLYVINLYHRELSNPLQVLYKFLVECSGFDWERYCLTLQGPIPLASFPNP 269
Query: 242 TAEPPRKDGGVLLLSKSFLDSCRYAY-ADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
E P LL++ F+ Y A + +PF K NV+DP+ NNNLGR
Sbjct: 270 VVETPEPLQREPLLTEHFMTRAYNKYTAPQVAAMGGEVKPFAIKQLNVMDPILPNNNLGR 329
Query: 301 SVSKGNFFRIRTAFTFRAKGLARLLDCPNE 330
SVSK ++ RIR AF A+ LA + + E
Sbjct: 330 SVSKASYLRIRRAFEHGARMLAAIAEQTKE 359
>gi|384253068|gb|EIE26543.1| hypothetical protein COCSUDRAFT_39611 [Coccomyxa subellipsoidea
C-169]
Length = 1155
Score = 288 bits (737), Expect = 3e-75, Method: Composition-based stats.
Identities = 144/292 (49%), Positives = 187/292 (64%), Gaps = 4/292 (1%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDD-QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQ 111
+ + FGSVPLKTYLPD DIDL F L+D W + + +LE E +N RVK+VQ
Sbjct: 5 EAYMFGSVPLKTYLPDGDIDLAVFQGKGPRLRDVWTYELSALLEAEGRNALNPHRVKDVQ 64
Query: 112 YIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYY 171
I AEVK++KCLVDN VVDI+F+ LGGLCT+ FL+ +D I + HLFKRS+IL+KAWCYY
Sbjct: 65 IINAEVKLLKCLVDNIVVDISFDTLGGLCTVAFLESIDRHIGKQHLFKRSVILVKAWCYY 124
Query: 172 ESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWG 231
ESR+LG HHGL+S+YAL T+VLYIF++++ PL+VL +FL FSKFDWD LSL G
Sbjct: 125 ESRLLGAHHGLLSTYALETMVLYIFNMYHHELQSPLKVLRKFLVVFSKFDWDGHALSLQG 184
Query: 232 PVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDP 291
P+P+S PD EP G LL L + Y+ Q+ G+ F K+ N++DP
Sbjct: 185 PIPLSSFPDPQVEPVAGAEGGALLRGDVLKTMLEMYSPV---QQGPGKAFTIKNMNIMDP 241
Query: 292 LRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNT 343
L NNLGRSV+K + RIR A L + D ++ V+ FF NT
Sbjct: 242 LLPTNNLGRSVNKASKARIRKALAHGCHMLDSIFDKVGQEATEAVDGFFRNT 293
>gi|356518706|ref|XP_003528019.1| PREDICTED: uncharacterized protein LOC100788864 [Glycine max]
Length = 721
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 213/342 (62%), Gaps = 5/342 (1%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
+D W EE E++ IQP+ SE R V YV++LI + +VF FGS PLKTY
Sbjct: 20 IDEELWRMTEERIQEILWTIQPNVLSEMNRKNVLNYVQKLIGDYYDTKVFPFGSFPLKTY 79
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL + + D +L +++ E ++VK++++I+A+V+++KC V
Sbjct: 80 LPDGDIDLTVINHE----DEEENLAKEICTILECANDLIYQVKDIEHIRAQVQVVKCTVK 135
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N +DI FNQ+ GLCTLCFL++VD L +NH+FKRSIILIKAWC Y+SR+LG HGL+S+
Sbjct: 136 NIPIDITFNQMTGLCTLCFLEQVDQLAGKNHIFKRSIILIKAWCCYDSRLLGSQHGLLST 195
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
YA LVLYI + F+ S PLEVLY F +++ FDW++ +S+WGP +S LP++ +
Sbjct: 196 YATEVLVLYIINRFHASVRDPLEVLYIFFDYYGTFDWEHNYMSIWGPKALSSLPEI-VDR 254
Query: 246 PRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 305
P D LL K FL + R ++ E F KH N++DPLR +NNLGRSV++
Sbjct: 255 PECDQDEFLLHKEFLINYRDIFSSKAKSSETTTNTFPVKHINILDPLRNDNNLGRSVNEA 314
Query: 306 NFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+F RIR A ++ AK ++ E++ + +FF +T R+
Sbjct: 315 SFHRIRFALSYGAKKFKQIFTLAGENMGEALEKFFFDTLQRN 356
>gi|30688308|ref|NP_850331.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|145330711|ref|NP_001078031.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|186506897|ref|NP_001118485.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|186506900|ref|NP_001118486.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|60547743|gb|AAX23835.1| hypothetical protein At2g40520 [Arabidopsis thaliana]
gi|330254746|gb|AEC09840.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|330254747|gb|AEC09841.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|330254748|gb|AEC09842.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|330254749|gb|AEC09843.1| nucleotidyltransferase protein [Arabidopsis thaliana]
Length = 502
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 217/343 (63%), Gaps = 7/343 (2%)
Query: 5 PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
P++ WL AE E++ IQP+ +E RN + + ++ L+ + +V+ FGS+PLKT
Sbjct: 27 PIEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLKT 86
Query: 65 YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
YLPD DIDL + + +D A V +LE E N ++ +V VQY+QA+VK+IKC +
Sbjct: 87 YLPDGDIDLTVLTHHASEEDC-ARAVCCVLEAEMGN--SDLQVTGVQYVQAKVKVIKCSI 143
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
+ DI+FNQL GL LCFL++VD +HLFK+SIIL+KAWC+YESRILG + GLIS
Sbjct: 144 RDVAFDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLIS 203
Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
+YAL LVL I ++ S +GPL VLY+F+ ++ FDW N+C+++ GPVPIS LPD+T
Sbjct: 204 TYALAILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDITET 263
Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
+ + L + F C Y+ G E + F K++N++DPL+ +NNLGRSV+K
Sbjct: 264 GNHE----VFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTK 319
Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GN R+R F + L +L P E++ ++ +FF + +R+
Sbjct: 320 GNMVRLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERN 362
>gi|21805733|gb|AAM76764.1| hypothetical protein [Arabidopsis thaliana]
Length = 502
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 217/343 (63%), Gaps = 7/343 (2%)
Query: 5 PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKT 64
P++ WL AE E++ +QP+ +E RN + + ++ L+ + +V+ FGS+PLKT
Sbjct: 27 PIEAEVWLIAEARAQEILCAVQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLKT 86
Query: 65 YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV 124
YLPD DIDL + + +D A V +LE E N ++ +V VQY+QA+VK+IKC +
Sbjct: 87 YLPDGDIDLTVLTHHASEEDC-ARAVCCVLEAEMGN--SDLQVTGVQYVQAKVKVIKCSI 143
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
+ DI+FNQL GL LCFL++VD +HLFK+SIIL+KAWC+YESRILG + GLIS
Sbjct: 144 RDVAFDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLIS 203
Query: 185 SYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAE 244
+YAL LVL I ++ S +GPL VLY+F+ ++ FDW N+C+++ GPVPIS LPD+T
Sbjct: 204 TYALAILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDITET 263
Query: 245 PPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
+ + L + F C Y+ G E + F K++N++DPL+ +NNLGRSV+K
Sbjct: 264 GNHE----VFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTK 319
Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GN R+R F + L +L P E++ ++ +FF + +R+
Sbjct: 320 GNMVRLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERN 362
>gi|297736507|emb|CBI25378.3| unnamed protein product [Vitis vinifera]
Length = 893
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 150/295 (50%), Positives = 198/295 (67%), Gaps = 3/295 (1%)
Query: 54 VFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI 113
V FGS+PLKTYLPD DIDL A + +D +A V +LE E + +EFRV+++ YI
Sbjct: 210 VLPFGSMPLKTYLPDGDIDLTALCPENDEED-FARDVCTLLEGE-RQMGSEFRVEDISYI 267
Query: 114 QAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYES 173
+A+VKI+KC+V + VDI+FNQ GGL TLCFL+++D LI ++HLFKRS+ILIKAWCYYE
Sbjct: 268 RAKVKIVKCMVQDISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEG 327
Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPV 233
RILG H GL+S+YAL LVLY+ ++F S PL VLYRFL+++S FDW+ F +S+ GPV
Sbjct: 328 RILGSHCGLLSTYALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPV 387
Query: 234 PISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLR 293
IS L E LL+++ FL SC+ A+A E QPF+ KH N+ DPLR
Sbjct: 388 SISSLLTGAPEAAETADKPLLINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLR 447
Query: 294 VNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNE-VNQFFMNTRDRH 347
NNLGRS+S GN +R R A + A+ L +L E NE + +FF NT DR+
Sbjct: 448 DYNNLGRSISLGNSYRFRYAISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRN 502
>gi|357116041|ref|XP_003559793.1| PREDICTED: uncharacterized protein LOC100830879 [Brachypodium
distachyon]
Length = 899
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 143/296 (48%), Positives = 190/296 (64%), Gaps = 2/296 (0%)
Query: 8 PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
P + AE A ++ + P +E RR V + RRLI F CQV T+GSVPLKTYLP
Sbjct: 33 PEQMRVAEAAAAGVLRCLLPTEEAERRRRQVTDHARRLIGTNFGCQVLTYGSVPLKTYLP 92
Query: 68 DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
D DID+ + + L T VR++L EEKN AEF ++ +Y+ A+VK+ KC + N
Sbjct: 93 DGDIDVTILTH-KPLDSTIIDDVRNLLNAEEKNTDAEFVLESRRYVDAQVKVFKCNIANI 151
Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
VDI+FNQ+GG+ TLCFL+ VD + ++HLFKRSIILIKAWCY E+RI G L+S+YA
Sbjct: 152 DVDISFNQIGGVSTLCFLELVDTEVGKDHLFKRSIILIKAWCYNEARIQGSDQWLLSTYA 211
Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
L L+LYIF++F+ S GP E LY FLE++SKFDW +C++L GPVP+S L + TAEP
Sbjct: 212 LEILILYIFNMFHNSLHGPFEALYMFLEYYSKFDWGKYCVTLDGPVPLSSLANFTAEPAV 271
Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
+ LLL K L + P G + F K N+IDPL+ +NNLGRS+S
Sbjct: 272 AN-DELLLGKESLSASSDRLLVLPKGSDRHDPEFRPKILNIIDPLKGDNNLGRSIS 326
>gi|147780178|emb|CAN75522.1| hypothetical protein VITISV_043595 [Vitis vinifera]
Length = 733
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 153/330 (46%), Positives = 202/330 (61%), Gaps = 37/330 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
+V FGS+PLKTYLPD DIDL A + +D +A V +LE E + +EFRV+++ Y
Sbjct: 11 EVLPFGSMPLKTYLPDGDIDLTALCPENDEED-FARDVCTLLEGE-RQMGSEFRVEDISY 68
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
I+A+VKI+KC+V + VDI+FNQ GGL TLCFL+++D LI ++HLFKRS+ILIKAWCYYE
Sbjct: 69 IRAKVKIVKCMVQDISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYE 128
Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGP 232
RILG H GL+S+YAL LVLY+ ++F S PL VLYRFL+++S FDW+ F +S+ GP
Sbjct: 129 GRILGSHCGLLSTYALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGP 188
Query: 233 VPISLL-------------------------------PDVT---AEPPRKDGGVLLLSKS 258
V IS L PD AE LL+++
Sbjct: 189 VSISSLLTGARESCLIMWLCLMVCFFRLIGLPFYLIFPDFVLFVAEAAETADKPLLINEE 248
Query: 259 FLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRA 318
FL SC+ A+A E QPF+ KH N+ DPLR NNLGRS+S GN +R R A + A
Sbjct: 249 FLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRFRYAISVGA 308
Query: 319 KGLARLLDCPNEDLYNE-VNQFFMNTRDRH 347
+ L +L E NE + +FF NT DR+
Sbjct: 309 QRLKEILLMLPEGRMNEGLKEFFNNTLDRN 338
>gi|297745772|emb|CBI15828.3| unnamed protein product [Vitis vinifera]
Length = 929
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 157/412 (38%), Positives = 222/412 (53%), Gaps = 76/412 (18%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W AE T E++A++QP S R V YV+RLI C C+VF +GSVPLKTYL D D
Sbjct: 40 WAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTYLLDGD 99
Query: 71 IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
IDL A +++ A V +L+ EE+NE+AEF VK++Q+I AEVK++KCLV + V+D
Sbjct: 100 IDLTALCS-SNVEEALASDVHAVLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKDIVID 158
Query: 131 IAFNQLGGLCTLCFLDEVDHLIN-------ENHLFKRSIILI--------KAWCYYES-- 173
I+FNQLGGL TLCFL++ L+ + ++ + S++++ +C Y S
Sbjct: 159 ISFNQLGGLSTLCFLEQWFILLTSYGETQMKENIIEASLLVLWFLYWHIWSLYCIYPSFT 218
Query: 174 ----------------------------------RILGGHHGLISSYALV-------TLV 192
R++G H S L+ + +
Sbjct: 219 SVQNHKRENPWFHMYGVQFLCNYSFKPLLSVIVDRLIGKDHLFKRSIILIKSWCYYESRI 278
Query: 193 LYIFHVFNGSFAGPLEVLY-----------------RFLEFFSKFDWDNFCLSLWGPVPI 235
L H ++A + VLY RFL++FSKFDWDN+C+SL GPV
Sbjct: 279 LGAHHGLISTYALEILVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGPVCK 338
Query: 236 SLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVN 295
S LPD+ AE P LLLS+ FL +C ++ G E + F KH N+IDPLR N
Sbjct: 339 SSLPDIVAELPENGQDDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPLREN 398
Query: 296 NNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
NNLGRSV+KGNF+RIR+AF + + L ++L P E + +E+ FF +T +RH
Sbjct: 399 NNLGRSVNKGNFYRIRSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERH 450
>gi|255083767|ref|XP_002508458.1| predicted protein [Micromonas sp. RCC299]
gi|226523735|gb|ACO69716.1| predicted protein [Micromonas sp. RCC299]
Length = 1269
Score = 251 bits (641), Expect = 3e-64, Method: Composition-based stats.
Identities = 150/355 (42%), Positives = 201/355 (56%), Gaps = 28/355 (7%)
Query: 20 ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ---VFTFGSVPLKTYLPDRDIDLGAF 76
ELI ++P S+ RR V ++ L+ CF + V FGSVPL+TYLPD DID+
Sbjct: 30 ELIDVLRPTEQSDRRRRGVFRHIASLVDGCFAGENVLVTAFGSVPLRTYLPDGDIDVCLL 89
Query: 77 SDDQTL-KDTWAHLVRDMLEN----------EEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
+ L +D W +R +E E + AEF V E+ I AEVK++K + D
Sbjct: 90 GPHELLSRDDWTVRLRAHVERAEAAAAEASIELGSPVAEFAVSEIHIIHAEVKLMKLICD 149
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
VVD++ NQ GGL L FL+EV+ I + +FKRSI+LIKAW +YE R+LG HH LIS+
Sbjct: 150 GVVVDVSANQFGGLAALGFLEEVNAFIGKGEIFKRSIVLIKAWGFYEGRLLGAHHALIST 209
Query: 186 YALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVT--- 242
YAL TLVLYI + F+ + PLEVL++FL FF+ FDWD F +S+ GPVP+ L VT
Sbjct: 210 YALETLVLYILNRFHKELSTPLEVLHKFLVFFADFDWDKFAVSVHGPVPLEDLHKVTGPI 269
Query: 243 AEPPRKDGGVLLLSKSFLDSCRYAY------ADFPGGQENQGQPFVSKHFNVIDPLRVNN 296
+ P LL+ F+ Y A GG ++ +P K+ NV+DPL +N
Sbjct: 270 GKRPEVHAEGALLTPDFMWRMMDKYGNESVSAKLGGGADSTPRPMARKYLNVVDPLLSSN 329
Query: 297 NLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPN-EDLYNEV---NQFFMNTRDRH 347
NLGRSVS+GN RIR A A+ L L + + + V QFF NT RH
Sbjct: 330 NLGRSVSQGNAKRIRKALALGAQRLTALRESSTGGECFGAVRMLEQFFGNTM-RH 383
>gi|145341816|ref|XP_001415999.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576222|gb|ABO94291.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 904
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 135/319 (42%), Positives = 191/319 (59%), Gaps = 14/319 (4%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ---VFTFGSVPLKTYLPDRDI 71
E +T EL+A ++P SE RR AV ++++L +CF V +GSVPL+ YLPD DI
Sbjct: 36 ETLTNELVASLRPTEMSEIRRRAVFEHIKQLAQECFGTAHTLVSAYGSVPLRAYLPDGDI 95
Query: 72 DLGAFSDDQTL-KDTWAHLVRDMLENEEKNEHA--EFRVKEVQYIQAEVKIIKCLVDNFV 128
D+ D + + K W R +E E EF V EV I AEV+++KC+VD +
Sbjct: 96 DVCLLGDHRVIDKAQWTTKFRKHIEKAEAEADPPHEFAVSEVSVINAEVRLMKCIVDGMM 155
Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
VD++ NQ GGL +L FL+E++ I + LF RSIIL+KAW +YE RILG HH LIS+YAL
Sbjct: 156 VDVSANQFGGLASLGFLEEMNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALISTYAL 215
Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
TLVLYI + ++ PL VL++ L F++FDW+ + L++ GPV I + A PP +
Sbjct: 216 ETLVLYIINKYHADLTCPLSVLHKLLSVFAEFDWEGYALTIHGPVAIEGI----ATPPDE 271
Query: 249 --DGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGN 306
+GG L+++ F+ + Y+ P K+ N+IDPL NNNLGRSVS GN
Sbjct: 272 CLEGG--LITEEFMRTMLSTYSCEFMRAAASSAPVTVKYMNIIDPLLPNNNLGRSVSCGN 329
Query: 307 FFRIRTAFTFRAKGLARLL 325
+ R+R A A+ L L+
Sbjct: 330 YRRVRAALKLGAQRLDALM 348
>gi|308799699|ref|XP_003074630.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
gi|116000801|emb|CAL50481.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
Length = 875
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 187/317 (58%), Gaps = 13/317 (4%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ---VFTFGSVPLKTYLPDRDI 71
E +T EL+ ++P SE RR AV +++ L CF V +GSVPL+ YLPD DI
Sbjct: 32 ETLTNELVESLRPTAKSEMRRRAVFEHIKELAQGCFGTAHTLVSVYGSVPLRAYLPDGDI 91
Query: 72 DLGAFSDDQTL-KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVD 130
D+ D + + K +W + +E E EF V EV I AEV+++KC+VD +VD
Sbjct: 92 DVCLLGDHRVIDKASWTTKFQKHIEKVEAESDFEFAVSEVSVINAEVRLMKCIVDGMMVD 151
Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVT 190
++ NQ GGL +L FL+E + I + LF RSIIL+KAW +YE RILG HH LI++YAL T
Sbjct: 152 VSANQFGGLASLGFLEETNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALIATYALET 211
Query: 191 LVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR-KD 249
LVLYI + + PL VL++ L F FDW+ + L++ GPV L D PP +
Sbjct: 212 LVLYIINKYYAELTCPLSVLHKLLRVFGDFDWEGYVLTIHGPV---ALEDANNIPPGCLE 268
Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
GG LL++ F+ S Y + + P V K+ N+IDPL NNNLGRSVS GN+ R
Sbjct: 269 GG--LLTEEFMQSMLCQYGQI---ETSNSAPVVVKYMNIIDPLVPNNNLGRSVSCGNYRR 323
Query: 310 IRTAFTFRAKGLARLLD 326
+R A A+ L +L++
Sbjct: 324 VRAALRLGARHLDKLME 340
>gi|147817122|emb|CAN62161.1| hypothetical protein VITISV_017634 [Vitis vinifera]
Length = 1147
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 119/201 (59%), Positives = 149/201 (74%)
Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGP 206
++D LI ++HLFKRSIILIKAWCYYESRILG HHGLIS+YAL TLVLYIFH+F+ GP
Sbjct: 405 KIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSLLNGP 464
Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYA 266
L VLY+FL++FSKFDWDN+C+SL GPV IS LP++ AE P G LL L C
Sbjct: 465 LAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPENVGADPLLGNDXLRDCLDR 524
Query: 267 YADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLD 326
++ G E + FV KHFN++DPL+ NNNLGRSVSKGNF+RIR+AFT+ A+ L R+L
Sbjct: 525 FSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGRILL 584
Query: 327 CPNEDLYNEVNQFFMNTRDRH 347
P + + E+ +FF NT +RH
Sbjct: 585 QPEDKISEELCKFFTNTLERH 605
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/125 (52%), Positives = 85/125 (68%), Gaps = 3/125 (2%)
Query: 54 VFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI 113
VF FGSVPLKTYLPD DIDL AF ++DT A+ V +LE E++N AEF VK+VQ I
Sbjct: 186 VFPFGSVPLKTYLPDGDIDLTAFGG-PAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLI 244
Query: 114 QAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLIN--ENHLFKRSIILIKAWCYY 171
AEVK++KCLV N VVDI+FNQLGGLCTLCFL++ + + E KR + + +
Sbjct: 245 HAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQQKAIWDGVEERFLKRLSLWKRQYISK 304
Query: 172 ESRIL 176
R++
Sbjct: 305 GXRLM 309
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 24/44 (54%)
Query: 10 RWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ 53
+W +AE E+I +QP SEERR V YV+ LI C+
Sbjct: 38 QWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCE 81
>gi|302125450|emb|CBI35537.3| unnamed protein product [Vitis vinifera]
Length = 398
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 125/207 (60%), Positives = 152/207 (73%), Gaps = 2/207 (0%)
Query: 45 LIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAE 104
LI C C+VF +GSVPLK YL D DIDL +++ A V +L+ E +NE+AE
Sbjct: 55 LIRCCLGCEVFPYGSVPLKIYLLDGDIDLTVLCSS-NVEEALASDVHAVLKGERQNENAE 113
Query: 105 FRVKEVQY-IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSII 163
F VK VQ+ I EVK +KCLV + V+DI+FNQLGGL TLCFL +VD LI ++HLFKRSII
Sbjct: 114 FEVKNVQFNIIVEVKPVKCLVKDIVIDISFNQLGGLSTLCFLKQVDRLIGKDHLFKRSII 173
Query: 164 LIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWD 223
LIK+ CYYESRILG +HGLIS+YAL LVLYIFH+F+ S GPL V YRFL++FSKFDWD
Sbjct: 174 LIKSRCYYESRILGAYHGLISTYALEILVLYIFHLFHSSLDGPLAVGYRFLDYFSKFDWD 233
Query: 224 NFCLSLWGPVPISLLPDVTAEPPRKDG 250
N+C+SL G V S LPD+ AE P G
Sbjct: 234 NYCISLNGSVCKSSLPDIVAELPENGG 260
>gi|6572083|emb|CAB63026.1| putative protein [Arabidopsis thaliana]
Length = 764
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 190/340 (55%), Gaps = 62/340 (18%)
Query: 8 PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
P W++ EE T E+I ++ P SE+RR V YV++LI C+V +FGSVPLKTYLP
Sbjct: 31 PELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLP 90
Query: 68 DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
D DIDL AF ++ A V +LE EE N ++F VK+VQ I+AEVK++KCLV N
Sbjct: 91 DGDIDLTAFGG-LYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNI 149
Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
VVDI+FNQ+GG +C L C+ E
Sbjct: 150 VVDISFNQIGG---ICTL-----------------------CFLE--------------- 168
Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
+VLY+FL++FSKFDWD++C+SL GPV +S LPD+ E P
Sbjct: 169 --------------------KVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPE 208
Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF 307
G LLL+ FL C Y+ G E + F SKH N++DPL+ NNLGRSVSKGNF
Sbjct: 209 NGGEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNF 268
Query: 308 FRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+RIR+AFT+ A+ L +L +E + +E+ +FF N RH
Sbjct: 269 YRIRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRH 308
>gi|30693508|ref|NP_190730.2| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
gi|332645292|gb|AEE78813.1| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
Length = 755
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 190/340 (55%), Gaps = 62/340 (18%)
Query: 8 PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLP 67
P W++ EE T E+I ++ P SE+RR V YV++LI C+V +FGSVPLKTYLP
Sbjct: 31 PELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLP 90
Query: 68 DRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNF 127
D DIDL AF ++ A V +LE EE N ++F VK+VQ I+AEVK++KCLV N
Sbjct: 91 DGDIDLTAFGG-LYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNI 149
Query: 128 VVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYA 187
VVDI+FNQ+GG +C L C+ E
Sbjct: 150 VVDISFNQIGG---ICTL-----------------------CFLE--------------- 168
Query: 188 LVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
+VLY+FL++FSKFDWD++C+SL GPV +S LPD+ E P
Sbjct: 169 --------------------KVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPE 208
Query: 248 KDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF 307
G LLL+ FL C Y+ G E + F SKH N++DPL+ NNLGRSVSKGNF
Sbjct: 209 NGGEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNF 268
Query: 308 FRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
+RIR+AFT+ A+ L +L +E + +E+ +FF N RH
Sbjct: 269 YRIRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRH 308
>gi|168035607|ref|XP_001770301.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678518|gb|EDQ64976.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1631
Score = 239 bits (609), Expect = 2e-60, Method: Composition-based stats.
Identities = 113/215 (52%), Positives = 149/215 (69%), Gaps = 16/215 (7%)
Query: 145 LDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-- 202
+D D + +NHLFKRS+IL+KAWCYYESRILG HHGLIS+YAL TLVLYIFHVF+
Sbjct: 255 IDRNDFELKQNHLFKRSVILVKAWCYYESRILGAHHGLISTYALETLVLYIFHVFHPKRR 314
Query: 203 FAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVT-------------AEPPRKD 249
GPLEVLY FL +F FDWD +C+++WGPVP++ + +++ AE PRKD
Sbjct: 315 LRGPLEVLYLFLVYFCNFDWDKYCVTMWGPVPLARITEISSGSARKTFRISDFAEAPRKD 374
Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQE-NQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFF 308
G LLLSK FL+ C +Y+D GGQE +Q + F++K NV+DP+R NNLGRSV+ G+F
Sbjct: 375 RGKLLLSKEFLERCIDSYSDAKGGQESSQRRNFITKFLNVLDPIRDTNNLGRSVNVGSFK 434
Query: 309 RIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNT 343
RIR+AF A+ L +L+CP + + + FF T
Sbjct: 435 RIRSAFGLGARTLGEVLECPTDQINEKFKSFFSCT 469
Score = 174 bits (442), Expect = 5e-41, Method: Composition-based stats.
Identities = 88/142 (61%), Positives = 105/142 (73%), Gaps = 1/142 (0%)
Query: 6 LDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTY 65
L+ G W + E TAELI I+P SEERR AV A+V+RLI F C+V FGSVPLKTY
Sbjct: 31 LEDGWWSRVEGHTAELIDSIKPTRSSEERRTAVTAFVQRLIRDRFDCKVVKFGSVPLKTY 90
Query: 66 LPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD 125
LPD DIDL F+ + LK+TWA V L+ E++ +AEFRVKEVQYIQAEVK+IKCLV+
Sbjct: 91 LPDGDIDLTIFARND-LKETWAQDVVKALKQAEEDTNAEFRVKEVQYIQAEVKLIKCLVE 149
Query: 126 NFVVDIAFNQLGGLCTLCFLDE 147
N VVDI+FNQ GGL T CFL+E
Sbjct: 150 NIVVDISFNQTGGLSTFCFLEE 171
>gi|290976573|ref|XP_002671014.1| predicted protein [Naegleria gruberi]
gi|284084579|gb|EFC38270.1| predicted protein [Naegleria gruberi]
Length = 763
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/376 (35%), Positives = 200/376 (53%), Gaps = 66/376 (17%)
Query: 4 RPLDPGR--WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
P+D + + + +L+ RIQP SE+ R V + +++ + + +GSV
Sbjct: 153 EPIDESTSCFRRCNSLIQQLLYRIQPSSESEKHRKEVFDIIA-AVLELANLKTYLYGSVA 211
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEE---------------KNEHAEFR 106
KTYLPD DIDL F ++ + + V ++L ++ KN H +
Sbjct: 212 FKTYLPDGDIDLSVFVSNEEYLELSSQNVNNLLSHQPQVNDSTISYVHNVLLKNMHIGLK 271
Query: 107 ----------------------------VKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGG 138
++++ +I AEVK+IKC V+N +D++ Q+GG
Sbjct: 272 QQLADPSIPWYNKARSLFSEIQRNNLAYIEDMTFINAEVKLIKCTVNNIPIDMSSGQIGG 331
Query: 139 LCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHV 198
L TLCFL EVD I +NHLFKRSIIL+K+W YYESRILG HHGL+S+Y L L++Y+F +
Sbjct: 332 LSTLCFLHEVDDKIADNHLFKRSIILMKSWSYYESRILGSHHGLVSTYGLTVLLMYMFRL 391
Query: 199 FNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTA---------EPPRKD 249
+ PL+ LYRFL ++S FDW NF +S++GP+P+ + D + P R D
Sbjct: 392 Y--KIETPLQALYRFLNYYSTFDWTNFGISIYGPIPLGAINDHKSIEDFYYENLPPERHD 449
Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
L+ SFL SC+ Y G + + F K+ N++DPLR NNLGRSV+ NF R
Sbjct: 450 S----LTSSFLQSCKSKY-----GTVDSSKTFTIKNLNIVDPLRDFNNLGRSVNYNNFLR 500
Query: 310 IRTAFTFRAKGLARLL 325
IR A +K + +L
Sbjct: 501 IRRAIKKGSKTITDIL 516
>gi|110738268|dbj|BAF01063.1| hypothetical protein [Arabidopsis thaliana]
Length = 660
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 117/200 (58%), Positives = 147/200 (73%)
Query: 148 VDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPL 207
+DHLI ++HLFKRSIILIKAWCYYESRILG HGLIS+YAL TLVLYIFH+F+ S GPL
Sbjct: 1 IDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALETLVLYIFHLFHSSLNGPL 60
Query: 208 EVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAY 267
VLY+FL++FSKFDWD++C+SL GPV +S LPD+ E P G LLL+ FL C Y
Sbjct: 61 AVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEMY 120
Query: 268 ADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDC 327
+ G E + F SKH N++DPL+ NNLGRSVSKGNF+RIR+AFT+ A+ L +L
Sbjct: 121 SVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFLQ 180
Query: 328 PNEDLYNEVNQFFMNTRDRH 347
+E + +E+ +FF N RH
Sbjct: 181 SDEAISSELRKFFSNMLLRH 200
>gi|307104056|gb|EFN52312.1| hypothetical protein CHLNCDRAFT_58914 [Chlorella variabilis]
Length = 740
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 187/329 (56%), Gaps = 40/329 (12%)
Query: 23 ARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQT- 81
A ++ + E+R+ AVA L+ +C + F FGSVPL+ LPD DID+ F+ T
Sbjct: 367 ASLEVEQLLEQRQAAVA-----LVQECLQVEAFMFGSVPLRAVLPDGDIDISFFATAATT 421
Query: 82 -----------------------LKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
L+DTWA + LE E A F++++VQ IQAEVK
Sbjct: 422 PSSPSGNGGEQPGHRAGASPPGDLRDTWASQLLRALEREAVRPDAPFKIRDVQIIQAEVK 481
Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
++KC+V + VVD++F+ +GGLCT+ FL+ D I HLFKRSI+L+KAWCYYESR+LG
Sbjct: 482 LVKCVVHDVVVDVSFDTVGGLCTVAFLEAADRRIGRQHLFKRSILLLKAWCYYESRLLGA 541
Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
HHGLISSYAL LVLYIF++ + PL+VL RFL FDW+ +CL+L GP+PI+ L
Sbjct: 542 HHGLISSYALEVLVLYIFNLHHAELHTPLDVLRRFLAVLGSFDWERYCLALQGPLPIADL 601
Query: 239 PDVTAEPPR--KDGGVLLLSKSFLDSCRYAYA----DFPGGQENQGQPFVS-----KHFN 287
+ + G LL F+ Y+ QE G V+ KH N
Sbjct: 602 HKLHVDRTALVSSGTEPLLDADFMRGVLQHYSVQHLSQQQQQEAAGMQLVAPRFPLKHLN 661
Query: 288 VIDPLRVNNNLGRSVSKGNFFRIRTAFTF 316
++DPL +NNLGRSVSK ++ R++ A
Sbjct: 662 IVDPLLPSNNLGRSVSKASYARVKKALAL 690
>gi|428171015|gb|EKX39935.1| hypothetical protein GUITHDRAFT_113927 [Guillardia theta CCMP2712]
Length = 632
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/337 (38%), Positives = 191/337 (56%), Gaps = 31/337 (9%)
Query: 16 EITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ------VFTFGSVPLKTYLPDR 69
E E++ ++QP +E R V YV++LI + V FGSVPLKTYLP
Sbjct: 17 EQADEIVRQLQPHRRAERHRLTVFEYVKKLIKHVADEENKTEIYVHRFGSVPLKTYLPHG 76
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNE--------HA---EFRVKEVQYIQAE-- 116
D+D+ AF+ + D W ++ LE+E K H+ + R + + + +
Sbjct: 77 DLDVTAFAAN----DLWLERLKAKLEDEAKKNDMYVVSGVHSVPRDLRAQSREELGKKDQ 132
Query: 117 -----VKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYY 171
VK++KC V+ VDI N LGG+C LCFL++VD ++ +HLFKR+ IL+K+WCY+
Sbjct: 133 GPVEIVKVVKCQVNGISVDITANALGGMCNLCFLEKVDTMLKRDHLFKRATILVKSWCYF 192
Query: 172 ESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWG 231
ES IL +GL+S+YAL TLVL I ++F+ PL+VL RFLE+++ FDW N CL++ G
Sbjct: 193 ESHILSSQNGLLSTYALETLVLCIVNIFHEELQTPLDVLKRFLEYYANFDWRNHCLTMRG 252
Query: 232 PVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQ-ENQGQPFVSKHFNVID 290
PV S +P P + LL+ + L + G Q +N+G F K+ N+ D
Sbjct: 253 PVNRSNIPPGGEVPHLDNEPSYLLNDAILQEDSHLQFLMSGLQDDNRG--FQWKYMNICD 310
Query: 291 PLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDC 327
PL NN+GRSVS+ + +RI +AF + L+ LL C
Sbjct: 311 PLSTRNNIGRSVSRSSAYRIASAFRHGWQSLSGLLYC 347
>gi|414866688|tpg|DAA45245.1| TPA: hypothetical protein ZEAMMB73_273182, partial [Zea mays]
Length = 260
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 106/194 (54%), Positives = 138/194 (71%), Gaps = 1/194 (0%)
Query: 24 RIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
R++P SE RR V Y RRL+ C+VF FGSVPLKTYLPD DIDL + +
Sbjct: 37 RVRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGN-TSYD 95
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
T + V +LE+EE+N AEF VK+++ I AEV++IKC + N +VDI+FNQ GG+C LC
Sbjct: 96 STLVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALC 155
Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
FL+ VD + +NHLFKRSIILIKAWCYYESR+LG HHGLIS+YAL L+LY+F++F+ S
Sbjct: 156 FLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSL 215
Query: 204 AGPLEVLYRFLEFF 217
P+EV + +F
Sbjct: 216 HSPVEVCLKRFTYF 229
>gi|297612542|ref|NP_001065982.2| Os12g0114200 [Oryza sativa Japonica Group]
gi|255669984|dbj|BAF29001.2| Os12g0114200, partial [Oryza sativa Japonica Group]
Length = 178
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 130/168 (77%), Gaps = 6/168 (3%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAF--SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEV 110
QVF FGSVPLKTYLPD DIDL AF S D+ L A V+ +LE+EE + AEF VK+V
Sbjct: 1 QVFPFGSVPLKTYLPDGDIDLTAFGHSSDEIL----AKQVQAVLESEEARKDAEFEVKDV 56
Query: 111 QYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCY 170
QYI AEVK++KC+V N +VDI+FNQ GGLCTLCFL++VD +NHLFKRSI+LIKAWCY
Sbjct: 57 QYIHAEVKLVKCIVQNIIVDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKAWCY 116
Query: 171 YESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFS 218
YESRILG HHGLIS+YAL LVLYIFH+F+G+ GPL V F F S
Sbjct: 117 YESRILGAHHGLISTYALEILVLYIFHLFHGTLDGPLAVSSDFQLFCS 164
>gi|77553482|gb|ABA96278.1| nucleotidyltransferase family protein, putative, expressed [Oryza
sativa Japonica Group]
gi|215769169|dbj|BAH01398.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 622
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 101/185 (54%), Positives = 129/185 (69%)
Query: 163 ILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDW 222
+LIKAWCYYESRILG HHGLIS+YAL LVLYIFH+F+G+ GPL VLYRFL+++SKFDW
Sbjct: 1 MLIKAWCYYESRILGAHHGLISTYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDW 60
Query: 223 DNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFV 282
DN +SL+GP+ +S LP++ + P + + FL C + P E Q F
Sbjct: 61 DNKGISLYGPISLSSLPELVTDSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFP 120
Query: 283 SKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMN 342
K FN++DPL+ +NNLGRSVSKGNF RIR+AF F A+ L ++L P+ +EVNQFF N
Sbjct: 121 RKFFNIVDPLKQSNNLGRSVSKGNFLRIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRN 180
Query: 343 TRDRH 347
T RH
Sbjct: 181 TLKRH 185
>gi|226506494|ref|NP_001141604.1| uncharacterized protein LOC100273722 [Zea mays]
gi|194705246|gb|ACF86707.1| unknown [Zea mays]
gi|413924676|gb|AFW64608.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
Length = 251
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 99/186 (53%), Positives = 125/186 (67%), Gaps = 4/186 (2%)
Query: 163 ILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDW 222
+LIK WCYYES ILG GL+S+YAL TLVLYIFHVF+ S GPL VLYRFL+++SKFDW
Sbjct: 1 MLIKHWCYYESCILGAQRGLVSTYALETLVLYIFHVFHKSLDGPLAVLYRFLDYYSKFDW 60
Query: 223 DNFCLSLWGPVPISLLPDVTAEPP--RKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQP 280
DN +SL+GP+ +S LP++ EPP R DG L ++FL C A++ P E Q
Sbjct: 61 DNKGISLFGPISLSSLPELVTEPPYTRDDG--FLSREAFLKDCAKAFSVPPINSEENPQV 118
Query: 281 FVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFF 340
F K N++DPL+ +NNLGRS+SKGN RIR F F A L ++L P NE+N+FF
Sbjct: 119 FSKKFVNIVDPLKQSNNLGRSISKGNLGRIRKEFYFGACKLGKILQAPACFSANEINRFF 178
Query: 341 MNTRDR 346
NT R
Sbjct: 179 RNTLSR 184
>gi|218192781|gb|EEC75208.1| hypothetical protein OsI_11468 [Oryza sativa Indica Group]
Length = 860
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 103/193 (53%), Positives = 124/193 (64%), Gaps = 38/193 (19%)
Query: 54 VFTFGSVPLKTYLPDRDIDL---GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEV 110
VF +GSVPLKTYLPD D+DL G S TL D H+ L++EE+N AEF VK++
Sbjct: 20 VFAYGSVPLKTYLPDGDVDLTVLGNTSYGSTLIDDIYHI----LQSEEQNCDAEFEVKDL 75
Query: 111 QYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCY 170
Q I AEV D + +NHL K SIILIKAWCY
Sbjct: 76 QLINAEV-------------------------------DRKVGKNHLVKNSIILIKAWCY 104
Query: 171 YESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLW 230
YESR+LG HHGLIS+YAL TL+LYIF++F+ S GPLEVLYRFLE+FSKFDWDN+C+SL
Sbjct: 105 YESRLLGAHHGLISTYALETLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLN 164
Query: 231 GPVPISLLPDVTA 243
GPV +S LP+ A
Sbjct: 165 GPVALSSLPNQIA 177
>gi|303287038|ref|XP_003062808.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455444|gb|EEH52747.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 781
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 99/229 (43%), Positives = 135/229 (58%), Gaps = 13/229 (5%)
Query: 120 IKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
+KC+ D VVDI+ NQ GGL TL FL+EVD I + +FKRSIILIKAW +YE R+LG H
Sbjct: 1 MKCIADGVVVDISANQFGGLATLGFLEEVDAFIARDGIFKRSIILIKAWGFYEGRVLGAH 60
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLP 239
H LIS+YAL TLVLY+ + ++ + PLEVL++FL +F+ F+WD + +S+ GPV + L
Sbjct: 61 HALISTYALETLVLYVLNAYHEELSTPLEVLHKFLTYFADFEWDAYAVSIHGPVRLDALE 120
Query: 240 DVTAEPPRKDGGVLL---LSKSFLDSCRYAYADFPGGQENQG--------QPFVSKHFNV 288
+ G LL +K LD +Y ++ Q + KH NV
Sbjct: 121 KGVRDADAPARGPLLTPAFTKRVLD--KYGNDAIINAEKGQAGPGGGGNRRAMQPKHLNV 178
Query: 289 IDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVN 337
IDPL +NNLGRSVS+GN RI+ A A L L + + + E+N
Sbjct: 179 IDPLLPSNNLGRSVSQGNAKRIQKALRLGAAKLTSLRNAMRDGVSCELN 227
>gi|401410712|ref|XP_003884804.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
gi|325119222|emb|CBZ54776.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
Length = 3449
Score = 172 bits (437), Expect = 2e-40, Method: Composition-based stats.
Identities = 77/206 (37%), Positives = 125/206 (60%), Gaps = 9/206 (4%)
Query: 54 VFTFGSVPLKTYLPDRDIDLGAFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEF 105
V+ +GS PL+T+LPD D+D+G S + + D ++ D + E+ H F
Sbjct: 358 VYRYGSFPLRTFLPDGDLDIGIISYNRRTGVVEGEEESDALLAVLLDKFQREDVKTHKTF 417
Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILI 165
++E + AEV+I+KC+V VD++ N++GG C+L FL+ D I +HLFKRS++LI
Sbjct: 418 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRHHLFKRSVLLI 477
Query: 166 KAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-FAGPLEVLYRFLEFFSKFDWDN 224
K+W YES +LG GL+++Y + LVL++FHV S PL +LY F ++S F WD
Sbjct: 478 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVLPASLLPTPLHLLYHFFSYYSSFHWDR 537
Query: 225 FCLSLWGPVPISLLPDVTAEPPRKDG 250
+ ++ GP+P++ + ++ P R+ G
Sbjct: 538 YAVTACGPLPLTFITRASSVPDRRGG 563
Score = 41.2 bits (95), Expect = 0.67, Method: Composition-based stats.
Identities = 19/46 (41%), Positives = 27/46 (58%)
Query: 280 PFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLL 325
PF+ + NV+DPL NNL RSVS+ F+R+ A + L +L
Sbjct: 889 PFLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTHIL 934
>gi|221502484|gb|EEE28211.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 3297
Score = 169 bits (427), Expect = 2e-39, Method: Composition-based stats.
Identities = 77/213 (36%), Positives = 127/213 (59%), Gaps = 9/213 (4%)
Query: 54 VFTFGSVPLKTYLPDRDIDLGAFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEF 105
V+ +GS PL+T+LPD D+D+G S + + D ++ + + E H F
Sbjct: 224 VYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHKTF 283
Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILI 165
++E + AEV+I+KC+V VD++ N++GG C+L FL+ D I NHLFKRS++LI
Sbjct: 284 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVLLI 343
Query: 166 KAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-FAGPLEVLYRFLEFFSKFDWDN 224
K+W YES +LG GL+++Y + LVL++FHVF + PL +LY+F ++S F WD
Sbjct: 344 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHWDR 403
Query: 225 FCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSK 257
+ ++ G +P++ + ++ R+ G L S+
Sbjct: 404 YAVTACGALPLTFITRTSSVQDRRGGSAPLPSR 436
Score = 41.6 bits (96), Expect = 0.57, Method: Composition-based stats.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 280 PFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLL 325
PF+ + NV+DPL NNL RSVS+ F+R+ A + L ++L
Sbjct: 738 PFLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 783
>gi|221482136|gb|EEE20497.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 3441
Score = 169 bits (427), Expect = 2e-39, Method: Composition-based stats.
Identities = 77/213 (36%), Positives = 127/213 (59%), Gaps = 9/213 (4%)
Query: 54 VFTFGSVPLKTYLPDRDIDLGAFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEF 105
V+ +GS PL+T+LPD D+D+G S + + D ++ + + E H F
Sbjct: 369 VYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHKTF 428
Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILI 165
++E + AEV+I+KC+V VD++ N++GG C+L FL+ D I NHLFKRS++LI
Sbjct: 429 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVLLI 488
Query: 166 KAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-FAGPLEVLYRFLEFFSKFDWDN 224
K+W YES +LG GL+++Y + LVL++FHVF + PL +LY+F ++S F WD
Sbjct: 489 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHWDR 548
Query: 225 FCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSK 257
+ ++ G +P++ + ++ R+ G L S+
Sbjct: 549 YAVTACGALPLTFITRTSSVQDRRGGSAPLPSR 581
Score = 41.6 bits (96), Expect = 0.57, Method: Composition-based stats.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 280 PFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLL 325
PF+ + NV+DPL NNL RSVS+ F+R+ A + L ++L
Sbjct: 883 PFLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 928
>gi|237843045|ref|XP_002370820.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
gi|211968484|gb|EEB03680.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
Length = 3436
Score = 169 bits (427), Expect = 2e-39, Method: Composition-based stats.
Identities = 77/213 (36%), Positives = 127/213 (59%), Gaps = 9/213 (4%)
Query: 54 VFTFGSVPLKTYLPDRDIDLGAFS--------DDQTLKDTWAHLVRDMLENEEKNEHAEF 105
V+ +GS PL+T+LPD D+D+G S + + D ++ + + E H F
Sbjct: 363 VYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHKTF 422
Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILI 165
++E + AEV+I+KC+V VD++ N++GG C+L FL+ D I NHLFKRS++LI
Sbjct: 423 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVLLI 482
Query: 166 KAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGS-FAGPLEVLYRFLEFFSKFDWDN 224
K+W YES +LG GL+++Y + LVL++FHVF + PL +LY+F ++S F WD
Sbjct: 483 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHWDR 542
Query: 225 FCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSK 257
+ ++ G +P++ + ++ R+ G L S+
Sbjct: 543 YAVTACGALPLTFITRTSSVQDRRGGSAPLPSR 575
Score = 41.6 bits (96), Expect = 0.57, Method: Composition-based stats.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 280 PFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARLL 325
PF+ + NV+DPL NNL RSVS+ F+R+ A + L ++L
Sbjct: 877 PFLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 922
>gi|222624434|gb|EEE58566.1| hypothetical protein OsJ_09878 [Oryza sativa Japonica Group]
Length = 1064
Score = 167 bits (423), Expect = 7e-39, Method: Composition-based stats.
Identities = 81/106 (76%), Positives = 86/106 (81%), Gaps = 1/106 (0%)
Query: 243 AEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQ-PFVSKHFNVIDPLRVNNNLGRS 301
AEPPR D LLLSKSFLD C YAYA P QE+QGQ PFVSKHFNVIDPLR NNNLGRS
Sbjct: 3 AEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGRS 62
Query: 302 VSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
VSKGNFFRIR+AF+F AK LA+LL+CP EDL EVNQFF NT RH
Sbjct: 63 VSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRH 108
>gi|412992209|emb|CCO19922.1| predicted protein [Bathycoccus prasinos]
Length = 1318
Score = 165 bits (418), Expect = 3e-38, Method: Composition-based stats.
Identities = 101/286 (35%), Positives = 145/286 (50%), Gaps = 50/286 (17%)
Query: 105 FRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIIL 164
VK++ I A+V+++KC+VD VVD++ NQ GGL TL FL EV+ I +N LFKRS+IL
Sbjct: 289 LEVKDIVVIHADVRLLKCVVDGIVVDVSANQFGGLATLAFLKEVNSKIGKNDLFKRSVIL 348
Query: 165 IKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG-------------SFAGPLEVLY 211
+KAW +YESRILG + L+S+YAL TL++ FN A PL+VL
Sbjct: 349 VKAWAFYESRILGAPYALLSTYALKTLIICALRRFNKKESKSDATKTKKREIATPLDVLR 408
Query: 212 RFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP-----------------------PRK 248
F E+ S F W+ ++++G VP+ L V+ +
Sbjct: 409 IFFEYVSDFPWETHAVTIFGDVPVEKLDKVSVREFSSSSKSEKNKNKNNDDEREEKDDEE 468
Query: 249 DGGVLLLSKSFLDSCRYAYADFPGGQEN--------QGQPFV-----SKHFNVIDPLRVN 295
LL +F+D+ +Y N + PF +KH +++DPL
Sbjct: 469 AEEDPLLDDTFVDTILKSYGPDSRPDANVLLNIGNGKKAPFRRRAIGAKHLHILDPLSET 528
Query: 296 NNLGRSVSKGNFFRIRTAFTFRAKGLARL-LDCPNEDLYNEVNQFF 340
NNLGRSVS GNF R+R AF A+ L RL ++ E++ FF
Sbjct: 529 NNLGRSVSLGNFARVRAAFRLGAERLKRLEMESEPENITRGFEYFF 574
Score = 55.1 bits (131), Expect = 5e-05, Method: Composition-based stats.
Identities = 33/87 (37%), Positives = 43/87 (49%), Gaps = 15/87 (17%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQ--------------VFTFGSV 60
E +T ELIA ++P SE+RR V + LI +CF + V FGSV
Sbjct: 129 EALTEELIASLRPSKQSEKRRRMVFRKMESLIRECFEKEFEGEGVNEKKNTIVVSAFGSV 188
Query: 61 PLKTYLPDRDIDLGAFSDDQTL-KDTW 86
P TYLPD DID+ D + L +W
Sbjct: 189 PFGTYLPDGDIDVCILGDHEVLDSQSW 215
>gi|224064218|ref|XP_002301405.1| predicted protein [Populus trichocarpa]
gi|222843131|gb|EEE80678.1| predicted protein [Populus trichocarpa]
Length = 141
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 94/116 (81%), Positives = 99/116 (85%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
SVIR LD RW KAEE TAELIA IQP+ SEE RNAVA YV+RLI +CFPCQVFTFGSV
Sbjct: 26 SVIRVLDSERWSKAEERTAELIACIQPNQPSEELRNAVADYVQRLIAKCFPCQVFTFGSV 85
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAE 116
PLKTYLPD DIDL AFS + LKDTWAH VRDMLENEEKNE+AEFRVKEVQYIQAE
Sbjct: 86 PLKTYLPDGDIDLTAFSKNPNLKDTWAHQVRDMLENEEKNENAEFRVKEVQYIQAE 141
>gi|297600524|ref|NP_001049344.2| Os03g0210800 [Oryza sativa Japonica Group]
gi|255674303|dbj|BAF11258.2| Os03g0210800 [Oryza sativa Japonica Group]
Length = 871
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 82/108 (75%), Positives = 88/108 (81%), Gaps = 1/108 (0%)
Query: 241 VTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQ-PFVSKHFNVIDPLRVNNNLG 299
+TAEPPR D LLLSKSFLD C YAYA P QE+QGQ PFVSKHFNVIDPLR NNNLG
Sbjct: 1 MTAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLG 60
Query: 300 RSVSKGNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
RSVSKGNFFRIR+AF+F AK LA+LL+CP EDL EVNQFF NT RH
Sbjct: 61 RSVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRH 108
>gi|224127915|ref|XP_002320195.1| predicted protein [Populus trichocarpa]
gi|222860968|gb|EEE98510.1| predicted protein [Populus trichocarpa]
Length = 145
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 93/117 (79%), Positives = 99/117 (84%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
SVIR LD RW KAEE TAELI IQP+ SEE RNAVA YV+RLI++CFPCQVFTFGSV
Sbjct: 26 SVIRVLDLDRWSKAEERTAELIDCIQPNQPSEELRNAVADYVQRLILKCFPCQVFTFGSV 85
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
PLKTYLPD DIDL AFS + LKDTWAH VRDMLENEEKNE+AEFRVKEVQYIQAE
Sbjct: 86 PLKTYLPDGDIDLTAFSKNPNLKDTWAHQVRDMLENEEKNENAEFRVKEVQYIQAEA 142
>gi|301093296|ref|XP_002997496.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110638|gb|EEY68690.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 782
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/375 (29%), Positives = 169/375 (45%), Gaps = 71/375 (18%)
Query: 21 LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV----FTFGSVPLKTYLPDRDIDLGAF 76
LI + P ++ R V ++V+R+I FP F GS P+KTYLP D+D+
Sbjct: 251 LIEWMGPSDVADRVRQQVLSFVQRVITAHFPLAAAPLFFATGSYPMKTYLPGSDLDICLL 310
Query: 77 SDDQTLKDTWAHLVRDMLENEEKNEHAEF------------------------------- 105
Q L+ +W ++V L + A
Sbjct: 311 VP-QELESSWYYIVTQALCVAGGSGGAGTVLDLGNSASSDVSGSSSPSGPAAASGGGPLL 369
Query: 106 ---RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSI 162
V+ V +I A+V+++KC VDN VD N++G L + LD + + HLFK+S+
Sbjct: 370 LTNTVRNVTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAARVGRQHLFKKSL 429
Query: 163 ILIKAWCYYESR---------------------ILGGHHGLISSYALVTLVLYIFHVFNG 201
ILIKAWC +ESR ++G HG +S+YA+ T+V+ +F+
Sbjct: 430 ILIKAWCTHESRPFMQRASNEAGGSVPGSTPASVMGASHGALSTYAVNTIVMALFNQHGD 489
Query: 202 SFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL---PDVTAEPPRKDGGVLLLSKS 258
+ PL+ LY FL+ ++F W L+L GPVP+S L P R L S
Sbjct: 490 ALTHPLQALYLFLDRLAEFPWHECALTLHGPVPLSRLASTPLNGTTSYRSKLKTAKLDAS 549
Query: 259 FLDSCRYAYADFPGG-----QENQGQP---FVSKHFNVIDPLRVNNNLGRSVSKGNFFRI 310
+++ R AD G + ++G P F + N++DPL NNL RSVS F +
Sbjct: 550 DVEAIRDTLADQFGAFDAALKSSKGTPTGLFPIRACNIVDPLDDKNNLARSVSAEGFPVM 609
Query: 311 RTAFTFRAKGLARLL 325
+ AF LA +L
Sbjct: 610 KRAFRLARDQLAAML 624
>gi|403357215|gb|EJY78230.1| hypothetical protein OXYTRI_24618 [Oxytricha trifallax]
Length = 831
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/336 (30%), Positives = 164/336 (48%), Gaps = 33/336 (9%)
Query: 22 IARIQPDPFSEERRNAVAAYVRRLIIQCF----PCQVFTFGSVPLKTYLPDRDIDLGAFS 77
+ +I P SE +R + V+ LI + V +GS PLKTYLPD DID+
Sbjct: 29 LNKIGPTQESERKRVKIFEQVKFLIEKALGGKSQVMVIRYGSDPLKTYLPDSDIDITVIR 88
Query: 78 DD--------QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV 128
D Q T L++ +E + ++ + VK + I QA+V+IIK N
Sbjct: 89 RDYLQGNQTNQLTALTQLKLIKKEIEIFGETQNGKNFVKSMVLIDQADVEIIKLNFQNTF 148
Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
VDI+ Q+GG+CTL F++ + I + L K+SIIL+KAW Y++ ILG +++YAL
Sbjct: 149 VDISIKQVGGICTLYFMNYMAKRIGKQQLLKKSIILLKAWFTYDASILGSQAACMATYAL 208
Query: 189 VTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
+VL+I + F P++V+ F + +S FDW+N ++++GP+ S + E
Sbjct: 209 YVMVLFILNNFYDELNSPMDVIMMFFKVWSHFDWENNIVTIFGPIKSSGFYERLKECQFD 268
Query: 249 DGGVLLLSKSFLDSCRY--------------------AYADFPGGQENQGQPFVSKHFNV 288
+ +L +S +Y +D + F +K+FN+
Sbjct: 269 IDRLTMLDRSLHQEYQYRKLLVTPDELSFLNLQFSGVRLSDVSSYNLANKKSFNTKYFNI 328
Query: 289 IDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLARL 324
IDP NNLG+S+SK N RI+ + + ++
Sbjct: 329 IDPTFSKNNLGKSISKLNSSRIKQVLRLQNMKMRQI 364
>gi|325189429|emb|CCA23919.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1193
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/334 (30%), Positives = 167/334 (50%), Gaps = 46/334 (13%)
Query: 13 KAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV----FTFGSVPLKTYLPD 68
+ E +LI + P +++ R + AY+R L+ FP F GS P KTYLPD
Sbjct: 721 RVETSVKKLIHALSPTHEADQARCNILAYLRHLLELQFPRSSSILFFPTGSFPCKTYLPD 780
Query: 69 RDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNE-HAEFR--------------------- 106
D+D+ ++++ TW V ML N+ HAE +
Sbjct: 781 ADLDVCLLVP-RSMEPTWFFSVVQMLCFAATNDVHAEPKHSLESVQAPSWMNSTSSTGNT 839
Query: 107 VKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIK 166
V+ V +I A+V+++KC +DN VDI N++G L L LD D + +HLFK+S++LIK
Sbjct: 840 VRNVTFINADVRVVKCTIDNVAVDITVNRVGALGALVLLDTFDLRVGRHHLFKQSLVLIK 899
Query: 167 AWCYYE-------SRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSK 219
AWC + +LG +G S+YA+ T+V+ +F+ + PLE L+ FL+ ++
Sbjct: 900 AWCALDCLEGGQGCGVLGSKNGAFSTYAVNTMVMTLFNRWGYRIQHPLEALHLFLDIMTQ 959
Query: 220 FDWDNFCLSLWGPVPIS-LLPDVTAE--PP--RKDGGVLLLSKSFLDSCRYAYADFPG-- 272
F W +++GPV + L ++++ PP L+++ ++ R ++ G
Sbjct: 960 FPWQECAWTIFGPVLFTQLYQNLSSRIVPPGWETASANCLITREDIEQIRVCLNEYFGSF 1019
Query: 273 ----GQENQGQPFVSKHFNVIDPLRVNNNLGRSV 302
G E F + FN+IDPL++ NNL RSV
Sbjct: 1020 DVSLGTETNA-VFPLRSFNMIDPLQLGNNLARSV 1052
>gi|348683529|gb|EGZ23344.1| hypothetical protein PHYSODRAFT_485178 [Phytophthora sojae]
Length = 793
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 165/381 (43%), Gaps = 77/381 (20%)
Query: 21 LIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV----FTFGSVPLKTYLPDRDIDLGAF 76
LI + P ++ R V ++V+++I FP F GS P+KTYLP D+D+
Sbjct: 255 LIEWMGPSDAADRVRQQVLSFVQQVITAHFPLAAAPLFFATGSYPMKTYLPGSDLDICLL 314
Query: 77 SDDQTLKDTWAHLVRDMLENEEKNEHAEF------------------------------- 105
Q L+ +W +V L + A
Sbjct: 315 VP-QELESSWYFIVTQALCIAGGSGGAGTVLDVGNPGGSVDGSGSSSPSGPAVGSGSSGA 373
Query: 106 -----RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKR 160
V+ V +I A+V+++KC VDN VD N++G L + LD + + HLFK+
Sbjct: 374 LLLTNTVRNVTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAVRVGRQHLFKK 433
Query: 161 SIILIKAWCYYES-------------------------RILGGHHGLISSYALVTLVLYI 195
S+ILIKAWC +ES ++G HG +S+YA+ T+V+ +
Sbjct: 434 SLILIKAWCTHESSPFMQAASVECGGLGPSVVPGSTPTSVMGASHGALSTYAVNTIVMAL 493
Query: 196 FHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL---PDVTAEPPRKDGGV 252
F+ + PL+ LY FL+ ++F W L+L G VP+S L P P +
Sbjct: 494 FNQHGDALTHPLQALYLFLDRLAEFPWHEAALTLHGAVPLSRLATTPLNGTTPSKSKLKA 553
Query: 253 LLLSKSFLDSCRYAYAD----FPGG-QENQGQP---FVSKHFNVIDPLRVNNNLGRSVSK 304
L +++ R +D F G + + P F + N++DPL NNL RSVS
Sbjct: 554 AKLDAGDVEAIRDTLSDQFGAFDAGLRSGKSAPTGLFPIRACNIVDPLDDKNNLARSVSA 613
Query: 305 GNFFRIRTAFTFRAKGLARLL 325
F ++ AF LA +L
Sbjct: 614 EGFPVMKRAFRLARDQLAAML 634
>gi|253744327|gb|EET00549.1| Topoisomerase I-related protein [Giardia intestinalis ATCC 50581]
Length = 511
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 70/224 (31%), Positives = 119/224 (53%), Gaps = 5/224 (2%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P S R + Y+R + FP Q+ +GS + +LPD D+DL +
Sbjct: 45 LAPSEDSISCRYQIIKYIRDELHSIFPELQLIPYGSFVTRIFLPDGDVDLSIIVAEDDAN 104
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
D ++ + E EHA F++ + IQAE+ II+ +++ +DI+ + GL T
Sbjct: 105 DVFSQFYTHLKEIASSQEHATFKITNLSKIQAEMSIIRLVINGIFIDISAARPTGLVTSL 164
Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
++ ++ I N+L KRS+IL++AW YE+ ILG H +++SYAL + +I +
Sbjct: 165 YIQLLNDSIGRNNLLKRSVILVQAWSLYEAHILGSHSQMLNSYALRVMTAFIL-TNSPEL 223
Query: 204 AGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPR 247
PL+VL++F F+S FD+ N ++ +G +P S +V PR
Sbjct: 224 VHPLQVLFKFFAFYSTFDFTNNTITAFGVIPNS---EVDGSDPR 264
>gi|308159127|gb|EFO61675.1| Topoisomerase I-related protein [Giardia lamblia P15]
Length = 512
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/213 (33%), Positives = 113/213 (53%), Gaps = 2/213 (0%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P S R + Y+R + FP Q+ +GS + +LPD DIDL +
Sbjct: 45 LAPTEDSITYRYQIIKYIRDKLHDLFPELQLIPYGSFVTRIFLPDGDIDLAIIVGEDDAA 104
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
D + + E FRV + IQAEV II+ +++ +DI+ + GL T
Sbjct: 105 DVLTQFYIHLKDIVASQEDTPFRVTNLSKIQAEVPIIRLVINGIFIDISSARPVGLVTSL 164
Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSF 203
+L ++ I N+L KRS+ILI+AWC YE+ ILG H +++SYAL + ++I +
Sbjct: 165 YLQLLNDAIGRNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTIFIL-TNSPEL 223
Query: 204 AGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPIS 236
PL+VL++F F+S FD+ N ++ +G +P S
Sbjct: 224 VHPLQVLFKFFAFYSAFDFTNNTITAFGVIPNS 256
>gi|159108047|ref|XP_001704297.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
gi|157432356|gb|EDO76623.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
Length = 512
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 111/203 (54%), Gaps = 2/203 (0%)
Query: 35 RNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDM 93
R + Y+R + FP Q+ +GS + +LPD DIDL + D A +
Sbjct: 55 RYQIIKYIRDKLHSLFPELQLIPYGSFVTRIFLPDGDIDLAIIVGEDDAADVLAQFYIYL 114
Query: 94 LENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
E +E F++ + IQAEV II+ +++ +DI+ + GL T +L ++ I
Sbjct: 115 KEVAASHEDTPFKLTNLSKIQAEVPIIRLVINGVFIDISSARPVGLVTSLYLQLLNDAIG 174
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
N+L KRS+ILI+AWC YE+ ILG H +++SYAL + +I + PL+VL++F
Sbjct: 175 RNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTTFIL-TNSPELVHPLQVLFKF 233
Query: 214 LEFFSKFDWDNFCLSLWGPVPIS 236
F+S FD+ N ++ +G VP S
Sbjct: 234 FAFYSAFDFTNNTITAFGVVPNS 256
>gi|242051292|ref|XP_002463390.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
gi|241926767|gb|EER99911.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
Length = 208
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 59/126 (46%), Positives = 88/126 (69%), Gaps = 1/126 (0%)
Query: 24 RIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
R+ P +E RR V +Y+RRLI C+VF FGSVPL+TYLPD D+D+ + L
Sbjct: 83 RVHPTQEAERRRQDVISYLRRLIGSSLGCEVFAFGSVPLRTYLPDGDVDITVLGNTW-LN 141
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC 143
T+ VR MLE+E++N AEF++ + +I AEVK+IKC+++N +VD++FNQ+GG+ T C
Sbjct: 142 STFIDDVRSMLESEQENCDAEFKLTGLHFINAEVKLIKCIIENIIVDVSFNQIGGVSTFC 201
Query: 144 FLDEVD 149
FL+ ++
Sbjct: 202 FLELIN 207
>gi|253742434|gb|EES99267.1| Hypothetical protein GL50581_3482 [Giardia intestinalis ATCC 50581]
Length = 711
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 121/239 (50%), Gaps = 23/239 (9%)
Query: 18 TAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAF 76
T +I+ + PD SEE R + ++ ++I P + +GS K YLP D+D+ +
Sbjct: 50 TDYIISLVSPDRASEEFRLKIFTFISKVIDVVLPNTLIVPYGSFISKIYLPSSDLDICCY 109
Query: 77 S-------------------DDQTLKDTWAHL--VRDMLENEEKNEHAEFRVKEVQYIQA 115
+ D L+ T + V L N + ++ +++I A
Sbjct: 110 NHSIDEIPLLQKILEALMVFSDPNLQSTGTRVSPVVSQLINSHISADERLELENIEFIMA 169
Query: 116 EVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRI 175
+V +IKC V VDI+ Q G L T ++++ I N+L KRS +LI++WC YE+RI
Sbjct: 170 KVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEARI 229
Query: 176 LGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVP 234
+GGH ++SSYAL +++ I + P +VLY FL ++S FD+D + GP+P
Sbjct: 230 VGGHSQMLSSYALRVMIINIL-INCKDIYTPFQVLYVFLAYYSNFDYDRNIIHPSGPLP 287
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 6/71 (8%)
Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
EP D VL L K + + + + Q F+ + +++DPL+V NNLGRSVS
Sbjct: 546 EPVLNDHDVLFLLKRYFSTSTFTDV------LSDSQVFLPSYISIVDPLQVTNNLGRSVS 599
Query: 304 KGNFFRIRTAF 314
+ NF RI +F
Sbjct: 600 EPNFMRITRSF 610
>gi|159115240|ref|XP_001707843.1| Hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
gi|157435951|gb|EDO80169.1| hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
Length = 731
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 74/241 (30%), Positives = 123/241 (51%), Gaps = 33/241 (13%)
Query: 21 LIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFS-- 77
+++ + PD SEE R + ++ ++I P + +GS K YLP D+D+ F+
Sbjct: 68 IVSLVSPDKASEEFRLKIFTFISKVIEAVLPNTLIVPYGSFISKIYLPSSDLDICCFNHG 127
Query: 78 -----------------DDQTLKDTW-------AHLVRDMLENEEKNEHAEFRVKEVQYI 113
D +L+ T + L+ + EE+ E ++ +++I
Sbjct: 128 LDEIPLLQKILEALTVFSDPSLRPTGVRVPPAVSQLINSRIPTEERLE-----LENIEFI 182
Query: 114 QAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYES 173
A+V +IKC V VDI+ Q G L T ++++ I N+L KRS +LI++WC YE+
Sbjct: 183 MAKVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEA 242
Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPV 233
RI+GGH ++SSYAL +V+ I + P + LY FL ++S FD+D + GP+
Sbjct: 243 RIVGGHSQMLSSYALRVMVINILLNCRDIYT-PFQALYVFLAYYSSFDYDRDIVHPSGPL 301
Query: 234 P 234
P
Sbjct: 302 P 302
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 6/71 (8%)
Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
EP D +LLL K + ++ P + + F+ + +++DPL+V NNLGRSVS
Sbjct: 568 EPNLNDHDILLLFKRY-----FSMGTLPNVLSD-SRAFLPSYISIVDPLQVINNLGRSVS 621
Query: 304 KGNFFRIRTAF 314
+ NF RI +F
Sbjct: 622 EPNFMRITRSF 632
>gi|308163112|gb|EFO65472.1| Hypothetical protein GLP15_5146 [Giardia lamblia P15]
Length = 719
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 72/236 (30%), Positives = 118/236 (50%), Gaps = 23/236 (9%)
Query: 21 LIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFS-- 77
+++ + PD SEE R + ++ R+I P + +GS K YLP D+D+ ++
Sbjct: 53 IVSLVSPDKASEEFRLKIFTFISRVIEAVLPNTLIVPYGSFISKIYLPSSDLDICCYNHG 112
Query: 78 -----------------DDQTLKDTWAHL--VRDMLENEEKNEHAEFRVKEVQYIQAEVK 118
D +L+ T + L N + ++ +++I A+V
Sbjct: 113 LDEIPLLQKILEALTIFSDPSLRPTGVRVSPAVSQLINSRISAEERLELENIEFIMAKVS 172
Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
+IKC V VDI+ Q G L T ++++ I N+L KRS +LI++WC YE+RI+GG
Sbjct: 173 LIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEARIVGG 232
Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVP 234
H ++SSYAL +V+ I + P + LY FL ++S FD+D + GP P
Sbjct: 233 HSQMLSSYALRVMVINILLNCKDIYT-PFQALYVFLAYYSTFDYDKNIVHPSGPFP 287
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 41/71 (57%), Gaps = 6/71 (8%)
Query: 244 EPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 303
EP D +LLL K + ++ FP + + F+ + +++DPL+V NNLGRSVS
Sbjct: 556 EPVINDHDILLLLKRY-----FSMGTFPDVLSD-SRVFLPSYISIVDPLQVINNLGRSVS 609
Query: 304 KGNFFRIRTAF 314
+ NF RI +F
Sbjct: 610 EPNFMRITRSF 620
>gi|2651305|gb|AAB87585.1| hypothetical protein [Arabidopsis thaliana]
Length = 384
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/228 (36%), Positives = 116/228 (50%), Gaps = 56/228 (24%)
Query: 5 PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQ---------------- 48
P++ WL AE E++ IQP+ +E RN + + ++ L+ +
Sbjct: 27 PIEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVRTFLLLLDEL 86
Query: 49 CFPCQ------VFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEH 102
F Q V+ FGS+PLKTYLPD DIDL + + +D A V +LE E N
Sbjct: 87 SFSLQRIRNAKVYLFGSMPLKTYLPDGDIDLTVLTHHASEEDC-ARAVCCVLEAEMGN-- 143
Query: 103 AEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSI 162
++ +V VQY+QA+ VD AF + +HLFK+SI
Sbjct: 144 SDLQVTGVQYVQAK------------VDKAFGR-------------------DHLFKKSI 172
Query: 163 ILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVL 210
IL+KAWC+YESRILG + GLIS+YAL LVL I ++ S +GPL L
Sbjct: 173 ILVKAWCFYESRILGANSGLISTYALAILVLNIVNMSYSSLSGPLAKL 220
>gi|224135259|ref|XP_002322023.1| predicted protein [Populus trichocarpa]
gi|222869019|gb|EEF06150.1| predicted protein [Populus trichocarpa]
Length = 85
Score = 113 bits (283), Expect = 1e-22, Method: Composition-based stats.
Identities = 52/63 (82%), Positives = 54/63 (85%)
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+NHLFKRSIILIKAWCYYESRILG HHGLIS+YAL TLVLYIFHVFN FAGPLEV F
Sbjct: 1 QNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNKFAGPLEVSTAF 60
Query: 214 LEF 216
F
Sbjct: 61 WNF 63
>gi|224135265|ref|XP_002322024.1| predicted protein [Populus trichocarpa]
gi|222869020|gb|EEF06151.1| predicted protein [Populus trichocarpa]
Length = 122
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 63/117 (53%), Positives = 73/117 (62%), Gaps = 30/117 (25%)
Query: 1 SVIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSV 60
SV + L+P RW AEE TAELIA IQP+ SEERRNAV YV+RLI+ CFPCQ
Sbjct: 25 SVTQALEPERWATAEERTAELIACIQPNQPSEERRNAVLCYVQRLIMNCFPCQ------- 77
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
+TWA+ VRD+LE+EEKNE+AEF VKEVQYIQAEV
Sbjct: 78 -----------------------ETWANEVRDILEHEEKNENAEFHVKEVQYIQAEV 111
>gi|261333426|emb|CBH16421.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 1120
Score = 107 bits (266), Expect = 1e-20, Method: Composition-based stats.
Identities = 54/155 (34%), Positives = 87/155 (56%), Gaps = 26/155 (16%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K ++D DI QLGG+ + FL E+D I NHL KR+++L+KAWC YE
Sbjct: 483 VVAEVRVLKLVMDGSSYDITVGQLGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYE 542
Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN------------------------GSFA--GP 206
+ +L G G ISSYA +++ + + G + P
Sbjct: 543 AHVLSGQGGYISSYAATVMIISMINTVEFLEDVEREERGGEGDGKHLDERQRGEYQHISP 602
Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
L++ RFL++FS FD++++CL+L+GPVP + +V
Sbjct: 603 LQLFARFLKYFSYFDFESYCLTLFGPVPCDKINNV 637
>gi|71748824|ref|XP_823467.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70833135|gb|EAN78639.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 1120
Score = 107 bits (266), Expect = 1e-20, Method: Composition-based stats.
Identities = 54/155 (34%), Positives = 87/155 (56%), Gaps = 26/155 (16%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K ++D DI QLGG+ + FL E+D I NHL KR+++L+KAWC YE
Sbjct: 483 VVAEVRVLKLVMDGSSYDITVGQLGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYE 542
Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN------------------------GSFA--GP 206
+ +L G G ISSYA +++ + + G + P
Sbjct: 543 AHVLSGQGGYISSYAATVMIISMINTVEFLEDVEREERGGEGDGKHLEERQRGEYQHISP 602
Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDV 241
L++ RFL++FS FD++++CL+L+GPVP + +V
Sbjct: 603 LQLFARFLKYFSYFDFESYCLTLFGPVPCDKINNV 637
>gi|298707565|emb|CBJ30149.1| nucleotidyltransferase family protein [Ectocarpus siliculosus]
Length = 1301
Score = 106 bits (265), Expect = 1e-20, Method: Composition-based stats.
Identities = 56/140 (40%), Positives = 83/140 (59%), Gaps = 4/140 (2%)
Query: 97 EEKNEHAEFRVKEVQYIQ-AEVKIIKCLVDNFV-VDIAFNQLGGLCTLCFLDEVDHLINE 154
+E+ R+ V +I V+ IKC+VDN V VDI NQ+G + T+ L+E D L+ +
Sbjct: 745 KEEGSSYRHRLSNVNFINMGRVQKIKCVVDNQVAVDIGANQVGDIATVALLEETDQLLGK 804
Query: 155 NHLFKRSIILIKAWCYYESRILGGHHGL--ISSYALVTLVLYIFHVFNGSFAGPLEVLYR 212
+HLFKRS++LIK+W YESR G + L I+ AL T+VL + + + PL+V+
Sbjct: 805 DHLFKRSLLLIKSWWVYESRAYTGSNMLSRITESALATMVLAVVNQHHARLHTPLQVMAL 864
Query: 213 FLEFFSKFDWDNFCLSLWGP 232
F + S FDW +C + GP
Sbjct: 865 FFQMHSHFDWSRYCWCIEGP 884
Score = 39.7 bits (91), Expect = 1.9, Method: Composition-based stats.
Identities = 17/57 (29%), Positives = 31/57 (54%)
Query: 20 ELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAF 76
+L+ ++P P +E R +V +V R + + Q F G ++ YLPD ++ + AF
Sbjct: 621 DLLRLLRPAPRAEGYRRSVFRFVTRQVKRALGAQCFPVGGYAIQAYLPDEEVGISAF 677
>gi|298710234|emb|CBJ26309.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1317
Score = 105 bits (263), Expect = 2e-20, Method: Composition-based stats.
Identities = 60/143 (41%), Positives = 87/143 (60%), Gaps = 8/143 (5%)
Query: 101 EHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKR 160
E A+ ++ V I A I+ +V N VVD+ NQ G + L+E D+LI NHLFKR
Sbjct: 111 ETAKPEIRNVSLINARTPIVTMVVGNVVVDLTENQGGSVAASALLEEADNLIQRNHLFKR 170
Query: 161 SIILIKAWCYYES------RILGGHHGLISSYALVTLVLYIFHVFNGSFA--GPLEVLYR 212
S++L+KAW + E+ R+LG G ++SY L +VL++F + A PL+VL R
Sbjct: 171 SLLLLKAWAWCETPRLVGNRVLGARKGGLTSYGLSVMVLHLFAASASADALVHPLDVLIR 230
Query: 213 FLEFFSKFDWDNFCLSLWGPVPI 235
F E +S+FDW +CL+L GPVP+
Sbjct: 231 FFEVYSEFDWARYCLTLDGPVPL 253
>gi|224114896|ref|XP_002316887.1| predicted protein [Populus trichocarpa]
gi|222859952|gb|EEE97499.1| predicted protein [Populus trichocarpa]
Length = 199
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/78 (76%), Positives = 68/78 (87%)
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVV 129
DIDL AFS++ LKDTWA V DMLENEE NE+AEF VKEV+YIQAEVKIIKCLV+N VV
Sbjct: 19 DIDLTAFSENPNLKDTWAPQVCDMLENEENNENAEFGVKEVEYIQAEVKIIKCLVENIVV 78
Query: 130 DIAFNQLGGLCTLCFLDE 147
DI+FNQLGGL TLCFL++
Sbjct: 79 DISFNQLGGLFTLCFLEK 96
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 36/50 (72%), Positives = 42/50 (84%)
Query: 255 LSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSK 304
LSK FL++C YA P GQ+NQGQPF+SKHFNVIDPLR+NNNLG SV+K
Sbjct: 106 LSKLFLEACSAIYAVLPAGQDNQGQPFLSKHFNVIDPLRINNNLGHSVNK 155
>gi|342184813|emb|CCC94295.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 1108
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 82/148 (55%), Gaps = 26/148 (17%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K ++D D+ QLGG+ + FL E+D + HL KR+++L+KAWC YE
Sbjct: 472 VVAEVRVLKLVMDGSSYDVTVGQLGGVSCIRFLHEMDMRVGCEHLLKRTLLLMKAWCCYE 531
Query: 173 SRILGGHHGLISSYALVTLVLYIFHVF--------NGSFA------------------GP 206
+ +L G G +SSYA +++ + + GS P
Sbjct: 532 AHVLSGQGGYMSSYAATVMLITMINTVEFLEDVEAEGSDGKTCSNCPEGHKSEGHVQISP 591
Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVP 234
L++ RFL+++S FD+D +CL+L+GPVP
Sbjct: 592 LQLFARFLKYYSYFDFDRYCLTLFGPVP 619
>gi|452823525|gb|EME30535.1| nucleotidyltransferase [Galdieria sulphuraria]
Length = 1412
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 69/263 (26%), Positives = 112/263 (42%), Gaps = 57/263 (21%)
Query: 27 PDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAF--SDDQTLKD 84
P FSE RR AV V +I + Q F +GS KTY D +++GAF + T +
Sbjct: 710 PTSFSELRREAVFRVVASIIKRSIGAQAFCYGSFATKTYHADSILEIGAFLVGKNDTAAE 769
Query: 85 TWAHLVRDMLEN-----EEKNEHAEFR--------------VKEVQYIQAEVKIIKC--- 122
A L+ + E+ + + EF V+ + Y + + C
Sbjct: 770 WSAKLMAALCEDATLASDHSSSSLEFSYLSLIQQKHPVPLPVRNISYFRPKPTPSGCQPP 829
Query: 123 -----------------------------LVDNFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
+ N V + N + G+ T C L+E DH +
Sbjct: 830 PAVTFTVNWPIEDPRSGLVALDTNSTERDIAPNVRVSVTLNHVAGIHTACVLEEFDHAMG 889
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
NHLFKRS++L++ W Y ++ ++ S A+ LV+++ + F+ S P ++LYRF
Sbjct: 890 RNHLFKRSLLLVRTWVDYGVKLT----DILPSRAVEVLVVFVANCFHSSIETPFDLLYRF 945
Query: 214 LEFFSKFDWDNFCLSLWGPVPIS 236
L +F FDW F L G + ++
Sbjct: 946 LTYFVHFDWRKFGLCETGIIDLA 968
>gi|340057832|emb|CCC52183.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 1145
Score = 94.0 bits (232), Expect = 9e-17, Method: Composition-based stats.
Identities = 48/148 (32%), Positives = 76/148 (51%), Gaps = 28/148 (18%)
Query: 121 KCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHH 180
K ++D D+ QLGG+ + FL +VD I HL KR+++L+KAWC YE+ +L G
Sbjct: 516 KLIMDGNSYDVTVGQLGGVSCIRFLHQVDTKIGCGHLLKRTLLLMKAWCCYEAHVLSGQG 575
Query: 181 GLISSYALVTLVLYIF-----------------------HVFNGSFAG-----PLEVLYR 212
G +SSYA +++ + H G PL++ R
Sbjct: 576 GYMSSYAATVMLIAMINTIEFLEDAESEACTELEEPARTHALEGRLGALNGVSPLQLFAR 635
Query: 213 FLEFFSKFDWDNFCLSLWGPVPISLLPD 240
FL++FS FD++ +C++L+GPVP + D
Sbjct: 636 FLKYFSCFDFERYCVTLFGPVPCEKIND 663
>gi|224064842|ref|XP_002301578.1| predicted protein [Populus trichocarpa]
gi|222843304|gb|EEE80851.1| predicted protein [Populus trichocarpa]
Length = 60
Score = 79.7 bits (195), Expect = 2e-12, Method: Composition-based stats.
Identities = 37/47 (78%), Positives = 42/47 (89%)
Query: 105 FRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHL 151
FRVK+V+YIQAEVKIIKCLV N VVDI+FNQLGGL TLCFL++V L
Sbjct: 13 FRVKKVEYIQAEVKIIKCLVKNIVVDISFNQLGGLFTLCFLEKVSAL 59
>gi|389601018|ref|XP_001564070.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322504611|emb|CAM38122.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 2016
Score = 77.0 bits (188), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/85 (38%), Positives = 55/85 (64%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K ++ DI Q GG+ + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 1055 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1114
Query: 173 SRILGGHHGLISSYALVTLVLYIFH 197
+ ILGG G I SYA +++ + +
Sbjct: 1115 AHILGGQAGYIGSYAATVMLISMLN 1139
>gi|398013931|ref|XP_003860157.1| hypothetical protein, conserved [Leishmania donovani]
gi|322498376|emb|CBZ33450.1| hypothetical protein, conserved [Leishmania donovani]
Length = 2047
Score = 77.0 bits (188), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/85 (38%), Positives = 55/85 (64%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K ++ DI Q GG+ + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 1058 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1117
Query: 173 SRILGGHHGLISSYALVTLVLYIFH 197
+ ILGG G I SYA +++ + +
Sbjct: 1118 AHILGGQAGYIGSYAATVMLISMLN 1142
Score = 42.4 bits (98), Expect = 0.37, Method: Composition-based stats.
Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 3/40 (7%)
Query: 206 PLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
PL + RFL+F++ FD+D +C++ +GP+P L +T+ P
Sbjct: 1269 PLTLFARFLKFYAYFDFDRYCVTAFGPLP---LHKITSTP 1305
>gi|401419332|ref|XP_003874156.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490390|emb|CBZ25650.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 2020
Score = 77.0 bits (188), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/85 (38%), Positives = 55/85 (64%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K ++ DI Q GG+ + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 1051 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1110
Query: 173 SRILGGHHGLISSYALVTLVLYIFH 197
+ ILGG G I SYA +++ + +
Sbjct: 1111 AHILGGQAGYIGSYAATVMLISMLN 1135
Score = 42.4 bits (98), Expect = 0.37, Method: Composition-based stats.
Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 3/40 (7%)
Query: 206 PLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
PL + RFL+F++ FD+D +C++ +GP+P L +T+ P
Sbjct: 1265 PLTLFARFLKFYAYFDFDRYCVTAFGPLP---LHKITSTP 1301
>gi|339897903|ref|XP_001464956.2| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399300|emb|CAM67197.2| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 2047
Score = 77.0 bits (188), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/85 (38%), Positives = 55/85 (64%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K ++ DI Q GG+ + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 1058 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1117
Query: 173 SRILGGHHGLISSYALVTLVLYIFH 197
+ ILGG G I SYA +++ + +
Sbjct: 1118 AHILGGQAGYIGSYAATVMLISMLN 1142
Score = 42.4 bits (98), Expect = 0.37, Method: Composition-based stats.
Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 3/40 (7%)
Query: 206 PLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
PL + RFL+F++ FD+D +C++ +GP+P L +T+ P
Sbjct: 1269 PLTLFARFLKFYAYFDFDRYCVTAFGPLP---LHKITSTP 1305
>gi|157868001|ref|XP_001682554.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68126008|emb|CAJ04245.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 1964
Score = 77.0 bits (188), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/85 (38%), Positives = 55/85 (64%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K ++ DI Q GG+ + FL E+D +I + H+ KR+++L+KAWC YE
Sbjct: 971 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1030
Query: 173 SRILGGHHGLISSYALVTLVLYIFH 197
+ ILGG G I SYA +++ + +
Sbjct: 1031 AHILGGQAGYIGSYAATVMLISMLN 1055
Score = 42.4 bits (98), Expect = 0.37, Method: Composition-based stats.
Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 3/40 (7%)
Query: 206 PLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEP 245
PL + RFL+F++ FD+D +C++ +GP+P L +T+ P
Sbjct: 1184 PLTLFARFLKFYAYFDFDRYCVTAFGPLP---LHKITSTP 1220
>gi|71652853|ref|XP_815075.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70880102|gb|EAN93224.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1276
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 86/152 (56%), Gaps = 26/152 (17%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K +++ DI QLGG+ + FL E+D LI HL KR+++L+KAWC YE
Sbjct: 565 VVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 624
Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN-----GSFA---------------------GP 206
+ IL G G +SSYA +++ + + GS P
Sbjct: 625 AHILSGQGGYLSSYAATIMLISMMNTVEFLEDLGSVEEREEDGEAHLGCEPHESLKNISP 684
Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
L++ RFL+FFS FD++++C++++GP+P + L
Sbjct: 685 LQLFARFLKFFSFFDFEHYCVTVFGPLPCACL 716
>gi|407407321|gb|EKF31173.1| hypothetical protein MOQ_004991 [Trypanosoma cruzi marinkellei]
Length = 1349
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 85/150 (56%), Gaps = 26/150 (17%)
Query: 115 AEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESR 174
AEV+++K +++ DI QLGG+ + FL E+D LI HL KR+++L+KAWC YE+
Sbjct: 596 AEVRVLKLVMEGSCFDITVGQLGGVECVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYEAH 655
Query: 175 ILGGHHGLISSYALVTLVLYIFHVF------------------------NGSFA--GPLE 208
IL G G +SSYA +++ + + SF PL+
Sbjct: 656 ILSGQGGYLSSYAATIMLIAMMNTVEFLEDVGSVEERDEDGEGRLGCEPQASFKNISPLQ 715
Query: 209 VLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
+ RFL+FFS FD++++C++++GP+P + L
Sbjct: 716 LFARFLKFFSFFDFEHYCVTIFGPLPCACL 745
>gi|71408844|ref|XP_806800.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870651|gb|EAN84949.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1239
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 85/152 (55%), Gaps = 26/152 (17%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K +++ DI QLGG+ + FL E+D LI HL KR+++L+KAWC YE
Sbjct: 568 VVAEVRVLKLVMEGGCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 627
Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN-----GSFA---------------------GP 206
+ IL G G +SSYA +++ + + GS P
Sbjct: 628 AHILSGQGGYLSSYAATIMLIAMMNTVEFVEDVGSVEEREEDGEGHLGCEPQEFFKNISP 687
Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
L++ RFL+FFS FD++++C++++GP+P L
Sbjct: 688 LQLFARFLKFFSFFDFEHYCVTIFGPLPCDCL 719
>gi|407846652|gb|EKG02680.1| hypothetical protein TCSYLVIO_006286 [Trypanosoma cruzi]
Length = 893
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 86/152 (56%), Gaps = 26/152 (17%)
Query: 113 IQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYE 172
+ AEV+++K +++ DI QLGG+ + FL E+D LI HL KR+++L+KAWC YE
Sbjct: 222 VVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 281
Query: 173 SRILGGHHGLISSYALVTLVLYIFHVFN-----GSFA---------------------GP 206
+ IL G G +SSYA +++ + + GS P
Sbjct: 282 AHILSGQGGYLSSYAATIMLISMMNTVEFLEDLGSVEEREEDGEAHLGCEPNESLKNISP 341
Query: 207 LEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
L++ RFL+FFS FD++++C++++GP+P + L
Sbjct: 342 LQLFARFLKFFSFFDFEHYCVTVFGPLPCACL 373
>gi|449533401|ref|XP_004173664.1| PREDICTED: uncharacterized LOC101209112 [Cucumis sativus]
Length = 831
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 35/43 (81%)
Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GNFFRIR+AF F AK LARL +CP ED+ E+NQFF+NT +RH
Sbjct: 5 GNFFRIRSAFAFGAKRLARLFECPREDILAELNQFFLNTWERH 47
>gi|302691928|ref|XP_003035643.1| hypothetical protein SCHCODRAFT_104957 [Schizophyllum commune H4-8]
gi|300109339|gb|EFJ00741.1| hypothetical protein SCHCODRAFT_104957, partial [Schizophyllum
commune H4-8]
Length = 671
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 100/220 (45%), Gaps = 22/220 (10%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSD--DQT 81
I P P +E R+ + + R+I FP +V FGS K YLP DIDL S+ +Q
Sbjct: 170 ISPTPAEDEVRSMIVLLIARIIQDKFPDAEVRPFGSYGTKLYLPHGDIDLVVQSNTLEQN 229
Query: 82 LKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV----DNFVVDIAFNQLG 137
K T + D++ + A +VQ I A V IIK + F +DI+ NQ
Sbjct: 230 NKKTVLQRLADLIRS------ARLSSGKVQVIGARVPIIKFITAAEYGRFQIDISVNQFS 283
Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
GL + ++ + + + RS++LI + + + G + SY++V LVL
Sbjct: 284 GLVSSDIINGFQRGM-QCPIAIRSLVLILKLYLSQRGMNEVYTGGLGSYSIVCLVLSFLQ 342
Query: 198 VFNGSFAGPLE-------VLYRFLEFFSKF-DWDNFCLSL 229
+ G ++ +L F E + K+ +++ +SL
Sbjct: 343 MHPKIRNGEIDPERNLGVLLLEFFELYGKYHNYEEVGVSL 382
>gi|147787660|emb|CAN69576.1| hypothetical protein VITISV_028613 [Vitis vinifera]
Length = 192
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 28/40 (70%)
Query: 220 FDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLLSKSF 259
DWD+FC+SLWGPVPIS LPD T EPPR+ LLL
Sbjct: 147 IDWDSFCVSLWGPVPISSLPDATTEPPRQGSRELLLDSGI 186
>gi|403419742|emb|CCM06442.1| predicted protein [Fibroporia radiculosa]
Length = 1487
Score = 57.8 bits (138), Expect = 8e-06, Method: Composition-based stats.
Identities = 52/172 (30%), Positives = 82/172 (47%), Gaps = 11/172 (6%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P E R+ V A + R + Q FP +V FGS K YLP DIDL S Q++
Sbjct: 167 ISPTPEENEVRSLVVALITRAVTQAFPDAEVHPFGSYDTKLYLPVGDIDLVVHS--QSMA 224
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGLCT 141
+ V + N K RV+ + +A+V I+K L N VDI+ NQ G+
Sbjct: 225 YSKKEAVLHSIANTMKRAGITDRVRIIS--KAKVPIVKFVTLHGNIPVDISINQGNGVTA 282
Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL 193
+ H + E + ++++K++ S + + G + SY++V LV+
Sbjct: 283 GTM---IKHFLAELPALRSLVLIVKSFLSQRS-MNEVYTGGLGSYSIVCLVI 330
>gi|390597612|gb|EIN07011.1| Nucleotidyltransferase [Punctularia strigosozonata HHB-11173 SS5]
Length = 464
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/183 (27%), Positives = 80/183 (43%), Gaps = 23/183 (12%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL------GAFS 77
I P P +E R + + R + Q FP QV FGS K YLP DIDL A+S
Sbjct: 152 ISPTPAEDEIRGLIVQLISRAVTQAFPDAQVLPFGSYETKLYLPLGDIDLVIQSPSMAYS 211
Query: 78 DDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQ 135
D T+ A+ +R A + +A+V IIK + + F VDI+ NQ
Sbjct: 212 DKVTVLHALANTMR----------RAGITDRVTIVAKAKVPIIKFITTHGRFAVDISLNQ 261
Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYI 195
G+ ++ + E + +++ KA+ S + + G + SY++V L +
Sbjct: 262 TNGVAAGKM---INRYLRELPALRGLVMITKAFLSQRS-MNEVYTGGLGSYSIVCLAISF 317
Query: 196 FHV 198
+
Sbjct: 318 LQM 320
>gi|392567029|gb|EIW60204.1| hypothetical protein TRAVEDRAFT_164816 [Trametes versicolor
FP-101664 SS1]
Length = 660
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 127/316 (40%), Gaps = 64/316 (20%)
Query: 19 AELIARIQ---------PDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPD 68
AE+ ARI+ P P +E R+ V A V R + + + QV FGS K YLP
Sbjct: 164 AEMYARIEVEAFVKYISPTPIEDEVRSLVVALVSRAVTRTYTDAQVLPFGSYETKLYLPL 223
Query: 69 RDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN-- 126
DIDL +S D + L L N K RV + +A+V IIK + +
Sbjct: 224 GDIDLVIYSQSMARMDRVSVL--HSLANIVKRAGITDRVTII--AKAKVPIIKFVTTHGR 279
Query: 127 FVVDIAFNQLGGLCT----LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGL 182
F VDI+ NQ G+ FL+E+ L RS++LI + + G
Sbjct: 280 FSVDISINQGNGVTAGKMVKQFLEELPAL--------RSLVLIIKSFLSQRSMNEVFTGG 331
Query: 183 ISSYALVTLVLYIFH----VFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLL 238
+ SY++V L + V G + +EFF + C +G V ISL
Sbjct: 332 LGSYSIVCLAISFLQMHPKVRRGEIDPSKNMGVLVMEFFELYG----CYFNYGEVGISL- 386
Query: 239 PDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNL 298
+DGG S+ + + + D+ GQ+ + + DP N++
Sbjct: 387 ---------RDGG------SYFNKTQRGWMDY--GQQ--------RLLCIEDPGDPTNDI 421
Query: 299 GRSVSKGNFFRIRTAF 314
R N ++RT
Sbjct: 422 SRGSY--NIAKVRTTL 435
>gi|145533334|ref|XP_001452417.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420105|emb|CAK85020.1| unnamed protein product [Paramecium tetraurelia]
Length = 361
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 99/213 (46%), Gaps = 32/213 (15%)
Query: 29 PFSEERRNAVAAYVR-RLIIQCFPCQV--FTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
P SEE R A +R I+ F +V FGS K YLP+ DID+ + K+
Sbjct: 77 PTSEEHRRREQAIMRVETFIKEFASEVDIQAFGSFKTKLYLPNADIDVVMIDKSMSAKEL 136
Query: 86 WAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKC--LVDNFVVDIAFNQLGGLCTL 142
+ + + +++++ + + V I A+V IIK + + DI+FNQ+ GL
Sbjct: 137 YKKVAQSLMKSD--------KFENVNLIANAKVPIIKFFEVESQYQFDISFNQMDGLKQ- 187
Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVL-YIFHVFN 200
+DE+ FK I+++K C + R L + G I S+ L ++L ++ +
Sbjct: 188 --IDEIRKAFTIYPEFKYLIMILK--CMLKQRELNETYSGGIGSFLLFQMILAFLREIRK 243
Query: 201 GSFAGPL----------EVLYRFLEFF-SKFDW 222
+FA E + RFLEF+ KFD+
Sbjct: 244 EAFANKKQEQLKNITLGEYILRFLEFYGQKFDY 276
>gi|147825319|emb|CAN73261.1| hypothetical protein VITISV_003724 [Vitis vinifera]
Length = 106
Score = 55.5 bits (132), Expect = 4e-05, Method: Composition-based stats.
Identities = 21/29 (72%), Positives = 24/29 (82%)
Query: 221 DWDNFCLSLWGPVPISLLPDVTAEPPRKD 249
DWD FC+SL GPVPIS LPD T EPPR++
Sbjct: 47 DWDGFCVSLGGPVPISSLPDATTEPPRQE 75
>gi|67989518|ref|NP_001018181.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe 972h-]
gi|81175166|sp|Q9UTN3.2|CID14_SCHPO RecName: Full=Poly(A) RNA polymerase cid14; Short=PAP; AltName:
Full=Caffeine-induced death protein 14; AltName:
Full=Polynucleotide adenylyltransferase cid14
gi|62554069|emb|CAI79317.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe]
Length = 684
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 97/219 (44%), Gaps = 40/219 (18%)
Query: 22 IARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
I I P P R + + + + ++Q +P ++ FGS K YLP D+DL S +
Sbjct: 249 IDYITPTPEEHAVRKTLVSRINQAVLQKWPDVSLYVFGSFETKLYLPTSDLDLVIISPEH 308
Query: 81 ----TLKDTW--AHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VD 130
T KD + AH ++ + EVQ I A V IIK VD VD
Sbjct: 309 HYRGTKKDMFVLAHHLKKLK-----------LASEVQVITTANVPIIK-FVDPLTKVHVD 356
Query: 131 IAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESR---ILGGHHGLISSYA 187
I+FNQ GGL T C + V+ + + + +I+IK + + LGG +SSYA
Sbjct: 357 ISFNQPGGLKT-CLV--VNGFMKKYPALRPLVIIIKHFLNMRALNEVFLGG----LSSYA 409
Query: 188 LVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK 219
+V LV+ + G + +L FLE + K
Sbjct: 410 IVCLVVSFLQLHPRLSTGSMREEDNFGVLLLEFLELYGK 448
>gi|147799779|emb|CAN72745.1| hypothetical protein VITISV_018734 [Vitis vinifera]
Length = 258
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/29 (72%), Positives = 25/29 (86%)
Query: 220 FDWDNFCLSLWGPVPISLLPDVTAEPPRK 248
DWD+FC+SLWGPVPIS LPD T +PPR+
Sbjct: 114 IDWDSFCVSLWGPVPISSLPDATTKPPRQ 142
>gi|336367333|gb|EGN95678.1| hypothetical protein SERLA73DRAFT_60289 [Serpula lacrymans var.
lacrymans S7.3]
Length = 538
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 88/202 (43%), Gaps = 15/202 (7%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
+ P P +E R V + V + + FP QV FGS K YLPD DIDL S+
Sbjct: 196 MSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDIDLVIQSESMAYS 255
Query: 83 -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGL 139
K T H + + L + A+ K +A+V I+K + ++ VDI+ NQ G+
Sbjct: 256 NKVTVLHALANTL------KRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDISINQGNGV 309
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
++ ++ RS+++I + + + G + SY++V L + +
Sbjct: 310 IAGKIVNGFLKDMHGCGFALRSLVMITKAFLNQRGMNEVYTGGLGSYSIVCLAISFLQMH 369
Query: 200 NGSFAGPLEVLYRF----LEFF 217
+G ++ +EFF
Sbjct: 370 PKIRSGEIDAEKNLGVLVMEFF 391
>gi|449547164|gb|EMD38132.1| hypothetical protein CERSUDRAFT_49354 [Ceriporiopsis subvermispora
B]
Length = 547
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 79/175 (45%), Gaps = 17/175 (9%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P +E R+ V +RR I + FP QV FGS K YLP DIDL S+
Sbjct: 183 ISPTPQEDEVRSLVVELIRRAITRQFPDAQVLPFGSYETKLYLPLGDIDLVIHSNTMAYS 242
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGLCT 141
D V L N + VK + +A+V I+K + F VDI+ NQ G+
Sbjct: 243 DK--ENVLRALANTLRRAGITDNVKII--AKAKVPIVKFVTIHGRFSVDISINQGNGVAA 298
Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESR---ILGGHHGLISSYALVTLVL 193
++H ++E + + ++K++ S GG + SY++V L +
Sbjct: 299 GKM---INHFLSELPALRALVFVVKSFLSQRSMNEVFTGG----LGSYSIVCLAI 346
>gi|213403316|ref|XP_002172430.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
gi|212000477|gb|EEB06137.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
Length = 667
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 85/185 (45%), Gaps = 21/185 (11%)
Query: 22 IARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQ 80
I ++P P R ++ + R I +P V+ FGS + YLP DID+ S D
Sbjct: 240 INYLEPTPQEHAVRKSLITKLDRAIRAKWPEVTVYVFGSFETRLYLPTSDIDMVVMSSDT 299
Query: 81 TLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VDIAFNQL 136
+ T H+ L KN E+Q I A V IIK VD F VD++FNQ
Sbjct: 300 VHRGTKKHMYS--LARHLKNCKL---ATEIQVITTANVPIIK-FVDPFTRIHVDVSFNQP 353
Query: 137 GGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESR---ILGGHHGLISSYALVTLVL 193
GGL T C + V+ + + + +L+K + + LGG +SSYA+V LV+
Sbjct: 354 GGLKT-CLV--VNGFLKKFPAVRPLTMLVKHFLNMRALNEVFLGG----LSSYAIVCLVV 406
Query: 194 YIFHV 198
+
Sbjct: 407 SFLQM 411
>gi|336380050|gb|EGO21204.1| hypothetical protein SERLADRAFT_476100 [Serpula lacrymans var.
lacrymans S7.9]
Length = 592
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 88/202 (43%), Gaps = 15/202 (7%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
+ P P +E R V + V + + FP QV FGS K YLPD DIDL S+
Sbjct: 196 MSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDIDLVIQSESMAYS 255
Query: 83 -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGL 139
K T H + + L + A+ K +A+V I+K + ++ VDI+ NQ G+
Sbjct: 256 NKVTVLHALANTL------KRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDISINQGNGV 309
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
++ ++ RS+++I + + + G + SY++V L + +
Sbjct: 310 IAGKIVNGFLKDMHGCGFALRSLVMITKAFLNQRGMNEVYTGGLGSYSIVCLAISFLQMH 369
Query: 200 NGSFAGPLEVLYRF----LEFF 217
+G ++ +EFF
Sbjct: 370 PKIRSGEIDAEKNLGVLVMEFF 391
>gi|406604992|emb|CCH43591.1| Poly(A) RNA polymerase protein 1 [Wickerhamomyces ciferrii]
Length = 624
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 99/237 (41%), Gaps = 40/237 (16%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + IA I P E RN +R I++ +P C+V FGS YLP
Sbjct: 211 WLTLE--IKDFIAYISPSKEEIELRNNTVRKLREAIMELWPDCEVHVFGSYATDLYLPGS 268
Query: 70 DIDL------GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKC 122
DID+ G + +L + L R L K V+ I +A+V IIK
Sbjct: 269 DIDMVIVSEHGGYESRNSLYSLSSFLKRKNL------------AKNVEVIAKAKVPIIKF 316
Query: 123 L--VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-H 179
N +D++F + G+ + I E + ++++K + SR L H
Sbjct: 317 TESTSNIHIDVSFERTNGIDA---AKTIRSWITETPGLREIVLIVKQ--FLSSRKLNNVH 371
Query: 180 HGLISSYALVTLVLYIFHVFNGSFA----GPLE----VLYRFLEFFSK-FDWDNFCL 227
G + Y+++ LV Y F + + + P E +L F E + K F +DN +
Sbjct: 372 VGGLGGYSIICLV-YSFLILHPRLSTGNISPYENLGVLLIEFFELYGKNFGYDNVAI 427
>gi|170109615|ref|XP_001886014.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164638944|gb|EDR03218.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 397
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 99/238 (41%), Gaps = 28/238 (11%)
Query: 14 AEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
AE + AE+ A I P P +E R + + + FP +V FGS K YLP
Sbjct: 99 AEMLHAEVKAFVHWISPSPVEDEVRGLIVTQISNTVKASFPDARVLPFGSYETKLYLPLG 158
Query: 70 DIDLGAFSDDQTL--KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN- 126
DIDL SD K H + + L+ H K A+V I+K + +
Sbjct: 159 DIDLVILSDSMAYSNKVNVLHALANTLKRSGVTSHVTIIAK------AKVPIVKFVTTHG 212
Query: 127 -FVVDIAFNQLGGLCTL----CFLDEV--DHLINENHLFKRSIILIKAWCYYESRILGGH 179
F VDI+ NQ GL + FL ++ + + + RS++++ + + +
Sbjct: 213 RFHVDISLNQSNGLLSGKIINGFLKDMHGNGAEGKGSMALRSLVMVTKAFLTQRSMNEVY 272
Query: 180 HGLISSYALVTLVLYIFHVF----NGSFAGPLEVLYRFLEFFS----KFDWDNFCLSL 229
G + SY++V L + + NG + +EFF F++D +SL
Sbjct: 273 TGGLGSYSIVCLAVSFLQMHPKIRNGEIDPEKNLGVLAMEFFELYGCYFNYDEVGISL 330
>gi|451844986|gb|EMD58301.1| hypothetical protein COCSADRAFT_165704 [Cochliobolus sativus
ND90Pr]
Length = 642
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 105/266 (39%), Gaps = 37/266 (13%)
Query: 7 DPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQC-FPCQ---VFTFGSVPL 62
+P +WL E + + + P PF E+RN + V + Q FP Q V FGS P
Sbjct: 318 EPEKWLHNEIL--DFYGFVAPKPFEHEQRNRLVNRVNNALGQRRFPQQNGRVLCFGSFPA 375
Query: 63 KTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNE---HAEFRVKEV-------QY 112
YLP D+DL SD V DM + A R+K + Q
Sbjct: 376 GLYLPTADMDLVYVSDQYY---NGGPPVVDMSQRGANKSLLYKASNRLKSMGMDADGCQV 432
Query: 113 IQAEVKIIKCL--VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIK 166
I A+V IIK + VDI+F L G+ + E +I L K+ +++
Sbjct: 433 IHAKVPIIKFQDRLTQLQVDISFENLSGVQAQATFAQWKQEYPDMIYMVALLKQFLVM-- 490
Query: 167 AWCYYESRILGGHHGLISSYALVTLVL-YIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNF 225
+ H G I ++++ L++ YI H G E FL ++ FD
Sbjct: 491 ------HGLNEVHTGGIGGFSIICLIVSYIQHSDKHENLG--ECFLGFLRYYGDFDLSRK 542
Query: 226 CLSLWGPVPISLLP-DVTAEPPRKDG 250
+ ++ P I + P R DG
Sbjct: 543 RIQMYPPAIIEKTAHGIDGRPERYDG 568
>gi|391346299|ref|XP_003747415.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Metaseiulus occidentalis]
Length = 491
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 127/316 (40%), Gaps = 61/316 (19%)
Query: 26 QPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD 84
QP+ + RR V VR I + +P C V FGS YLP DID+ ++
Sbjct: 98 QPNAADQSRREQVIEKVRAAIREKWPDCVVEVFGSYKTGLYLPTGDIDM-------VIQG 150
Query: 85 TWA------HLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL-VDNFV-VDIAFNQL 136
W L R ++E ++ E F+V + +A V +IK D + VD++FNQ
Sbjct: 151 NWEIIPPLFDLERQLIE-KKVGEKNTFKVLD----KASVPLIKFKDADTEIRVDLSFNQA 205
Query: 137 GGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIF 196
F+ + + I ++K + + HG ISSY+L ++L
Sbjct: 206 NCTEAAAFVKQCCRTFPP---LAKLIFVLKQYLSLHG-LNEVFHGGISSYSLTLMILSFL 261
Query: 197 H------VFNGSFAGPLEVLYRFLEFFS-KFDWDNFCLSLWGPVPISLLPDVTAEPPRKD 249
+ ++L FLEF+ +F++D + + +D
Sbjct: 262 QLHPEQEMVRSDKPETGKLLVEFLEFYGDRFEYDKMGIRI------------------RD 303
Query: 250 GGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFR 309
GG L+ K+ L C A GG + G S + DPL N++ RS R
Sbjct: 304 GGA-LVDKNQLRECLIAA----GGPPSSG----SNLLCIEDPLTPGNDVARSSYA--MSR 352
Query: 310 IRTAFTFRAKGLARLL 325
+R AF L++L+
Sbjct: 353 VRDAFKSAFTCLSKLV 368
>gi|403331574|gb|EJY64740.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
Length = 316
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 55/226 (24%), Positives = 105/226 (46%), Gaps = 23/226 (10%)
Query: 14 AEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDID 72
E T + + + P +E RN VA + +I FP C VF FGS LP+ DID
Sbjct: 15 TETSTHDFVNFVTPSKEDKEIRNKVATSIEEVIKGVFPDCHVFVFGSCATGLNLPNSDID 74
Query: 73 LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQ-AEVKIIKCLVDNFV--V 129
L + D + + V D + ++K K + ++ +V +IK F V
Sbjct: 75 LIVYQPDVS-ESRMITKVADAIVRQKK-------CKTIDVLKNTKVPLIKITDSEFGVNV 126
Query: 130 DIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYAL 188
DI+FN+ G+ + + ++ + E K ++++K C+ +SR L + G + S+ L
Sbjct: 127 DISFNRTNGVYCVKLVKQLLQMFPE---LKPLMMVLK--CFLKSRQLNEPYSGGVGSFLL 181
Query: 189 VTLVL-YIFHVFNGSFAGPLEVLYRFLEFF----SKFDWDNFCLSL 229
+V ++ + L++ + L+FF ++F++ + +S+
Sbjct: 182 TMMVTSFLQRQYKLGNTNNLDLGKQLLDFFKLYGTEFNYQHVGISI 227
>gi|145525609|ref|XP_001448621.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416176|emb|CAK81224.1| unnamed protein product [Paramecium tetraurelia]
Length = 364
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 88/211 (41%), Gaps = 29/211 (13%)
Query: 29 PFSEERRNAVAAYVR--RLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
P +E + V AY+R + + P Q+ +FGS + YLP+ DID+ T K
Sbjct: 77 PSDQEHKRRVTAYLRVEKYLQDIAPEAQIESFGSFKTRMYLPNADIDIVMIETSCTQKQL 136
Query: 86 WAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKC--LVDNFVVDIAFNQLGGLCTLC 143
+ + M++ K E+ A+V IIK + + D++FNQL GL +
Sbjct: 137 FKKVAARMMKQTNKFENVNL------IANAKVPIIKFVEVESQYHFDLSFNQLDGLKQIE 190
Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVLYIFHVFNGS 202
L++ L E +L+ C R L + G + S+ L ++L F
Sbjct: 191 ELEKAFELYPE-----LKFLLMTLKCVLRQRDLNETYSGGVGSFLLFQMILAFLREFRKD 245
Query: 203 F-----------AGPLEVLYRFLEFFS-KFD 221
F E + +FLEF+ KFD
Sbjct: 246 FFQHNKEDQIKNVTLGEYMIKFLEFYGIKFD 276
>gi|212645230|ref|NP_492446.3| Protein GLD-4 [Caenorhabditis elegans]
gi|403399397|sp|G5EFL0.1|GLD4_CAEEL RecName: Full=Poly(A) RNA polymerase gld-4; AltName: Full=Defective
in germ line development protein 4; AltName:
Full=Germline development defective-4
gi|194686198|emb|CAB02138.3| Protein GLD-4 [Caenorhabditis elegans]
gi|226972859|gb|ACO95123.1| germline defective-4 [Caenorhabditis elegans]
Length = 845
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 132/320 (41%), Gaps = 73/320 (22%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W+K EI + L ++ F + R + + + ++ I ++ FGS+ +LP D
Sbjct: 90 WIKPNEIESRLRTKV----FEKVRDSVLRRWKQKTI------KISMFGSLRTNLFLPTSD 139
Query: 71 IDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQ-YIQAEVKIIKCLVD---N 126
ID+ DD W D L + A+ + V Y A V I+K +VD
Sbjct: 140 IDVLVECDD------WVGTPGDWLAETARGLEADNIAESVMVYGGAFVPIVK-MVDRDTR 192
Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
+DI+FN + G+ ++ +V E L + ++L+K + +Y + + G +SSY
Sbjct: 193 LSIDISFNTVQGVRAASYIAKVKE---EFPLIEPLVLLLKQFLHYRN-LNQTFTGGLSSY 248
Query: 187 ALVTLVLYIFHVF-----------NGSFAGPLEVLYRFLEFFS-KFDWDNFCLSLWGPVP 234
LV L++ F ++ G G L L RFLE +S +F+++ +S
Sbjct: 249 GLVLLLVNFFQLYALNMRSRTIYDRGVNLGHL--LLRFLELYSLEFNFEEMGIS------ 300
Query: 235 ISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRV 294
G + KS RY + Q QP + + DPL
Sbjct: 301 --------------PGQCCYIPKS-ASGARYGHK--------QAQP---GNLALEDPLLT 334
Query: 295 NNNLGRSVSKGNFFRIRTAF 314
N++GRS NF I AF
Sbjct: 335 ANDVGRSTY--NFSSIANAF 352
>gi|365758533|gb|EHN00370.1| Pap2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 514
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 99/240 (41%), Gaps = 30/240 (12%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN + +R + Q +P + FGS YLP
Sbjct: 180 WLTFE--IKDFVAYISPSREEIEIRNQTISTIREALKQLWPDADLHVFGSYSTDLYLPGS 237
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID LG L +HL ++ L E + A+ RV +++++ +I
Sbjct: 238 DIDCVVNSELGGKESRNNLYSLASHLKKNNLATEIE-VVAKARVPIIKFVEPHSRI---- 292
Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
+D++F + GL + E +N+ + ++++K + + R+ H G +
Sbjct: 293 ----HIDVSFERTNGLEAAKLIRE---WLNDTPGLRELVLIVKQFLHAR-RLNNVHTGGL 344
Query: 184 SSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPVPI 235
++++ LV H+ ++ +L F E + K F +D+ L PI
Sbjct: 345 GGFSIICLVFSFLHMHPRIITKEIDSKDNLGVLLIEFFELYGKNFGYDDVALGSSDGYPI 404
>gi|426199822|gb|EKV49746.1| hypothetical protein AGABI2DRAFT_63272 [Agaricus bisporus var.
bisporus H97]
Length = 481
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 86/218 (39%), Gaps = 19/218 (8%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
+ P P +E R + R I F +VF FGS K YLP DIDL SD
Sbjct: 154 MAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDIDLVIVSDSMAYS 213
Query: 83 -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV--DNFVVDIAFNQLGGL 139
K + H + +L A +A+V I+K + F VDI+ NQ G+
Sbjct: 214 NKSSVLHSLASVL------RRAGIASNVTVIAKAKVPIVKFVTIHGRFNVDISINQTNGI 267
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
+ + L RS++LI + + G + SY++V L + +
Sbjct: 268 VGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSIVCLAISFLQMH 327
Query: 200 NGSFAGPLE-------VLYRFLEFFS-KFDWDNFCLSL 229
G ++ ++ F E + F++D +S+
Sbjct: 328 PKIRRGEIDPEKNLGVLVMEFFELYGCHFNYDEVGISV 365
>gi|71005312|ref|XP_757322.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
gi|46096726|gb|EAK81959.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
Length = 730
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 65/240 (27%), Positives = 101/240 (42%), Gaps = 38/240 (15%)
Query: 14 AEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCF-PCQVFTFGSVPLKTYLPDR 69
AE + ELIA + P E R V + R I F +V+ FGS K YLP
Sbjct: 96 AEALHRELIAFDYWMTPTAAEHETRCMVIELISRAIKSQFRDAEVYPFGSQETKLYLPQG 155
Query: 70 DIDLGAFSDD-------QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIK 121
D+DL S+ L+ A L R L +VQ I +A+V IIK
Sbjct: 156 DLDLVVVSNSMANLRVQSALRTMAACLRRHNL------------ATDVQVIAKAKVPIIK 203
Query: 122 CLVD--NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
+ VDI+ N GL T +++ L H+ R +IL+ + + +
Sbjct: 204 FVTTYARLKVDISLNHTNGLTTASYVNS--WLRKWPHI--RPLILVVKYLLMQRGMSEVF 259
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
G + SY+++ +V+ + G ++ +L FLE + K F +DN +S+ G
Sbjct: 260 SGGLGSYSVIIMVISFLQLHPKVQRGEIDADRSLGVLLLEFLELYGKNFGYDNCGISIRG 319
>gi|401623740|gb|EJS41828.1| trf4p [Saccharomyces arboricola H-6]
Length = 573
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 97/242 (40%), Gaps = 34/242 (14%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN + +R + Q +P + FGS YLP
Sbjct: 170 WLTFE--IKDFVAYISPSREEIEVRNQTISMIREAVKQLWPDADLHVFGSYSTDLYLPGS 227
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID LG L +HL ++KN E V +A V IIK +
Sbjct: 228 DIDCVITSELGGKESRNNLFSLASHL-------KKKNLATEIEV----VAKARVPIIKFV 276
Query: 124 VDN--FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
N +D++F + GL + E +N+ + ++++K + + R+ H G
Sbjct: 277 EPNSGIHIDVSFERTNGLEAAKLIRE---WLNDTPGLRELVLIVKQFLH-SRRLNNVHTG 332
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPV 233
+ ++++ LV H+ +E +L F E + K F +D+ L
Sbjct: 333 GLGGFSIICLVFSFLHMHPRIITKEIEAKDNLGVLLIEFFELYGKNFGYDDVALGSSDGY 392
Query: 234 PI 235
P+
Sbjct: 393 PV 394
>gi|409081996|gb|EKM82354.1| hypothetical protein AGABI1DRAFT_52475, partial [Agaricus bisporus
var. burnettii JB137-S8]
Length = 559
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 86/218 (39%), Gaps = 19/218 (8%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
+ P P +E R + R I F +VF FGS K YLP DIDL SD
Sbjct: 153 MAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDIDLVIVSDSMAYS 212
Query: 83 -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGL 139
K + H + +L A +A+V I+K + F VDI+ NQ G+
Sbjct: 213 NKSSVLHSLASVL------RRAGIASNVTVIAKAKVPIVKFVTIHGRFNVDISINQTNGI 266
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
+ + L RS++LI + + G + SY++V L + +
Sbjct: 267 VGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSIVCLAISFLQMH 326
Query: 200 NGSFAGPLE-------VLYRFLEFFS-KFDWDNFCLSL 229
G ++ ++ F E + F++D +S+
Sbjct: 327 PKIRRGEIDPEKNLGVLVMEFFELYGCHFNYDEVGISV 364
>gi|401837753|gb|EJT41641.1| PAP2-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 592
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 99/240 (41%), Gaps = 30/240 (12%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN + +R + Q +P + FGS YLP
Sbjct: 180 WLTFE--IKDFVAYISPSREEIEIRNQTISTIREALKQLWPDADLHVFGSYSTDLYLPGS 237
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID LG L +HL ++ L E + A+ RV +++++ +I
Sbjct: 238 DIDCVVNSELGGKESRNNLYSLASHLKKNNLATEIEVV-AKARVPIIKFVEPHSRI---- 292
Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
+D++F + GL + E +N+ + ++++K + + R+ H G +
Sbjct: 293 ----HIDVSFERTNGLEAAKLIRE---WLNDTPGLRELVLIVKQFLHAR-RLNNVHTGGL 344
Query: 184 SSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPVPI 235
++++ LV H+ ++ +L F E + K F +D+ L PI
Sbjct: 345 GGFSIICLVFSFLHMHPRIITKEIDSKDNLGVLLIEFFELYGKNFGYDDVALGSSDGYPI 404
>gi|403213331|emb|CCK67833.1| hypothetical protein KNAG_0A01440 [Kazachstania naganishii CBS
8797]
Length = 526
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 103/244 (42%), Gaps = 31/244 (12%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + I+ I P+ ++RN +R + + +P + FGS YLP
Sbjct: 139 WLTLE--VKDFISYISPNRVEIKQRNTTIGKIRAAVSELWPDADLHVFGSYATDLYLPGS 196
Query: 70 DIDL------GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID G + +L HL ++ L E + A+ RV +++++ E +I
Sbjct: 197 DIDCVVNSKGGDKENQSSLYKLATHLKKNGLATEIEI-IAKARVPIIKFVEPESRI---- 251
Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLI 183
+D++F ++ GL + E E+ R ++LI + R+ H G +
Sbjct: 252 ----HIDVSFERINGLEAAKLIRE----WLESTPGLRELVLIIKQFLHSRRLNNVHTGGL 303
Query: 184 SSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPVPI 235
++++ LV + ++ +L F E + K F +D+ +SL VP
Sbjct: 304 GGFSIICLVYSFLSMHPRVITNEIDPIDNLGVLLIDFFELYGKNFGYDDVAISLSNGVP- 362
Query: 236 SLLP 239
S LP
Sbjct: 363 SYLP 366
>gi|449017212|dbj|BAM80614.1| hypothetical protein CYME_CMK272C [Cyanidioschyzon merolae strain
10D]
Length = 1647
Score = 50.4 bits (119), Expect = 0.001, Method: Composition-based stats.
Identities = 43/160 (26%), Positives = 66/160 (41%), Gaps = 12/160 (7%)
Query: 119 IIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
+++C + N LC CFL E D LI HL R +IL+K W + S
Sbjct: 1160 VVRCRTNGLTTQFLLNPAVALCRSCFLVECDELIGRRHLLIRCLILLKVW-WRHSLATAQ 1218
Query: 179 HHGLIS--SYALVTLVLYIFHVFNGSFAG-----PLEVLYRFLEFFS-KFDWDNFCLSLW 230
L+S S +LV+ L + + + G P VL F+ DW +S++
Sbjct: 1219 ARALLSPLSGSLVSPFLALLLLSYLNCRGLPGDEPAHVLQGLFSFYGFDMDWSRCGMSIY 1278
Query: 231 GPVPI---SLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAY 267
GP I +L+ +T P +L + +CR Y
Sbjct: 1279 GPFDIQSGALMTHLTTRQPLIPDAMLRAHQLEYATCRLRY 1318
>gi|145546801|ref|XP_001459083.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426906|emb|CAK91686.1| unnamed protein product [Paramecium tetraurelia]
Length = 364
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 91/213 (42%), Gaps = 33/213 (15%)
Query: 29 PFSEERRNAVAAYVR--RLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
P +E + V AY+R + + P Q+ +FGS + YLP+ DID+ T K
Sbjct: 77 PSDQEHKRRVTAYMRVEKYLQDIAPEAQIESFGSFKTRMYLPNADIDMVMIETSCTQKQL 136
Query: 86 WAHLVRDMLENEEKNEH----AEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCT 141
+ + M++ K E+ A +V +++I+ E + + D++FNQL GL
Sbjct: 137 FKKVAAKMMKQTNKFENVNLIANAKVPIIKFIEVESQ--------YHFDLSFNQLDGLKQ 188
Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVLYIFHVFN 200
+ L++ + E +L+ C R L + G + S+ L ++L +
Sbjct: 189 IEELEKAFEIYPE-----LKFLLMTLKCVLRQRDLNETYSGGVGSFLLFQMILAFLREYR 243
Query: 201 GSF-----------AGPLEVLYRFLEFFS-KFD 221
F E + +FLEF+ KFD
Sbjct: 244 KDFFQHNKQDQIKNVTLGEYMIKFLEFYGIKFD 276
>gi|50302781|ref|XP_451327.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640458|emb|CAH02915.1| KLLA0A07359p [Kluyveromyces lactis]
Length = 684
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 63/260 (24%), Positives = 109/260 (41%), Gaps = 42/260 (16%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFT-FGSVPLKTYLPDR 69
WL E + ++ I P+ E+RN A ++ +++ +P FGS YLP
Sbjct: 190 WLTLE--IKDFVSYISPNRQEIEQRNQAIAKLKEAVVELWPDSSLNCFGSYATDLYLPGS 247
Query: 70 DIDL------GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID G + L + L R L + + A+ RV +++++ E KI
Sbjct: 248 DIDCVVRSASGDKENRNALYSLASFLKRKQLATQVE-VIAKARVPIIKFVEPESKI---- 302
Query: 124 VDNFVVDIAFNQLGGL----CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
+D++F + GL +L+E L R ++LI + R+ H
Sbjct: 303 ----HIDVSFERTNGLEAARVIRGWLEEQPGL--------RELVLIVKQFLHARRLNNVH 350
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCLSLWG 231
G + Y+++ LV + G PLE +L F E + K F +D+ +S+
Sbjct: 351 TGGLGGYSIICLVYTFLKLHPRVLTGDIDPLENLGVLLIDFFELYGKNFGYDDVGISVSE 410
Query: 232 P----VPISLLPDVTAEPPR 247
+P + PD++A PR
Sbjct: 411 HEARYIPKNEHPDLSAGRPR 430
>gi|443895250|dbj|GAC72596.1| DNA polymerase sigma [Pseudozyma antarctica T-34]
Length = 689
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 67/240 (27%), Positives = 102/240 (42%), Gaps = 28/240 (11%)
Query: 11 WLK----AEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCF-PCQVFTFGSVPL 62
W K AE + EL+A + P E R V + R I F +V FGS
Sbjct: 88 WAKCQNGAEALHRELMAFDHWMAPTAAEHETRCMVIELISRAIKSQFRDAEVHPFGSQET 147
Query: 63 KTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIK 121
K YLP D+DL S T + L R M ++ A +VQ I +A+V IIK
Sbjct: 148 KLYLPQGDLDLVVVSRSMANLRTQSAL-RTMAACLRRHNLA----TDVQVIAKAKVPIIK 202
Query: 122 CLVD--NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
+ VDI+ N GL T F++ L H+ R +I++ + +
Sbjct: 203 FVTTYARLKVDISLNHTNGLTTASFVNS--WLRKWPHI--RPLIIVVKHLLMQRGMSEVF 258
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
G + SY+++ +V+ + G +E +L FLE + K F +DN +S+ G
Sbjct: 259 SGGLGSYSIIIMVISFLQLHPKVQRGEIEPGRSLGVLLLEFLELYGKNFGYDNCGISIRG 318
>gi|268566431|ref|XP_002639720.1| Hypothetical protein CBG12446 [Caenorhabditis briggsae]
Length = 897
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 83/321 (25%), Positives = 133/321 (41%), Gaps = 75/321 (23%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRD 70
W+K EI L ++ E+ R++V+ Q P ++ FGS+ +LP D
Sbjct: 92 WIKPNEIEVRLRTKVY-----EKVRDSVSQR-----WQHKPIKISMFGSLRTNLFLPTSD 141
Query: 71 IDLGAFSDD--QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD--- 125
ID+ DD T D W LEN+ E + A V I+K +VD
Sbjct: 142 IDVLVECDDWVGTPGD-WLGETARGLENDNIAESVTV------FGGAFVPIVK-MVDRDT 193
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
+DI+FN + G+ ++ +V E L + ++L+K + +Y + + G +SS
Sbjct: 194 RLSIDISFNTVQGVRAASYIAKVKE---EFPLIEPLVLLLKQFLHYRN-LNQTFTGGLSS 249
Query: 186 YALVTLVLYIFHVF-----------NGSFAGPLEVLYRFLEFFS-KFDWDNFCLSLWGPV 233
Y LV L++ F ++ +G G L L RFLE +S +F+++ +S
Sbjct: 250 YGLVLLLVNFFQLYALNMRHRTIYDSGVNLGHL--LLRFLEVYSMEFNYEEIGIS----- 302
Query: 234 PISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLR 293
G +SKS RY + + QP + + DPL
Sbjct: 303 ---------------PGQCCYISKSAA-GARYGH--------KRAQP---GNLALEDPLL 335
Query: 294 VNNNLGRSVSKGNFFRIRTAF 314
N++GRS NF I AF
Sbjct: 336 TANDVGRSTY--NFSSIANAF 354
>gi|388580693|gb|EIM21006.1| Nucleotidyltransferase, partial [Wallemia sebi CBS 633.66]
Length = 360
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 98/222 (44%), Gaps = 27/222 (12%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAF--SDDQT 81
I P + R +RR I + +VF FGS + YLPD DIDL S +Q
Sbjct: 86 ISPSLTEHKTREYTIECIRRCITSRWADAEVFAFGSFETRLYLPDGDIDLVVMRKSVNQY 145
Query: 82 LKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIK--CLVDNFVVDIAFNQLGG 138
K + H + ML + +Q I +A V IIK + +DI+ NQ G
Sbjct: 146 NKQSMLHTMASMLRQAN-------LAQSIQVISKARVPIIKFTSSFGGYPIDISLNQTNG 198
Query: 139 LCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYALVTLVLYIFH 197
+ ++E+ ++ + +L+K C+ R + + G +SSY+++ LV+
Sbjct: 199 VDAGRMVNEI---LDRYPAARPLSMLLK--CFLSQRSMNEVYTGGVSSYSVICLVVSFLQ 253
Query: 198 VFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCLSLWG 231
+ G PL+ +L LE + + F++D +S+ G
Sbjct: 254 MHPKVRRGDINPLDNLGVLLVDLLELYGRNFNYDVTGISIEG 295
>gi|395333834|gb|EJF66211.1| hypothetical protein DICSQDRAFT_152192 [Dichomitus squalens
LYAD-421 SS1]
Length = 647
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 94/220 (42%), Gaps = 27/220 (12%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P +E R+ + R I + +P +V FGS K YLP DIDL +S
Sbjct: 169 MSPTPIEDEVRSLSVQLIARAISKSYPDAKVLPFGSYETKLYLPSGDIDLVIYSHSMMRM 228
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGLCT 141
D + L L N K RV + +A+V IIK + + F VDI+ NQ G+ T
Sbjct: 229 DKVSVL--HSLANIMKRAGITDRVTII--AKAKVPIIKFVTAHGRFSVDISVNQGNGVDT 284
Query: 142 ----LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
FL E+ L RS++LI + + G + SY++V L +
Sbjct: 285 GKMVKQFLRELPAL--------RSLVLIIKNFLSQRSMNEVFTGGLGSYSIVCLAISFLQ 336
Query: 198 VFNGSFAGPLE-------VLYRFLEFF-SKFDWDNFCLSL 229
+ G ++ ++ F E + S F++ +SL
Sbjct: 337 MHPKIRRGEIDPSKNLGVLVMEFFELYGSYFNYQEVGISL 376
>gi|393216777|gb|EJD02267.1| hypothetical protein FOMMEDRAFT_141374 [Fomitiporia mediterranea
MF3/22]
Length = 732
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 95/217 (43%), Gaps = 21/217 (9%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P E R V + I + + +V FGS K YLP DIDL S +TL
Sbjct: 160 VSPTPVEHEVRWMVVQLISSSIKRVYSDSEVLPFGSFGTKLYLPQGDIDLVVQS--RTLA 217
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGLCT 141
L N K +V + QA V IIK L F VDI+ NQ G+ T
Sbjct: 218 SFEKVTALKSLANIVKRTGLADKVTIIS--QARVPIIKFTTLYGRFAVDISMNQSNGVKT 275
Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYALVTLVLYIFHVFN 200
D ++ +NE + ++++K+ + + R L + G + SYA+V L + +
Sbjct: 276 ---GDMINRFLNEFPALRAIVLIVKS--FLKQRNLNEVYSGGLGSYAIVCLAVSHLQMHP 330
Query: 201 G------SFAGPLEVLY-RFLEFFSK-FDWDNFCLSL 229
+ A L VL F E + K F+++N +SL
Sbjct: 331 KVRRAEINSAKNLGVLTLEFFELYGKYFNYNNTGISL 367
>gi|388851758|emb|CCF54564.1| related to TRF4-topoisomerase I-related protein [Ustilago hordei]
Length = 701
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 67/233 (28%), Positives = 100/233 (42%), Gaps = 24/233 (10%)
Query: 14 AEEITAELIARIQ---PDPFSEERRNAVAAYVRRLIIQCF-PCQVFTFGSVPLKTYLPDR 69
AE + ELIA Q P E R V + R I F +V FGS K YLP
Sbjct: 96 AEALHRELIAFDQWMAPTGAEHETRCMVIELIARAIKSQFRDAEVRPFGSQETKLYLPQG 155
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--N 126
D+DL S T + L R M ++ A +VQ I +A+V IIK +
Sbjct: 156 DLDLVVVSRSMANLRTQSAL-RTMAACLRRHNLA----TDVQVIAKAKVPIIKFVTTYAR 210
Query: 127 FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
VDI+ N GL T +++ L H+ R +IL+ + + G + SY
Sbjct: 211 LKVDISLNHTNGLTTASYVN--GWLRKWPHI--RPLILVIKHLLMQRGMSEVFSGGLGSY 266
Query: 187 ALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
+++ +V+ + G +E +L FLE + K F +DN +S+ G
Sbjct: 267 SVIIMVISFLQLHPKLQRGEIEPGRSLGVLLLEFLELYGKNFGYDNCGISIRG 319
>gi|451992975|gb|EMD85451.1| hypothetical protein COCHEDRAFT_1148848 [Cochliobolus
heterostrophus C5]
Length = 624
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 69/266 (25%), Positives = 105/266 (39%), Gaps = 37/266 (13%)
Query: 7 DPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQC-FPCQ---VFTFGSVPL 62
+P +WL E + + + P PF E+RN + V + Q FP Q V FGS P
Sbjct: 300 EPEKWLHNEIL--DFYDFVAPKPFEHEQRNRLVNRVNNALGQRRFPQQNGRVLCFGSFPA 357
Query: 63 KTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNE---HAEFRVKEV-------QY 112
YLP D+DL SD V DM + A R+K + Q
Sbjct: 358 GLYLPTADMDLVYVSDQYY---NGGPPVVDMSQRGANKSLLYKASNRLKSMGMDADGCQV 414
Query: 113 IQAEVKIIK--CLVDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIK 166
I A+V IIK + VDI+F L G+ + + +I L K+ +++
Sbjct: 415 IHAKVPIIKFQDRLTQLQVDISFENLSGVQAQATFAQWKQDYPDMIYMVALLKQFLVM-- 472
Query: 167 AWCYYESRILGGHHGLISSYALVTLVL-YIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNF 225
+ H G I ++++ L++ YI H G E FL+++ FD
Sbjct: 473 ------HGLNEVHTGGIGGFSIICLIVSYIQHSDKHENLG--ECFLGFLKYYGDFDLSRK 524
Query: 226 CLSLWGPVPISLLP-DVTAEPPRKDG 250
+ + P I + P R DG
Sbjct: 525 RIQMHPPAIIEKTAHGIDGRPERYDG 550
>gi|357491469|ref|XP_003616022.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
gi|355517357|gb|AES98980.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
Length = 490
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 31/43 (72%)
Query: 305 GNFFRIRTAFTFRAKGLARLLDCPNEDLYNEVNQFFMNTRDRH 347
GNF+RIR+AF + A+ L +L P + + +E+N+FF NT DRH
Sbjct: 11 GNFYRIRSAFKYGARKLGWILMLPEDRIADELNRFFANTLDRH 53
>gi|341895116|gb|EGT51051.1| hypothetical protein CAEBREN_16945 [Caenorhabditis brenneri]
Length = 901
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 59/223 (26%), Positives = 101/223 (45%), Gaps = 32/223 (14%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCF---PCQVFTFGSVPLKTYLPDRDIDLGAFSDDQT 81
I+P+ R V VR +++ + P +V FGS+ +LP DID+ D+
Sbjct: 99 IKPNEIESRLRYKVYEKVRLSLLERWKHKPIKVSMFGSLRTTLFLPTSDIDVLVECDE-- 156
Query: 82 LKDTWAHLVRDMLENEEKNEHAEFRVKEVQ-YIQAEVKIIKCLVD---NFVVDIAFNQLG 137
W D L + + + V Y A V I+K +VD +DI+FN +
Sbjct: 157 ----WIGTPGDWLTETARGLEIDNIAESVSVYGGAFVPIVK-MVDRDTRLSIDISFNTVQ 211
Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
G+ ++D+V E L + ++L+K + +Y + + G +SSY LV L++ F
Sbjct: 212 GVRAASYIDKVKE---EFPLIEPLVLLLKQFLHYRN-LNQTFTGGLSSYGLVLLLVNFFQ 267
Query: 198 VF-----------NGSFAGPLEVLYRFLEFFS-KFDWDNFCLS 228
++ G G L L RFLE +S +F+++ +S
Sbjct: 268 LYALNMRHRTIYDRGVNLGHL--LLRFLEVYSLEFNYEEIGIS 308
>gi|341883718|gb|EGT39653.1| hypothetical protein CAEBREN_22894 [Caenorhabditis brenneri]
Length = 901
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 59/223 (26%), Positives = 101/223 (45%), Gaps = 32/223 (14%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCF---PCQVFTFGSVPLKTYLPDRDIDLGAFSDDQT 81
I+P+ R V VR +++ + P +V FGS+ +LP DID+ D+
Sbjct: 99 IKPNEIESRLRYKVYEKVRLSLLERWKHKPIKVSMFGSLRTTLFLPTSDIDVLVECDE-- 156
Query: 82 LKDTWAHLVRDMLENEEKNEHAEFRVKEVQ-YIQAEVKIIKCLVD---NFVVDIAFNQLG 137
W D L + + + V Y A V I+K +VD +DI+FN +
Sbjct: 157 ----WIGTPGDWLTETARGLEIDNIAESVSVYGGAFVPIVK-MVDRDTRLSIDISFNTVQ 211
Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
G+ ++D+V E L + ++L+K + +Y + + G +SSY LV L++ F
Sbjct: 212 GVRAASYIDKVKE---EFPLIEPLVLLLKQFLHYRN-LNQTFTGGLSSYGLVLLLVNFFQ 267
Query: 198 VF-----------NGSFAGPLEVLYRFLEFFS-KFDWDNFCLS 228
++ G G L L RFLE +S +F+++ +S
Sbjct: 268 LYALNMRHRTIYDRGVNLGHL--LLRFLEVYSLEFNYEEIGIS 308
>gi|409045762|gb|EKM55242.1| hypothetical protein PHACADRAFT_93478 [Phanerochaete carnosa
HHB-10118-sp]
Length = 478
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 71/304 (23%), Positives = 122/304 (40%), Gaps = 59/304 (19%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P +E R+ + + R + + FP +V FGS K YLP DIDL SD
Sbjct: 164 ISPTQEEDEIRSLIVESISRAVTKAFPDARVLPFGSYETKLYLPLGDIDLVIESDSM--- 220
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGLCT 141
+ + V + + A K +A+V IIK + + F VDI+ NQ+ G+
Sbjct: 221 -AYNNKVNVLQALATTMKRAGITDKVTIIAKAKVPIIKFVTRHGRFSVDISLNQMNGVKA 279
Query: 142 LC----FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
FLD + L ++++LI + + G + SY++V L +
Sbjct: 280 GTMIKRFLDHIPAL--------QALVLITKSFLSQRSMNEVFTGGLGSYSIVCLAISFLQ 331
Query: 198 ----VFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVL 253
+ G + +EFF + C + V IS+ +DGG
Sbjct: 332 MHPKIRRGEIDSSKNLGVLVMEFFELYG----CYFNYREVGISV----------RDGG-- 375
Query: 254 LLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF--FRIR 311
S+ + + +AD+ PF+ ++ DP +N+ +S+G+F ++R
Sbjct: 376 ----SYYNKAQRGWADYK-------SPFL---LSIEDPGDPSND----ISRGSFGIVKVR 417
Query: 312 TAFT 315
T
Sbjct: 418 TTLA 421
>gi|389748468|gb|EIM89645.1| Nucleotidyltransferase [Stereum hirsutum FP-91666 SS1]
Length = 479
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 52/120 (43%), Gaps = 11/120 (9%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P +E R+ V ++R I FP +V +FGS K YLP DIDL S
Sbjct: 114 ISPTPVEDEIRSLVVLQIQRCISSKFPDAKVRSFGSYETKLYLPLGDIDLVIISKSMAYS 173
Query: 84 D--TWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV--DNFVVDIAFNQLGGL 139
D T H V + L + K A+V I+K + F VDI+ N G+
Sbjct: 174 DRVTVLHAVANTLRTAGITDRVSVIAK------AKVPIVKFVTTFGRFAVDISINMSNGV 227
>gi|256271045|gb|EEU06149.1| Pap2p [Saccharomyces cerevisiae JAY291]
Length = 584
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 55/244 (22%), Positives = 95/244 (38%), Gaps = 38/244 (15%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN + +R + Q +P + FGS YLP
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNKTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID LG L +HL + L E + A+ RV +++++ I
Sbjct: 236 DIDCVVTSKLGGKESRNNLYSLASHLKKKKLATEVEVV-AKARVPIIKFVEPHSGI---- 290
Query: 124 VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
+D++F + G+ +LD+ L R ++LI + R+ H
Sbjct: 291 ----HIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLNNVH 338
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
G + ++++ LV H+ ++ +L F E + K F +D+ L
Sbjct: 339 TGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALGSSD 398
Query: 232 PVPI 235
P+
Sbjct: 399 GYPV 402
>gi|396490001|ref|XP_003843230.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
gi|312219809|emb|CBX99751.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
Length = 717
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 104/237 (43%), Gaps = 36/237 (15%)
Query: 7 DPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLI-IQCFP---CQVFTFGSVPL 62
DP +WL E + + + P P+ E+RN + V+ ++ FP ++ FGS P
Sbjct: 348 DPEKWLHNEIL--DFYDFVAPKPYEHEQRNLLVQRVQSVLGYHRFPQDNGRILCFGSFPA 405
Query: 63 KTYLPDRDIDLGAFSD----------DQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
YLP D+DL SD D T ++ A L++ + N + + F Y
Sbjct: 406 GLYLPTADMDLVYTSDRHFNGGPPVMDVTARNATAPLLKG-VRNVLQRRNMAFGAISCIY 464
Query: 113 IQAEVKIIKCL--VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIK 166
A+V ++K V VDI+F L G+ + D+ +I L K+ +++
Sbjct: 465 -GAKVPLVKFTDSVTRLQVDISFENLSGMQAQATFAQWKDKYPDMIYMVALLKQFLVM-- 521
Query: 167 AWCYYESRILGG-HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFF-SKFD 221
R L H G I +A++ L+++ H G E+ FL+++ +KFD
Sbjct: 522 -------RGLNEVHTGGIGGFAIICLIVHYIHQA-GKAENLAELFKGFLDYYGNKFD 570
>gi|124481633|gb|AAI33102.1| LOC568678 protein [Danio rerio]
Length = 535
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 60/119 (50%), Gaps = 12/119 (10%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P E+ R+ V A ++R+I +P +V FGS YLP DIDL F + +TL
Sbjct: 61 ISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLVVFGNWETLP 120
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W + + L + + +V + +A V IIK L+D+ VDI+FN G+
Sbjct: 121 -LWT--LEEALRKRKVADENSIKVLD----KATVPIIK-LMDSHTEVKVDISFNVQSGV 171
>gi|164656242|ref|XP_001729249.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
gi|159103139|gb|EDP42035.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
Length = 527
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 93/212 (43%), Gaps = 33/212 (15%)
Query: 38 VAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLEN 96
V + ++R + +P +V++FGS + YLP DIDL S+ ++ DM
Sbjct: 2 VISLLQRALCSKWPDARVYSFGSQDTQLYLPQGDIDLVVLSN----------VMNDMPRE 51
Query: 97 EEKNEHA------EFRVKEVQYIQAEVKIIK--CLVDNFVVDIAFNQLGGLCTLCFLDEV 148
+E A + + +A+V IIK C F VDI+ NQ GL F V
Sbjct: 52 ITLSEMAACLRSYQLAIHVQVLARAKVPIIKFVCPYGQFNVDISINQANGLQASKF---V 108
Query: 149 DHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYALVTLVLYIFHVFNGSFAGPL 207
+ + + + +++IK + + R L + G + SY++ +VL + G +
Sbjct: 109 NGWLKKQPAIRPLVMVIKQ--FLQQRALSEVYTGGLGSYSVTLMVLSFLQLHPKLQRGEM 166
Query: 208 E-------VLYRFLEFFSK-FDWDNFCLSLWG 231
+L FLE + K + +D +S+ G
Sbjct: 167 SADKNLGTLLMEFLELYGKNYGYDECAISVRG 198
>gi|151945519|gb|EDN63760.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
Length = 584
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 55/244 (22%), Positives = 95/244 (38%), Gaps = 38/244 (15%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN + +R + Q +P + FGS YLP
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNKTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID LG L +HL + L E + A+ RV +++++ I
Sbjct: 236 DIDCVVTSKLGGKESRNNLYSLASHLKKKNLATEVEVV-AKARVPIIKFVEPHSGI---- 290
Query: 124 VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
+D++F + G+ +LD+ L R ++LI + R+ H
Sbjct: 291 ----HIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLNNVH 338
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
G + ++++ LV H+ ++ +L F E + K F +D+ L
Sbjct: 339 TGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALGSSD 398
Query: 232 PVPI 235
P+
Sbjct: 399 GYPV 402
>gi|395335008|gb|EJF67384.1| hypothetical protein DICSQDRAFT_77074 [Dichomitus squalens LYAD-421
SS1]
Length = 592
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 97/240 (40%), Gaps = 34/240 (14%)
Query: 13 KAEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPD 68
K + + E++A I P P R V A V L+ + FP V TFGSV YLPD
Sbjct: 101 KEQRLHDEIVAFFQYISPTPEEAHARAMVIAKVSSLVTRRFPQGAVDTFGSVAQNLYLPD 160
Query: 69 RDIDL-----GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
D D+ + D +T K T L M +N V+ + + V + +
Sbjct: 161 GDTDMVVTMPPQYDDPETKKRTLFQLAALM-----RNNRVTPHVQVIHRARVPVISFQTV 215
Query: 124 VD--NFVVDIAFNQLGGLCTLCFLDEV-DHLINENHLFKRSIILIKAWCYYESRILGGHH 180
D + +D++ N GL + L D + HL ++ +KA +
Sbjct: 216 PDLGSLKIDVSLNATDGLKAVPILRSYFDRMPALRHL----VLCLKALLSRHG-LNSASF 270
Query: 181 GLISSYALVTLVLYIFHVFNGS-----FAGPLE------VLYRFLEFFS-KFDWDNFCLS 228
G +SSYAL+ L + + P+E +L FLE++ K+ ++ +S
Sbjct: 271 GGLSSYALICLAISFLQLNPMGRPKELIDAPVENESLGVLLMDFLEYYGHKYKYETGVVS 330
>gi|68363844|ref|XP_697115.1| PREDICTED: PAP-associated domain-containing protein 5 [Danio rerio]
Length = 653
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 60/119 (50%), Gaps = 12/119 (10%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P E+ R+ V A ++R+I +P +V FGS YLP DIDL F + +TL
Sbjct: 179 ISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLVVFGNWETLP 238
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W + + L + + +V + +A V IIK L+D+ VDI+FN G+
Sbjct: 239 -LWT--LEEALRKRKVADENSIKVLD----KATVPIIK-LMDSHTEVKVDISFNVQSGV 289
>gi|190407236|gb|EDV10503.1| DNA polymerase sigma [Saccharomyces cerevisiae RM11-1a]
gi|259149371|emb|CAY86175.1| Pap2p [Saccharomyces cerevisiae EC1118]
Length = 584
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 55/244 (22%), Positives = 95/244 (38%), Gaps = 38/244 (15%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN + +R + Q +P + FGS YLP
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNKTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID LG L +HL + L E + A+ RV +++++ I
Sbjct: 236 DIDCVVTSKLGGKESRNNLYSLASHLKKKNLATEVEVV-AKARVPIIKFVEPHSGI---- 290
Query: 124 VDNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
+D++F + G+ +LD+ L R ++LI + R+ H
Sbjct: 291 ----HIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLNNVH 338
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWG 231
G + ++++ LV H+ ++ +L F E + K F +D+ L
Sbjct: 339 TGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALGSSD 398
Query: 232 PVPI 235
P+
Sbjct: 399 GYPV 402
>gi|6324457|ref|NP_014526.1| non-canonical poly(A) polymerase PAP2 [Saccharomyces cerevisiae
S288c]
gi|1717744|sp|P53632.1|PAP2_YEAST RecName: Full=Poly(A) RNA polymerase protein 2; AltName: Full=DNA
polymerase kappa; AltName: Full=DNA polymerase sigma;
AltName: Full=Topoisomerase 1-related protein TRF4
gi|663237|emb|CAA88145.1| ORF [Saccharomyces cerevisiae]
gi|950226|gb|AAC49091.1| Trf4p [Saccharomyces cerevisiae]
gi|1419987|emb|CAA99134.1| TRF4 [Saccharomyces cerevisiae]
gi|51830518|gb|AAU09782.1| YOL115W [Saccharomyces cerevisiae]
gi|285814775|tpg|DAA10668.1| TPA: non-canonical poly(A) polymerase PAP2 [Saccharomyces
cerevisiae S288c]
gi|392296670|gb|EIW07772.1| Pap2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 584
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 94/247 (38%), Gaps = 44/247 (17%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN + +R + Q +P + FGS YLP
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKC 122
DID LG L +HL + L EV+ + +A V IIK
Sbjct: 236 DIDCVVTSELGGKESRNNLYSLASHLKKKNL------------ATEVEVVAKARVPIIKF 283
Query: 123 LV--DNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRIL 176
+ +D++F + G+ +LD+ L R ++LI + R+
Sbjct: 284 VEPHSGIHIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLN 335
Query: 177 GGHHGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLS 228
H G + ++++ LV H+ ++ +L F E + K F +D+ L
Sbjct: 336 NVHTGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALG 395
Query: 229 LWGPVPI 235
P+
Sbjct: 396 SSDGYPV 402
>gi|349581056|dbj|GAA26214.1| K7_Pap2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 584
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 94/247 (38%), Gaps = 44/247 (17%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN + +R + Q +P + FGS YLP
Sbjct: 178 WLTFE--IKDFVAYISPSREEIEIRNQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 235
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKC 122
DID LG L +HL + L EV+ + +A V IIK
Sbjct: 236 DIDCVVTSELGGKESRNNLYSLASHLKKKNL------------ATEVEVVAKARVPIIKF 283
Query: 123 LV--DNFVVDIAFNQLGGLCTLC----FLDEVDHLINENHLFKRSIILIKAWCYYESRIL 176
+ +D++F + G+ +LD+ L R ++LI + R+
Sbjct: 284 VEPHSGIHIDVSFERTNGIEAAKLIREWLDDTPGL--------RELVLIVKQFLHARRLN 335
Query: 177 GGHHGLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLS 228
H G + ++++ LV H+ ++ +L F E + K F +D+ L
Sbjct: 336 NVHTGGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALG 395
Query: 229 LWGPVPI 235
P+
Sbjct: 396 SSDGYPV 402
>gi|366992111|ref|XP_003675821.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
gi|342301686|emb|CCC69457.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
Length = 586
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 57/246 (23%), Positives = 103/246 (41%), Gaps = 35/246 (14%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E A+ ++ I P E RN + VR + Q +P + FGS YLP
Sbjct: 176 WLTLE--IADFVSYISPSREEIESRNQTISKVRNAVKQLWPDADLHVFGSYATDLYLPGS 233
Query: 70 DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNE---HAEFRVKEVQYIQAEVKIIKCLV 124
DID + + + D+ +++ L + + + A+ RV +++++ E
Sbjct: 234 DIDCVINSKAGDKENRNSLYSLASFLKQQGLATQIEVIAKTRVPIIKFVEPE-------- 285
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINE---NHLFKRSIILIKAWCYYESRILGGHHG 181
N +D++F + GL E LI E + R ++LI + R+ H G
Sbjct: 286 SNIHIDVSFERTNGL-------EAAKLIREWLQDTPGLRELVLIIKQFLHSRRLNNVHTG 338
Query: 182 LISSYALVTLVLYIFHVFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCLSLWGPV 233
+ ++++ +V + P+E +L F E + K F +D+ +S+
Sbjct: 339 GLGGFSIICIVFSFLQMHPRIITNEIDPMENLGVLLIEFFELYGKNFGYDDVAISVTDGY 398
Query: 234 PISLLP 239
P S LP
Sbjct: 399 P-SYLP 403
>gi|50286703|ref|XP_445781.1| hypothetical protein [Candida glabrata CBS 138]
gi|49525087|emb|CAG58700.1| unnamed protein product [Candida glabrata]
Length = 485
Score = 45.8 bits (107), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 55/241 (22%), Positives = 100/241 (41%), Gaps = 28/241 (11%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + + +A I P E RN +R + + +P + FGS YLP
Sbjct: 102 WLNYEIL--DFVAYISPSKEEIETRNRTIGSIRSAVKELWPDADLHVFGSYATDLYLPGS 159
Query: 70 DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNE---HAEFRVKEVQYIQAEVKIIKCLV 124
DID + + D+ ++ L + + E E A+ RV +++++ E +
Sbjct: 160 DIDCVVNSKQGDKQSRNNLYKLANFLKKKEIATEIEVVAKARVPIIKFVEVESRT----- 214
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
+DI+F +L GL + D L + L R ++L+ + R+ H G +
Sbjct: 215 ---HMDISFERLNGLEAAKLI--RDWLASTPGL--RELVLVVKQFLHSRRLNNVHSGGLG 267
Query: 185 SYALVTLVLYIFHVFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCLSLWGPVPIS 236
++++ LV + PLE +L F E + K F +D+ + + PI
Sbjct: 268 GFSIICLVYSFLRMHPRIITAEIDPLENLGVLLIEFFELYGKNFGYDDVAIGVQDGSPIY 327
Query: 237 L 237
+
Sbjct: 328 M 328
>gi|146184040|ref|XP_001027646.2| Chitinase class I family protein [Tetrahymena thermophila]
gi|146143378|gb|EAS07404.2| Chitinase class I family protein [Tetrahymena thermophila SB210]
Length = 463
Score = 45.8 bits (107), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 94/222 (42%), Gaps = 30/222 (13%)
Query: 20 ELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSD 78
EL + P E R ++++++ P C+V TFGS + YLP+ DID+ D
Sbjct: 166 ELTDYLSPTKQEHEIRLKSMERLKKILLDAVPGCEVKTFGSFSTELYLPNSDIDMVIVKD 225
Query: 79 DQTLKDTWAHLVRDMLENEEKNEHAEF----RVKEVQYIQAEVKIIKCLVDNFVVDIAFN 134
D K + + ++ ++ E+ +V +++++ E +I NF DI+FN
Sbjct: 226 DIQNKSLYKKVADKIMNCDDIYENINLVTNAKVPIIKFVEKETQI------NF--DISFN 277
Query: 135 QLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRIL-GGHHGLISSYALVTLVL 193
+ G+ L + + L E K I+++K C R L + G I S+ L ++L
Sbjct: 278 KEDGVKQLSEVKKGLELYPE---MKYLIMVMK--CILRQRDLHETYSGGIGSFLLFCMIL 332
Query: 194 YIFHVFNGSFAGPL-----------EVLYRFLEFFSKFDWDN 224
+ E L + +F+ FD DN
Sbjct: 333 AFLRDLRRQYEKENRVQEIQNITLGEYLLKMFKFYGFFDVDN 374
>gi|328860813|gb|EGG09918.1| hypothetical protein MELLADRAFT_115680 [Melampsora larici-populina
98AG31]
Length = 987
Score = 45.4 bits (106), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 69/139 (49%), Gaps = 12/139 (8%)
Query: 17 ITAEL---IARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDID 72
+TAE+ +A I+P +E R + +R+ + +P V FGS K YLP DID
Sbjct: 234 LTAEIGSFVAYIRPTREEDELRLMIIEMIRKAVTMQWPDADVVPFGSFGTKLYLPGGDID 293
Query: 73 LGAFSDDQTLKDTWAHLVRDMLE-NEEKNEHAEFRVKEVQYIQAEVKII--KCLVDNFVV 129
L S + +KD + ++ + E+N + V +A+V II K + NF V
Sbjct: 294 LVILS-TRMMKDAKSKILYRLAPLLREQNIGQDV----VVIAKAKVPIIKFKTIFGNFQV 348
Query: 130 DIAFNQLGGLCTLCFLDEV 148
DI+ NQ GL L ++E+
Sbjct: 349 DISINQSNGLVALEKVNEL 367
>gi|242212981|ref|XP_002472321.1| predicted protein [Postia placenta Mad-698-R]
gi|220728598|gb|EED82489.1| predicted protein [Postia placenta Mad-698-R]
Length = 1512
Score = 45.4 bits (106), Expect = 0.038, Method: Composition-based stats.
Identities = 70/287 (24%), Positives = 114/287 (39%), Gaps = 59/287 (20%)
Query: 21 LIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDD 79
+ I P P +E R+ V + R + + FP QV FGS K YLP +
Sbjct: 167 FVKYISPTPEEDEVRSLVVTLISRAVTRAFPDAQVLPFGSYETKLYLPIGN--------- 217
Query: 80 QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD--NFVVDIAFNQLG 137
K++ H L N K RVK + +A+V I+K + +F VDI+ NQ
Sbjct: 218 ---KESVLH----ALANTVKRAGITDRVKIIA--KAKVPIVKFVTTHGHFSVDISVNQGN 268
Query: 138 GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFH 197
G+ + H + E + I++IK++ S + + G + SY++V L +
Sbjct: 269 GVTA---GKMIKHYLAELPALRSLILVIKSFLSQRS-MNEVYTGGLGSYSIVCLAISFLQ 324
Query: 198 ----VFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVL 253
+ G + +EFF + C + V ISLL DGG
Sbjct: 325 MHPKIRRGEIDPSRNLGVLVMEFFELYG----CYFNYHEVGISLL----------DGG-- 368
Query: 254 LLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGR 300
++ + + D+ GQP K ++ DP N++ R
Sbjct: 369 ----TYFNKAERGWLDY-------GQP---KLLSIEDPGDPTNDISR 401
>gi|313241181|emb|CBY33472.1| unnamed protein product [Oikopleura dioica]
Length = 422
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 85/193 (44%), Gaps = 37/193 (19%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI + I +QP + R+ V +R+++ + +P ++ TFGS YLPD DID+
Sbjct: 90 EEI-EDFIKFMQPTESEQAMRDDVVWRIRQVVKELWPSAKLETFGSYNTGLYLPDGDIDM 148
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI----QAEVKIIKCLVDNFV- 128
++ W L L +N+ E R+ + I +A V IIK + N +
Sbjct: 149 -------VIQGQWEQLPMWQL----RNKLVERRIAREENITVIEKAVVPIIKLIESNTLV 197
Query: 129 -VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGL----- 182
VDI+FN G V + E K+ ++L+K + H GL
Sbjct: 198 HVDISFNTSNGREAAAL---VKKYMAEYPNLKQLVVLLK--------YILNHRGLNEVWK 246
Query: 183 --ISSYALVTLVL 193
+ SYAL LV+
Sbjct: 247 GGLGSYALTLLVV 259
>gi|145475559|ref|XP_001423802.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124390863|emb|CAK56404.1| unnamed protein product [Paramecium tetraurelia]
Length = 354
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 109/245 (44%), Gaps = 38/245 (15%)
Query: 29 PFSEERRNAVAAYVRRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAH 88
P EE R A++R ++ F V F + LP+ DID+ + + K+ +
Sbjct: 77 PTIEEHRKREQAFMR---VETFIKGV-CFRILRQNFNLPNADIDVVMIDKNMSAKELYKK 132
Query: 89 LVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKC--LVDNFVVDIAFNQLGGLCTLCFLD 146
+ ++++++ +K E+ K A+V IIK + ++ DI+FNQ+ G+ +D
Sbjct: 133 VAQNLMKS-DKFENVNLIAK------AKVPIIKFFEIESSYQFDISFNQMDGIRQ---ID 182
Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVL-YIFHVFNGSFA 204
E+ FK I+++K C + R L + G I S+ L ++L ++ V +FA
Sbjct: 183 EIQKAFTIYPEFKYLIMILK--CILKQRDLNETYSGGIGSFLLFQMILAFLREVRKEAFA 240
Query: 205 GPL----------EVLYRFLEFF-SKFDWDNFCLSLWG-------PVPISLLPDVTAEPP 246
E + RFLEF+ SKFD+ + + P P ++ + P
Sbjct: 241 NKKQEQLKNITLGEYILRFLEFYGSKFDYQKKRILMVNGGSIVNKPTPDDKFSLISPQDP 300
Query: 247 RKDGG 251
D G
Sbjct: 301 DHDIG 305
>gi|50294195|ref|XP_449509.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528823|emb|CAG62485.1| unnamed protein product [Candida glabrata]
Length = 626
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 99/230 (43%), Gaps = 26/230 (11%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL AE + +A I P E RN A +RR + + + + FGS YLP
Sbjct: 186 WLTAE--IRDFVAYISPSREEIETRNKTIAKIRRSVKRLWTDADLQVFGSYATDMYLPGS 243
Query: 70 DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VD 125
DID + + S D+ + L R + KN+ RV+ + ++ V IIK +
Sbjct: 244 DIDCVVNSKSGDKENRQYLYELARHL-----KNDGLATRVEVI--AKSRVPIIKFVEPES 296
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
+ +D++F + GL + E I + + +++K + + R+ H G +
Sbjct: 297 DIHIDVSFERSNGLEAAKLIRE---WIGDTPGLRELTLVVKQFLHAR-RLNDVHTGGLGG 352
Query: 186 YALVTLVLYIFHVFNGSFAG---PLE----VLYRFLEFFSK-FDWDNFCL 227
++++ LV + G PL+ +L F E + K F +D+ +
Sbjct: 353 FSIICLVFSFLRLHPRIITGDIDPLDNLGVLLIEFFELYGKNFAYDDVAI 402
>gi|440296452|gb|ELP89279.1| PAP-associated domain containing protein, putative [Entamoeba
invadens IP1]
Length = 344
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 92/218 (42%), Gaps = 12/218 (5%)
Query: 3 IRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVP 61
+R + P L E I P +E R V +L+ +P C+V +GS
Sbjct: 3 LRSVCPTDKLTLTEEIKLFTRYISLTPNEQELRQISYQKVSQLLTNRYPGCEVTIYGSYV 62
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK 121
LP DIDL ++ K+ L+ + ++ RV++V A+V IIK
Sbjct: 63 SGFSLPSSDIDLVLSFSEEVSKNQVKKLLFKISTICRSSKF--LRVEDV-ITNAKVPIIK 119
Query: 122 CL-VDNFV-VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
L +D + +D++ N GG+ + + H + + F + I L + +++ + +
Sbjct: 120 LLDLDTTISIDLSINCEGGIDS----SALTHSLLTSSQFTQEIALFVKYLVFQNNLNEPY 175
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFF 217
HG I SYA+V L + G L FL F+
Sbjct: 176 HGGIGSYAIVLLTATFLKFYPQHSLG--RALVEFLNFY 211
>gi|330805693|ref|XP_003290813.1| hypothetical protein DICPUDRAFT_81531 [Dictyostelium purpureum]
gi|325079023|gb|EGC32644.1| hypothetical protein DICPUDRAFT_81531 [Dictyostelium purpureum]
Length = 892
Score = 44.3 bits (103), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 53/224 (23%), Positives = 100/224 (44%), Gaps = 25/224 (11%)
Query: 5 PLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPC-QVFTFGSVPLK 63
P ++ E+ AE ++ + S +R+N + + FP +++ +GS +
Sbjct: 619 PESKSEFINYLELKAE---TLKENSNSLQRKNNSFNTLENFLKNEFPTGKLYKYGSFVTR 675
Query: 64 TYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIK-C 122
PD DID+ Q ++V L+N+ + ++ E R A+V II+ C
Sbjct: 676 LSSPDSDIDVTLIDSSQPY-----NMVLQKLKNKPRYDNFETRP------DAKVPIIRFC 724
Query: 123 LVDNFV-VDIAFNQLG--GLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
N V D++FN +G + F+ E + + K I+L+K + ++ I
Sbjct: 725 DKINLVKFDLSFN-IGEPNQNSNFFISE----LKDKKYLKELILLVKHYT-EKANIKDAS 778
Query: 180 HGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWD 223
G SS+AL + +Y + S ++L+ F F+ KFD++
Sbjct: 779 QGYFSSHALTIMAIYFYKTLVRSNLNIHKLLHSFFLFYIKFDYN 822
>gi|452845518|gb|EME47451.1| hypothetical protein DOTSEDRAFT_69399 [Dothistroma septosporum
NZE10]
Length = 610
Score = 44.3 bits (103), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 49/215 (22%), Positives = 81/215 (37%), Gaps = 32/215 (14%)
Query: 46 IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEF 105
I+Q ++FT+GS LK + P DID + ++ + + DM+ E
Sbjct: 77 ILQQAGGKIFTYGSYRLKVFGPGSDIDALMIAPRHVTREDFFKYMPDMIRQSTPTEQL-- 134
Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFN-----------QLGGLCTLCFLDEVD----- 149
+ V A V IIK +D VD+ F+ QL L L E D
Sbjct: 135 -TELVPVEAANVPIIKTEIDGVAVDLIFSTLHMASVPKDLQLKDSNLLRGLSETDLRCVN 193
Query: 150 ---------HLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFN 200
L+ E F+ ++ IK W + G +G +V+ + ++
Sbjct: 194 GTRVTDRLLQLVPETKTFRLALRAIKLWASRRG-VYGNVYGFPGGVGYAMMVVRMCQLYP 252
Query: 201 GSFAGPLEVLYRFLEFFSKFDW-DNFCLSLWGPVP 234
+ A P+ ++ +F K+ W D L P P
Sbjct: 253 RA-AAPV-IVNKFFMVMGKWRWPDPVTLCKREPAP 285
>gi|384485719|gb|EIE77899.1| hypothetical protein RO3G_02603 [Rhizopus delemar RA 99-880]
Length = 494
Score = 43.9 bits (102), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 92/211 (43%), Gaps = 44/211 (20%)
Query: 43 RRLIIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEH 102
+R I+CF + FGS L Y+ D DIDL Q L+ + +L+ +
Sbjct: 57 KRGDIECF---LSPFGSYALGGYIRDADIDLVLVCPIQVLRKYFFKFFPQLLKQQT---- 109
Query: 103 AEFRVKEVQYIQ-AEVKIIKCLVDNFVVDIAFNQLG---GLCTLCFLD-----EVDHL-- 151
V V+ IQ A V IIKC +DN +DI+F +L + FLD ++D
Sbjct: 110 ---LVSNVESIQKANVPIIKCTIDNISIDISFVRLKVERVAQNINFLDDSLLKDIDETCL 166
Query: 152 -------INE---NHLFKRSIIL-------IKAWCYYE---SRILGGHHGLISSYALVTL 191
+N+ N ++++ + L IK W S+ +G +G SS+ L+ +
Sbjct: 167 ASMDGPRVNQFCKNQIYRQHVRLFQVCLQCIKHWATQRGIYSKPIGYLNG--SSWTLLLV 224
Query: 192 VLYIFHVFNGSFAGPLEVLYRFLEFFSKFDW 222
Y+ + N +L RF +S++ W
Sbjct: 225 KAYM-SIKNKELLSVTMILSRFFSMWSQWPW 254
>gi|392595411|gb|EIW84734.1| hypothetical protein CONPUDRAFT_47123 [Coniophora puteana
RWD-64-598 SS2]
Length = 663
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 47/218 (21%), Positives = 91/218 (41%), Gaps = 19/218 (8%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL- 82
+ P +E R V V + + FP +V FGS K YLP DIDL SD
Sbjct: 162 MSPTSIEDEIRGLVVKLVGKAVTSAFPDAKVLPFGSYGTKLYLPSGDIDLVIESDSMQYV 221
Query: 83 -KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLV--DNFVVDIAFNQLGGL 139
K++ H + ++L + A K +A+V I+K + VDI+ NQ GL
Sbjct: 222 PKNSVLHSLANVL------KRAGIADKVTIIAKAKVPIVKFITRHGRLNVDISINQSNGL 275
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
++ + R+++++ + + + G + SY++V + + +
Sbjct: 276 VAGQIVNGFLADMRGCGRALRALVMVAKAFLGQRGMNEVYTGGLGSYSIVCMAISFLQMH 335
Query: 200 NGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSL 229
G ++ ++ F E + + F+++ +S+
Sbjct: 336 PKIRRGEIDAERNLGVLVMEFFELYGRYFNYEQVGISI 373
>gi|348500306|ref|XP_003437714.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Oreochromis niloticus]
Length = 672
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 58/129 (44%), Gaps = 16/129 (12%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P E+ R V ++ +I +P +V FGS YLP DIDL F
Sbjct: 191 ISPRPEEEKMRLEVVDRIKEVIHDLWPSAEVEVFGSFSTGLYLPTSDIDLVVFG------ 244
Query: 84 DTWAHLVRDMLEN--EEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGG 138
W L LE +KN E +K + +A V IIK L D++ VDI+FN + G
Sbjct: 245 -KWESLPLWTLEEALRKKNVADENSIKVLD--KATVPIIK-LTDSYTEVKVDISFNVMSG 300
Query: 139 LCTLCFLDE 147
+ + E
Sbjct: 301 VKAARLIKE 309
>gi|302148910|pdb|3NYB|A Chain A, Structure And Function Of The Polymerase Core Of Tramp, A
Rna Surveillance Complex
Length = 323
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 55/243 (22%), Positives = 94/243 (38%), Gaps = 36/243 (14%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN + +R + Q +P + FGS YLP
Sbjct: 20 WLTFE--IKDFVAYISPSREEIEIRNQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGS 77
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID LG L +HL + L E + A+ RV +++++ I
Sbjct: 78 DIDCVVTSELGGKESRNNLYSLASHLKKKNLATEVEVV-AKARVPIIKFVEPHSGI---- 132
Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINE---NHLFKRSIILIKAWCYYESRILGGHH 180
+ ++F + G+ E LI E + R ++LI + R+ H
Sbjct: 133 ----HIAVSFERTNGI-------EAAKLIREWLDDTPGLRELVLIVKQFLHARRLNNVHT 181
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGP 232
G + ++++ LV H+ ++ +L F E + K F +D+ L
Sbjct: 182 GGLGGFSIICLVFSFLHMHPRIITNEIDPKDNLGVLLIEFFELYGKNFGYDDVALGSSDG 241
Query: 233 VPI 235
P+
Sbjct: 242 YPV 244
>gi|299752783|ref|XP_002911796.1| Trf5 [Coprinopsis cinerea okayama7#130]
gi|298409998|gb|EFI28302.1| Trf5 [Coprinopsis cinerea okayama7#130]
Length = 816
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 76/183 (41%), Gaps = 22/183 (12%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P +E R + + + FP V FGS K YLP DIDL S+
Sbjct: 289 ISPTPVEDEIRGLIVKQIAVTVQSKFPDASVLPFGSYETKLYLPMGDIDLVILSESMAYS 348
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDN--FVVDIAFNQLGGLCT 141
+ + V L N K RV + +A V I+K + + F VDI+ NQ GL +
Sbjct: 349 NKVS--VLHTLANTLKRAGITSRVTVI--AKARVPIVKFVTTHGRFNVDISINQENGLVS 404
Query: 142 LCFLDE-VDHLIN--------------ENHLFKRSIILIKAWCYYESRILGGHHGLISSY 186
++ + HL N + L RS++LI + + + G + SY
Sbjct: 405 GNIINGFLRHLHNPTSNTPEFDANGNPKTSLALRSLVLITKAFLAQRSMNEVYTGGLGSY 464
Query: 187 ALV 189
+++
Sbjct: 465 SIM 467
>gi|196004468|ref|XP_002112101.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
gi|190586000|gb|EDV26068.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
Length = 289
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 92/214 (42%), Gaps = 20/214 (9%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P + R V V+ +I+ +P QV FGS YLP DIDL F D K
Sbjct: 21 ISPRPEEKNMRETVVEGVKEVILTLWPHVQVEVFGSFRTGLYLPTSDIDLVIFGIDG--K 78
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD--NFVVDIAFNQLGGLCT 141
+ L + ++++E + +K + A V IIK N+ +DI FN + +
Sbjct: 79 GAFEDLEKALMQHEVCDRD---NIKCIH--NAMVPIIKLTEKTCNYKMDIEFNIENSVKS 133
Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG 201
D + I + K ++++K + ++ + G +SSY LV +V+ +
Sbjct: 134 ---ADIIQTYIRKYEPLKYLVLVLKQFL-FQRELNEVFSGGVSSYTLVMMVVNFLQLHPR 189
Query: 202 SFAGPLEVLYRFL--EFFS----KFDWDNFCLSL 229
+ E Y L EFF F++ C+ +
Sbjct: 190 RYTDHPEANYGVLLIEFFELYGRHFNYHTTCIRV 223
>gi|156837261|ref|XP_001642660.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
70294]
gi|156113216|gb|EDO14802.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
70294]
Length = 524
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 59/238 (24%), Positives = 98/238 (41%), Gaps = 38/238 (15%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + ++ I P+ E RN +R + +P + FGS YLP
Sbjct: 114 WLTLE--IRDFVSYISPNRKEIELRNQTIGKLRDAVQHHWPDANLHVFGSYATDLYLPGS 171
Query: 70 DIDL---GAFSDDQT---LKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID D Q+ L +HL ++ L E+ A+ RV +++++ KI
Sbjct: 172 DIDCVVNSKAGDKQSRNCLYSLASHLKKEGLA-EDIEIIAKARVPIIKFVEPLSKI---- 226
Query: 124 VDNFVVDIAFNQLGGLCTL----CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH 179
VD++F + GL +LD + L R ++LI R+ H
Sbjct: 227 ----HVDVSFERTNGLEAAKLIRGWLDSTNGL--------RELVLIVKQFLQARRLNKVH 274
Query: 180 HGLISSYALVTLVLYIFHVFNGSFA---GPLE----VLYRFLEFFSK-FDWDNFCLSL 229
G + ++++ LV H+ A P+E +L F E + K F +D+ LS+
Sbjct: 275 TGGLGGFSIICLVYSFLHLHPRILANEINPIENLGVLLIDFFELYGKNFGYDHVALSV 332
>gi|313232447|emb|CBY24115.1| unnamed protein product [Oikopleura dioica]
Length = 887
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 130/335 (38%), Gaps = 51/335 (15%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSV 60
+ PL G EEI + I+ P R+ V V I Q FP QV FGS
Sbjct: 133 ITSPLSKGMEGLHEEII-DFHNWIRSTPEEYTMRHDVVLRVEEAIKQEFPGAQVEVFGSF 191
Query: 61 PLKTYLPDRDIDLGAFSDD-----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQA 115
YLP DID+ + ++ + ++D L + E +V + A
Sbjct: 192 QTGLYLPTSDIDMVVLGEKIEPRYGNPQNGPHYRLQDRLLKQGIAERYSIKVID----SA 247
Query: 116 EVKIIKC--LVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYES 173
V IIK ++ + VDI+FN G+ + V I + + ++++K + +
Sbjct: 248 AVPIIKMRDMITDIKVDISFNMKTGVTAIGL---VKGYIRQFPALRYLVLVLKQFL-LQR 303
Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGP---LEVLY-RFLEFFS-KFDWDNFCLS 228
+ G ISSY L+ +V+ G L VL +FL F+ +F++ C+
Sbjct: 304 DMNEVWTGGISSYGLILMVVSFLQHQGADNTGDDVNLGVLLIKFLRFYGMEFEYSKCCIR 363
Query: 229 LWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNV 288
+ K+GG + + + A G +V ++
Sbjct: 364 V------------------KNGGQFIKKEEMATQMKEAPT---------GPKYVPNFLSI 396
Query: 289 IDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLAR 323
DPL +N++GR+ ++ AF F + L R
Sbjct: 397 EDPLTPSNDVGRASHGAE--NVKDAFLFAYRVLDR 429
>gi|320164013|gb|EFW40912.1| PAP associated domain containing 5 [Capsaspora owczarzaki ATCC
30864]
Length = 558
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 82/185 (44%), Gaps = 22/185 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
E+ + + I+P P + R + +R +I + +V FGS YLP DID+
Sbjct: 213 EQEMYDFVEFIKPTPLEHQMREEIVQRIREVITGAWKHARVEVFGSFATGLYLPMSDIDI 272
Query: 74 GAFSD-DQTLKDTWAHLVRDMLENEEKNEHAEFRV-KEVQYI-QAEVKIIKC--LVDNFV 128
F + DQ T L+ E R+ K V+ I + V IIK +
Sbjct: 273 VVFGNWDQIPLFTLGKLLE------------ESRIAKNVKVIDKTSVPIIKLADALSGVF 320
Query: 129 VDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYAL 188
VDI+FN GL T+ F + ++E + +IK + + ++ + G + SY++
Sbjct: 321 VDISFNLESGLRTVEF---IRACVDEYRMLYHLTFVIKQFL-AQRQLNEPYSGGLGSYSV 376
Query: 189 VTLVL 193
V LV+
Sbjct: 377 VLLVV 381
>gi|449707156|gb|EMD46861.1| PAPassociated domain containing protein [Entamoeba histolytica
KU27]
Length = 400
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 53/224 (23%), Positives = 102/224 (45%), Gaps = 36/224 (16%)
Query: 10 RWLKAEEITAELIARIQ-----PDPFSEE---RRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
+WLK+ E +L +Q +P E R + Y + I++ V FGS
Sbjct: 5 QWLKSFEGELDLNQEVQLFIKFIEPNKNEYKIREELLTKYSK--ILEKEGYNVMAFGSTQ 62
Query: 62 LKTYLPDRDIDLGAFSDD----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
K +LP DID +++ + L + L +LE++++N +A +
Sbjct: 63 SKLFLPTSDIDFSVLTNEYNTRKVLNSVSSILSSYVLEDQKRN------------FKASI 110
Query: 118 KIIKCLVDN---FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKA-WCYYES 173
++K L D V+DI+ N G T+ F++EV I ++ ++ ++LIK+ C Y+
Sbjct: 111 PVLK-LTDKKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSILCCYDF 166
Query: 174 RILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFF 217
+G + +Y++ +V + N + E+L FL+++
Sbjct: 167 H--QPANGGLGTYSVFVMVYCYINNNNITTHDYGELLKGFLKYY 208
>gi|241955483|ref|XP_002420462.1| poly(A) polymerase, putative; polynucleotide adenylyltransferase,
putative [Candida dubliniensis CD36]
gi|223643804|emb|CAX41541.1| poly(A) polymerase, putative [Candida dubliniensis CD36]
Length = 558
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 41/189 (21%), Positives = 75/189 (39%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEH---------- 102
+VFTFGS L Y P DID +D + + D++ + E
Sbjct: 82 KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDAYVP 141
Query: 103 ---AEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A++ I + +D N + ++ L L DE+ L+
Sbjct: 142 IIKIEFDGISIDLIMAKLNIPRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVP 201
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + + G G A LV I ++ + + ++ +F
Sbjct: 202 KPTVFKHALRCIKMWAQQRA-VYGNIFGFPGGVAWAMLVARICQLYPNAVSSV--IVEKF 258
Query: 214 LEFFSKFDW 222
++K++W
Sbjct: 259 FNIYTKWNW 267
>gi|68482706|ref|XP_714750.1| hypothetical protein CaO19.10713 [Candida albicans SC5314]
gi|3334283|sp|O42617.1|PAP_CANAL RecName: Full=Poly(A) polymerase PAPalpha; AltName:
Full=Polynucleotide adenylyltransferase alpha
gi|2696030|dbj|BAA23802.1| poly A polymerase [Candida albicans]
gi|5771514|gb|AAD51412.1| unknown [Candida albicans]
gi|46436342|gb|EAK95706.1| hypothetical protein CaO19.10713 [Candida albicans SC5314]
Length = 558
Score = 42.4 bits (98), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 41/189 (21%), Positives = 74/189 (39%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHA--------- 103
+VFTFGS L Y P DID +D + + D++ + E
Sbjct: 82 KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDAYVP 141
Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A + I + +D N + ++ L L DE+ L+
Sbjct: 142 IIKLEFDGISIDLIMARLNIPRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVP 201
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + + G G A LV I ++ + + ++ +F
Sbjct: 202 KPTVFKHALRCIKLWAQQRA-VYGNIFGFPGGVAWAMLVARICQLYPNAVSSA--IVEKF 258
Query: 214 LEFFSKFDW 222
++K++W
Sbjct: 259 FNIYTKWNW 267
>gi|403373923|gb|EJY86891.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
Length = 403
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 44/205 (21%), Positives = 94/205 (45%), Gaps = 17/205 (8%)
Query: 32 EERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLV 90
++ R V + + +++ +CF +V FGS LP+ D+DL + DQ ++ L
Sbjct: 125 QQARRKVVSRIHKIVKECFSQAKVMIFGSCATGLDLPNSDVDLLVYYPDQREQNMINRLA 184
Query: 91 RDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDH 150
++++ + +V I+ + K C +DI+FN+ G+ + V
Sbjct: 185 GSLMKSGICKSIEAIKHAKVPIIKLQDKETSC-----NIDISFNRTNGIYCVKL---VKT 236
Query: 151 LINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVL-YIFHVFNGSFAGPLE 208
L+ + + +I++KA + + R L + G ISS+ L L Y+ + ++
Sbjct: 237 LMIKYPELRPLMIVLKA--FLKCRGLNETYSGGISSFLLTMLATSYLQMAYKSGKTDKMD 294
Query: 209 VLYRFLEFF----SKFDWDNFCLSL 229
+ ++FF +KF+++ +S+
Sbjct: 295 LGKHLIDFFELYGTKFNYEQIGISI 319
>gi|238882575|gb|EEQ46213.1| Poly(A) polymerase [Candida albicans WO-1]
Length = 558
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 41/189 (21%), Positives = 74/189 (39%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHA--------- 103
+VFTFGS L Y P DID +D + + D++ + E
Sbjct: 82 KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDAYVP 141
Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A + I + +D N + ++ L L DE+ L+
Sbjct: 142 IIKLEFDGISIDLIMARLNIPRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVP 201
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + + G G A LV I ++ + + ++ +F
Sbjct: 202 KPTVFKHALRCIKLWAQQRA-VYGNIFGFPGGVAWAMLVARICQLYPNAVSSA--IVEKF 258
Query: 214 LEFFSKFDW 222
++K++W
Sbjct: 259 FNIYTKWNW 267
>gi|407039791|gb|EKE39813.1| topoisomerase, putative [Entamoeba nuttalli P19]
Length = 400
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 88/197 (44%), Gaps = 35/197 (17%)
Query: 10 RWLKAEEITAELIARIQ-----PDPFSEE---RRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
+WLK+ E +L +Q +P E R + Y + I++ V FGS
Sbjct: 5 QWLKSFEGELDLNQEVQLFIKFIEPNKNEYKIREELLTKYSK--ILEKEGYNVMAFGSTQ 62
Query: 62 LKTYLPDRDIDLGAFSDD----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
K +LP DID +++ + L + L +LE++++N +A +
Sbjct: 63 SKLFLPTSDIDFSVITNEYNTRKVLNSVSSILSSYVLEDQKRN------------FKASI 110
Query: 118 KIIKCLVDN---FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAW--CYYE 172
++K L D V+DI+ N G T+ F++EV I ++ ++ ++LIK+ CY
Sbjct: 111 PVLK-LTDKKTLIVLDISHNNTNGTKTVNFIEEV---IKKDDRIRKLVLLIKSLLCCYDF 166
Query: 173 SRILGGHHGLISSYALV 189
+ G G S + +V
Sbjct: 167 HQPANGGLGTYSVFVMV 183
>gi|260948920|ref|XP_002618757.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
gi|238848629|gb|EEQ38093.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
Length = 567
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 91/236 (38%), Gaps = 34/236 (14%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + + I P RN V ++R I + +P Q FGS YLP
Sbjct: 157 WLTME--IKDFVNYISPSKEEIVVRNTVIRRLKRRIAEFWPQTQAHVFGSCATDLYLPGS 214
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--- 125
DID+ S T + R L K ++ I A+V IIK VD
Sbjct: 215 DIDMVVIS------TTGDYEQRGKLYQLSSFLRTNKLAKNIEVIATAKVPIIK-FVDPQY 267
Query: 126 NFVVDIAFNQLGGLCTL----CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
N VDI+F + GL +LD + L R ++LI R+ H G
Sbjct: 268 NIHVDISFERTNGLDAARRIRKWLDSMPGL--------RELVLIVKQFLRSRRLNNVHVG 319
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEV-------LYRFLEFFSK-FDWDNFCLSL 229
+ YA + L+ + + G + V L F E + + F +D+ ++L
Sbjct: 320 GLGGYATIILMYHFLRLHPRVSTGNISVMENLGTLLIEFFELYGRNFSYDHLIVAL 375
>gi|387196341|gb|AFJ68755.1| DNA polymerase sigma subunit, partial [Nannochloropsis gaditana
CCMP526]
Length = 419
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 85/211 (40%), Gaps = 20/211 (9%)
Query: 33 ERRNAVAAYVRRLIIQCFPC-QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVR 91
E R V + + +P V FGS K +LPD DID+ D H +R
Sbjct: 86 EARQKVTRISADTVKKLWPSFDVHVFGSEATKVFLPDSDIDMVVLPP----TDLPLHQIR 141
Query: 92 DMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFVVDIAFNQLGGLCTLCFLDEVDH 150
L + E V ++ I QA V I+K N VDI+F+ GL + ++ E
Sbjct: 142 KNLFTLAEAFKQEESVSGMEIISQARVPIVKLRFQNLQVDISFSSDSGLKSARYMLEK-- 199
Query: 151 LINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-YIFHVFNGSFAGPLEV 209
E R +IL+ + + + + G S+ L +V+ Y+ H +
Sbjct: 200 --MEAMPPLRPLILVLKYFLAQRELNQTYMGGCGSFLLQLMVIAYLQHAQKEADKASRSE 257
Query: 210 LYR--------FLEFFS-KFDWDNFCLSLWG 231
R FL F+ +F+++ +S+ G
Sbjct: 258 RTRNLGSLFLGFLRFYGHQFNYEEVGISVLG 288
>gi|67465021|ref|XP_648697.1| topoisomerase [Entamoeba histolytica HM-1:IMSS]
gi|56464936|gb|EAL43308.1| topoisomerase, putative [Entamoeba histolytica HM-1:IMSS]
Length = 400
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 88/197 (44%), Gaps = 35/197 (17%)
Query: 10 RWLKAEEITAELIARIQ-----PDPFSEE---RRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
+WLK+ E +L +Q +P E R + Y + I++ V FGS
Sbjct: 5 QWLKSFEGELDLNQEVQLFIKFIEPNKNEYKIREELLTKYSK--ILEKEGYNVMAFGSTQ 62
Query: 62 LKTYLPDRDIDLGAFSDD----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEV 117
K +LP DID +++ + L + L +LE++++N +A +
Sbjct: 63 SKLFLPTSDIDFSVLTNEYNTRKVLNSVSSILSSYVLEDQKRN------------FKASI 110
Query: 118 KIIKCLVDN---FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKA--WCYYE 172
++K L D V+DI+ N G T+ F++EV I ++ ++ ++LIK+ CY
Sbjct: 111 PVLK-LTDKKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSILCCYDF 166
Query: 173 SRILGGHHGLISSYALV 189
+ G G S + +V
Sbjct: 167 HQPANGGLGTYSVFVMV 183
>gi|256078812|ref|XP_002575688.1| hypothetical protein [Schistosoma mansoni]
gi|360044186|emb|CCD81733.1| hypothetical protein Smp_145600 [Schistosoma mansoni]
Length = 672
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 53/121 (43%), Gaps = 18/121 (14%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P + R V A V+ ++ +P CQV FGS YLP DID+ F
Sbjct: 67 ISPSPAEQFAREVVVAKVKDIVYSLWPNCQVDVFGSFKTGLYLPTSDIDMVIFGK----- 121
Query: 84 DTWAHLVRDMLENE--EKNEHAEFRVKEVQYIQAEVKIIKCLVDN---FVVDIAFNQLGG 138
W L LE + +E +V + +A V I+K + D VDI+FN +
Sbjct: 122 --WDALPLHTLEQALFKSGISSEIKVLD----KATVPIVK-MTDKETELRVDISFNMINS 174
Query: 139 L 139
+
Sbjct: 175 V 175
>gi|334311788|ref|XP_003339660.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Monodelphis domestica]
Length = 809
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 63/137 (45%), Gaps = 16/137 (11%)
Query: 8 PGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYL 66
PG +L EEI+ + + P P E+ R V + +I + +P V FGS YL
Sbjct: 353 PGTYLH-EEIS-DFYEYMSPRPEEEKMRMEVVNRIENVIKELWPSADVQIFGSFKTGLYL 410
Query: 67 PDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD 125
P DIDL F W +L LE E +H V+ + +A V IIK L D
Sbjct: 411 PTSDIDLVVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTD 461
Query: 126 NFV---VDIAFNQLGGL 139
+F VDI+FN G+
Sbjct: 462 SFTEVKVDISFNVQNGV 478
>gi|367014043|ref|XP_003681521.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
gi|359749182|emb|CCE92310.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
Length = 663
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 51/232 (21%), Positives = 100/232 (43%), Gaps = 26/232 (11%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P+ E RN + +R + + +P + FGS YLP
Sbjct: 206 WLTME--IKDFVAYISPNRQEIEIRNKTISKIRAAVRELWPDADLQVFGSYATDLYLPGS 263
Query: 70 DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VD 125
DID + + D+ +++ L L+++E E K A V IIK +
Sbjct: 264 DIDCVVNSKGRDKENRNSLYSLAS-FLKSKELATRVEVIAK------ARVPIIKFVEPQS 316
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
+D++F ++ GL + E + E + ++++K + + R+ H G +
Sbjct: 317 QIHIDVSFERINGLEAARLIRE---WLEETPGLRELVLIVKQFLHSR-RLNNVHTGGLGG 372
Query: 186 YALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSL 229
++++ LV H+ + ++ +L F E + K F +D+ L++
Sbjct: 373 FSIICLVYSFLHLHPRVVSDEIDPLDNLGVLLIDFFELYGKNFGYDDVGLTV 424
>gi|313242854|emb|CBY39607.1| unnamed protein product [Oikopleura dioica]
Length = 833
Score = 42.0 bits (97), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 134/335 (40%), Gaps = 51/335 (15%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSV 60
+ PL G EEI + I+ P R+ V V I Q FP QV FGS
Sbjct: 79 ITSPLSKGMEGLHEEII-DFHNWIRSTPEEYTMRHDVVLRVEEAIKQEFPGAQVEVFGSF 137
Query: 61 PLKTYLPDRDIDLGAFSDD-----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQA 115
YLP DID+ + ++ + ++D L + E +V + A
Sbjct: 138 QTGLYLPTSDIDMVVLGEKIEPRYGNPQNGPHYRLQDRLLKQGIAERYSIKVID----SA 193
Query: 116 EVKIIKC--LVDNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYES 173
V IIK ++ + VDI+FN G+ + V I + + ++++K + +
Sbjct: 194 AVPIIKMRDMITDIKVDISFNMKTGVTAIGL---VKGYIRQFPALRYLVLVLKQFL-LQR 249
Query: 174 RILGGHHGLISSYALVTLVL-YIFHVFNGSFAGPLE---VLYRFLEFFS-KFDWDNFCLS 228
+ G ISSY L+ +V+ ++ H + A + +L +FL F+ +F++ C+
Sbjct: 250 DMNEVWTGGISSYGLILMVVSFLQHQGADNTADDVNLGVLLIKFLRFYGMEFEYSKCCIR 309
Query: 229 LWGPVPISLLPDVTAEPPRKDGGVLLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNV 288
+ K+GG + + + + G +V ++
Sbjct: 310 V------------------KNGGQFIKKEEMATQMKESPT---------GPKYVPNFLSI 342
Query: 289 IDPLRVNNNLGRSVSKGNFFRIRTAFTFRAKGLAR 323
DPL +N++GR+ ++ AF F + L R
Sbjct: 343 EDPLTPSNDVGRASHGAE--NVKDAFLFAYRVLDR 375
>gi|149237693|ref|XP_001524723.1| Poly(A) polymerase PAPa [Lodderomyces elongisporus NRRL YB-4239]
gi|146451320|gb|EDK45576.1| Poly(A) polymerase PAPa [Lodderomyces elongisporus NRRL YB-4239]
Length = 587
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 80/189 (42%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNE------- 101
++FTFGS L Y P DID L F T +D + L+R+ E EE N
Sbjct: 84 KLFTFGSYRLGVYGPSSDIDALVVFPRYITREDFFTEFEKLLRERPELEEINSVREAFVP 143
Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A++ I + D N + +I L L DE+ +L+
Sbjct: 144 IIKLEFDGISIDLIFAKLDIPRIPKDLTLTDKNLLKNIDEKDLRALNGTRVTDEILNLVP 203
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + I +G A LV I ++ + + ++ +F
Sbjct: 204 KPTVFKHALRFIKMWA-QQRAIYANVYGFPGGVAWAMLVARICQLYPNAVSS--YIVEKF 260
Query: 214 LEFFSKFDW 222
+ +S++ W
Sbjct: 261 FQIYSQWSW 269
>gi|255732153|ref|XP_002551000.1| Poly(A) polymerase PAPalpha [Candida tropicalis MYA-3404]
gi|240131286|gb|EER30846.1| Poly(A) polymerase PAPalpha [Candida tropicalis MYA-3404]
Length = 558
Score = 41.6 bits (96), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 39/189 (20%), Positives = 75/189 (39%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHA--------- 103
+VFTFGS L Y P DID +D + + D++ + E
Sbjct: 83 KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTRDDFFSVFPDIIRKRPELEEIACVPDAFVP 142
Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A + + + ++ N + ++ L L DE+ L+
Sbjct: 143 IIKLEFDGISIDLIMARLNVPRVPLEMTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVP 202
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + + G +G A LV I ++ + + ++ +F
Sbjct: 203 KPTVFKHALRCIKLWAQQRA-VYGNVYGFPGGVAWAMLVARICQLYPNAVSA--VIVEKF 259
Query: 214 LEFFSKFDW 222
++K++W
Sbjct: 260 FSIYTKWNW 268
>gi|198427134|ref|XP_002121817.1| PREDICTED: similar to PAP-associated domain-containing protein 5
(Topoisomerase-related function protein 4-2) (TRF4-2)
[Ciona intestinalis]
Length = 391
Score = 41.6 bits (96), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 70/309 (22%), Positives = 128/309 (41%), Gaps = 65/309 (21%)
Query: 29 PFSEER--RNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
P EER R V V ++++ +P C++ FGS YLP DID+ F +
Sbjct: 92 PTEEERQMREYVIKSVEEVVLELWPTCKLDVFGSFRTDLYLPTSDIDIVLFGE------- 144
Query: 86 WAHLVRDMLENE--EKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV--VDIAFNQLGGLCT 141
W HL L+ K+ AE VK + +A V +IK + VDI+FN G+ +
Sbjct: 145 WEHLPLWSLQKALVSKDIVAEGSVKVLD--RAAVPLIKFQHKETLVKVDISFNIQSGVQS 202
Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG 201
+ + + + + + I ++K + + G +SSY+L+ + +
Sbjct: 203 VELIKD---FMKKYPALPKLIFVLKQFLLVRE-LNEVWTGGLSSYSLILMAISFLQTHPR 258
Query: 202 SFAGPLE-----VLYRFLEFFSK-FDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGVLLL 255
S + + +L FLE + + F++ + C+ + K+ G +
Sbjct: 259 SDSRDITNNLGVMLLEFLELYGRHFNYQSLCICV------------------KNKGYI-- 298
Query: 256 SKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNF--FRIRTA 313
+F +N QP + ++ DPL + N+LGR G++ +++ A
Sbjct: 299 ----------TKEEFRKQMDNGCQPSL---LSIEDPLTLGNDLGR----GSYAVMQVKQA 341
Query: 314 FTFRAKGLA 322
F F + L
Sbjct: 342 FEFSFRTLT 350
>gi|195115910|ref|XP_002002499.1| GI12386 [Drosophila mojavensis]
gi|193913074|gb|EDW11941.1| GI12386 [Drosophila mojavensis]
Length = 348
Score = 41.6 bits (96), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 105/259 (40%), Gaps = 42/259 (16%)
Query: 27 PDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
P P R + + V R+I +P V FGS L LP+ DIDL +
Sbjct: 39 PTPTEHAARIELLSRVERVIQGLWPEALVEIFGSFRLGINLPNSDIDL-------VVLGC 91
Query: 86 WAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VDNFVVDIAFNQLGGLCTLC 143
W HL LE+E ++ A V II+ + VDI+FN G+ +
Sbjct: 92 WEHLPLRSLESELRSSGIVLPGTLQVVDTAAVPIIRFTDCETHLKVDISFNMPNGIDSSE 151
Query: 144 FLDEVDHLINENHLFKRSIILIKAWCYYESRILGGH-HGLISSYALVTLVLYIFHVF--- 199
+ + H E+ + + ++++K + E R L +G ISSY L+ + + +
Sbjct: 152 LIKKFLH---EHPVLGKLVLVLKQ--FLEQRNLNSTLNGGISSYNLIIMCINFLQMHPRQ 206
Query: 200 NGSFAGPLEVLYRFLEFFS----KFDWDNFCLSLW-----------GPVPISLLPDVTAE 244
+ L VL LEFF F++ +S+W G SL D
Sbjct: 207 RSPESTNLGVL--LLEFFELYGLSFNYAQIGISIWNGYVRKENILVGSRTPSLYIDDPLL 264
Query: 245 PPRKDGGVLLLSKSFLDSC 263
P R++ S+SF+ SC
Sbjct: 265 PGRQN------SRSFIASC 277
>gi|147905450|ref|NP_001089116.1| poly(A) polymerase gamma [Xenopus laevis]
gi|73671771|gb|AAZ80291.1| SRP 3'-adenylating protein [Xenopus laevis]
Length = 751
Score = 41.6 bits (96), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 45/201 (22%), Positives = 80/201 (39%), Gaps = 22/201 (10%)
Query: 46 IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEE--KN--- 100
II ++FTFGS L + DID + + + + L+ ++ KN
Sbjct: 87 IISAAGGKIFTFGSYRLGVHTKGADIDSLCVAPRHVERSDFFQTFSEKLKQQDGIKNLRA 146
Query: 101 -EHAEFRVKEVQYIQAEVKII------KCLVDNFVV--DIAFNQLGGLCTLCF-----LD 146
E A V + +++ E+ ++ +C+ DN + D L C D
Sbjct: 147 VEDAFVPVIKFEFMNTEIDLVFARLPLQCIPDNLDLRDDSRLRNLDIRCIRSLNGCRVTD 206
Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGP 206
E+ HL+ F+ ++ IK W I G + + LV ++ + A
Sbjct: 207 EILHLVPNKENFRLTLRAIKLWAKRRG-IYSNMLGFLGGVSWAMLVARTCQLYPNAIAST 265
Query: 207 LEVLYRFLEFFSKFDWDNFCL 227
L +++F FSK++W N L
Sbjct: 266 L--VHKFFLVFSKWEWPNPVL 284
>gi|213625101|gb|AAI69821.1| LOC733387 protein [Xenopus laevis]
Length = 750
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 45/201 (22%), Positives = 80/201 (39%), Gaps = 22/201 (10%)
Query: 46 IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEE--KN--- 100
II ++FTFGS L + DID + + + + L+ ++ KN
Sbjct: 86 IISAAGGKIFTFGSYRLGVHTKGADIDSLCVAPRHVERSDFFQTFSEKLKQQDGIKNLRA 145
Query: 101 -EHAEFRVKEVQYIQAEVKII------KCLVDNFVV--DIAFNQLGGLCTLCF-----LD 146
E A V + +++ E+ ++ +C+ DN + D L C D
Sbjct: 146 VEDAFVPVIKFEFMNTEIDLVFARLPLQCIPDNLDLRDDSRLRNLDIRCIRSLNGCRVTD 205
Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGP 206
E+ HL+ F+ ++ IK W I G + + LV ++ + A
Sbjct: 206 EILHLVPNKENFRLTLRAIKLWAKRRG-IYSNMLGFLGGVSWAMLVARTCQLYPNAIAST 264
Query: 207 LEVLYRFLEFFSKFDWDNFCL 227
L +++F FSK++W N L
Sbjct: 265 L--VHKFFLVFSKWEWPNPVL 283
>gi|54020874|ref|NP_001005684.1| poly(A) polymerase gamma [Xenopus (Silurana) tropicalis]
gi|49522894|gb|AAH75107.1| poly(A) polymerase beta (testis specific) [Xenopus (Silurana)
tropicalis]
Length = 752
Score = 41.2 bits (95), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 45/201 (22%), Positives = 80/201 (39%), Gaps = 22/201 (10%)
Query: 46 IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEE--KN--- 100
II ++FTFGS L + DID + + + + L+ ++ KN
Sbjct: 87 IISAVGGKIFTFGSYRLGVHTKGADIDSLCVAPRHVERSDFFQSFSEKLKQQDGIKNLRA 146
Query: 101 -EHAEFRVKEVQYIQAEVKII------KCLVDNFVV--DIAFNQLGGLCTLCF-----LD 146
E A V + +++ E+ ++ +C+ DN + D L C D
Sbjct: 147 VEDAFVPVIKFEFMNTEIDLVFARLPLQCIPDNLDLRDDSRLRNLDIRCIRSLNGCRVTD 206
Query: 147 EVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGP 206
E+ HL+ F+ ++ IK W I G + + LV ++ + A
Sbjct: 207 EILHLVPNKENFRLTLRAIKLWAKRRG-IYSNMLGFLGGVSWAMLVARTCQLYPNAIAST 265
Query: 207 LEVLYRFLEFFSKFDWDNFCL 227
L +++F FSK++W N L
Sbjct: 266 L--VHKFFLVFSKWEWPNPVL 284
>gi|365982357|ref|XP_003668012.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
gi|343766778|emb|CCD22769.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
Length = 684
Score = 41.2 bits (95), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 66/265 (24%), Positives = 108/265 (40%), Gaps = 42/265 (15%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + ++ I P E RN + +R+ + + + Q+ FGS YLP
Sbjct: 204 WLTLE--IKDFVSYISPSREEIELRNKTISKLRKAVKELWSDSQLHIFGSYATDLYLPGS 261
Query: 70 DID------LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL 123
DID +G Q L D HL + L ++ + A+ RV +++++ +I
Sbjct: 262 DIDCVVNSKMGDKEQRQYLYDLARHLKQKGLTSQVE-VIAKARVPIIKFVEKSSQI---- 316
Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINE---NHLFKRSIILIKAWCYYESRILGGHH 180
+D++F + G+ E LI E R +ILI R+ H
Sbjct: 317 ----HIDVSFERTNGV-------EAAKLIREWLSATPGLRELILIVKQFLSARRLNDVHT 365
Query: 181 GLISSYALVTLV---LYIFHVFNGSFAGPLE----VLYRFLEFFSKFDWDNFCLSLWGPV 233
G + + ++ LV L + + PLE +L F E + K NF L V
Sbjct: 366 GGLGGFTIICLVYSFLSMHPRIKTNDIDPLENLGVLLIEFFELYGK----NFAYDL---V 418
Query: 234 PISLLPDVTAEPPRKDGGVLLLSKS 258
ISLL + P+ + LL ++S
Sbjct: 419 AISLLDGYPSYIPKSEWRSLLPTRS 443
>gi|351712688|gb|EHB15607.1| PAP-associated domain-containing protein 5, partial [Heterocephalus
glaber]
Length = 599
Score = 41.2 bits (95), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI ++ + P P E+ R V + + +I + +P V FGS YLP DIDL
Sbjct: 102 EEI-SDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 160
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 161 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 211
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 212 DISFNVQNGV 221
>gi|218186296|gb|EEC68723.1| hypothetical protein OsI_37216 [Oryza sativa Indica Group]
Length = 112
Score = 41.2 bits (95), Expect = 0.81, Method: Composition-based stats.
Identities = 20/44 (45%), Positives = 27/44 (61%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFPCQV 54
W E ++ARIQP+P SE+RR AV AYV+ L+ CQ+
Sbjct: 30 WDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQGLLRFNVGCQM 73
>gi|238609344|ref|XP_002397464.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
gi|215471952|gb|EEB98394.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
Length = 174
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 51/116 (43%), Gaps = 17/116 (14%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P +E R+ + + I +P +V FGS K YLP DID+ S T+
Sbjct: 27 ISPSPVEDEIRSLLVQLISSAIKTRYPDAEVHPFGSYATKLYLPTGDIDIVVLSRTHTI- 85
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGL 139
+ V L A+ RV V+++ + + VDI+FNQ GG+
Sbjct: 86 -AFRCFVTAKL--------AKARVPIVKFVT------RVELGGIPVDISFNQPGGV 126
>gi|431914108|gb|ELK15367.1| PAP-associated domain-containing protein 5 [Pteropus alecto]
Length = 530
Score = 40.8 bits (94), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 65/143 (45%), Gaps = 17/143 (11%)
Query: 2 VIRPLDPGRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSV 60
++R GR EEI+ + + P P E+ R V + + +I + +P V FGS
Sbjct: 22 LVRSAQTGRL--HEEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSF 78
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKI 119
YLP DIDL F W +L LE E +H V+ + +A V I
Sbjct: 79 KTGLYLPTSDIDLVVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPI 130
Query: 120 IKCLVDNFV---VDIAFNQLGGL 139
IK L D+F VDI+FN G+
Sbjct: 131 IK-LTDSFTEVKVDISFNVQNGV 152
>gi|254573058|ref|XP_002493638.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p)
[Komagataella pastoris GS115]
gi|238033437|emb|CAY71459.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p)
[Komagataella pastoris GS115]
gi|328354535|emb|CCA40932.1| DNA polymerase sigma subunit [Komagataella pastoris CBS 7435]
Length = 601
Score = 40.8 bits (94), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 56/238 (23%), Positives = 99/238 (41%), Gaps = 25/238 (10%)
Query: 4 RPLDPGRWLKAEEITAELIARIQPDPFS-EERRNAVAAYVRRLIIQCFP-CQVFTFGSVP 61
+ L+ WL E + I I P E R NAV + + +P C V FGS
Sbjct: 127 KQLELSDWLTLE--IKDFINYISPSIAEIEARNNAVKRLRKEITTNLWPDCYVNVFGSFA 184
Query: 62 LKTYLPDRDIDLGAFSDD-QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII 120
YLP DID+ SD + ++ + + L ++ + E +A+V II
Sbjct: 185 TDLYLPGSDIDMVITSDSGKYCAKSYLYQLSSFLRSKNLGVNIE------TIARAKVPII 238
Query: 121 KCLV--DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG 178
K + +D++F + GL + + + E + ++++K + R+
Sbjct: 239 KFIEPRSKIHIDVSFEKTNGLRA---AERIQGWLRETPGLRELVLIVKQFLAVR-RMNNV 294
Query: 179 HHGLISSYALVTLV---LYIFHVFNGSFAGPLE----VLYRFLEFFS-KFDWDNFCLS 228
HHG + ++++ LV L + + PL+ +L F E + F +DN LS
Sbjct: 295 HHGGLGGFSIICLVHSFLSLHPRLITNSIDPLDNLGVLLIEFFELYGYNFGYDNVILS 352
>gi|403159818|ref|XP_003320384.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168256|gb|EFP75965.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 876
Score = 40.8 bits (94), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 65/138 (47%), Gaps = 10/138 (7%)
Query: 17 ITAEL---IARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDID 72
+TAE+ +A IQP + R + +R+ + +P V FGS K YLP DID
Sbjct: 72 LTAEIGSFVAYIQPTHEEHQLRQMIIQMIRKTVHSRWPDADVEPFGSFGTKLYLPAGDID 131
Query: 73 LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKII--KCLVDNFVVD 130
L S Q + + + ++ + +N + V +A+V II K + N VD
Sbjct: 132 LVIIS-TQMMNEQKSRILYKLAPLIRENNIGQ---DVVVIAKAKVPIIKFKTIFGNINVD 187
Query: 131 IAFNQLGGLCTLCFLDEV 148
I+ NQ G+ + ++E+
Sbjct: 188 ISINQTNGIVAMKKVNEL 205
>gi|427795543|gb|JAA63223.1| Putative pap-associated domain-containing protein 5, partial
[Rhipicephalus pulchellus]
Length = 627
Score = 40.8 bits (94), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 54/201 (26%), Positives = 84/201 (41%), Gaps = 23/201 (11%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+QP P + R V ++ +I+ +P +V FGS YLP DID+ +TL
Sbjct: 153 MQPTPAEHQMRLGVIQRIKDVILGLWPQAEVEIFGSFRTGLYLPTSDIDVVVLGKWETLP 212
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD---NFVVDIAFNQLGGL 139
W L + +L H + ++ + +A V I+K L D VDI+FN G+
Sbjct: 213 -MWT-LEKALL------SHGIAEPQSIKVLDKASVPIVK-LTDAKTTVKVDISFNMNNGV 263
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALV--TLVLYIFH 197
+ C + E ++L+ + + G ISSY+L+ T+ H
Sbjct: 264 KSACLIQS----FKEKFPALPKLVLVLKQFLLQRDLNEVFTGGISSYSLILMTVSFLQLH 319
Query: 198 VFNGSFAGP-LEVLYRFLEFF 217
G P L L LEFF
Sbjct: 320 PRGGDAPSPNLGTL--LLEFF 338
>gi|256818784|ref|NP_001157969.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
gi|256818786|ref|NP_001157970.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
Length = 680
Score = 40.8 bits (94), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + + +I + +P V FGS YLP DIDL
Sbjct: 183 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 241
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 242 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 292
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 293 DISFNVQNGV 302
>gi|395839409|ref|XP_003792582.1| PREDICTED: PAP-associated domain-containing protein 5 [Otolemur
garnettii]
Length = 629
Score = 40.8 bits (94), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + + +I + +P V FGS YLP DIDL
Sbjct: 132 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 190
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 191 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 241
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 242 DISFNVQNGV 251
>gi|354474676|ref|XP_003499556.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Cricetulus griseus]
Length = 464
Score = 40.8 bits (94), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI ++ + P P E+ R V + + +I + +P V FGS YLP DIDL
Sbjct: 14 EEI-SDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 72
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 73 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 123
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 124 DISFNVQNGV 133
>gi|60392891|sp|Q68ED3.2|PAPD5_MOUSE RecName: Full=PAP-associated domain-containing protein 5; AltName:
Full=Topoisomerase-related function protein 4-2;
Short=TRF4-2
gi|148878177|gb|AAI45738.1| Papd5 protein [Mus musculus]
gi|219519562|gb|AAI44797.1| Papd5 protein [Mus musculus]
Length = 633
Score = 40.8 bits (94), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + + +I + +P V FGS YLP DIDL
Sbjct: 136 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 194
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 195 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 245
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 246 DISFNVQNGV 255
>gi|239835858|gb|ACS29269.1| pap1 [Meyerozyma guilliermondii]
Length = 564
Score = 40.8 bits (94), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 31/206 (15%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEH---------- 102
++FTFGS L Y P DID +++ + + M+ + E
Sbjct: 82 KIFTFGSYRLGVYGPGSDIDTLIVVPKHVVREDFFTIFDQMIRQRPELEEITAVPDAFVP 141
Query: 103 ---AEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A + + + +D N + +I N + L D++ L+
Sbjct: 142 IIMIEFSGISIDLIFARLNVSRVPLDMTLEDNNLLKNIDENDMRALNGTRVTDQILQLVP 201
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + + G G A LV I ++ + + ++ +F
Sbjct: 202 KVTVFKHALRCIKLWAQQRA-VYGNMFGFPGGVAWAMLVARICQLYPNAVSA--VIVEKF 258
Query: 214 LEFFSKFDWDNFCLSLWGPVPISLLP 239
++K++W P P+ L P
Sbjct: 259 FNIYTKWNW---------PQPVLLKP 275
>gi|51328369|gb|AAH80314.1| Papd5 protein [Mus musculus]
Length = 583
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + + +I + +P V FGS YLP DIDL
Sbjct: 86 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 144
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 145 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 195
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 196 DISFNVQNGV 205
>gi|340506956|gb|EGR32991.1| hypothetical protein IMG5_064460 [Ichthyophthirius multifiliis]
Length = 347
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 90/219 (41%), Gaps = 30/219 (13%)
Query: 20 ELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSD 78
EL + P E R + ++I P C+V TFGS K YLP+ DID+ +
Sbjct: 54 ELTEYLAPTKEEHELRIKSFENLTQIIKSVIPDCEVKTFGSFSSKLYLPNSDIDIVIVKE 113
Query: 79 DQTLKDTWAHLVRDMLENEEKNEHAEF----RVKEVQYIQAEVKIIKCLVDNFVVDIAFN 134
++ K + + +L E+ E+ F +V +++++ K NF DI+FN
Sbjct: 114 GESNKYLYKKVADVVLTCEDIYENISFITNAKVPLIKFVE------KSTQTNF--DISFN 165
Query: 135 QLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILG-GHHGLISSYALVTLVL 193
+ G+ L EV + K I ++K C R L + G I S+ L ++L
Sbjct: 166 KEDGVKQ---LPEVQKCLQIYPEIKYLIFIMK--CILRQRDLNETYTGGIGSFLLFCMIL 220
Query: 194 YIFHVFNGSFAGPLEV-----------LYRFLEFFSKFD 221
+ +V L + +F+S FD
Sbjct: 221 AFLRELRKEYKDNNKVSEIKNITLGEYLLKMFKFYSNFD 259
>gi|47209824|emb|CAF91228.1| unnamed protein product [Tetraodon nigroviridis]
Length = 964
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P E R V + ++I +P QV FGS YLP DIDL F
Sbjct: 461 ISPRPEEEAMRRDVVNRIEKVIKDLWPTAQVEIFGSFSTGLYLPTSDIDLVVFGK----- 515
Query: 84 DTWAHLVRDMLEN--EEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFN 134
W H LE +++N + +K + +A V IIK L D+ VDI+FN
Sbjct: 516 --WDHPPLQELEQALKKRNVAGPYPIKVLD--KATVPIIK-LTDHETEVKVDISFN 566
>gi|440291374|gb|ELP84643.1| PAP-associated domain containing protein, putative [Entamoeba
invadens IP1]
Length = 475
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 106/244 (43%), Gaps = 42/244 (17%)
Query: 10 RWLKAE--EITAE-----LIARIQPDPFSEERRNAVAAYVRRLII--QCFPCQVFTFGSV 60
+WL+ E +IT + L ++P+P E R V R+I + +V FGS
Sbjct: 5 KWLEYEGGDITLDDEFDILYHYVEPNPIEYEIRRYVLEKYTRVIENDKKSEIKVVPFGST 64
Query: 61 PLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDM----LENEEKNEHAEFRVKEVQYIQAE 116
K +LP DID + + R + +E+E++ ++A
Sbjct: 65 QSKLFLPSSDIDFTVVTKGGKTNMVLNSVARILSLYTMEDEKR------------ALRAT 112
Query: 117 VKIIKCLVD---NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKA-WCYYE 172
V +IK L D V+DI+ N G+ T+ ++++ + N L + + +IK YE
Sbjct: 113 VPVIK-LTDRETGIVLDISHNNESGVDTVRWMEKE---MKSNALIRPLLFIIKTVLSSYE 168
Query: 173 SRI--LGGHHGLISSYALVTLVLYIFHVFNGSFAGPL--EVLYRFLEFF-SKFDWDNFCL 227
+ LGG + +Y+L +V F +L RFL+++ ++FD F L
Sbjct: 169 LNLPALGG----LGTYSLFMMVFCFFREKGSDLKDKRGGAILLRFLKYYATEFDSRKFGL 224
Query: 228 SLWG 231
S+ G
Sbjct: 225 SVTG 228
>gi|256818788|ref|NP_001157971.1| PAP-associated domain-containing protein 5 isoform b [Mus musculus]
Length = 637
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + + +I + +P V FGS YLP DIDL
Sbjct: 183 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 241
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 242 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 292
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 293 DISFNVQNGV 302
>gi|219518398|gb|AAI44798.1| Papd5 protein [Mus musculus]
Length = 590
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + + +I + +P V FGS YLP DIDL
Sbjct: 136 EEIS-DFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 194
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 195 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 245
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 246 DISFNVQNGV 255
>gi|426382139|ref|XP_004057678.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Gorilla gorilla gorilla]
Length = 664
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 214 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 272
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 273 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 323
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 324 DISFNVQNGV 333
>gi|410905163|ref|XP_003966061.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
Length = 778
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P E R V + R+I +P +V FGS YLP DIDL F
Sbjct: 248 ISPRPEEEAMRRDVVNRIERVIKDLWPTARVEIFGSFSTGLYLPTSDIDLVVFGK----- 302
Query: 84 DTWAHLVRDMLEN--EEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFN 134
W H LE +++N + +K + +A V IIK L D+ VDI+FN
Sbjct: 303 --WDHPPLQELEQALKKRNVAGPYPIKVLD--KATVPIIK-LTDHETEVKVDISFN 353
>gi|441597299|ref|XP_003263084.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Nomascus leucogenys]
Length = 666
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 216 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 274
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 275 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 325
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 326 DISFNVQNGV 335
>gi|296231051|ref|XP_002760982.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Callithrix jacchus]
Length = 664
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 214 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 272
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 273 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 323
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 324 DISFNVQNGV 333
>gi|402550493|pdb|4FHX|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - H336n Mutant Bound
To Mgatp
Length = 349
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 77/184 (41%), Gaps = 23/184 (12%)
Query: 24 RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL 82
+I F E+R A +R + + P ++ FGS+ L + D+DL D +
Sbjct: 28 KISDKEFKEKR--AALDTLRLCLKRISPDAELVAFGSLESGLALKNSDMDLCVLMDSRVQ 85
Query: 83 KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVVDIAFNQ 135
DT A L+ E+ F K +Q +A + IIK D +F DI FN
Sbjct: 86 SDTIA------LQFYEELIAEGFEGKFLQ--RARIPIIKLTSDTKNGFGASFQCDIGFNN 137
Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-Y 194
+ L L + K ++L+K W +I + G +SSY V +VL Y
Sbjct: 138 RLAIHNTLLLSSYTKL---DARLKPMVLLVKHWA-KRKQINSPYFGTLSSYGYVLMVLYY 193
Query: 195 IFHV 198
+ HV
Sbjct: 194 LIHV 197
>gi|63101121|gb|AAY33178.1| PAPa [Candida parapsilosis]
gi|354544642|emb|CCE41367.1| hypothetical protein CPAR2_303560 [Candida parapsilosis]
Length = 552
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 39/189 (20%), Positives = 78/189 (41%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD----TWAHLVRDMLENEEKN--EHAEFR 106
++FTFGS L Y P DID +D + ++R E EE N + A
Sbjct: 82 KIFTFGSYKLGVYGPSSDIDALVVVPRHVTRDDFFTVFEKILRGRQELEEINCVKEAFVP 141
Query: 107 VKEVQYIQAEVKIIKCLVD-------------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
+ ++++ + ++ +D N + +I + L DE+ L+
Sbjct: 142 IIKLEFAGISIDLLFAKLDIPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVP 201
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
++ +FK ++ +K W + I +G A LV I ++ + + +L +F
Sbjct: 202 KSTVFKNALRFVKMWAQQRA-IYANVYGFPGGVAWAMLVARICQLYPNAVSAV--ILEKF 258
Query: 214 LEFFSKFDW 222
+ +S++ W
Sbjct: 259 FQIYSQWSW 267
>gi|389738915|gb|EIM80110.1| hypothetical protein STEHIDRAFT_126102 [Stereum hirsutum FP-91666
SS1]
Length = 1326
Score = 40.0 bits (92), Expect = 1.6, Method: Composition-based stats.
Identities = 51/196 (26%), Positives = 86/196 (43%), Gaps = 29/196 (14%)
Query: 20 ELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAF-- 76
+ + ++ P P + V + RLI P ++ +FGS L + D+DL
Sbjct: 47 DFVIQLLPTPEELSVKEDVRKLLERLIRTIEPDSRLLSFGSTANGFSLRNSDMDLCCLID 106
Query: 77 SDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVV 129
SD++ ++ D+LE E K F VK + + A + I+K +D
Sbjct: 107 SDERLSAADLVTMLGDLLERETK-----FHVKPLPH--ARIPIVKLSLDPSPGLPLGIAC 159
Query: 130 DIAF-NQLGGLCT---LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
DI F N+L T C+ +I+ + + ++ +K WC +I + G +SS
Sbjct: 160 DIGFENRLALENTRLLYCYA-----MIDPTRV-RTLVLFLKVWCK-RRKINSPYQGTLSS 212
Query: 186 YALVTLVLY-IFHVFN 200
Y V LV+Y + HV N
Sbjct: 213 YGYVLLVIYFLVHVKN 228
>gi|402550488|pdb|4FH3|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity
gi|402550489|pdb|4FH5|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Mgutp Bound
gi|402550490|pdb|4FHP|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Cautp Bound
gi|402550491|pdb|4FHV|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Mgctp Bound
gi|402550492|pdb|4FHW|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Mggtp Bound
gi|402550494|pdb|4FHY|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Mg 3'-Datp Bound
Length = 349
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 77/184 (41%), Gaps = 23/184 (12%)
Query: 24 RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL 82
+I F E+R A +R + + P ++ FGS+ L + D+DL D +
Sbjct: 28 KISDKEFKEKR--AALDTLRLCLKRISPDAELVAFGSLESGLALKNSDMDLCVLMDSRVQ 85
Query: 83 KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVVDIAFNQ 135
DT A L+ E+ F K +Q +A + IIK D +F DI FN
Sbjct: 86 SDTIA------LQFYEELIAEGFEGKFLQ--RARIPIIKLTSDTKNGFGASFQCDIGFNN 137
Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-Y 194
+ L L + K ++L+K W +I + G +SSY V +VL Y
Sbjct: 138 RLAIHNTLLLSSYTKL---DARLKPMVLLVKHWA-KRKQINSPYFGTLSSYGYVLMVLYY 193
Query: 195 IFHV 198
+ HV
Sbjct: 194 LIHV 197
>gi|320039014|gb|EFW20949.1| hypothetical protein CPSG_02791 [Coccidioides posadasii str.
Silveira]
Length = 1241
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/101 (21%), Positives = 48/101 (47%), Gaps = 2/101 (1%)
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
D+ + D+++ L L LD + I + F+++ I AW + L G +
Sbjct: 770 DDPIFDLSYQALTKLQAFRDLDYIRRSIPDLAAFRKAHRFITAWAKHRGVYLS-RFGYLG 828
Query: 185 SYALVTLVLYIFHVFNGSF-AGPLEVLYRFLEFFSKFDWDN 224
+ ++ +F +F G +++YRF ++++ FDW++
Sbjct: 829 GIHITMMLSRVFKLFCGEVRVTSTDMIYRFFQYYADFDWEH 869
>gi|390136629|pdb|4EP7|A Chain A, Functional Implications From The Cid1 Poly(U) Polymerase
Crystal Structure
gi|390136630|pdb|4EP7|B Chain B, Functional Implications From The Cid1 Poly(U) Polymerase
Crystal Structure
Length = 340
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 77/184 (41%), Gaps = 23/184 (12%)
Query: 24 RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL 82
+I F E+R A +R + + P ++ FGS+ L + D+DL D +
Sbjct: 19 KISDKEFKEKR--AALDTLRLCLKRISPDAELVAFGSLESGLALKNSDMDLCVLMDSRVQ 76
Query: 83 KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVVDIAFNQ 135
DT A L+ E+ F K +Q +A + IIK D +F DI FN
Sbjct: 77 SDTIA------LQFYEELIAEGFEGKFLQ--RARIPIIKLTSDTKNGFGASFQCDIGFNN 128
Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-Y 194
+ L L + K ++L+K W +I + G +SSY V +VL Y
Sbjct: 129 RLAIHNTLLLSSYTKL---DARLKPMVLLVKHWA-KRKQINSPYFGTLSSYGYVLMVLYY 184
Query: 195 IFHV 198
+ HV
Sbjct: 185 LIHV 188
>gi|335308290|ref|XP_003361170.1| PREDICTED: PAP-associated domain-containing protein 5-like [Sus
scrofa]
Length = 511
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 60/139 (43%), Gaps = 17/139 (12%)
Query: 9 GRW---LKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKT 64
G W L E ++ + P P E+ R V + +I + +P V FGS
Sbjct: 4 GGWTGSLGLHEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGL 63
Query: 65 YLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCL 123
YLP DIDL F W +L LE E +H V+ + +A V IIK L
Sbjct: 64 YLPTSDIDLVVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-L 114
Query: 124 VDNFV---VDIAFNQLGGL 139
D+F VDI+FN G+
Sbjct: 115 TDSFTEVKVDISFNVQNGV 133
>gi|440900205|gb|ELR51393.1| PAP-associated domain-containing protein 5, partial [Bos grunniens
mutus]
Length = 563
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI ++ + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 66 EEI-SDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 124
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 125 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 175
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 176 DISFNVQNGV 185
>gi|119910013|ref|XP_001256516.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
gi|297485254|ref|XP_002694925.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
gi|296478153|tpg|DAA20268.1| TPA: DNA polymerase sigma-like [Bos taurus]
Length = 467
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI ++ + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 17 EEI-SDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 75
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 76 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 126
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 127 DISFNVQNGV 136
>gi|397498213|ref|XP_003819879.1| PREDICTED: PAP-associated domain-containing protein 5, partial [Pan
paniscus]
Length = 593
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 96 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 154
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 155 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 205
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 206 DISFNVQNGV 215
>gi|303317898|ref|XP_003068951.1| Endonuclease/Exonuclease/phosphatase family protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240108632|gb|EER26806.1| Endonuclease/Exonuclease/phosphatase family protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 1241
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/101 (21%), Positives = 48/101 (47%), Gaps = 2/101 (1%)
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLIS 184
D+ + D+++ L L LD + I + F+++ I AW + L G +
Sbjct: 770 DDPIFDLSYQALTKLQAFRDLDYIRRSIPDLAAFRKAHRFITAWAKHRGVYLS-RFGYLG 828
Query: 185 SYALVTLVLYIFHVFNGSF-AGPLEVLYRFLEFFSKFDWDN 224
+ ++ +F +F G +++YRF ++++ FDW++
Sbjct: 829 GIHITMMLSRVFKLFCGEVRVTSTDMIYRFFQYYADFDWEH 869
>gi|291410211|ref|XP_002721395.1| PREDICTED: DNA polymerase sigma-like [Oryctolagus cuniculus]
Length = 522
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 25 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 83
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 84 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 134
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 135 DISFNVQNGV 144
>gi|59800139|sp|Q8NDF8.2|PAPD5_HUMAN RecName: Full=PAP-associated domain-containing protein 5; AltName:
Full=Terminal uridylyltransferase 3; Short=TUTase 3;
AltName: Full=Topoisomerase-related function protein
4-2; Short=TRF4-2
Length = 572
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 122 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 180
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 181 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 231
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 232 DISFNVQNGV 241
>gi|345325980|ref|XP_001507597.2| PREDICTED: PAP-associated domain-containing protein 5-like
[Ornithorhynchus anatinus]
Length = 578
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P E+ R V + +I + +P V FGS YLP DIDL F + L
Sbjct: 90 MSPRPEEEKMRMEVVNRIENVIKELWPTADVQIFGSFKTGLYLPTSDIDLVVFGKWENLP 149
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W L + +++ +EH+ VK + +A V IIK L D+F VDI+FN G+
Sbjct: 150 -LWT-LEEALRKHKVADEHS---VKVLD--KATVPIIK-LTDSFTEVKVDISFNVQNGV 200
>gi|355710188|gb|EHH31652.1| hypothetical protein EGK_12764, partial [Macaca mulatta]
Length = 564
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI ++ + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 67 EEI-SDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 125
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 126 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 176
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 177 DISFNVQNGV 186
>gi|426382137|ref|XP_004057677.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Gorilla gorilla gorilla]
Length = 631
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 134 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 192
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 193 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 243
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 244 DISFNVQNGV 253
>gi|403292555|ref|XP_003937307.1| PREDICTED: PAP-associated domain-containing protein 5 [Saimiri
boliviensis boliviensis]
Length = 631
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 134 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 192
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 193 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 243
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 244 DISFNVQNGV 253
>gi|390477686|ref|XP_002760981.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Callithrix jacchus]
Length = 631
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 134 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 192
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 193 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 243
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 244 DISFNVQNGV 253
>gi|256818782|ref|NP_001035375.2| PAP-associated domain-containing protein 5 isoform b [Homo sapiens]
Length = 651
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 201 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 259
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 260 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 310
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 311 DISFNVQNGV 320
>gi|359319041|ref|XP_535307.4| PREDICTED: PAP-associated domain-containing protein 5 [Canis lupus
familiaris]
Length = 641
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 144 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 202
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 203 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 253
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 254 DISFNVQNGV 263
>gi|441597295|ref|XP_003263083.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Nomascus leucogenys]
gi|348031139|emb|CCB84642.1| PAP associated domain containing 5 [Homo sapiens]
Length = 631
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 134 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 192
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 193 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 243
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 244 DISFNVQNGV 253
>gi|119603156|gb|EAW82750.1| PAP associated domain containing 5, isoform CRA_c [Homo sapiens]
Length = 371
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 30 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 88
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 89 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 139
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 140 DISFNVQNGV 149
>gi|19115813|ref|NP_594901.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe 972h-]
gi|15213942|sp|O13833.2|CID1_SCHPO RecName: Full=Poly(A) RNA polymerase protein cid1; AltName:
Full=Caffeine-induced death protein 1
gi|393715400|pdb|4E7X|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715401|pdb|4E7X|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715402|pdb|4E7X|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715403|pdb|4E7X|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715405|pdb|4E80|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715406|pdb|4E80|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715407|pdb|4E80|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715408|pdb|4E80|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715409|pdb|4E8F|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715410|pdb|4E8F|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|4324457|gb|AAD16889.1| caffeine-induced death protein 1 [Schizosaccharomyces pombe]
gi|5524947|emb|CAB50789.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe]
Length = 405
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 76/184 (41%), Gaps = 23/184 (12%)
Query: 24 RIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTL 82
+I F E+R A +R + + P ++ FGS+ L + D+DL D +
Sbjct: 56 KISDKEFKEKR--AALDTLRLCLKRISPDAELVAFGSLESGLALKNSDMDLCVLMDSRVQ 113
Query: 83 KDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVD-------NFVVDIAFNQ 135
DT A + L E F K +Q +A + IIK D +F DI FN
Sbjct: 114 SDTIALQFYEELIAE------GFEGKFLQ--RARIPIIKLTSDTKNGFGASFQCDIGFNN 165
Query: 136 LGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVL-Y 194
+ L L + K ++L+K W +I + G +SSY V +VL Y
Sbjct: 166 RLAIHNTLLLSSYTKL---DARLKPMVLLVKHWA-KRKQINSPYFGTLSSYGYVLMVLYY 221
Query: 195 IFHV 198
+ HV
Sbjct: 222 LIHV 225
>gi|380798533|gb|AFE71142.1| PAP-associated domain-containing protein 5 isoform a, partial
[Macaca mulatta]
Length = 618
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 121 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 179
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 180 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 230
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 231 DISFNVQNGV 240
>gi|301756837|ref|XP_002914273.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
[Ailuropoda melanoleuca]
Length = 593
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 143 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 201
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 202 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 252
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 253 DISFNVQNGV 262
>gi|256818780|ref|NP_001035374.2| PAP-associated domain-containing protein 5 isoform a [Homo sapiens]
gi|194374871|dbj|BAG62550.1| unnamed protein product [Homo sapiens]
Length = 698
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 201 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 259
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 260 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 310
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 311 DISFNVQNGV 320
>gi|402908342|ref|XP_003916909.1| PREDICTED: PAP-associated domain-containing protein 5 [Papio
anubis]
Length = 605
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 108 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 166
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 167 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 217
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 218 DISFNVQNGV 227
>gi|426243516|ref|XP_004015600.1| PREDICTED: PAP-associated domain-containing protein 5 [Ovis aries]
Length = 588
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI ++ + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 138 EEI-SDFYEYMSPRPEEEKMRMEVVNRIEGVIKELWPSADVQIFGSFKTGLYLPTSDIDL 196
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 197 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 247
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 248 DISFNVQNGV 257
>gi|432853107|ref|XP_004067543.1| PREDICTED: PAP-associated domain-containing protein 5-like [Oryzias
latipes]
Length = 679
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 55/126 (43%), Gaps = 10/126 (7%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
I P P E+ R V ++ +I +P +V FGS YLP DIDL F +TL
Sbjct: 197 ISPRPEEEKMRLEVVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLVVFGKWETLP 256
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VDNFVVDIAFNQLGGLCT 141
W + + L + + +V + +A V IIK V VDI+FN G+
Sbjct: 257 -LWT--LEEALRKRNVADKSAIKVLD----KATVPIIKLTDSVTEVKVDISFNVESGVKA 309
Query: 142 LCFLDE 147
+ E
Sbjct: 310 ARLIKE 315
>gi|344301689|gb|EGW31994.1| Poly(A) polymerase PAPalpha [Spathaspora passalidarum NRRL Y-27907]
Length = 556
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 45/206 (21%), Positives = 79/206 (38%), Gaps = 31/206 (15%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD----TWAHLVRDMLENEEKNE------- 101
+VFTFGS L Y P DID ++ + ++R E +E
Sbjct: 82 KVFTFGSYRLGVYGPGSDIDTLVVVPKHVTREDFFTVFEQIIRKRPELQEIASVPDAFVP 141
Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A + + + +D N + +I L L DE+ L+
Sbjct: 142 IIKIEFDGISIDLILARLNVPRVPLDMTLDDKNLLKNIDERDLRSLNGTRVTDEILQLVP 201
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + + G G A LV I ++ + + ++ +F
Sbjct: 202 KPTVFKHALRCIKLWAQQRA-VYGNVFGFPGGVAWAMLVARICQLYPNAVSA--VIVEKF 258
Query: 214 LEFFSKFDWDNFCLSLWGPVPISLLP 239
++K++W P P+ L P
Sbjct: 259 FNIYTKWNW---------PQPVLLKP 275
>gi|332845909|ref|XP_003315148.1| PREDICTED: PAP-associated domain-containing protein 5 [Pan
troglodytes]
Length = 586
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 89 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 147
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 148 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 198
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 199 DISFNVQNGV 208
>gi|281338901|gb|EFB14485.1| hypothetical protein PANDA_002140 [Ailuropoda melanoleuca]
Length = 632
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 135 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 193
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 194 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 244
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 245 DISFNVQNGV 254
>gi|119603155|gb|EAW82749.1| PAP associated domain containing 5, isoform CRA_b [Homo sapiens]
Length = 374
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 30 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 88
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 89 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 139
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 140 DISFNVQNGV 149
>gi|395505923|ref|XP_003757286.1| PREDICTED: PAP-associated domain-containing protein 5, partial
[Sarcophilus harrisii]
Length = 615
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 118 EEIS-DFYEYMSPRPEEEKMRMEVVNRIENVIKELWPSADVQIFGSFKTGLYLPTSDIDL 176
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 177 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 227
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 228 DISFNVQNGV 237
>gi|297283970|ref|XP_002802516.1| PREDICTED: PAP-associated domain-containing protein 5 [Macaca
mulatta]
Length = 653
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 203 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 261
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 262 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 312
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 313 DISFNVQNGV 322
>gi|344289184|ref|XP_003416325.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Loxodonta africana]
Length = 595
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI ++ + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 98 EEI-SDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 156
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 157 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 207
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 208 DISFNVQNGV 217
>gi|119603153|gb|EAW82747.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
gi|119603154|gb|EAW82748.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
Length = 527
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 30 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 88
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 89 VVFG-------KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 139
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 140 DISFNVQNGV 149
>gi|410983511|ref|XP_003998082.1| PREDICTED: PAP-associated domain-containing protein 5 [Felis catus]
Length = 514
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 64 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 122
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 123 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 173
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 174 DISFNVQNGV 183
>gi|297283968|ref|XP_001083145.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Macaca mulatta]
Length = 700
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 203 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 261
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 262 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 312
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 313 DISFNVQNGV 322
>gi|162317662|gb|AAI56330.1| PAP associated domain containing 5 [synthetic construct]
gi|162318878|gb|AAI57080.1| PAP associated domain containing 5 [synthetic construct]
Length = 442
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 54/120 (45%), Gaps = 14/120 (11%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P E+ R V + +I + +P V FGS YLP DIDL F
Sbjct: 1 MSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFGK----- 55
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W +L LE E +H V+ + +A V IIK L D+F VDI+FN G+
Sbjct: 56 --WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKVDISFNVQNGV 111
>gi|297698707|ref|XP_002826459.1| PREDICTED: PAP-associated domain-containing protein 5 [Pongo
abelii]
Length = 588
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 59/130 (45%), Gaps = 15/130 (11%)
Query: 15 EEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDL 73
EEI+ + + P P E+ R V + +I + +P V FGS YLP DIDL
Sbjct: 91 EEIS-DFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 149
Query: 74 GAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---V 129
F W +L LE E +H V+ + +A V IIK L D+F V
Sbjct: 150 VVFGK-------WENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKV 200
Query: 130 DIAFNQLGGL 139
DI+FN G+
Sbjct: 201 DISFNVQNGV 210
>gi|324975490|gb|ADY62673.1| PAPa [Candida orthopsilosis]
Length = 372
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 41/189 (21%), Positives = 76/189 (40%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD----TWAHLVRDMLENEEKNE------- 101
++FTFGS L Y P DID ++ + ++R E EE N
Sbjct: 82 KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVP 141
Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + + A++ I + D N + +I + L DE+ L+
Sbjct: 142 IIKLEFAGISIDLLFAKLDIPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVP 201
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + + +G A LV I ++ + + +L +F
Sbjct: 202 KPTVFKNALRFIKMWAQQRA-VYANVYGFPGGVAWAMLVARICQLYPNAVSA--VILEKF 258
Query: 214 LEFFSKFDW 222
+ +S+++W
Sbjct: 259 FQIYSQWNW 267
>gi|241855549|ref|XP_002416033.1| PAP-associated domain-containing protein, putative [Ixodes
scapularis]
gi|215510247|gb|EEC19700.1| PAP-associated domain-containing protein, putative [Ixodes
scapularis]
Length = 347
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 82/199 (41%), Gaps = 19/199 (9%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+QP P E R V ++ +I+ +P +V FGS YLP DID+ +TL
Sbjct: 12 MQPSPAEHEMRLGVIQRIKEVILSLWPQAEVEIFGSFRTGLYLPTSDIDVVVLGKWETLP 71
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD---NFVVDIAFNQLGGL 139
W L + +L H + ++ + +A V I+K L D VDI+FN G+
Sbjct: 72 -MWT-LEKALL------THGIAEPRSIKVLDKASVPIVK-LTDARTTVKVDISFNMNNGV 122
Query: 140 CTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVF 199
+ + E ++L+ + + G ISSY+L+ + + +
Sbjct: 123 KSARLIKS----FKEKFPALAKLVLVLKQFLLQRDLNEVFTGGISSYSLILMTVSFLQLH 178
Query: 200 NGSFAGPLEVLYR-FLEFF 217
GP L LEFF
Sbjct: 179 PRGGDGPNPNLGTLLLEFF 197
>gi|324975502|gb|ADY62684.1| PAPa [Candida orthopsilosis]
Length = 547
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNEHA----- 103
++FTFGS L Y P DID L T +D T+ ++R E +E N +
Sbjct: 81 KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTTFDKIIRQRSELQEINGVSDAFVP 140
Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A + + + +D N + ++ L L DE+ L+
Sbjct: 141 IIKLEFDGISLDLIMARLNVPRVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVP 200
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W E + G G A L I ++ + + ++ +F
Sbjct: 201 KPGVFKHALRCIKLWA-QERAVYGNVFGFPGGVAWAMLTARICQLYPNAVSAV--IVEKF 257
Query: 214 LEFFSKFDW 222
++K++W
Sbjct: 258 FNIYTKWNW 266
>gi|328772133|gb|EGF82172.1| hypothetical protein BATDEDRAFT_23561 [Batrachochytrium
dendrobatidis JAM81]
Length = 752
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 55/120 (45%), Gaps = 10/120 (8%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
++P R A VR+++ Q + +V FGS K YLP D+D+ D L
Sbjct: 189 VRPTEAEHSLRKLTIARVRKIVKQIWADAEVHVFGSFQTKLYLPSSDVDIVVVGDSCVLP 248
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VDNFVVDIAFNQLGGLCT 141
L + E+ + + V E + +V IIK + + +F +DI+FN + G+ +
Sbjct: 249 KCLRQLAKAF---EKADTLSRMEVIE----KTKVPIIKGVDKLTHFSLDISFNMVNGIKS 301
>gi|402890991|ref|XP_003908748.1| PREDICTED: poly(A) polymerase gamma [Papio anubis]
Length = 700
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 38/183 (20%), Positives = 72/183 (39%), Gaps = 22/183 (12%)
Query: 46 IIQCFPCQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEF 105
++ ++FTFGS L + DID + + + + L++
Sbjct: 88 VVATVGGKIFTFGSYRLGVHTKGADIDALCVAPRHVERSDFFQSFFEKLKH--------- 138
Query: 106 RVKEVQYIQAEVKIIKCLVDNFVVDIAFNQLGGLCTLC-FLDEVDHLINENHLFKRSIIL 164
Q ++ ++ + D FV I F + G+ C DE+ HL+ F+ ++
Sbjct: 139 --------QDGIRNLRAVEDAFVPVIKF-EFDGIEVRCRVTDEILHLVPNKETFRLTLRA 189
Query: 165 IKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDN 224
+K W I G + + LV ++ + A L +++F FSK++W N
Sbjct: 190 VKLWAKRRG-IYSNMLGFLGGVSWAMLVARTCQLYPNAAASTL--VHKFFLVFSKWEWPN 246
Query: 225 FCL 227
L
Sbjct: 247 PVL 249
>gi|324975520|gb|ADY62700.1| PAPa [Candida orthopsilosis]
Length = 547
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNEHA----- 103
++FTFGS L Y P DID L T +D T+ ++R E +E N +
Sbjct: 81 KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTTFDKIIRQRPELQEINGVSDAFVP 140
Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A + + + +D N + ++ L L DE+ L+
Sbjct: 141 IIKLEFDGISLDLIMARLNVPRVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVP 200
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W E + G G A L I ++ + + ++ +F
Sbjct: 201 KPGVFKHALRCIKLWA-QERAVYGNVFGFPGGVAWAMLTARICQLYPNAVSAV--IVEKF 257
Query: 214 LEFFSKFDW 222
++K++W
Sbjct: 258 FNIYTKWNW 266
>gi|324975487|gb|ADY62671.1| PAPa [Candida metapsilosis]
Length = 547
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNEHA----- 103
++FTFGS L Y P DID L T +D T+ ++R E +E N +
Sbjct: 81 KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTTFDQIIRKRPELQEINGVSDAFVP 140
Query: 104 ----EFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + I A + + + +D N + ++ L L DE+ L+
Sbjct: 141 IIKLEFDGISLDLIMARLNVPRVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVP 200
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W E + G G A L I ++ + + ++ +F
Sbjct: 201 KPGVFKHALRCIKLWA-QERAVYGNVFGFPGGVAWAMLTARICQLYPNAVSSV--IVEKF 257
Query: 214 LEFFSKFDW 222
++K++W
Sbjct: 258 FSIYTKWNW 266
>gi|224064673|ref|XP_002197521.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Taeniopygia guttata]
Length = 443
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 12/119 (10%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P E R V + +I + +P V FGS YLP DIDL F +TL
Sbjct: 1 MSPRPEEETMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP 60
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W + + L + +V + +A V IIK L D+F VDI+FN G+
Sbjct: 61 -LWT--LEEALRKHNVADENSVKVLD----KATVPIIK-LTDSFTEVKVDISFNVQNGV 111
>gi|344305107|gb|EGW35339.1| hypothetical protein SPAPADRAFT_48344 [Spathaspora passalidarum
NRRL Y-27907]
Length = 615
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 55/216 (25%), Positives = 89/216 (41%), Gaps = 26/216 (12%)
Query: 29 PFSEE--RRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
P S+E RN V ++ I + +P + FGS YLP DID+ S +T
Sbjct: 197 PSSDEIVTRNTVVNRLKTQIAKFWPGTEAHVFGSCATDLYLPGSDIDMVVIS------ET 250
Query: 86 WAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD---NFVVDIAFNQLGGLCT 141
+ R L ++ K V+ I A+V IIK VD +D++F + G+
Sbjct: 251 GDYENRSRLYQLSSFLRSKKLAKNVEVIANAKVPIIK-FVDPESEIHIDVSFERTNGIDA 309
Query: 142 LCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNG 201
+ + LI L R ++LI R+ H G + YA + + + +
Sbjct: 310 AKRIRK--WLITTPGL--RELVLIVKQFLRSRRLNNVHVGGLGGYATIIMCYHFLRLHPK 365
Query: 202 SFAGPLEVLYR----FLEFFS----KFDWDNFCLSL 229
G +++L +EFF F +DN +SL
Sbjct: 366 VSTGSIDILDNLGVLLIEFFELYGRNFSYDNLIISL 401
>gi|49899785|gb|AAH76872.1| LOC445836 protein, partial [Xenopus laevis]
Length = 563
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 54/120 (45%), Gaps = 14/120 (11%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P E+ R V + +I + +P V FGS YLP DIDL F
Sbjct: 80 MSPRPEEEKMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGK----- 134
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W +L LE E +H V+ + +A V IIK L D+F VDI+FN G+
Sbjct: 135 --WENLPLWTLE-EALRKHNVADENSVKVLDKATVPIIK-LTDSFTEVKVDISFNVQNGV 190
>gi|410077415|ref|XP_003956289.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
gi|372462873|emb|CCF57154.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
Length = 537
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 54/245 (22%), Positives = 100/245 (40%), Gaps = 33/245 (13%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + ++ I P E RN + +R + + +P + FGS YLP
Sbjct: 144 WLTLE--MKDFVSYISPSSTEIEDRNITISRIRDAVKELWPDADLHVFGSYSTDLYLPGS 201
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLV--DN 126
DID + ++ KD+ ++ L K + +V+ + +A V IIK +
Sbjct: 202 DIDC-VVNSERGNKDS-----KNCLYQLAKFLTTKKLATDVEVVSKARVPIIKFVEPHTG 255
Query: 127 FVVDIAFNQLGGLCTL----CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGL 182
+D++F + GL +LD L R ++L+ + R+ H G
Sbjct: 256 IHIDVSFERTNGLEAAKLIRSWLDSTAGL--------RELVLVIKQFLHARRLNNVHTGG 307
Query: 183 ISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGPVP 234
+ ++++ LV H+ ++ +L F E + K F +D+ +S+ P
Sbjct: 308 LGGFSIICLVFTFLHMHPRIITNEIDPIDNLGVLLIDFFELYGKNFGYDDVAISVLNGHP 367
Query: 235 ISLLP 239
S +P
Sbjct: 368 -SYIP 371
>gi|449282422|gb|EMC89255.1| PAP-associated domain-containing protein 5, partial [Columba livia]
Length = 501
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 12/119 (10%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P E R V + +I + +P V FGS YLP DIDL F +TL
Sbjct: 12 MSPRPEEERMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP 71
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W + + L + +V + +A V IIK L D+F VDI+FN G+
Sbjct: 72 -LWT--LEEALRKHNVADENSVKVLD----KATVPIIK-LTDSFTEVKVDISFNVQNGV 122
>gi|190345571|gb|EDK37480.2| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
6260]
Length = 588
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 95/232 (40%), Gaps = 26/232 (11%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + + I P RN V ++ I + +P +V FGS YLP
Sbjct: 167 WLTLE--IKDFVNYISPSKLEITTRNNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGS 224
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--- 125
DID+ S D + R L + ++ K ++ I +A+V I+K VD
Sbjct: 225 DIDMVVISRDGDREQ------RSRLYQLSTHLRSKKLAKNIEVIAKAKVPIVK-FVDPDS 277
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N +D++F + G+ + E L + L R ++L+ R+ H G +
Sbjct: 278 NIHIDVSFERSNGIDAAIKIREW--LASTPGL--RELVLVVKQFLRSRRLNNVHVGGLGG 333
Query: 186 YALVTLVLYIFHVF------NGSFAGPL-EVLYRFLEFFSK-FDWDNFCLSL 229
Y+ + L + + N S L +L F E + + F +DN L++
Sbjct: 334 YSTIILCYHFLKLHPRVATENMSILDNLGSLLIEFFELYGRNFSYDNLILAI 385
>gi|254579541|ref|XP_002495756.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
gi|238938647|emb|CAR26823.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
Length = 531
Score = 38.9 bits (89), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 54/241 (22%), Positives = 93/241 (38%), Gaps = 34/241 (14%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + +A I P E RN +R + + +P + FGS YLP
Sbjct: 100 WLTLE--IRDFVAYISPSRQEIELRNKTIRTLRHAVRKLWPGADLQVFGSYATDLYLPGS 157
Query: 70 DID--LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VD 125
DID + + + D+ + + L L+N + E K A V IIK +
Sbjct: 158 DIDCVINSKTGDKENRSSLYELAH-FLKNRKLATQVEVIAK------ARVPIIKFVEPTS 210
Query: 126 NFVVDIAFNQLGGLCTL----CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHG 181
VD++F + GL +L + L R ++LI + R+ H G
Sbjct: 211 QIHVDVSFERTNGLEAAKLIRSWLQQTPGL--------RELVLIVKQFLHARRLNNVHTG 262
Query: 182 LISSYALVTLVLYIFHVFNGSFAGPLEVLYR-------FLEFFSK-FDWDNFCLSLWGPV 233
+ ++++ LV ++ G ++ Y F E + K F +D+ + +
Sbjct: 263 GLGGFSIICLVYAFLNLHPRIVTGEIDARYNLGVLLIDFFELYGKNFGYDDVAVVVADQR 322
Query: 234 P 234
P
Sbjct: 323 P 323
>gi|146419896|ref|XP_001485907.1| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
6260]
Length = 588
Score = 38.9 bits (89), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 95/232 (40%), Gaps = 26/232 (11%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + + I P RN V ++ I + +P +V FGS YLP
Sbjct: 167 WLTLE--IKDFVNYISPSKLEITTRNNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGS 224
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--- 125
DID+ S D + R L + ++ K ++ I +A+V I+K VD
Sbjct: 225 DIDMVVISRDGDREQ------RSRLYQLSTHLRSKKLAKNIEVIAKAKVPIVK-FVDPDS 277
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N +D++F + G+ + E L + L R ++L+ R+ H G +
Sbjct: 278 NIHIDVSFERSNGIDAAIKIREW--LASTPGL--RELVLVVKQFLRSRRLNNVHVGGLGG 333
Query: 186 YALVTLVLYIFHVF------NGSFAGPL-EVLYRFLEFFSK-FDWDNFCLSL 229
Y+ + L + + N S L +L F E + + F +DN L++
Sbjct: 334 YSTIILCYHFLKLHPRVATENMSILDNLGSLLIEFFELYGRNFSYDNLILAI 385
>gi|444720754|gb|ELW61529.1| HEAT repeat-containing protein 3 [Tupaia chinensis]
Length = 1047
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 54/120 (45%), Gaps = 14/120 (11%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P E+ R V + +I + +P V FGS YLP DIDL F
Sbjct: 657 MSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFG------ 710
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W +L LE E +H V+ + +A V IIK L D+F VDI+FN G+
Sbjct: 711 -KWENLPLWTLE-EALRKHKVADEDSVKVLDKATVPIIK-LTDSFTEVKVDISFNVQNGV 767
>gi|327278603|ref|XP_003224050.1| PREDICTED: PAP-associated domain-containing protein 5-like [Anolis
carolinensis]
Length = 665
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 53/119 (44%), Gaps = 12/119 (10%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P + R V + +I + +P V FGS YLP DIDL F +TL
Sbjct: 180 MSPRPEEQRMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP 239
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W + + L + +V + +A V IIK L D+F VDI+FN G+
Sbjct: 240 -LWT--LEEALRKHNVADKGSVKVLD----KATVPIIK-LTDSFTEVKVDISFNVQNGV 290
>gi|449472874|ref|XP_004176276.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Taeniopygia guttata]
Length = 490
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 12/119 (10%)
Query: 25 IQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLK 83
+ P P E R V + +I + +P V FGS YLP DIDL F +TL
Sbjct: 1 MSPRPEEETMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP 60
Query: 84 DTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCLVDNFV---VDIAFNQLGGL 139
W + + L + +V + +A V IIK L D+F VDI+FN G+
Sbjct: 61 -LWT--LEEALRKHNVADENSVKVLD----KATVPIIK-LTDSFTEVKVDISFNVQNGV 111
>gi|389744511|gb|EIM85694.1| poly-A polymerase [Stereum hirsutum FP-91666 SS1]
Length = 628
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 45/195 (23%), Positives = 72/195 (36%), Gaps = 24/195 (12%)
Query: 53 QVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQY 112
++FTFGS L + P DID L++ + + ML E V E
Sbjct: 84 KIFTFGSYRLGVHGPGSDIDTLCVVPKHVLREDFFDVFEQMLRETEGVTECSG-VPEAYV 142
Query: 113 IQAEVKIIKCLVDNFVVDIAF------------NQLGGLCTLCF--------LDEVDHLI 152
+VKI +D + +A N L L C DE+ L+
Sbjct: 143 PIVKVKISGIPIDFLMARLALSTIPDDLSLQDDNLLRNLDDRCIRSLGGSRVTDEILRLV 202
Query: 153 NENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYR 212
++F+ S+ IK W + I +G + A LV I ++ + AG ++ R
Sbjct: 203 PNVNVFRDSLRCIKLWAQRRA-IYSNVNGFLGGVAWAMLVARICQLYPNAIAG--AIVSR 259
Query: 213 FLEFFSKFDWDNFCL 227
F ++ W L
Sbjct: 260 FFIIMYQWSWPQPVL 274
>gi|367040851|ref|XP_003650806.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
gi|346998067|gb|AEO64470.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
Length = 759
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 52/237 (21%), Positives = 98/237 (41%), Gaps = 22/237 (9%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + + I+P F E R + +++ + F +V+ FGS P YLP
Sbjct: 392 WLHKEVV--DFYEYIKPRDFEERLRGELVEHLKTFCRKTFKDAEVYPFGSFPSGLYLPTA 449
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDML---ENEEKNEHAEFRVKEVQYIQAEVKIIKCLV-- 124
D+DL SD + + L ++ KN + + + A+V ++K +
Sbjct: 450 DMDLAFISDSYAKGGVPRYGTKSFLYRFRSQLKNHRIAWEDEIELIVGAKVPLVKFIEHR 509
Query: 125 DNFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSII-LIKAWCYYESRILGGHHGLI 183
VDI+F GL + E E + +++ LIK + + +G I
Sbjct: 510 TGLKVDISFENRTGLTAI----ETFKAWREQYPGMPALVTLIKHFLLMRG-LNEPVNGGI 564
Query: 184 SSYALVTLVLYIFHVFNGSFAGPLEVLYR----FLEFF----SKFDWDNFCLSLWGP 232
++++ LV+ + + +G L+ + L FF +KF++ +S+ P
Sbjct: 565 GGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQLLLHFFDLYGNKFNYQTVAISMNPP 621
>gi|238879008|gb|EEQ42646.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 603
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 91/215 (42%), Gaps = 24/215 (11%)
Query: 29 PFSEE--RRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
P SEE RN V + +++ I + +P + FGS YLP DID+ S +T
Sbjct: 182 PSSEEIVTRNNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVVVS------ET 235
Query: 86 WAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCL--VDNFVVDIAFNQLGGLCTL 142
+ R L + K V+ I A+V IIK + V +D++F + GL
Sbjct: 236 GDYENRSRLYQLSTFLRTKKLAKNVEVIASAKVPIIKFVDPVSELHIDVSFERTNGLDAA 295
Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHV---F 199
+ LI+ L R ++L+ R+ H G + YA + + + +
Sbjct: 296 KRIRR--WLISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLRLHPKL 351
Query: 200 NGSFAGPLE----VLYRFLEFFSK-FDWDNFCLSL 229
+ S L+ +L F E + + F +DN LSL
Sbjct: 352 STSSMDALDNLGVLLIEFFELYGRNFSYDNLILSL 386
>gi|167384281|ref|XP_001736885.1| PAP-associated domain-containing protein [Entamoeba dispar SAW760]
gi|165900593|gb|EDR26889.1| PAP-associated domain-containing protein, putative [Entamoeba
dispar SAW760]
Length = 400
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 86/197 (43%), Gaps = 35/197 (17%)
Query: 10 RWLKAEEITAELIARIQ-----PDPFSEE---RRNAVAAYVRRLIIQCFPCQVFTFGSVP 61
+WLK+ E +L +Q +P E R + Y + I++ + FGS
Sbjct: 5 QWLKSFEGELDLNQEVQLFIKFIEPNKNEYKIREELLTKYSK--ILEKEGYNIMPFGSTQ 62
Query: 62 LKTYLPDRDIDLGAFSDDQTLKDTWAHLVRD----MLENEEKNEHAEFRVKEVQYIQAEV 117
K +LP DID +++ + + +LE++++N +A V
Sbjct: 63 SKLFLPTSDIDFSVITNEYNTRKVLNSISSILSSYVLEDQKRN------------FKASV 110
Query: 118 KIIKCLVDN---FVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAW--CYYE 172
++K L D V+DI+ N G T+ F++E+ I ++ +R ++LIK+ CY
Sbjct: 111 PVLK-LTDKQTLIVLDISHNNTSGTKTVDFIEEI---IKKDDRIRRLVLLIKSILCCYDF 166
Query: 173 SRILGGHHGLISSYALV 189
+ G G S + +V
Sbjct: 167 HQPANGGLGTYSVFVMV 183
>gi|391342828|ref|XP_003745717.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
[Metaseiulus occidentalis]
Length = 512
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 69/289 (23%), Positives = 108/289 (37%), Gaps = 46/289 (15%)
Query: 26 QPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKD 84
+P + R V V+ ++ Q +P Q FGS YLP DIDL D +TL
Sbjct: 106 KPTRTEHQVRQEVVNRVKEVVRQLWPQAQCEVFGSFCTGLYLPTSDIDLVILGDWETLPM 165
Query: 85 TWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL--VDNFVVDIAFNQLGGLCTL 142
H L E+ + +V + +A V I+K N VDI+FNQ G+ +
Sbjct: 166 FTLH---KALIQEKIASASTIKVLD----RASVPIVKFTEQSTNVKVDISFNQKNGVKSA 218
Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSYALVTLVLYIF--HVF 199
+ + + + ++K Y R L G ISSY+L+ LV+ H+
Sbjct: 219 KLIKDFCKTFPP---LPKLVFVLKQ--YLLQRDLNEVFTGGISSYSLILLVVSFLQRHLR 273
Query: 200 NGSFAGPLE------VLYRFLEFFSK-FDWDNFCLSLWGPVPISLLPDVTAEPPRKDGGV 252
PL +L F E + + F++ + + KDGG
Sbjct: 274 IKELQSPLSNVNLGVLLLEFFELYGRYFNYAEVGIRI------------------KDGGS 315
Query: 253 LLLSKSFLDSCRYAYADFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRS 301
+ ++ A G S + DPL N++GRS
Sbjct: 316 YMSKEALQREMATAQGQTSGAGVIHD---TSSILCIEDPLTPGNDIGRS 361
>gi|324975506|gb|ADY62687.1| PAPa [Candida orthopsilosis]
Length = 552
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNE------- 101
++FTFGS L Y P DID L T +D + ++R E EE N
Sbjct: 82 KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVP 141
Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + + A++ I + D N + +I + L DE+ L+
Sbjct: 142 IIKLEFAGISIDLLFAKLDIPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVP 201
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + + +G A LV I ++ + + +L +F
Sbjct: 202 KPTVFKNALRFIKMWAQQRA-VYANVYGFPGGVAWAMLVARICQLYPNAVSAV--ILEKF 258
Query: 214 LEFFSKFDW 222
+ +S+++W
Sbjct: 259 FQIYSQWNW 267
>gi|94490330|gb|ABF29402.1| nonribosomal peptide synthetase [Xylaria sp. BCC 1067]
Length = 6744
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 33/74 (44%), Gaps = 11/74 (14%)
Query: 179 HHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRFLEFFSKFDWDNFCLSLWGP------ 232
HH L +AL L+ + V+ GS + PL F+++ +K D+ F WG
Sbjct: 834 HHALSDGWALPLLLQQVSAVYEGSISLPLRPFNHFIDYMTKMDYKTF----WGRYFDDLQ 889
Query: 233 -VPISLLPDVTAEP 245
LLP VT P
Sbjct: 890 VAAFPLLPSVTYTP 903
>gi|68480208|ref|XP_715914.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
gi|68480321|ref|XP_715864.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
gi|46437507|gb|EAK96852.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
gi|46437559|gb|EAK96903.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
Length = 603
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 91/215 (42%), Gaps = 24/215 (11%)
Query: 29 PFSEE--RRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDT 85
P SEE RN V + +++ I + +P + FGS YLP DID+ S +T
Sbjct: 182 PSSEEIVTRNNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVVVS------ET 235
Query: 86 WAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCL--VDNFVVDIAFNQLGGLCTL 142
+ R L + K V+ I A+V IIK + V +D++F + GL
Sbjct: 236 GDYENRSRLYQLSTFLRTKKLAKNVEVIASAKVPIIKFVDPVSELHIDVSFERTNGLDAA 295
Query: 143 CFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHV---F 199
+ LI+ L R ++L+ R+ H G + YA + + + +
Sbjct: 296 KRIRR--WLISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLRLHPKL 351
Query: 200 NGSFAGPLE----VLYRFLEFFSK-FDWDNFCLSL 229
+ S L+ +L F E + + F +DN LSL
Sbjct: 352 STSSMDALDNLGVLLIEFFELYGRNFSYDNLILSL 386
>gi|448519050|ref|XP_003868035.1| non-canonical poly(A) polymerase [Candida orthopsilosis Co 90-125]
gi|380352374|emb|CCG22600.1| non-canonical poly(A) polymerase [Candida orthopsilosis]
Length = 604
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 90/232 (38%), Gaps = 26/232 (11%)
Query: 11 WLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDR 69
WL E + ++ I P RN V ++R + +P + FGS YLP
Sbjct: 164 WLTME--MKDFVSYISPSRAEIVTRNNVINTLKREVSSFWPGTEAHVFGSCATDLYLPGS 221
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--- 125
DID+ S T + R L A+ K V+ I A+V IIK VD
Sbjct: 222 DIDMVVIS------STGDYENRSRLYQLSSFLRAKNLAKNVEVIASAKVPIIK-FVDPES 274
Query: 126 NFVVDIAFNQLGGLCTLCFLDEVDHLINENHLFKRSIILIKAWCYYESRILGGHHGLISS 185
N +DI+F + GL + L+ L R ++L+ ++ H G +
Sbjct: 275 NLPIDISFERTNGLDAARRIRRW--LLATPGL--RELVLVVKQFLRSRKLNNVHVGGLGG 330
Query: 186 YALVTLVLYIFH----VFNGSFAGPLEVLYRFLEFFS----KFDWDNFCLSL 229
YA + + + + + P + +EFF F +DN +S+
Sbjct: 331 YATIIMCYHFMQLHPKISTNTMNAPDNLGVLLIEFFELYGRNFSYDNLIISI 382
>gi|448531596|ref|XP_003870285.1| Pap1 poly(A) polymerase [Candida orthopsilosis Co 90-125]
gi|380354639|emb|CCG24155.1| Pap1 poly(A) polymerase [Candida orthopsilosis]
Length = 557
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 22/189 (11%)
Query: 53 QVFTFGSVPLKTYLPDRDID-LGAFSDDQTLKD---TWAHLVRDMLENEEKNE------- 101
++FTFGS L Y P DID L T +D + ++R E EE N
Sbjct: 87 KIFTFGSYRLGVYGPSSDIDALVVVPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVP 146
Query: 102 --HAEFRVKEVQYIQAEVKIIKCLVD------NFVVDIAFNQLGGLCTLCFLDEVDHLIN 153
EF + + A++ I + D N + +I + L DE+ L+
Sbjct: 147 IIKLEFAGISIDLLFAKLDIPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVP 206
Query: 154 ENHLFKRSIILIKAWCYYESRILGGHHGLISSYALVTLVLYIFHVFNGSFAGPLEVLYRF 213
+ +FK ++ IK W + + +G A LV I ++ + + +L +F
Sbjct: 207 KPTVFKNALRFIKMWAQQRA-VYANVYGFPGGVAWAMLVARICQLYPNAVSAV--ILEKF 263
Query: 214 LEFFSKFDW 222
+ +S+++W
Sbjct: 264 FQIYSQWNW 272
>gi|336276454|ref|XP_003352980.1| hypothetical protein SMAC_03298 [Sordaria macrospora k-hell]
gi|380092465|emb|CCC09742.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 781
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 63/143 (44%), Gaps = 14/143 (9%)
Query: 9 GRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLP 67
G WL E I + I+P F + R V + R + FP V+ FGS P YLP
Sbjct: 446 GHWLHKEII--DFYEYIKPRAFEKRIRQEVLDEINRFVRSTFPDAGVYPFGSFPSGLYLP 503
Query: 68 DRDIDLGAFSDD-----QTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQ-AEVKIIK 121
D+D+ SD + DT + R L + K + F+ EV+ I A+V ++K
Sbjct: 504 TGDMDMVLCSDQYKRNYRAKYDTRRTMYR--LSDALKQQKLAFQ-NEVEIIAFAKVPLVK 560
Query: 122 CLVD--NFVVDIAFNQLGGLCTL 142
+ +D++F GL +
Sbjct: 561 WVDSRTGLKIDVSFENDTGLQAI 583
>gi|452823485|gb|EME30495.1| DNA polymerase sigma subunit [Galdieria sulphuraria]
Length = 417
Score = 38.1 bits (87), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 66/159 (41%), Gaps = 16/159 (10%)
Query: 33 ERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLPDRDIDLGAFSDDQTLKDTWAHLVR 91
++R + V +I Q +P V FGS YLP DIDL S + HL+
Sbjct: 116 KQRKQLIERVTEIIRQIWPNSSVHVFGSFATNLYLPTSDIDLCILSSPENGSKRELHLLA 175
Query: 92 DMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCL--VDNFVVDIAFNQLGGLCTLCFLDEV 148
D+L + +++ V I +A V IIK DI+F + G+ + +
Sbjct: 176 DVLRRKTN------KMRRVMAIDKARVPIIKVTDRETGIQCDISFGRTNGIEN---VRHI 226
Query: 149 DHLINENHLFKRSIILIKAWCYYESRILGG-HHGLISSY 186
+ + +++IK C+ R L H G I SY
Sbjct: 227 QKYLKRYPSLRPLMMVIK--CFLHQRALNEVHEGGIGSY 263
>gi|401837953|gb|EJT41787.1| TRF5-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 642
Score = 37.7 bits (86), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 54/247 (21%), Positives = 102/247 (41%), Gaps = 33/247 (13%)
Query: 9 GRWLKAEEITAELIARIQPDPFSEERRNAVAAYVRRLIIQCFP-CQVFTFGSVPLKTYLP 67
WL +E + + I P + RN +R+ + Q + + FGS YLP
Sbjct: 173 AEWLTSE--IKDFVHYISPSKSEIKCRNRTIDKLRQAVKQLWSDADLHVFGSFATDLYLP 230
Query: 68 DRDID--LGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYIQAEVKIIKCL-- 123
DID + + D+ ++ L R + KNE R++ + ++ V IIK +
Sbjct: 231 GSDIDCVINSRHHDKEDRNYIYELARYL-----KNEGLAIRMEVI--VRTRVPIIKFIEP 283
Query: 124 VDNFVVDIAFNQLGGLCTLCFLDEVDHLINE---NHLFKRSIILIKAWCYYESRILGGHH 180
+ +D++F + GL E LI E + R ++L+ + R+ H
Sbjct: 284 LSQLHIDVSFERTNGL-------EAARLIREWLRDSPGLRELVLVIKQFLHSRRLNNVHT 336
Query: 181 GLISSYALVTLVLYIFHVFNGSFAGPLE-------VLYRFLEFFSK-FDWDNFCLSLWGP 232
G + + ++ LV ++ + ++ +L F E + K F +D+ +S+
Sbjct: 337 GGLGGFTVICLVYSFLNMHPRIKSNDIDTPDNLGVLLIDFFELYGKNFGYDDVAISISDD 396
Query: 233 VPISLLP 239
P S +P
Sbjct: 397 HP-SYIP 402
>gi|343427054|emb|CBQ70582.1| related to TRF4-topoisomerase I-related protein [Sporisorium
reilianum SRZ2]
Length = 697
Score = 37.7 bits (86), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 61/140 (43%), Gaps = 12/140 (8%)
Query: 14 AEEITAELIA---RIQPDPFSEERRNAVAAYVRRLIIQCF-PCQVFTFGSVPLKTYLPDR 69
AE + ELIA + P E R V + R I F +V FGS K YLP
Sbjct: 98 AEALHRELIAFDDWMAPTVAEHETRCMVVELISRAIKSQFRDAEVHPFGSQETKLYLPQG 157
Query: 70 DIDLGAFSDDQTLKDTWAHLVRDMLENEEKNEHAEFRVKEVQYI-QAEVKIIKCLVD--N 126
D+DL S T + L R M ++ A +VQ I +A+V IIK +
Sbjct: 158 DLDLVVVSQSMANLRTQSAL-RTMAACLRRHNLAT----DVQVIAKAKVPIIKFVTTYAR 212
Query: 127 FVVDIAFNQLGGLCTLCFLD 146
VDI+ N GL T +++
Sbjct: 213 LKVDISLNHTNGLTTASYVN 232
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.326 0.143 0.449
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,666,672,430
Number of Sequences: 23463169
Number of extensions: 241218603
Number of successful extensions: 726104
Number of sequences better than 100.0: 348
Number of HSP's better than 100.0 without gapping: 158
Number of HSP's successfully gapped in prelim test: 190
Number of HSP's that attempted gapping in prelim test: 725528
Number of HSP's gapped (non-prelim): 495
length of query: 347
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 204
effective length of database: 9,003,962,200
effective search space: 1836808288800
effective search space used: 1836808288800
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)