BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008474
(564 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297733888|emb|CBI15135.3| unnamed protein product [Vitis vinifera]
Length = 622
Score = 785 bits (2027), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/564 (78%), Positives = 490/564 (86%), Gaps = 9/564 (1%)
Query: 1 MHDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESG 60
+ D +PDPG+SE ERDQSS++V+V DQREESE + T SDEKE YGQRV TSRRR+VIESG
Sbjct: 68 LRDVDPDPGESEAERDQSSEDVEVGDQREESEGRETGSDEKEGYGQRVVTSRRRDVIESG 127
Query: 61 SERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAI 120
SERSEENHY DHEDEEVDQARSPRSPG EEKDQTH SHS AEIRDVFG+SD+E+ EYA+
Sbjct: 128 SERSEENHYADHEDEEVDQARSPRSPG-EEKDQTHNSHSVAEIRDVFGESDDEEAAEYAV 186
Query: 121 RNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEI 180
NDI+Q+S RSPMEEEGSYEK++RPEDI+PDED Y SEEE+VE K KEKPVGPPL LE+
Sbjct: 187 DNDIEQNSIRSPMEEEGSYEKNLRPEDILPDEDAQYGSEEENVEIKSKEKPVGPPLVLEV 246
Query: 181 PLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWR 240
PLR PPA P KMNMIKVSNIMGI+PKPFDPKTYVEEDTFVTDESG+KKRIRLENNIVRWR
Sbjct: 247 PLRRPPAHPDKMNMIKVSNIMGIEPKPFDPKTYVEEDTFVTDESGSKKRIRLENNIVRWR 306
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQ 300
VKN DG TSYESNARFVRWSDGSLQL IGNEVLDI+VQDAQHDQ+HLFLRHGKGILQSQ
Sbjct: 307 NVKNPDGTTSYESNARFVRWSDGSLQLLIGNEVLDISVQDAQHDQSHLFLRHGKGILQSQ 366
Query: 301 GRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNI 360
GRIL+KMRF+PSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKE+A SQ I
Sbjct: 367 GRILRKMRFMPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKEKAVSQTI 426
Query: 361 RANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEELEA 420
RAN LLNRKREK+NRKYTQTV+R RQLS G+LE+AL+EDDE DYHDSRR RRRFE++LE
Sbjct: 427 RANQLLNRKREKVNRKYTQTVDRGRQLSPGFLEEALDEDDEPDYHDSRRHRRRFEDDLEM 486
Query: 421 EVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDERSP 480
+ RAEKRI N KK GH+DIPRKSS L A KSSRRPVDFS+SEREESEYE+DGEEDERS
Sbjct: 487 DTRAEKRIFNVKK--GHKDIPRKSS-LSAIKSSRRPVDFSDSEREESEYESDGEEDERSF 543
Query: 481 LRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRGIES 540
RKR E E + E EE+EEE++ +E + + EEE EPKQK ++ KR+GIES
Sbjct: 544 SRKRAE--EPEEEYAEEDEEEDERDEEEAEANEESEEEVEEPKQKGKEYA---KRKGIES 598
Query: 541 DEDSPPRKMPTHRRMAVVYDSDEE 564
DEDSPPRK +HRRMA VY+SDEE
Sbjct: 599 DEDSPPRKTASHRRMAFVYESDEE 622
>gi|359491648|ref|XP_002280975.2| PREDICTED: uncharacterized protein LOC100248444 [Vitis vinifera]
Length = 623
Score = 781 bits (2017), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/564 (77%), Positives = 487/564 (86%), Gaps = 8/564 (1%)
Query: 1 MHDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESG 60
+ D +PDPG+SE ERDQSS++V+V DQREESE + T SDEKE YGQRV TSRRR+VIESG
Sbjct: 68 LRDVDPDPGESEAERDQSSEDVEVGDQREESEGRETGSDEKEGYGQRVVTSRRRDVIESG 127
Query: 61 SERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAI 120
SERSEENHY DHEDEEVDQARSP GEEKDQTH SHS AEIRDVFG+SD+E+ EYA+
Sbjct: 128 SERSEENHYADHEDEEVDQARSPSRSPGEEKDQTHNSHSVAEIRDVFGESDDEEAAEYAV 187
Query: 121 RNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEI 180
NDI+Q+S RSPMEEEGSYEK++RPEDI+PDED Y SEEE+VE K KEKPVGPPL LE+
Sbjct: 188 DNDIEQNSIRSPMEEEGSYEKNLRPEDILPDEDAQYGSEEENVEIKSKEKPVGPPLVLEV 247
Query: 181 PLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWR 240
PLR PPA P KMNMIKVSNIMGI+PKPFDPKTYVEEDTFVTDESG+KKRIRLENNIVRWR
Sbjct: 248 PLRRPPAHPDKMNMIKVSNIMGIEPKPFDPKTYVEEDTFVTDESGSKKRIRLENNIVRWR 307
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQ 300
VKN DG TSYESNARFVRWSDGSLQL IGNEVLDI+VQDAQHDQ+HLFLRHGKGILQSQ
Sbjct: 308 NVKNPDGTTSYESNARFVRWSDGSLQLLIGNEVLDISVQDAQHDQSHLFLRHGKGILQSQ 367
Query: 301 GRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNI 360
GRIL+KMRF+PSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKE+A SQ I
Sbjct: 368 GRILRKMRFMPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKEKAVSQTI 427
Query: 361 RANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEELEA 420
RAN LLNRKREK+NRKYTQTV+R RQLS G+LE+AL+EDDE DYHDSRR RRRFE++LE
Sbjct: 428 RANQLLNRKREKVNRKYTQTVDRGRQLSPGFLEEALDEDDEPDYHDSRRHRRRFEDDLEM 487
Query: 421 EVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDERSP 480
+ RAEKRI N KK GH+DIPRKSS L A KSSRRPVDFS+SEREESEYE+DGEEDERS
Sbjct: 488 DTRAEKRIFNVKK--GHKDIPRKSS-LSAIKSSRRPVDFSDSEREESEYESDGEEDERSF 544
Query: 481 LRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRGIES 540
RKR E E + E EE+EEE++ +E + + EEE EPKQK ++ KR+GIES
Sbjct: 545 SRKRAE--EPEEEYAEEDEEEDERDEEEAEANEESEEEVEEPKQKGKEYA---KRKGIES 599
Query: 541 DEDSPPRKMPTHRRMAVVYDSDEE 564
DEDSPPRK +HRRMA VY+SDEE
Sbjct: 600 DEDSPPRKTASHRRMAFVYESDEE 623
>gi|224121454|ref|XP_002318586.1| PAF1 complex component [Populus trichocarpa]
gi|222859259|gb|EEE96806.1| PAF1 complex component [Populus trichocarpa]
Length = 620
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/564 (75%), Positives = 478/564 (84%), Gaps = 14/564 (2%)
Query: 1 MHDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESG 60
M + EPDPG+SEGER+QSSQEVD+ DQR+ES+AK +DSD KEEYG RVATSRRR+VIESG
Sbjct: 71 MREVEPDPGESEGEREQSSQEVDIGDQRDESDAKDSDSDAKEEYGHRVATSRRRDVIESG 130
Query: 61 SERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAI 120
SE E+HY DHEDEEVDQARSP EEKDQ + SA IRDVFGDS++E+ YA+
Sbjct: 131 SE---EHHYVDHEDEEVDQARSPSKSPEEEKDQAQNAQSAPAIRDVFGDSEDEEEDAYAV 187
Query: 121 RNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEI 180
RNDI+QDSNRSPMEEE SYEK++RPED++ DED YESEEE+ E K KEKP+GPPLE+E+
Sbjct: 188 RNDIEQDSNRSPMEEEESYEKNLRPEDMLADEDAQYESEEENREAKPKEKPLGPPLEIEM 247
Query: 181 PLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWR 240
P PPPADP KMNMIKVSNIMGIDP FDPKTYVEE TFVTDESGA+KRIRLENNIVRWR
Sbjct: 248 PFCPPPADPKKMNMIKVSNIMGIDPNQFDPKTYVEEKTFVTDESGAQKRIRLENNIVRWR 307
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQ 300
TVKN DG S ESNARFVRWSDGSLQL IGNEVLDI+VQDAQ DQ HLFLRH K +LQSQ
Sbjct: 308 TVKNPDGSNSVESNARFVRWSDGSLQLLIGNEVLDISVQDAQQDQTHLFLRHNKSLLQSQ 367
Query: 301 GRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNI 360
GRIL+KMRF+PSSLSSNSHRLLTALVDSRH+K YKVKNCITDIDPEREKEEKE+AESQ I
Sbjct: 368 GRILRKMRFMPSSLSSNSHRLLTALVDSRHRKAYKVKNCITDIDPEREKEEKEKAESQTI 427
Query: 361 RANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEELEA 420
RANVLLNRKREK++RKYT VERRRQLS G+LE AL+EDDE DY++SRRS RRFEE+LE
Sbjct: 428 RANVLLNRKREKVSRKYTPNVERRRQLSPGFLEGALDEDDEPDYYNSRRS-RRFEEDLEV 486
Query: 421 EVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDERSP 480
E RAEKRI+NAKK G RDIPRKS + A KSS+RPVDFS+SEREESEYETDG+E ERSP
Sbjct: 487 EARAEKRIMNAKK--GQRDIPRKSD-MSAVKSSKRPVDFSDSEREESEYETDGDEYERSP 543
Query: 481 LRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRGIES 540
+ KRV+ PE +YEEEEE EE+ E +N AS++EE EPKQK ++ GS+ KR G ES
Sbjct: 544 VHKRVDEPEHEYEEEEEHYEEDTE----VNGASEEEE---EPKQKGKEFGSSRKRIGFES 596
Query: 541 DEDSPPRKMPTHRRMAVVYDSDEE 564
+EDSPPRK+ + RRM VVYDSDEE
Sbjct: 597 EEDSPPRKIQSGRRMTVVYDSDEE 620
>gi|147820590|emb|CAN72101.1| hypothetical protein VITISV_003891 [Vitis vinifera]
Length = 718
Score = 738 bits (1905), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/639 (68%), Positives = 482/639 (75%), Gaps = 83/639 (12%)
Query: 1 MHDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESG 60
+ D +PDPG+SE ERDQSS++V+V DQREESE + T SDEKE YGQRV TSRRR+VIESG
Sbjct: 88 LRDVDPDPGESEAERDQSSEDVEVGDQREESEGRETGSDEKEGYGQRVVTSRRRDVIESG 147
Query: 61 SERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAI 120
SERSEENHY DHEDEEVDQARSP GEEKDQTH SHS AEIRDVFG+SD+E+ EYA+
Sbjct: 148 SERSEENHYADHEDEEVDQARSPSRSPGEEKDQTHNSHSVAEIRDVFGESDDEEAAEYAV 207
Query: 121 RNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEI 180
NDI+Q+S RSPMEEEGSYEK++RPEDI+PDED Y SEEE+VE K KEKPVGPPL LE+
Sbjct: 208 DNDIEQNSIRSPMEEEGSYEKNLRPEDILPDEDAQYGSEEENVEIKSKEKPVGPPLVLEV 267
Query: 181 PLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWR 240
PLR PPA P KMNMIKVSNIMGI+PKPFDPKTYVEEDTFVTDESG+KKRIRLENNIVRWR
Sbjct: 268 PLRRPPAHPDKMNMIKVSNIMGIEPKPFDPKTYVEEDTFVTDESGSKKRIRLENNIVRWR 327
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGK------ 294
VKN DG TSYESNARFVRWSDGSLQL IGNEVLDI+VQDAQHDQ+HLFLRHGK
Sbjct: 328 NVKNPDGTTSYESNARFVRWSDGSLQLLIGNEVLDISVQDAQHDQSHLFLRHGKXNNAEL 387
Query: 295 ------------------------GILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRH 330
GILQSQGRIL+KMRF+PSSLSSNSHRLLTALVDSRH
Sbjct: 388 SICPFAXVTFMENLSLMSFMPIMQGILQSQGRILRKMRFMPSSLSSNSHRLLTALVDSRH 447
Query: 331 KKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTG 390
KKVYKVKNCITDIDPEREKEEKE+A SQ IRAN LLNR REK+NRKYTQTV+R RQLS G
Sbjct: 448 KKVYKVKNCITDIDPEREKEEKEKAVSQTIRANQLLNRXREKVNRKYTQTVDRGRQLSPG 507
Query: 391 YLEDALEE---------------------------------------------DDETDYH 405
+LE+AL+E DDE DYH
Sbjct: 508 FLEEALDEKISWTXSEANSLHFMVLYVIDMDKSTXNGFSRNFNWAYSLSTXXQDDEPDYH 567
Query: 406 DSRRSRRRFEEELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESERE 465
DS R RRRFE++LE + AEKRI N KK GH+DIP KSS L A KSSRRPVDFS+SERE
Sbjct: 568 DSXRHRRRFEDDLEMDTXAEKRIFNVKK--GHKDIPXKSS-LSAIKSSRRPVDFSDSERE 624
Query: 466 ESEYETDGEEDERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQK 525
ESEYE+DGEEDERS RKR E E + E EE EEE++ +E + + EEE EPKQK
Sbjct: 625 ESEYESDGEEDERSFSRKRAE--EPEEEYAEEXEEEDEHDEEEAEANEESEEEVEEPKQK 682
Query: 526 ARDSGSNHKRRGIESDEDSPPRKMPTHRRMAVVYDSDEE 564
++ KR+GIESDEDSPPRK +HRRMA VYDSDEE
Sbjct: 683 GKEYA---KRKGIESDEDSPPRKSASHRRMAFVYDSDEE 718
>gi|449439305|ref|XP_004137426.1| PREDICTED: uncharacterized protein LOC101221752 [Cucumis sativus]
gi|449486962|ref|XP_004157455.1| PREDICTED: uncharacterized protein LOC101229895 [Cucumis sativus]
Length = 623
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/571 (75%), Positives = 483/571 (84%), Gaps = 21/571 (3%)
Query: 1 MHDAEPDPGQSEGERDQSSQEVDVVDQREE-SEAKYTDSDEKEEYGQRVATSRRREVIES 59
+ DA+PDPG+SEGERDQSSQEVDV REE SE K DSD++EEY QRV TSRR +++ES
Sbjct: 67 LQDADPDPGESEGERDQSSQEVDVGHHREEESEGKEVDSDDREEYDQRVVTSRRHDLVES 126
Query: 60 GSERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
SERSEENHY D+EDEEVDQ RSPR EE+D H SH AEIRDVFGDSDEE+ EYA
Sbjct: 127 ESERSEENHYIDNEDEEVDQTRSPR----EEED--HNSHPVAEIRDVFGDSDEEEEAEYA 180
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYES------EEEHVEPKHKEKPVG 173
+ N+I QDS RSPMEEEGS+EKS RP+DI+PDED YES EEE+ E KHKEKPVG
Sbjct: 181 VGNEIKQDSARSPMEEEGSFEKSPRPDDIIPDEDARYESDARYESEEENFEVKHKEKPVG 240
Query: 174 PPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLE 233
PPLELEIPLR PPA P KMNMIKVSNIMGIDPKPFDPKTYVEED FVTDESGA KRIRLE
Sbjct: 241 PPLELEIPLRHPPARPEKMNMIKVSNIMGIDPKPFDPKTYVEEDIFVTDESGANKRIRLE 300
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
NNIVRWRTV+ DG T YESNARFVRWSDGSLQL IGNEVLDI VQDA+HDQAHLFLRHG
Sbjct: 301 NNIVRWRTVRKPDGTTKYESNARFVRWSDGSLQLLIGNEVLDINVQDAKHDQAHLFLRHG 360
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKE 353
KGILQSQGR+++KMRFIPSSL+SNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKE
Sbjct: 361 KGILQSQGRLMRKMRFIPSSLTSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKE 420
Query: 354 RAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRR 413
RAESQ IRANVLLNRK+EK+++KY ++RRRQLS G+LEDALEE+DETDY DSRRSRRR
Sbjct: 421 RAESQTIRANVLLNRKKEKVSKKYAPVIDRRRQLSPGFLEDALEEEDETDYIDSRRSRRR 480
Query: 414 FEEELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDG 473
FEE+LEAE RAEKRI+NAKK G RDIPRK S A+KSS RPVDFS+S+REESEYET+G
Sbjct: 481 FEEDLEAEARAEKRIMNAKK--GTRDIPRKPS-FHASKSSGRPVDFSDSDREESEYETEG 537
Query: 474 EEDERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNH 533
E+D +P R + EE+ EE++E++EE ++N SD +EEA EPK K RD GS+H
Sbjct: 538 EDD--APGRVEE---PEQEYEEDGEEDDERDEEAEVNEVSDVDEEAEEPKHKVRDYGSSH 592
Query: 534 KRRGIESDEDSPPRKMPTHRRMAVVYDSDEE 564
KR+GIESDE+SPPRK+ THRRMAVVYDSD+E
Sbjct: 593 KRKGIESDEESPPRKVATHRRMAVVYDSDDE 623
>gi|255547219|ref|XP_002514667.1| RNA polymerase-associated protein LEO1, putative [Ricinus communis]
gi|223546271|gb|EEF47773.1| RNA polymerase-associated protein LEO1, putative [Ricinus communis]
Length = 629
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/567 (73%), Positives = 485/567 (85%), Gaps = 10/567 (1%)
Query: 1 MHDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTD--SDEKEEYGQRVATSRRREVIE 58
M + +PDP +SEGER+ SS+E+D+ D+RE SEAK D SDEKEEY RVAT R+R++IE
Sbjct: 70 MREVDPDPRESEGEREASSEEIDIGDEREVSEAKDVDVDSDEKEEYSHRVATGRKRDIIE 129
Query: 59 SGSERSEENHYPDHEDEEVDQARSP-RSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGE 117
SGSERSEE+ Y +HEDEEVDQARSP +SP E+ DQ HISHSAAEIRDVFG SD+E+
Sbjct: 130 SGSERSEEHQYDNHEDEEVDQARSPSKSPDMEKDDQHHISHSAAEIRDVFGASDDEEEPV 189
Query: 118 YAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLE 177
YA RN+I+Q RSPMEEEG YE+++RPED++ DED YESEEE++E K KEKPVGPPLE
Sbjct: 190 YATRNEIEQHLPRSPMEEEGRYERNLRPEDMLVDEDARYESEEENIEVKAKEKPVGPPLE 249
Query: 178 LEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIV 237
LEIP + PPADPT+M+MIK+SNIMGIDP FDPKTYVE+ TFVTDESG KRIRLENNIV
Sbjct: 250 LEIPFQHPPADPTRMHMIKISNIMGIDPNQFDPKTYVEDKTFVTDESGENKRIRLENNIV 309
Query: 238 RWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGIL 297
RWR V+N DG S ESNARFVRWSDGSLQL IGNEVL+I+V++ +HDQ HLFLRH K +L
Sbjct: 310 RWRPVRNPDGSKSVESNARFVRWSDGSLQLLIGNEVLNISVREDRHDQTHLFLRHNKSLL 369
Query: 298 QSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAES 357
QSQGRIL+KM+F+PSSLSSNSH+ LTALVDS ++KVYKVKNC+TDIDPEREKEEKERAES
Sbjct: 370 QSQGRILRKMKFMPSSLSSNSHKRLTALVDSSYRKVYKVKNCVTDIDPEREKEEKERAES 429
Query: 358 QNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEE 417
Q IRANVLLNRKREK+NRKYTQTVERRRQLS G+LE AL+EDDE DY+D RRSRRRFEE+
Sbjct: 430 QTIRANVLLNRKREKVNRKYTQTVERRRQLSPGFLEGALDEDDEPDYYDHRRSRRRFEED 489
Query: 418 LEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDE 477
LE E RAE+RI+NAKK G++DI RK S LP+AKSS+RPV+FS+SEREESEYE+DGEEDE
Sbjct: 490 LEVEARAERRIMNAKK--GNKDILRKPS-LPSAKSSKRPVNFSDSEREESEYESDGEEDE 546
Query: 478 RSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRG 537
RSP RKRVE PEQ+YEEEEEE E E E + AS++EEEA EPKQ+++D G KR G
Sbjct: 547 RSPPRKRVEEPEQEYEEEEEEREAEAEAD----GASEEEEEAEEPKQRSKDFGGIRKRMG 602
Query: 538 IESDEDSPPRKMPTHRRMAVVYDSDEE 564
IESDE+SPPRKMPTHRRMAVVYDSDEE
Sbjct: 603 IESDEESPPRKMPTHRRMAVVYDSDEE 629
>gi|356508502|ref|XP_003522995.1| PREDICTED: uncharacterized protein LOC100791454 [Glycine max]
Length = 629
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/568 (71%), Positives = 475/568 (83%), Gaps = 11/568 (1%)
Query: 3 DAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEE-YGQRVATSRRRE-VIESG 60
D EPD +SEGER+QSSQEV+VV++ EESE + +DSD K+ Y Q TS+RR+ V+ESG
Sbjct: 67 DGEPD-RESEGEREQSSQEVEVVEREEESEGRDSDSDAKDGGYSQHGVTSKRRDDVVESG 125
Query: 61 SERSEENHYPDHED-EEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEED---VG 116
SERSEEN Y H+D EEVD+ARSP +EKD+T H+A EIRDVFGD D+++ +G
Sbjct: 126 SERSEENQYAHHDDGEEVDEARSPSGSPRDEKDETRDLHAAPEIRDVFGDFDDDEEEDIG 185
Query: 117 EYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPL 176
YA++ DI QDSNR +EEEGSY K++RPEDI+ DED YESE+E++E K KEKP+GPPL
Sbjct: 186 -YAVQQDIRQDSNRYAVEEEGSYGKNLRPEDILADEDNQYESEDENIEIKTKEKPLGPPL 244
Query: 177 ELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNI 236
ELE+PLRPPPA P KMNMIKVSNIMG+DPKPFDPKTYVEEDTFVTDESG +KRIRLENNI
Sbjct: 245 ELEVPLRPPPALPEKMNMIKVSNIMGVDPKPFDPKTYVEEDTFVTDESGTRKRIRLENNI 304
Query: 237 VRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGI 296
VRWRT +N DG TS ESNARFVRWSDGSLQL IGNEVLDI+VQDAQHDQAHLFLRHGKGI
Sbjct: 305 VRWRTTRNPDGTTSCESNARFVRWSDGSLQLLIGNEVLDISVQDAQHDQAHLFLRHGKGI 364
Query: 297 LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
LQSQGR+L+KMRF+PSSLSSNSHRLLTALVDSRHKK +KVKNCITDIDPEREKEEKE+AE
Sbjct: 365 LQSQGRLLRKMRFMPSSLSSNSHRLLTALVDSRHKKAFKVKNCITDIDPEREKEEKEKAE 424
Query: 357 SQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEE 416
SQNIRANVLLNRKREK+NRKYT VERRRQLS G+LEDAL+E+DE DY+D+ RS+RRFE+
Sbjct: 425 SQNIRANVLLNRKREKVNRKYTPAVERRRQLSPGFLEDALDEEDEADYYDNHRSQRRFED 484
Query: 417 ELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEED 476
+LE E RAEKRI+NAKK +G +DIPRKSS P AKSSR P+ + + EREESEYET+ EED
Sbjct: 485 DLEVEARAEKRIMNAKKSRGPKDIPRKSS-FPPAKSSRNPMGYPDDEREESEYETEEEED 543
Query: 477 ERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRR 536
ER P RKR E + E+EEEEEE EE +N ASDD EE EPKQK+++ + KR+
Sbjct: 544 ERPPSRKRDED-TEPEYEDEEEEEEHYEEAEQVNDASDDGEEE-EPKQKSKEFRGSAKRK 601
Query: 537 GIESDEDSPPRKMPTHRRMAVVYDSDEE 564
G ESD+DS PRK+ THRRMAVVYDSDE+
Sbjct: 602 GFESDDDSSPRKITTHRRMAVVYDSDED 629
>gi|356516855|ref|XP_003527108.1| PREDICTED: uncharacterized protein LOC100810965 [Glycine max]
Length = 631
Score = 672 bits (1733), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/568 (71%), Positives = 465/568 (81%), Gaps = 16/568 (2%)
Query: 3 DAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEE-YGQRVATSRRRE-VIESG 60
D EPD +SEGER+QSSQEV+VV++ EESE + +DSD K+ Y QR TS+RR+ +ESG
Sbjct: 74 DGEPD-RESEGEREQSSQEVEVVEREEESEGRDSDSDVKDGGYSQRGVTSKRRDDFVESG 132
Query: 61 SERSEENHYPDHED--EEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGE- 117
SE EN Y H+D EEVD+ARSP +EKD+T HSA EIRDVFGD D+++ E
Sbjct: 133 SE---ENQYAHHDDGEEEVDEARSPSGSPRDEKDETRDLHSAPEIRDVFGDFDDDEEEEM 189
Query: 118 -YAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPL 176
YAI+ DI+QDSNR PMEEEGSY K++RPEDI+ DED YESEEE++E K KEKP+GPPL
Sbjct: 190 GYAIQQDIEQDSNRYPMEEEGSYGKNLRPEDILADEDHQYESEEENIEIKTKEKPLGPPL 249
Query: 177 ELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNI 236
ELE+PLRPPPA P KMNMIKVSNIMG+DPKPFDPKTYVEEDTFVTDESG +KRIRLENNI
Sbjct: 250 ELEVPLRPPPALPEKMNMIKVSNIMGVDPKPFDPKTYVEEDTFVTDESGTRKRIRLENNI 309
Query: 237 VRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGI 296
VRWRT KN DG TS ESNARFVRWSDGSLQL IGNEVLDI+VQDAQHDQAHLFLRHGKGI
Sbjct: 310 VRWRTTKNPDGTTSCESNARFVRWSDGSLQLLIGNEVLDISVQDAQHDQAHLFLRHGKGI 369
Query: 297 LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
LQSQGR+L+KMRF+PSSLSSNSHRLLTALVDSRHKKV+KVKNCITDIDPEREKEEKE+AE
Sbjct: 370 LQSQGRLLRKMRFMPSSLSSNSHRLLTALVDSRHKKVFKVKNCITDIDPEREKEEKEKAE 429
Query: 357 SQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEE 416
SQNIRANVLLNRKREK+NRKYT VERRRQLS G+LEDAL+E+DE DY+D+RRS+ RFE+
Sbjct: 430 SQNIRANVLLNRKREKVNRKYTPAVERRRQLSPGFLEDALDEEDEADYYDNRRSQHRFED 489
Query: 417 ELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEED 476
+LEAE RAEKRI+NAKK G +DIPRKSS P AKSSR P+ + + + E EED
Sbjct: 490 DLEAEARAEKRIMNAKK--GPKDIPRKSS-FPPAKSSRNPMGYPD-DEREESEYETEEED 545
Query: 477 ERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRR 536
ER P RKR E + E+EEEEEE EE +N AS DEEE EPKQK ++ + KR+
Sbjct: 546 ERPPSRKRDED-TEPEYEDEEEEEEHYEEAEQVNDAS-DEEEEEEPKQKNKEFRGSAKRK 603
Query: 537 GIESDEDSPPRKMPTHRRMAVVYDSDEE 564
G ESDEDSPPRK THRRMAVVYDSDE+
Sbjct: 604 GFESDEDSPPRKTTTHRRMAVVYDSDED 631
>gi|297797071|ref|XP_002866420.1| hypothetical protein ARALYDRAFT_358320 [Arabidopsis lyrata subsp.
lyrata]
gi|297312255|gb|EFH42679.1| hypothetical protein ARALYDRAFT_358320 [Arabidopsis lyrata subsp.
lyrata]
Length = 720
Score = 602 bits (1552), Expect = e-169, Method: Compositional matrix adjust.
Identities = 375/613 (61%), Positives = 456/613 (74%), Gaps = 58/613 (9%)
Query: 3 DAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEE-YGQRVATSRRREVIESGS 61
D E DPG+SEGER+QSSQEVDV D +EESEA+ +DSD KEE +G RVA +RR++V+ESGS
Sbjct: 115 DVELDPGESEGEREQSSQEVDVDDPQEESEARDSDSDNKEEEHGGRVAKNRRQDVVESGS 174
Query: 62 ERSEENHYPDHEDEEVDQARSPRSPG-----------------------GEEKDQTHISH 98
ERS E Y + EDEEVDQ RSPR G EEK++T ++
Sbjct: 175 ERSGEKQY-ESEDEEVDQTRSPRFLGCSIFLMDHFVSSTWKNAMVGTSPSEEKEETQVAQ 233
Query: 99 SAAEIRDVFGDSDEEDVGEYAIRNDID------------------QDSNRSPMEEEGSYE 140
S IR+VFG SD+ED EY +RND++ ++ +RSP+E+E E
Sbjct: 234 SDVNIRNVFGSSDDEDAEEY-VRNDVEHFTEYLFPFQCLLPLRNHRNVHRSPIEDEEGSE 292
Query: 141 KSIRPEDIV-----PDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKMNMI 195
K +RP+D+V P+ED YESE EHVE +++EKPVGPPLE+E+P RPPP DP KMNMI
Sbjct: 293 KDLRPDDMVLDDMVPEEDPQYESEAEHVEARYREKPVGPPLEVEVPFRPPPGDPVKMNMI 352
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNA 255
KVSNIMGIDPKPFD KT+VEED FVTDE GAKKRIRLENNIVR R VK++DG T ESNA
Sbjct: 353 KVSNIMGIDPKPFDAKTFVEEDRFVTDEPGAKKRIRLENNIVRHRFVKSRDGKTYSESNA 412
Query: 256 RFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLS 315
RFVRWSDGSLQL IGNEVLDIT QDA+ DQ HLF++H KGILQSQGRILKKMRFIPSSL+
Sbjct: 413 RFVRWSDGSLQLLIGNEVLDITEQDAKQDQNHLFIKHEKGILQSQGRILKKMRFIPSSLT 472
Query: 316 SNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKREKINR 375
SNSHRLLTALVDSRHKK YKVKNC+TDIDPEREKE++E+AESQN++A+ L++ REKI R
Sbjct: 473 SNSHRLLTALVDSRHKKAYKVKNCVTDIDPEREKEKREKAESQNLKASTKLSQAREKIKR 532
Query: 376 KYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEELEAEVRAEKRIINAKKPQ 435
KYT E RRQLSTGYLEDAL+EDDETD++ S RS R +EE+LEAE + E+RI+NAKK
Sbjct: 533 KYTLPAE-RRQLSTGYLEDALDEDDETDHYGSHRSNRGYEEDLEAEAQRERRILNAKK-- 589
Query: 436 GHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDERSPLRKRVEGPEQDYEEE 495
H+ IP +SS + +A+ SRR +++SESEREESEYET+ EE+E+SP R G E + E E
Sbjct: 590 SHKGIPGRSS-MTSARPSRRQMEYSESEREESEYETEEEEEEKSP-PPRGRGKESEDEYE 647
Query: 496 EEEEEEEQEEEPDINRAS---DDEEEAVEPKQKARDSGSNHKRRGIESD-EDSPPRKMPT 551
E+ EE+E+E NR S ++EEEA + + GS KR+GIESD E+SP RK PT
Sbjct: 648 EDAEEDEEEGGGKSNRYSDEDEEEEEAAGVRAEKDHRGSGRKRKGIESDEEESPQRKAPT 707
Query: 552 HRRMAVVYDSDEE 564
HRR AV+ DSDE+
Sbjct: 708 HRRKAVIDDSDED 720
>gi|30697488|ref|NP_200923.2| leo1-like family protein [Arabidopsis thaliana]
gi|332010045|gb|AED97428.1| leo1-like family protein [Arabidopsis thaliana]
Length = 623
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 360/570 (63%), Positives = 445/570 (78%), Gaps = 26/570 (4%)
Query: 3 DAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEE-YGQRVATSRRREVIESGS 61
D E DPG+SEGER+QSSQE D +EESEA+ +DSD KEE +G RVA RR+EV+ESGS
Sbjct: 72 DVELDPGESEGEREQSSQEAD---PQEESEARDSDSDNKEEEHGGRVAKKRRQEVVESGS 128
Query: 62 ERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAIR 121
ERS E HY + EDEEVDQ RSPRSP EEK++ ++ S IR+VFG SD+ED EY +R
Sbjct: 129 ERSGEKHY-ESEDEEVDQTRSPRSPS-EEKEEVQVAQSDVNIRNVFGSSDDEDAEEY-VR 185
Query: 122 NDIDQDSNRSPMEEEGSYEKSIRPED-----IVPDEDMHYESEEEHVEPKHKEKPVGPPL 176
ND++QD +RSP+E+E EK +RP+D I+P+ED YESE EHVE +++E+PVGPPL
Sbjct: 186 NDVEQDEHRSPIEDEEGSEKDLRPDDMVLDDIIPEEDPQYESEAEHVEARYRERPVGPPL 245
Query: 177 ELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNI 236
E+E+P RPPP DP KMNMIKVSNIMGIDPKPFD KT+VEEDTF+TDE GAK RIRL+NNI
Sbjct: 246 EVEVPFRPPPGDPVKMNMIKVSNIMGIDPKPFDAKTFVEEDTFMTDEPGAKNRIRLDNNI 305
Query: 237 VRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGI 296
VR R VK++DG T ESNARFVRWSDGSLQL IGNEVL+IT QDA+ DQ HLF++H KGI
Sbjct: 306 VRHRFVKSRDGKTYSESNARFVRWSDGSLQLLIGNEVLNITEQDAKEDQNHLFIKHEKGI 365
Query: 297 LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
LQSQGRILKKMRF PSSL+SNSHRLLTA+V+SR KK +KVKNC+TDIDPEREKE++E+AE
Sbjct: 366 LQSQGRILKKMRFTPSSLTSNSHRLLTAIVESRQKKAFKVKNCVTDIDPEREKEKREKAE 425
Query: 357 SQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEE 416
SQN++A+ L++ REKI RKY VE RRQLSTGY ED + + RS R +EE
Sbjct: 426 SQNLKASTKLSQAREKIKRKYPLPVE-RRQLSTGY-----LEDALDEDDEDYRSNRGYEE 479
Query: 417 ELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEED 476
+LEAE + E+RI+NAKK H+ IP +SS + +A+ SRR +++SESEREESEYET+ EE+
Sbjct: 480 DLEAEAQRERRILNAKK--SHKGIPGRSS-MTSARPSRRQMEYSESEREESEYETEEEEE 536
Query: 477 ERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDS-GSNHKR 535
E+SP R R + E +YEE+ EE+EEE+ + NR SD++EE E + +D GS KR
Sbjct: 537 EKSPARGRGKDSEDEYEEDAEEDEEERGKS---NRYSDEDEEEEEGGRAEKDHRGSGRKR 593
Query: 536 RGIESD-EDSPPRKMPTHRRMAVVYDSDEE 564
+GIESD E+SPPRK PTHRR AV+ DSDE+
Sbjct: 594 KGIESDEEESPPRKAPTHRRKAVIDDSDED 623
>gi|30697484|ref|NP_851237.1| leo1-like family protein [Arabidopsis thaliana]
gi|9759461|dbj|BAB10377.1| unnamed protein product [Arabidopsis thaliana]
gi|25083387|gb|AAN72069.1| putative protein [Arabidopsis thaliana]
gi|30725446|gb|AAP37745.1| At5g61156 [Arabidopsis thaliana]
gi|332010044|gb|AED97427.1| leo1-like family protein [Arabidopsis thaliana]
Length = 625
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 355/569 (62%), Positives = 441/569 (77%), Gaps = 22/569 (3%)
Query: 3 DAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEE-YGQRVATSRRREVIESGS 61
D E DPG+SEGER+QSSQE D +EESEA+ +DSD KEE +G RVA RR+EV+ESGS
Sbjct: 72 DVELDPGESEGEREQSSQEAD---PQEESEARDSDSDNKEEEHGGRVAKKRRQEVVESGS 128
Query: 62 ERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAIR 121
ERS E HY + EDEEVDQ RSPRSP EEK++ ++ S IR+VFG SD+ED EY +R
Sbjct: 129 ERSGEKHY-ESEDEEVDQTRSPRSPS-EEKEEVQVAQSDVNIRNVFGSSDDEDAEEY-VR 185
Query: 122 NDIDQDSNRSPMEEEGSYEKSIRPED-----IVPDEDMHYESEEEHVEPKHKEKPVGPPL 176
ND++QD +RSP+E+E EK +RP+D I+P+ED YESE EHVE +++E+PVGPPL
Sbjct: 186 NDVEQDEHRSPIEDEEGSEKDLRPDDMVLDDIIPEEDPQYESEAEHVEARYRERPVGPPL 245
Query: 177 ELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNI 236
E+E+P RPPP DP KMNMIKVSNIMGIDPKPFD KT+VEEDTF+TDE GAK RIRL+NNI
Sbjct: 246 EVEVPFRPPPGDPVKMNMIKVSNIMGIDPKPFDAKTFVEEDTFMTDEPGAKNRIRLDNNI 305
Query: 237 VRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGI 296
VR R VK++DG T ESNARFVRWSDGSLQL IGNEVL+IT QDA+ DQ HLF++H KGI
Sbjct: 306 VRHRFVKSRDGKTYSESNARFVRWSDGSLQLLIGNEVLNITEQDAKEDQNHLFIKHEKGI 365
Query: 297 LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
LQSQGRILKKMRF PSSL+SNSHRLLTA+V+SR KK +KVKNC+TDIDPEREKE++E+AE
Sbjct: 366 LQSQGRILKKMRFTPSSLTSNSHRLLTAIVESRQKKAFKVKNCVTDIDPEREKEKREKAE 425
Query: 357 SQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEE 416
SQN++A+ L++ REKI RKY VE RRQLSTGY ED + + RS R +EE
Sbjct: 426 SQNLKASTKLSQAREKIKRKYPLPVE-RRQLSTGY-----LEDALDEDDEDYRSNRGYEE 479
Query: 417 ELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEED 476
+LEAE + E+RI+NAKK H+ IP +SS + +A+ SRR +++SESEREESEYET+ EE+
Sbjct: 480 DLEAEAQRERRILNAKK--SHKGIPGRSS-MTSARPSRRQMEYSESEREESEYETEEEEE 536
Query: 477 ERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRR 536
E+SP R R + E +YEE+ EE+EEE+ + + ++EEE + + GS KR+
Sbjct: 537 EKSPARGRGKDSEDEYEEDAEEDEEERGKSNRYSDEDEEEEEVAGGRAEKDHRGSGRKRK 596
Query: 537 GIESD-EDSPPRKMPTHRRMAVVYDSDEE 564
GIESD E+SPPRK PTHRR AV+ DSDE+
Sbjct: 597 GIESDEEESPPRKAPTHRRKAVIDDSDED 625
>gi|21929715|gb|AAM81969.1| VERNALIZATION INDEPENDENCE 4 [Arabidopsis thaliana]
Length = 625
Score = 574 bits (1479), Expect = e-161, Method: Compositional matrix adjust.
Identities = 351/569 (61%), Positives = 436/569 (76%), Gaps = 22/569 (3%)
Query: 3 DAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEE-YGQRVATSRRREVIESGS 61
D E DPG+SEGER+QSSQE D +EESEA+ +DSD KEE +G RVA RR+EV+ESGS
Sbjct: 72 DVELDPGESEGEREQSSQEAD---PQEESEARDSDSDNKEEEHGGRVAKKRRQEVVESGS 128
Query: 62 ERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAIR 121
RS E HY + EDEEVDQ RSPRSP EEK++ ++ S IR+VFG SD+ED EY +R
Sbjct: 129 XRSGEKHY-ESEDEEVDQTRSPRSPS-EEKEEVQVAQSDVNIRNVFGSSDDEDAEEY-VR 185
Query: 122 NDIDQDSNRSPMEEEGSYEKSIRPED-----IVPDEDMHYESEEEHVEPKHKEKPVGPPL 176
ND++QD +RSP+E+E EK +RP+D I+P+ED YESE EHVE +++E+PVGPPL
Sbjct: 186 NDVEQDEHRSPIEDEEGSEKHLRPDDMVLDDIIPEEDPQYESEAEHVEARYRERPVGPPL 245
Query: 177 ELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNI 236
E+E+P RPPP DP KMNMIKVSNIMGIDPKPFD KT+VEEDTF+TDE GAK RIRL+NNI
Sbjct: 246 EVEVPFRPPPGDPVKMNMIKVSNIMGIDPKPFDAKTFVEEDTFMTDEPGAKNRIRLDNNI 305
Query: 237 VRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGI 296
VR R VK++DG ESNARFVRWSDGSLQL IGNEVL+IT QDA+ DQ HLF++H KGI
Sbjct: 306 VRHRFVKSRDGKPYSESNARFVRWSDGSLQLLIGNEVLNITEQDAKEDQNHLFIKHEKGI 365
Query: 297 LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
LQSQGRILKKMRF PSSL+SNSHRLLTA+V+SR KK +KVKNC+TDIDPEREKE++E+AE
Sbjct: 366 LQSQGRILKKMRFTPSSLTSNSHRLLTAIVESRQKKAFKVKNCVTDIDPEREKEKREKAE 425
Query: 357 SQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEE 416
SQN++A+ L++ REKI RKY VE RRQLSTGY D + + RS +EE
Sbjct: 426 SQNLKASTKLSQAREKIKRKYPLPVE-RRQLSTGY-----LXDALDEDXEDXRSTXGYEE 479
Query: 417 ELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEED 476
LEAE + E+RI+NAKK H+ IP +SS + +A+ SRR +++SESEREESEYET+ EE+
Sbjct: 480 XLEAEAQRERRILNAKK--XHKGIPGRSS-MTSARPSRRQMEYSESEREESEYETEEEEE 536
Query: 477 ERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRR 536
E+SP R R + E +YEE+ EE+EEE+ + + ++EEE + + GS KR+
Sbjct: 537 EKSPARGRGKDSEDEYEEDAEEDEEERGKSNRYSDEDEEEEEVAGGRAEKDHRGSGRKRK 596
Query: 537 GIESD-EDSPPRKMPTHRRMAVVYDSDEE 564
GIESD E+SPPRK PTHRR AV+ DSDE+
Sbjct: 597 GIESDEEESPPRKAPTHRRKAVIDDSDED 625
>gi|414592102|tpg|DAA42673.1| TPA: hypothetical protein ZEAMMB73_653521 [Zea mays]
Length = 678
Score = 558 bits (1437), Expect = e-156, Method: Compositional matrix adjust.
Identities = 339/575 (58%), Positives = 413/575 (71%), Gaps = 30/575 (5%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+G++ QSS E ++ DQR E +A+ DS E E Y QR + RRR VI S S
Sbjct: 121 HGADLDQGESDGDKVQSSPERELDDQRMEPDARGMDS-EDEGYQQRTVSGRRRGVIASES 179
Query: 62 ERSEENHYPD--HEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
E SE+++Y EDEE Q R SP EE+D H +RDVFGDSDE++ Y
Sbjct: 180 EGSEDSYYAAGAQEDEEPRQTRKQSSPLEEERD-----HEV--VRDVFGDSDEDEPAPYR 232
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELE 179
+R++ID+DS+RS ME+EG YEK ++P+D+V DEDM YES++ E K KEKPVGPPL+L
Sbjct: 233 VRHEIDEDSHRSHMEDEGHYEKDMQPDDVVADEDMRYESDDNR-ELKPKEKPVGPPLDLV 291
Query: 180 IPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRW 239
+P + PPA P KMN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVRW
Sbjct: 292 VPFKQPPARPDKMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVRW 351
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
RT++N DG TS ESNARFV+W DG++QL IGNEVLDI+V +A HDQ+HLFLRHGKGILQS
Sbjct: 352 RTIRNADGTTSCESNARFVKWKDGTMQLLIGNEVLDISVHEAHHDQSHLFLRHGKGILQS 411
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
QGR+L KMRF+PSSLSS SHRLLTALVDS++KK K++ DPE+ K EKER QN
Sbjct: 412 QGRLLHKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWFETKDPEKAKMEKERVLGQN 471
Query: 360 IRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRR--SRRRFEEE 417
IRA+ +L RKREK+NRKYTQ RRQLS G+LEDAL+EDDE Y DSRR RRRFE+E
Sbjct: 472 IRAHSILQRKREKVNRKYTQPARPRRQLSPGFLEDALDEDDEPGY-DSRRMPGRRRFEDE 530
Query: 418 LEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDE 477
LEAE AE+RIINAKK R++P K PA R+ ++S SEREESEYETDGE+ E
Sbjct: 531 LEAEALAERRIINAKKSNMSRNVPHKPPYPPARPPRRQADEYSGSEREESEYETDGEDIE 590
Query: 478 RSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRG 537
S R R E ++ EEEE EE+ E+EE P SD+E EA PK+K G +R
Sbjct: 591 HSLPRGRRE--DELDEEEEYEEDVEEEEAP----LSDEEMEA--PKRKRESGGGGQRREE 642
Query: 538 IES------DEDSPPRKMP--THRRMAVVYDSDEE 564
+ S +DSPPRK P +HRR AVV+D +E
Sbjct: 643 LVSEEDDEDHDDSPPRKQPAVSHRRKAVVFDDSDE 677
>gi|222422855|dbj|BAH19414.1| AT5G61150 [Arabidopsis thaliana]
Length = 531
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 298/459 (64%), Positives = 363/459 (79%), Gaps = 21/459 (4%)
Query: 3 DAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEE-YGQRVATSRRREVIESGS 61
D E DPG+SEGER+QSSQE D +EESEA+ +DSD KEE +G RVA RR+EV+ESGS
Sbjct: 72 DVELDPGESEGEREQSSQEAD---PQEESEARDSDSDNKEEEHGGRVAKKRRQEVVESGS 128
Query: 62 ERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAIR 121
ERS E HY + EDEEVDQ RSPRSP EEK++ ++ S IR+VFG SD+ED EY +R
Sbjct: 129 ERSGEKHY-ESEDEEVDQTRSPRSPS-EEKEEVQVAQSDVNIRNVFGSSDDEDAEEY-VR 185
Query: 122 NDIDQDSNRSPMEEEGSYEKSIRPED-----IVPDEDMHYESEEEHVEPKHKEKPVGPPL 176
ND++QD +RSP+E+E EK +RP+D I+P+ED YESE EHVE +++E+PVGPPL
Sbjct: 186 NDVEQDEHRSPIEDEEGSEKDLRPDDMVLDDIIPEEDPQYESEAEHVEARYRERPVGPPL 245
Query: 177 ELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNI 236
E+E+P RPPP DP KMNMIKVSNIMGIDPKPFD KT+VEEDTF+TDE GAK RIRL+NNI
Sbjct: 246 EVEVPFRPPPGDPVKMNMIKVSNIMGIDPKPFDAKTFVEEDTFMTDEPGAKNRIRLDNNI 305
Query: 237 VRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGI 296
VR R VK++DG T ESNARFVRWSDGSLQL IGNEVL+IT QDA+ DQ HLF++H KGI
Sbjct: 306 VRHRFVKSRDGKTYSESNARFVRWSDGSLQLLIGNEVLNITEQDAKEDQNHLFIKHEKGI 365
Query: 297 LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
LQSQGRILKKMRF PSSL+SNSHRLLTA+V+SR KK +KVKNC+TDIDPEREKE++E+AE
Sbjct: 366 LQSQGRILKKMRFTPSSLTSNSHRLLTAIVESRQKKAFKVKNCVTDIDPEREKEKREKAE 425
Query: 357 SQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEE 416
SQN++A+ L++ REKI RKY VE RRQLSTGY ED + + RS R +EE
Sbjct: 426 SQNLKASTKLSQAREKIKRKYPLPVE-RRQLSTGY-----LEDALDEDDEDYRSNRGYEE 479
Query: 417 ELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRR 455
+LEAE + E+RI+NAKK H+ IP +SS + +A+ SRR
Sbjct: 480 DLEAEAQRERRILNAKK--SHKGIPGRSS-MTSARPSRR 515
>gi|414883382|tpg|DAA59396.1| TPA: hypothetical protein ZEAMMB73_865131 [Zea mays]
Length = 661
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 346/572 (60%), Positives = 418/572 (73%), Gaps = 28/572 (4%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+G++ SS E ++ DQR ES+AK DS E E Y QR +SRRR V+ S S
Sbjct: 108 HRADLDQGESDGDKVHSSPERELDDQRMESDAKGLDS-EDEGYQQRTVSSRRRGVVASES 166
Query: 62 ERSEENHYPD--HEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
E SE+N+Y D EDEE Q R SP EE+D H +RDVFGDSDE++ Y
Sbjct: 167 EGSEDNYYADGAQEDEEPRQTRKQSSPMEEERD-----HEV--VRDVFGDSDEDEPAPYR 219
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELE 179
R++ID++S+RSPME+E YEK ++P+D V DEDM YES+E E K KEKPVGPPL+L
Sbjct: 220 SRHEIDEESHRSPMEDEDQYEKDMQPDDTVADEDMRYESDENR-ELKPKEKPVGPPLDLV 278
Query: 180 IPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRW 239
+P + PPA P KMN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVRW
Sbjct: 279 VPFKQPPARPDKMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVRW 338
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
RT++N DG TS ESNARFV+W DGS+QL IGNEVLDI+V +A HDQ+HLFLRHGKGILQS
Sbjct: 339 RTIRNADGTTSCESNARFVKWKDGSMQLLIGNEVLDISVHEAHHDQSHLFLRHGKGILQS 398
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
QGR+L KM+F+PSSL S SHRLLTALVDS++KK K++ DPE+ K EKER + QN
Sbjct: 399 QGRLLHKMQFMPSSLFSKSHRLLTALVDSQNKKTVKMQKWFETKDPEKAKMEKERIQGQN 458
Query: 360 IRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRR--SRRRFEEE 417
IRA+ +L RKREK+NRKYTQ RRQLS G+LEDAL+ED+E DY SRR RRRFE+E
Sbjct: 459 IRAHSILQRKREKVNRKYTQPARPRRQLSPGFLEDALDEDEEPDY-GSRRMPGRRRFEDE 517
Query: 418 LEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDE 477
LEAE AE+RIINAKK R++PRK PA R+ ++SESEREESEYETDGE+ E
Sbjct: 518 LEAEALAERRIINAKKSNMSRNVPRKPPYPPARLPRRQADEYSESEREESEYETDGEDIE 577
Query: 478 RSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDS--GSNHKR 535
SP R R E + +EEEE EEE +EE P SD+E E PK+K R+S G H R
Sbjct: 578 HSPPRGR----EDELDEEEEYEEEVEEEAP----LSDEEMEMQAPKRK-RESGGGGGHMR 628
Query: 536 RGIES-DEDSPPRKMPT--HRRMAVVYDSDEE 564
+ S D++SPPRK PT HRR AVV+D +E
Sbjct: 629 EELASEDDESPPRKQPTVQHRRKAVVFDDSDE 660
>gi|414883383|tpg|DAA59397.1| TPA: hypothetical protein ZEAMMB73_865131 [Zea mays]
Length = 659
Score = 550 bits (1416), Expect = e-153, Method: Compositional matrix adjust.
Identities = 340/569 (59%), Positives = 415/569 (72%), Gaps = 24/569 (4%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+G++ SS E ++ DQR ES+AK DS E E Y QR +SRRR V+ S S
Sbjct: 108 HRADLDQGESDGDKVHSSPERELDDQRMESDAKGLDS-EDEGYQQRTVSSRRRGVVASES 166
Query: 62 ERSEENHYPD--HEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
E SE+N+Y D EDEE Q R SP EE+D H +RDVFGDSDE++ Y
Sbjct: 167 EGSEDNYYADGAQEDEEPRQTRKQSSPMEEERD-----HEV--VRDVFGDSDEDEPAPYR 219
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELE 179
R++ID++S+RSPME+E YEK ++P+D V DEDM YES+E E K KEKPVGPPL+L
Sbjct: 220 SRHEIDEESHRSPMEDEDQYEKDMQPDDTVADEDMRYESDENR-ELKPKEKPVGPPLDLV 278
Query: 180 IPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRW 239
+P + PPA P KMN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVRW
Sbjct: 279 VPFKQPPARPDKMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVRW 338
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
RT++N DG TS ESNARFV+W DGS+QL IGNEVLDI+V +A HDQ+HLFLRHGKGILQS
Sbjct: 339 RTIRNADGTTSCESNARFVKWKDGSMQLLIGNEVLDISVHEAHHDQSHLFLRHGKGILQS 398
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
QGR+L KM+F+PSSL S SHRLLTALVDS++KK K++ DPE+ K EKER + QN
Sbjct: 399 QGRLLHKMQFMPSSLFSKSHRLLTALVDSQNKKTVKMQKWFETKDPEKAKMEKERIQGQN 458
Query: 360 IRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRR--SRRRFEEE 417
IRA+ +L RKREK+NRKYTQ RRQLS G+LEDAL+ED+E DY SRR RRRFE+E
Sbjct: 459 IRAHSILQRKREKVNRKYTQPARPRRQLSPGFLEDALDEDEEPDY-GSRRMPGRRRFEDE 517
Query: 418 LEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDE 477
LEAE AE+RIINAKK R++PRK PA R+ ++SESEREESEYETDGE+ E
Sbjct: 518 LEAEALAERRIINAKKSNMSRNVPRKPPYPPARLPRRQADEYSESEREESEYETDGEDIE 577
Query: 478 RSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRG 537
SP R R E + +EEEE EEE +EE P SD+E EA + K+++ G + +
Sbjct: 578 HSPPRGR----EDELDEEEEYEEEVEEEAP----LSDEEMEAPKRKRESGGGGGHMREEL 629
Query: 538 IESDEDSPPRKMPT--HRRMAVVYDSDEE 564
D++SPPRK PT HRR AVV+D +E
Sbjct: 630 ASEDDESPPRKQPTVQHRRKAVVFDDSDE 658
>gi|242042718|ref|XP_002459230.1| hypothetical protein SORBIDRAFT_02g001060 [Sorghum bicolor]
gi|241922607|gb|EER95751.1| hypothetical protein SORBIDRAFT_02g001060 [Sorghum bicolor]
Length = 673
Score = 544 bits (1401), Expect = e-152, Method: Compositional matrix adjust.
Identities = 336/568 (59%), Positives = 414/568 (72%), Gaps = 28/568 (4%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+G++ QSS E ++ DQR E +A DS E E Y QR +SRRR V+ S S
Sbjct: 120 HRADLDQGESDGDKVQSSPERELDDQRMEPDAGGMDS-EDEGYQQRTVSSRRRGVVASES 178
Query: 62 ERSEENHYPD--HEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
E SE+N+Y D EDEE Q R SP EE+D H +RDVFG+SDE++ Y
Sbjct: 179 EGSEDNYYADGAQEDEEPRQTRKQSSPMEEERD-----HEV--VRDVFGESDEDEPAPYR 231
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELE 179
R++ID++S+RSPM++EG YEK ++P+D V DEDM YES++ E K KEKPVGPPL+L
Sbjct: 232 ARHEIDEESHRSPMDDEGQYEKDMQPDDDVADEDMRYESDDNR-ELKTKEKPVGPPLDLV 290
Query: 180 IPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRW 239
+P + PPA P KMN+IKVSNIMGIDPKPF+PKTYVEED FVTDESG KKRIRLE+NIVRW
Sbjct: 291 VPFKQPPAQPDKMNVIKVSNIMGIDPKPFNPKTYVEEDVFVTDESGTKKRIRLEDNIVRW 350
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
RTV+N DG TS ESNARFV+W DGS+QL IGNEVLDI+V +A HDQ+HLFLRHGKGILQS
Sbjct: 351 RTVRNADGTTSCESNARFVKWKDGSMQLLIGNEVLDISVHEAHHDQSHLFLRHGKGILQS 410
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
QGR+L+KMRF+PSSLSS SHRLLTALVDS++KK K++ DPE+ K E+ER +SQ
Sbjct: 411 QGRLLRKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWFETKDPEKAKMERERNQSQT 470
Query: 360 IRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRS-RRRFEEEL 418
IRA+ +L RKREK+NRKYTQ RRQLS G+LEDAL+ED+E DY R + RRRFE+EL
Sbjct: 471 IRAHSILQRKREKVNRKYTQPARPRRQLSPGFLEDALDEDEEPDYGSRRMTGRRRFEDEL 530
Query: 419 EAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDER 478
EAE AE+RIINAKK R++PRK PA R+ ++SESEREESEYETDGE+ E
Sbjct: 531 EAEALAERRIINAKKSNMSRNVPRKPLYPPARPPRRQADEYSESEREESEYETDGEDIEH 590
Query: 479 SPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRGI 538
SP R R E + +EEEE EE+ +EE P SD+E EA PK+K G H+R +
Sbjct: 591 SPPRGR----EDELDEEEEYEEDVEEEAP----LSDEEMEA--PKRKRESVGGGHRREEL 640
Query: 539 ESD----EDSPPRKMPT--HRRMAVVYD 560
S+ +DSPPRK P HRR V++D
Sbjct: 641 VSEDDDDDDSPPRKQPAVQHRRKTVMFD 668
>gi|218187142|gb|EEC69569.1| hypothetical protein OsI_38878 [Oryza sativa Indica Group]
Length = 664
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 333/573 (58%), Positives = 425/573 (74%), Gaps = 31/573 (5%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+GE+ QSS E ++ D+ +++A DS E E Y QR SRRR V+ S S
Sbjct: 113 HRADLDQGESDGEKVQSSPERELSDRVMQNDAAGMDS-EDEAYQQRPVASRRRGVVASES 171
Query: 62 ERSEENHYPD--HEDEEVDQAR-SPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEY 118
E SE+++Y HEDEE Q R + SP EE+DQ +RDVFG+SDE++ Y
Sbjct: 172 EGSEDDYYAGRAHEDEEPRQTRKASSSPVEEERDQE-------VVRDVFGESDEDEPAPY 224
Query: 119 AIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLEL 178
+ +ID+DS+RSPME+EG YEK ++P+D+V DEDM YES+E E K KEKPVGPPL L
Sbjct: 225 RDQQEIDEDSHRSPMEDEGHYEKDLQPDDVVADEDMRYESDENR-ELKPKEKPVGPPLNL 283
Query: 179 EIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVR 238
+PL+ PPA P +MN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVR
Sbjct: 284 VVPLKQPPAQPERMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVR 343
Query: 239 WRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQ 298
WRTVKN +G TS ESNAR V+W DG++QL IGNEVLDI+V +A HDQ+HLFLR+GKG+LQ
Sbjct: 344 WRTVKNANGTTSCESNARIVKWKDGTMQLLIGNEVLDISVHEAHHDQSHLFLRNGKGVLQ 403
Query: 299 SQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQ 358
SQGR+L+KMRF+PSSLSS SHRLLTALVDS++KK K++ I DPER K+EKERA Q
Sbjct: 404 SQGRLLRKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWIESKDPERVKQEKERALGQ 463
Query: 359 NIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDY-HDSRR--SRRRFE 415
NIRA+ +L RKREK++RKYTQ ++RRQLS G+LEDAL+ED+E D+ + SRR +R RF
Sbjct: 464 NIRAHSILQRKREKVSRKYTQPAKQRRQLSPGFLEDALDEDEEPDHQYGSRRMPARSRFG 523
Query: 416 EELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEE 475
+ELEAE AE+RI++AKK R+IPRK S PA R+ ++SESEREESEYET+GE+
Sbjct: 524 DELEAEALAERRIVSAKKSSMGRNIPRKPS-FPARPPRRQENEYSESEREESEYETEGED 582
Query: 476 DERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSG---SN 532
E SP + R E+E +EE++ EE A+ +EE EPK++ R+SG ++
Sbjct: 583 IEHSPTQGR----------EDELDEEDEYEEDVEEEAAMSDEEIEEPKRR-RESGGASAS 631
Query: 533 HKRRGIESDEDSPPRK-MPTHRRMAVVYDSDEE 564
+R+ I+SD+DSPPRK HRR AVV+DSD+E
Sbjct: 632 QRRKEIDSDDDSPPRKQQAVHRRKAVVFDSDDE 664
>gi|115489280|ref|NP_001067127.1| Os12g0579000 [Oryza sativa Japonica Group]
gi|77556278|gb|ABA99074.1| Leo1, expressed [Oryza sativa Japonica Group]
gi|113649634|dbj|BAF30146.1| Os12g0579000 [Oryza sativa Japonica Group]
gi|215741260|dbj|BAG97755.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222617359|gb|EEE53491.1| hypothetical protein OsJ_36649 [Oryza sativa Japonica Group]
Length = 663
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 332/573 (57%), Positives = 423/573 (73%), Gaps = 32/573 (5%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+GE+ QSS ++ D+ +++A DS E E Y QR SRRR V+ S S
Sbjct: 113 HRADLDQGESDGEKVQSSPG-ELSDRVMQNDAAGMDS-EDEAYQQRPVASRRRGVVASES 170
Query: 62 ERSEENHYPD--HEDEEVDQAR-SPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEY 118
E SE+++Y HEDEE Q R + SP EE+DQ +RDVFG+SDE++ Y
Sbjct: 171 EGSEDDYYAGRAHEDEEPRQTRKASSSPVEEERDQE-------VVRDVFGESDEDEPAPY 223
Query: 119 AIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLEL 178
+ +ID+DS+RSPME+EG YEK ++P+D+V DEDM YES+E E K KEKPVGPPL L
Sbjct: 224 RDQQEIDEDSHRSPMEDEGHYEKDLQPDDVVADEDMRYESDENR-ELKPKEKPVGPPLNL 282
Query: 179 EIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVR 238
+PL+ PPA P +MN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVR
Sbjct: 283 VVPLKQPPAQPERMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVR 342
Query: 239 WRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQ 298
WRTVKN +G TS ESNAR V+W DG++QL IGNEVLDI+V +A HDQ+HLFLR+GKG+LQ
Sbjct: 343 WRTVKNANGTTSCESNARIVKWKDGTMQLLIGNEVLDISVHEAHHDQSHLFLRNGKGVLQ 402
Query: 299 SQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQ 358
SQGR+L+KMRF+PSSLSS SHRLLTALVDS++KK K++ I DPER K+EKERA Q
Sbjct: 403 SQGRLLRKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWIESKDPERVKQEKERALGQ 462
Query: 359 NIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDY-HDSRR--SRRRFE 415
NIRA+ +L RKREK++RKYTQ +RRQLS G+LEDAL+ED+E D+ + SRR +R RF
Sbjct: 463 NIRAHSILQRKREKVSRKYTQPARQRRQLSPGFLEDALDEDEEPDHQYGSRRMPARSRFG 522
Query: 416 EELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEE 475
+ELEAE AE+RI++AKK R+IPRK S PA R+ ++SESEREESEYET+GE+
Sbjct: 523 DELEAEALAERRIVSAKKSSMGRNIPRKPS-FPARPPRRQENEYSESEREESEYETEGED 581
Query: 476 DERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSG---SN 532
E SP + R E+E +EE++ EE A+ +EE EPK++ R+SG ++
Sbjct: 582 IEHSPTQGR----------EDELDEEDEYEEDVEEEAAMSDEEIEEPKRR-RESGGASAS 630
Query: 533 HKRRGIESDEDSPPRK-MPTHRRMAVVYDSDEE 564
+R+ I+SD+DSPPRK HRR AVV+DSD+E
Sbjct: 631 QRRKEIDSDDDSPPRKQQAVHRRKAVVFDSDDE 663
>gi|33146453|dbj|BAC79561.1| putative VERNALIZATION INDEPENDENCE 4 [Oryza sativa Japonica Group]
gi|50510031|dbj|BAD30643.1| putative VERNALIZATION INDEPENDENCE 4 [Oryza sativa Japonica Group]
gi|218198971|gb|EEC81398.1| hypothetical protein OsI_24626 [Oryza sativa Indica Group]
gi|222636312|gb|EEE66444.1| hypothetical protein OsJ_22827 [Oryza sativa Japonica Group]
Length = 667
Score = 530 bits (1365), Expect = e-148, Method: Compositional matrix adjust.
Identities = 331/573 (57%), Positives = 422/573 (73%), Gaps = 31/573 (5%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+GE+ QSS E + D+ +++A DS E Y Q S RR V+ S S
Sbjct: 116 HQADLDQGESDGEKVQSSPEREFSDRVMQNDAAGMDS-EDGGYQQWPVASGRRGVVASES 174
Query: 62 ERSEENHYPD--HEDEEVDQAR-SPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEY 118
E SE+++Y HEDEE Q R +P SP EE+D H +RDVFG+SDE+ Y
Sbjct: 175 EGSEDDYYAGRGHEDEEPHQTRKTPSSPVEEERD-----HEV--VRDVFGESDEDGPAPY 227
Query: 119 AIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLEL 178
+++ID+DS+RSPME+EG YEK ++PED+V DEDM YES+E E K KEKPVGPPL L
Sbjct: 228 RDQHEIDEDSHRSPMEDEGHYEKDLQPEDVVADEDMRYESDENR-ELKPKEKPVGPPLNL 286
Query: 179 EIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVR 238
+PL+ PPA P +MN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVR
Sbjct: 287 VVPLKQPPAQPDRMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVR 346
Query: 239 WRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQ 298
WRTVKN +G TS ESNAR V+W DG++QL IGNEVLDI+V +A HDQ+HLFLR+GKG+LQ
Sbjct: 347 WRTVKNANGTTSCESNARIVKWKDGTMQLLIGNEVLDISVHEAHHDQSHLFLRNGKGVLQ 406
Query: 299 SQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQ 358
SQGR+L+KMRF+PSSLSS SHRLLTALVDS++KK K++ I DPE+ K+EKERA Q
Sbjct: 407 SQGRLLRKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWIESKDPEKVKQEKERALGQ 466
Query: 359 NIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDY-HDSRR--SRRRFE 415
NIRA+ +L RK+EK++RKYTQ +RRQLS G+LEDAL+ED+E D+ + SRR +R RFE
Sbjct: 467 NIRAHSILQRKKEKVSRKYTQPARQRRQLSPGFLEDALDEDEEPDHQYGSRRMPARSRFE 526
Query: 416 EELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEE 475
+ELEAE AE+RI++AKK R+IPRK S PA R+ ++SESEREESEYET+GE+
Sbjct: 527 DELEAEALAERRIVSAKKSSMGRNIPRKPS-FPARPPRRQANEYSESEREESEYETEGED 585
Query: 476 DERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSG---SN 532
E SP + R E+E +EE++ EE A+ +EE EPK++ R+SG ++
Sbjct: 586 IEHSPTQGR----------EDELDEEDEYEEDVEEEAAMSDEEIEEPKRR-RESGGGSAS 634
Query: 533 HKRRGIESDEDSPPRK-MPTHRRMAVVYDSDEE 564
+R+ I+SD+DSPPRK HRR AVV+DSD+E
Sbjct: 635 QRRKEIDSDDDSPPRKQQAVHRRKAVVFDSDDE 667
>gi|357111781|ref|XP_003557689.1| PREDICTED: uncharacterized protein LOC100820835 [Brachypodium
distachyon]
Length = 669
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 324/571 (56%), Positives = 403/571 (70%), Gaps = 32/571 (5%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQR-VATSRRREVIESG 60
H A+ D G+S+ ER QSS E D+ DQ +++ + DE YGQR VA+ RRR V S
Sbjct: 116 HPADLDQGESDAERVQSSPERDLSDQVVQTDPRVDSEDEG--YGQRAVASRRRRGVAASD 173
Query: 61 SERSEENHYPDH--EDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEED-VGE 117
SE SE+N+Y EDEE Q R P SP EE+D H +RDVFGDSDEED
Sbjct: 174 SEGSEDNYYAGQAPEDEEA-QTRKPISPMEEERD-----HEV--VRDVFGDSDEEDEPAP 225
Query: 118 YAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLE 177
Y ++ID+DS+RSPME+EG YEK ++PED+V DE+M YES++ E K KEKPVGPPL
Sbjct: 226 YRAPDEIDEDSHRSPMEDEGQYEKDLQPEDVVADEEMRYESDDNR-ELKIKEKPVGPPLH 284
Query: 178 LEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIV 237
L +PL+ PPA P +MN+IKVSNIMGIDP+PFDPKTYVEED +VTDE+G KK+IRLE+NIV
Sbjct: 285 LHVPLQKPPARPERMNVIKVSNIMGIDPRPFDPKTYVEEDVYVTDETGTKKKIRLEDNIV 344
Query: 238 RWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGIL 297
RWRTVK DG TS ESNARFV+W DG++QL IGNEVLDI+V +A HDQ+HLFLR GKG+L
Sbjct: 345 RWRTVKKADGSTSIESNARFVKWKDGTMQLLIGNEVLDISVNEANHDQSHLFLRSGKGVL 404
Query: 298 QSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAES 357
QSQGR+L+KMRF+PSSLSS SHRLLTALVDS++KK K++ DPER K+EKERAE
Sbjct: 405 QSQGRLLQKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWFDAKDPERAKQEKERAEG 464
Query: 358 QNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRR---SRRRF 414
Q+IRA+ +L RKREK+NRKYTQ +RRQLS G+LEDAL+ED+ETD H S R SR RF
Sbjct: 465 QSIRAHSILQRKREKVNRKYTQPARQRRQLSPGFLEDALDEDEETDGHYSARRMTSRGRF 524
Query: 415 EEELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGE 474
E++LEAE AE+RIINAKK R PR+ S P +++ RR EESEYET+GE
Sbjct: 525 EDDLEAEALAERRIINAKKSSLSRGAPRRPS-FPPSRAPRRQEYSESEREEESEYETEGE 583
Query: 475 EDERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHK 534
+ E SP R E+E +EE+E EE+ ++ A+ ++E EPK+K S +
Sbjct: 584 DIEHSPTGAR---------EDELDEEDEYEEDLEVEEAAMSDDEIQEPKKKRDSVASGSQ 634
Query: 535 RRGIESDEDSPP----RKMPTHRRMAVVYDS 561
RRG E D D ++ HRR AVV+DS
Sbjct: 635 RRGREVDSDDDDSPPRKQQAVHRRKAVVFDS 665
>gi|326527249|dbj|BAK04566.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 665
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 315/571 (55%), Positives = 403/571 (70%), Gaps = 32/571 (5%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+ E+ QSS E ++ D +++A+ +S+E + Y Q+ SRRREV+ S S
Sbjct: 117 HPADLDQGESDPEKVQSSPEREMSDGVMQTDARGMESEE-DGYEQQAVPSRRREVVASES 175
Query: 62 ERSEENHYPDH--EDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
E SE+N+Y EDEE Q R SP EE+D H +RDVFGDSDE++ Y
Sbjct: 176 EGSEDNYYAGQAPEDEEAPQ-RKLSSPMEEERD-----HEV--VRDVFGDSDEDEPAPYR 227
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELE 179
++ID+DSN PME+EG YEK ++P+D+V D+DM YES++ EPK KEKPVGPPL L
Sbjct: 228 APDEIDEDSN--PMEDEGQYEKELQPDDVVADDDMRYESDDNR-EPKTKEKPVGPPLNLV 284
Query: 180 IPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRW 239
+P + PPA P +MN+IKVSNIMGIDP+PFDPKTYVEED +VTDESG KK+IRLE+NIVRW
Sbjct: 285 VPRQQPPARPDRMNVIKVSNIMGIDPRPFDPKTYVEEDVYVTDESGTKKKIRLEDNIVRW 344
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
RTVK DG TS ESNARFV+W DG++QL IGNEVLDI+V +A HDQ+HLFLR GKG+LQS
Sbjct: 345 RTVKKADGTTSVESNARFVKWKDGTMQLLIGNEVLDISVHEANHDQSHLFLRSGKGVLQS 404
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
QGR+L+KMRF+PSSLSS SHRLLTALVDS++KK K++ DPER K+EKE+AE Q+
Sbjct: 405 QGRLLQKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWYDAKDPERVKQEKEKAEGQS 464
Query: 360 IRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETD-YHDSRR--SRRRFEE 416
IRA+ +L RKREK+NRKYTQ +RRQLS G+LEDAL+ED+ETD + SRR SR RFE+
Sbjct: 465 IRAHSILQRKREKVNRKYTQPARQRRQLSPGFLEDALDEDEETDNQYSSRRMASRGRFED 524
Query: 417 ELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEED 476
+LEAE AE+RIINAKK R +PRK S P ++ +RR E E+
Sbjct: 525 DLEAEALAERRIINAKKSNMSRGVPRKPS-FPPSRPARRQEYSESEREESEYETEG-EDI 582
Query: 477 ERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARD-SGSNHKR 535
E SP R +E +EE+E +E+ + R SD E EPK+K +G + +R
Sbjct: 583 EHSPTGGR--------GDELDEEDEYEEDPEEDARMSD--EGIQEPKRKRESAAGGSQRR 632
Query: 536 RGIESDEDSPPRKMP--THRRMAVVYDSDEE 564
R ++S EDSPPRK P HRR AVV++SD+E
Sbjct: 633 REVDSGEDSPPRKQPAMVHRRKAVVFESDDE 663
>gi|326522012|dbj|BAK04134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 665
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 315/571 (55%), Positives = 403/571 (70%), Gaps = 32/571 (5%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+ E+ QSS E ++ D +++A+ +S+E + Y Q+ SRRREV+ S S
Sbjct: 117 HPADLDQGESDPEKVQSSPEREMSDGVMQTDARGMESEE-DGYEQQAVPSRRREVVASES 175
Query: 62 ERSEENHYPDH--EDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
E SE+N+Y EDEE Q R SP EE+D H +RDVFGDSDE++ Y
Sbjct: 176 EGSEDNYYAGQAPEDEEAPQ-RKLSSPMEEERD-----HEV--VRDVFGDSDEDEPAPYR 227
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELE 179
++ID+DSN PME+EG YEK ++P+D+V D+DM YES++ EPK KEKPVGPPL L
Sbjct: 228 APDEIDEDSN--PMEDEGQYEKELQPDDVVADDDMRYESDDNR-EPKTKEKPVGPPLNLV 284
Query: 180 IPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRW 239
+P + PPA P +MN+IKVSNIMGIDP+PFDPKTYVEED +VTDESG KK+IRLE+NIVRW
Sbjct: 285 VPRQQPPARPDRMNVIKVSNIMGIDPRPFDPKTYVEEDVYVTDESGTKKKIRLEDNIVRW 344
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
RTVK DG TS ESNARFV+W DG++QL IGNEVLDI+V +A HDQ+HLFLR GKG+LQS
Sbjct: 345 RTVKKADGTTSVESNARFVKWKDGTMQLLIGNEVLDISVHEANHDQSHLFLRSGKGVLQS 404
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
QGR+L+KMRF+PSSLSS SHRLLTALVDS++KK K++ DPER K+EKE+AE Q+
Sbjct: 405 QGRLLQKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWYDAKDPERVKQEKEKAEGQS 464
Query: 360 IRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETD-YHDSRR--SRRRFEE 416
IRA+ +L RKREK+NRKYTQ +RRQLS G+LEDAL+ED+ETD + SRR SR RFE+
Sbjct: 465 IRAHSILQRKREKVNRKYTQPARQRRQLSPGFLEDALDEDEETDNQYSSRRMASRGRFED 524
Query: 417 ELEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEED 476
+LEAE AE+RIINAKK R +PRK S P ++ +RR E E+
Sbjct: 525 DLEAEALAERRIINAKKSNMSRGVPRKPS-FPPSRPARRQEYSESEREESEYETEG-EDI 582
Query: 477 ERSPLRKRVEGPEQDYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARD-SGSNHKR 535
E SP R +E +EE+E +E+ + R SD E EPK+K +G + +R
Sbjct: 583 EHSPTGGR--------GDELDEEDEYEEDPEEDARMSD--EGIQEPKRKRESAAGGSQRR 632
Query: 536 RGIESDEDSPPRKMP--THRRMAVVYDSDEE 564
R ++S EDSPPRK P HRR AVV++SD+E
Sbjct: 633 REVDSGEDSPPRKQPAMVHRRKAVVFESDDE 663
>gi|359491295|ref|XP_002284385.2| PREDICTED: RNA polymerase-associated protein LEO1 [Vitis vinifera]
Length = 387
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 294/374 (78%), Positives = 326/374 (87%), Gaps = 9/374 (2%)
Query: 191 KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
+MNMIKVSN MGI+PKPFDPK YVEEDTFVTDESG+KKR+RLENNIVRWR VKN DG TS
Sbjct: 23 QMNMIKVSNTMGIEPKPFDPKMYVEEDTFVTDESGSKKRMRLENNIVRWRNVKNPDGTTS 82
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFI 310
YESNARFVRWSDGSLQL IGNEVLDI+VQDAQHDQ+HLFLRHGKGILQSQGRIL+KMRF+
Sbjct: 83 YESNARFVRWSDGSLQLLIGNEVLDISVQDAQHDQSHLFLRHGKGILQSQGRILRKMRFM 142
Query: 311 PSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKR 370
PSSLSSNSHRLLTALVDSRHKKVYK+KNCITDIDPEREKEEKE+A SQ IRAN LLNRKR
Sbjct: 143 PSSLSSNSHRLLTALVDSRHKKVYKIKNCITDIDPEREKEEKEKAVSQTIRANQLLNRKR 202
Query: 371 EKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEELEAEVRAEKRIIN 430
EK+NRKYTQTV+R RQLS G+LE+AL+EDDE DYHDS R RRRFE++LE + +AEKRI N
Sbjct: 203 EKVNRKYTQTVDRGRQLSPGFLEEALDEDDEPDYHDSHRHRRRFEDDLEMDTQAEKRIFN 262
Query: 431 AKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDERSPLRKRVEGPEQ 490
KK GH+DIP KSS L A KSSRRPVDFS+SEREESEYE+DGEEDERS RKR E PE+
Sbjct: 263 VKK--GHKDIPHKSS-LSAIKSSRRPVDFSDSEREESEYESDGEEDERSFSRKRAEEPEE 319
Query: 491 DYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRGIESDEDSPPRKMP 550
+Y EEEEEE+E EEE A+++ EE EPKQK ++ KR+GIESDEDSPPRK
Sbjct: 320 EYAEEEEEEDEHDEEE---AEANEESEEVEEPKQKGKEYA---KRKGIESDEDSPPRKSA 373
Query: 551 THRRMAVVYDSDEE 564
+HRRMA VYDSDEE
Sbjct: 374 SHRRMAFVYDSDEE 387
>gi|297733900|emb|CBI15147.3| unnamed protein product [Vitis vinifera]
Length = 364
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 294/373 (78%), Positives = 325/373 (87%), Gaps = 9/373 (2%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSY 251
MNMIKVSN MGI+PKPFDPK YVEEDTFVTDESG+KKR+RLENNIVRWR VKN DG TSY
Sbjct: 1 MNMIKVSNTMGIEPKPFDPKMYVEEDTFVTDESGSKKRMRLENNIVRWRNVKNPDGTTSY 60
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIP 311
ESNARFVRWSDGSLQL IGNEVLDI+VQDAQHDQ+HLFLRHGKGILQSQGRIL+KMRF+P
Sbjct: 61 ESNARFVRWSDGSLQLLIGNEVLDISVQDAQHDQSHLFLRHGKGILQSQGRILRKMRFMP 120
Query: 312 SSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKRE 371
SSLSSNSHRLLTALVDSRHKKVYK+KNCITDIDPEREKEEKE+A SQ IRAN LLNRKRE
Sbjct: 121 SSLSSNSHRLLTALVDSRHKKVYKIKNCITDIDPEREKEEKEKAVSQTIRANQLLNRKRE 180
Query: 372 KINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEELEAEVRAEKRIINA 431
K+NRKYTQTV+R RQLS G+LE+AL+EDDE DYHDS R RRRFE++LE + +AEKRI N
Sbjct: 181 KVNRKYTQTVDRGRQLSPGFLEEALDEDDEPDYHDSHRHRRRFEDDLEMDTQAEKRIFNV 240
Query: 432 KKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDERSPLRKRVEGPEQD 491
KK GH+DIP KSS L A KSSRRPVDFS+SEREESEYE+DGEEDERS RKR E PE++
Sbjct: 241 KK--GHKDIPHKSS-LSAIKSSRRPVDFSDSEREESEYESDGEEDERSFSRKRAEEPEEE 297
Query: 492 YEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRGIESDEDSPPRKMPT 551
Y EEEEEE+E EEE A+++ EE EPKQK ++ KR+GIESDEDSPPRK +
Sbjct: 298 YAEEEEEEDEHDEEE---AEANEESEEVEEPKQKGKEYA---KRKGIESDEDSPPRKSAS 351
Query: 552 HRRMAVVYDSDEE 564
HRRMA VYDSDEE
Sbjct: 352 HRRMAFVYDSDEE 364
>gi|414592104|tpg|DAA42675.1| TPA: hypothetical protein ZEAMMB73_653521 [Zea mays]
Length = 433
Score = 476 bits (1225), Expect = e-131, Method: Compositional matrix adjust.
Identities = 277/442 (62%), Positives = 329/442 (74%), Gaps = 20/442 (4%)
Query: 133 MEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKM 192
ME+EG YEK ++P+D+V DEDM YES++ E K KEKPVGPPL+L +P + PPA P KM
Sbjct: 1 MEDEGHYEKDMQPDDVVADEDMRYESDDNR-ELKPKEKPVGPPLDLVVPFKQPPARPDKM 59
Query: 193 NMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYE 252
N+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVRWRT++N DG TS E
Sbjct: 60 NVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVRWRTIRNADGTTSCE 119
Query: 253 SNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPS 312
SNARFV+W DG++QL IGNEVLDI+V +A HDQ+HLFLRHGKGILQSQGR+L KMRF+PS
Sbjct: 120 SNARFVKWKDGTMQLLIGNEVLDISVHEAHHDQSHLFLRHGKGILQSQGRLLHKMRFMPS 179
Query: 313 SLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKREK 372
SLSS SHRLLTALVDS++KK K++ DPE+ K EKER QNIRA+ +L RKREK
Sbjct: 180 SLSSKSHRLLTALVDSQNKKTVKMQKWFETKDPEKAKMEKERVLGQNIRAHSILQRKREK 239
Query: 373 INRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRR--SRRRFEEELEAEVRAEKRIIN 430
+NRKYTQ RRQLS G+LEDAL+EDDE Y DSRR RRRFE+ELEAE AE+RIIN
Sbjct: 240 VNRKYTQPARPRRQLSPGFLEDALDEDDEPGY-DSRRMPGRRRFEDELEAEALAERRIIN 298
Query: 431 AKKPQGHRDIPRKSSTLPAAKSSRRPVDFSESEREESEYETDGEEDERSPLRKRVEGPEQ 490
AKK R++P K PA R+ ++S SEREESEYETDGE+ E S R R E ++
Sbjct: 299 AKKSNMSRNVPHKPPYPPARPPRRQADEYSGSEREESEYETDGEDIEHSLPRGRRE--DE 356
Query: 491 DYEEEEEEEEEEQEEEPDINRASDDEEEAVEPKQKARDSGSNHKRRGIES------DEDS 544
EEEE EE+ E+EE P SD+E EA PK+K G +R + S +DS
Sbjct: 357 LDEEEEYEEDVEEEEAP----LSDEEMEA--PKRKRESGGGGQRREELVSEEDDEDHDDS 410
Query: 545 PPRKMP--THRRMAVVYDSDEE 564
PPRK P +HRR AVV+D +E
Sbjct: 411 PPRKQPAVSHRRKAVVFDDSDE 432
>gi|226500858|ref|NP_001145989.1| uncharacterized protein LOC100279518 [Zea mays]
gi|219885229|gb|ACL52989.1| unknown [Zea mays]
Length = 508
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 237/381 (62%), Positives = 289/381 (75%), Gaps = 11/381 (2%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+G++ QSS E ++ DQR E +A+ DS E E Y QR + RRR VI S S
Sbjct: 121 HGADLDQGESDGDKVQSSPERELDDQRMEPDARGMDS-EDEGYQQRTVSGRRRGVIASES 179
Query: 62 ERSEENHYPD--HEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
E SE+++Y EDEE Q R SP EE+D +RDVFGDSDE++ Y
Sbjct: 180 EGSEDSYYAAGAQEDEEPRQTRKQSSPLEEERDHE-------VVRDVFGDSDEDEPAPYR 232
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELE 179
+R++ID+DS+RS ME+EG YEK ++P+D+V DEDM YES++ E K KEKPVGPPL+L
Sbjct: 233 VRHEIDEDSHRSHMEDEGHYEKDMQPDDVVADEDMRYESDDNR-ELKPKEKPVGPPLDLV 291
Query: 180 IPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRW 239
+P + PPA P KMN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVRW
Sbjct: 292 VPFKQPPARPDKMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVRW 351
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
RT++N DG TS ESNARFV+W DG++QL IGNEVLDI+V +A HDQ+HLFLRHGKGILQS
Sbjct: 352 RTIRNADGTTSCESNARFVKWKDGTMQLLIGNEVLDISVHEAHHDQSHLFLRHGKGILQS 411
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
QGR+L KMRF+PSSLSS SHRLLTALVDS++KK K++ DPE+ K EKER QN
Sbjct: 412 QGRLLHKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWFETKDPEKAKMEKERVLGQN 471
Query: 360 IRANVLLNRKREKINRKYTQT 380
IRA+ +L RKREK+NRKYT
Sbjct: 472 IRAHSILQRKREKVNRKYTAC 492
>gi|115470253|ref|NP_001058725.1| Os07g0109600 [Oryza sativa Japonica Group]
gi|113610261|dbj|BAF20639.1| Os07g0109600, partial [Oryza sativa Japonica Group]
Length = 476
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 227/370 (61%), Positives = 280/370 (75%), Gaps = 12/370 (3%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+GE+ QSS E + D+ +++A DS E Y Q S RR V+ S S
Sbjct: 116 HQADLDQGESDGEKVQSSPEREFSDRVMQNDAAGMDS-EDGGYQQWPVASGRRGVVASES 174
Query: 62 ERSEENHYPD--HEDEEVDQAR-SPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEY 118
E SE+++Y HEDEE Q R +P SP EE+D H +RDVFG+SDE+ Y
Sbjct: 175 EGSEDDYYAGRGHEDEEPHQTRKTPSSPVEEERD-----HEV--VRDVFGESDEDGPAPY 227
Query: 119 AIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLEL 178
+++ID+DS+RSPME+EG YEK ++PED+V DEDM YES+E E K KEKPVGPPL L
Sbjct: 228 RDQHEIDEDSHRSPMEDEGHYEKDLQPEDVVADEDMRYESDENR-ELKPKEKPVGPPLNL 286
Query: 179 EIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVR 238
+PL+ PPA P +MN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVR
Sbjct: 287 VVPLKQPPAQPDRMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVR 346
Query: 239 WRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQ 298
WRTVKN +G TS ESNAR V+W DG++QL IGNEVLDI+V +A HDQ+HLFLR+GKG+LQ
Sbjct: 347 WRTVKNANGTTSCESNARIVKWKDGTMQLLIGNEVLDISVHEAHHDQSHLFLRNGKGVLQ 406
Query: 299 SQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQ 358
SQGR+L+KMRF+PSSLSS SHRLLTALVDS++KK K++ I DPE+ K+EKERA Q
Sbjct: 407 SQGRLLRKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWIESKDPEKVKQEKERALGQ 466
Query: 359 NIRANVLLNR 368
NIRA+ +L R
Sbjct: 467 NIRAHSILQR 476
>gi|414592103|tpg|DAA42674.1| TPA: hypothetical protein ZEAMMB73_653521 [Zea mays]
Length = 493
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 223/367 (60%), Positives = 277/367 (75%), Gaps = 12/367 (3%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+G++ QSS E ++ DQR E +A+ DS E E Y QR + RRR VI S S
Sbjct: 121 HGADLDQGESDGDKVQSSPERELDDQRMEPDARGMDS-EDEGYQQRTVSGRRRGVIASES 179
Query: 62 ERSEENHYPD--HEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
E SE+++Y EDEE Q R SP EE+D +RDVFGDSDE++ Y
Sbjct: 180 EGSEDSYYAAGAQEDEEPRQTRKQSSPLEEERDHE-------VVRDVFGDSDEDEPAPYR 232
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELE 179
+R++ID+DS+RS ME+EG YEK ++P+D+V DEDM YES++ E K KEKPVGPPL+L
Sbjct: 233 VRHEIDEDSHRSHMEDEGHYEKDMQPDDVVADEDMRYESDDNR-ELKPKEKPVGPPLDLV 291
Query: 180 IPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRW 239
+P + PPA P KMN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVRW
Sbjct: 292 VPFKQPPARPDKMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVRW 351
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
RT++N DG TS ESNARFV+W DG++QL IGNEVLDI+V +A HDQ+HLFLRHGKGILQS
Sbjct: 352 RTIRNADGTTSCESNARFVKWKDGTMQLLIGNEVLDISVHEAHHDQSHLFLRHGKGILQS 411
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
QGR+L KMRF+PSSLSS SHRLLTALVDS++KK K++ DPE+ K EKER S++
Sbjct: 412 QGRLLHKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWFETKDPEKAKMEKERV-SKS 470
Query: 360 IRANVLL 366
+ + L+
Sbjct: 471 LLLHYLV 477
>gi|194703254|gb|ACF85711.1| unknown [Zea mays]
Length = 493
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 223/367 (60%), Positives = 277/367 (75%), Gaps = 12/367 (3%)
Query: 2 HDAEPDPGQSEGERDQSSQEVDVVDQREESEAKYTDSDEKEEYGQRVATSRRREVIESGS 61
H A+ D G+S+G++ QSS E ++ DQR E +A+ DS E E Y QR + RRR VI S S
Sbjct: 121 HGADLDQGESDGDKVQSSPERELDDQRMEPDARGMDS-EDEGYQQRTVSGRRRGVISSES 179
Query: 62 ERSEENHYPD--HEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYA 119
E SE+++Y EDEE Q R SP EE+D +RDVFGDSDE++ Y
Sbjct: 180 EGSEDSYYAAGAQEDEEPRQTRKQSSPLEEERDHE-------VVRDVFGDSDEDEPAPYR 232
Query: 120 IRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELE 179
+R++ID+DS+RS ME+EG YEK ++P+D+V DEDM YES++ E K KEKPVGPPL+L
Sbjct: 233 VRHEIDEDSHRSHMEDEGHYEKDMQPDDVVADEDMRYESDDNR-ELKPKEKPVGPPLDLV 291
Query: 180 IPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRW 239
+P + PPA P KMN+IKVSNIMGIDPKPFDPKTYVEED FVTDESG KKRIRLE+NIVRW
Sbjct: 292 VPFKQPPARPDKMNVIKVSNIMGIDPKPFDPKTYVEEDVFVTDESGTKKRIRLEDNIVRW 351
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
RT++N DG TS ESNARFV+W DG++QL IGNEVLDI+V +A HDQ+HLFLRHGKGILQS
Sbjct: 352 RTIRNADGTTSCESNARFVKWKDGTMQLLIGNEVLDISVHEAHHDQSHLFLRHGKGILQS 411
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
QGR+L KMRF+PSSLSS SHRLLTALVDS++KK K++ DPE+ K EKER S++
Sbjct: 412 QGRLLHKMRFMPSSLSSKSHRLLTALVDSQNKKTVKMQKWFETKDPEKAKMEKERV-SKS 470
Query: 360 IRANVLL 366
+ + L+
Sbjct: 471 LLLHYLV 477
>gi|168061473|ref|XP_001782713.1| Paf1 complex protein [Physcomitrella patens subsp. patens]
gi|162665806|gb|EDQ52478.1| Paf1 complex protein [Physcomitrella patens subsp. patens]
Length = 607
Score = 337 bits (865), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 224/426 (52%), Positives = 289/426 (67%), Gaps = 30/426 (7%)
Query: 36 TDSDEKEEYGQRVATSRRR----EVIESGSERSEENHYP-DHEDEEV-DQARSPRSPGGE 89
DSD E G+ V + R EV SGS+RS + D+EDE+V ++AR E
Sbjct: 83 ADSDRGSEDGREVESEGERDGEVEVSASGSDRSHQVQIAGDYEDEDVQEEARGQSEGEEE 142
Query: 90 EKDQTHISH--SAAEIRDVFGDSDEEDVGEYAIRNDIDQDSNRSPME------------E 135
E+ AE RDVFG+SDEED E + + + RSP E +
Sbjct: 143 EQQDDEPRERIEVAEARDVFGESDEEDDQEPEQEPEQEAEERRSPEESLLGSPERVRSDD 202
Query: 136 EGSYEKSIRPEDIVPDEDM----HYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTK 191
EGS R ED+V D+D YES++E + EKP+GPP+EL++PLRPPP P +
Sbjct: 203 EGSEGDPARVEDVVGDDDREGSDRYESDDERYIEQKPEKPMGPPIELQVPLRPPPGMPDQ 262
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSY 251
+N++++SNIM ID KPFDPKTY E+ F TD G K+ IRLE NIVRWR V N+DG TSY
Sbjct: 263 LNIVRMSNIMNIDMKPFDPKTYEAEEMFTTDVEGKKQHIRLEENIVRWRPVHNRDGSTSY 322
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIP 311
ESNARFV+WSDGS+QL +G+EVLD++VQ+A DQ+HLFLRH K ILQSQG++L+KMRF+P
Sbjct: 323 ESNARFVKWSDGSMQLLLGSEVLDLSVQNATQDQSHLFLRHPKSILQSQGQVLRKMRFMP 382
Query: 312 SSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKRE 371
SSL+S SHRLLTALVDS+HKKV+KVK IT+ DPE+EK EKE+ Q IR L RK+E
Sbjct: 383 SSLTSKSHRLLTALVDSKHKKVFKVKAVITETDPEKEKHEKEKQAEQRIRNKEDLQRKQE 442
Query: 372 KINRKY----TQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEELEAEVRAEKR 427
K+ R+Y + +ER R LS GYLEDALEE+DE D +D RRS RR++++LE E RAE+R
Sbjct: 443 KLGRRYEPPRPRVIERERGLSPGYLEDALEEEDEPDQYD-RRS-RRYQDQLEQEQRAERR 500
Query: 428 IINAKK 433
I++AK+
Sbjct: 501 IMHAKR 506
>gi|37718798|gb|AAR01669.1| putative Paf1/RNA polymerase II complex protein [Oryza sativa
Japonica Group]
gi|108711161|gb|ABF98956.1| Leo1 [Oryza sativa Japonica Group]
gi|125545771|gb|EAY91910.1| hypothetical protein OsI_13595 [Oryza sativa Indica Group]
gi|125587969|gb|EAZ28633.1| hypothetical protein OsJ_12643 [Oryza sativa Japonica Group]
Length = 605
Score = 327 bits (839), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 215/479 (44%), Positives = 307/479 (64%), Gaps = 55/479 (11%)
Query: 103 IRDVFGDSDEEDVGEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEH 162
+ D F D+DE+ + Y R+D ++ ++ S M +EG YE+ + PED+V DED HYE +E +
Sbjct: 165 LHDAFDDNDEDGLAPYGSRDD-NKHAHESLMNDEGPYEE-LLPEDMV-DEDKHYEPDE-N 220
Query: 163 VEPKHKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTD 222
+E + K+KP+GPPL L +P PP P +MN+IKVSNIMG++PKPFDP+TYVEED TD
Sbjct: 221 IEHELKDKPLGPPLNLVVPRMLPPGQPDRMNVIKVSNIMGVNPKPFDPETYVEEDAL-TD 279
Query: 223 ESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQ 282
ESG++K+IRLE+NIVRW+ VKN DG S ESNARFV+W DGS+QL IGNEVLDI+V D+
Sbjct: 280 ESGSRKKIRLEDNIVRWKIVKNADGTESRESNARFVKWKDGSIQLLIGNEVLDISVNDSN 339
Query: 283 HDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSS----NSHRLLTALVDSRH-------- 330
HD ++LFLR+GK I+QS +P L S + H+L T +
Sbjct: 340 HDNSNLFLRNGKAIIQS----------LPLPLLSIDHHHVHKLTTDFFSGAYAITRKAST 389
Query: 331 --KKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLS 388
+K K++ I + DPE+ KEE+E+AE +NIRA+ L RKRE++ RKY+Q + +RR+L+
Sbjct: 390 EDEKTIKMQTWIDENDPEKVKEEREKAEGENIRASSSLQRKREQVKRKYSQPLRKRRKLT 449
Query: 389 TGYLEDALEEDDETDY-HDSRR--SRRRFEEELEAEVRAEKRIINAKKPQGHRDIPRKSS 445
G+LEDALEED+ ++ RR FE+ LEAE +++ + NAKK + +P S
Sbjct: 450 PGFLEDALEEDEAPGVGYNQRRGPGHAHFEDSLEAEALSKRHVTNAKKANVGKAVP--SP 507
Query: 446 TLPAAKSSRRPVDFSESEREESEYETDGEEDERSPLRKRVEGPEQDYEEEEEEEEEEQEE 505
++P + + S+SE EESEYETD E+ + SP R ++ +EEEE+ +E
Sbjct: 508 SVPKHQVNEY----SKSESEESEYETDVEDIDNSPTNGR-------EDDMDEEEEDPEEV 556
Query: 506 EPDINRASDDEEEAVEPKQKARDSGSNHKRRGIESDEDSPPRKMPTHRRMAVVYDSDEE 564
D + + ++ EE K+ R+G SD++SPPRK P +RR VV+DSD+E
Sbjct: 557 IGDTSMSDENNEEQEHVKE----------RKGFNSDDESPPRKQPLNRRKTVVFDSDDE 605
>gi|302809908|ref|XP_002986646.1| hypothetical protein SELMODRAFT_446684 [Selaginella moellendorffii]
gi|300145534|gb|EFJ12209.1| hypothetical protein SELMODRAFT_446684 [Selaginella moellendorffii]
Length = 605
Score = 322 bits (825), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 185/362 (51%), Positives = 239/362 (66%), Gaps = 42/362 (11%)
Query: 100 AAEIRDVFGDSDEEDVGEYAIRNDIDQDSNRSPM-------EEEGSYEKSIRPEDIVPDE 152
A IR GDSD++D E + D R P + EGSY K + D+
Sbjct: 113 GARIRAALGDSDDDDQEEGGV------DGPRKPSSPEGEGSDLEGSYHKDSKRG--AEDD 164
Query: 153 DMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADP---------------------TK 191
+ Y S+EE E K KP G PL +E+PLR PP +
Sbjct: 165 EEQYYSDEERAEIK---KPKGEPLTVEVPLRQPPTQAHNASSLALVSFSLSRELTGLCFQ 221
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSY 251
MN++++SNIM I+ +PFDPKTYVEED FV DE+G ++RIR+E N+VRWR V+N+DG S
Sbjct: 222 MNLVRISNIMDIEQRPFDPKTYVEEDGFV-DETG-RRRIRIEENVVRWRYVRNRDGSRSA 279
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIP 311
ESNARFV+WSDGS+QL +GNEVLD+ VQD Q D++HLF+R KG+LQ+QGR+ +KMRF+P
Sbjct: 280 ESNARFVKWSDGSMQLLLGNEVLDLAVQDGQQDESHLFIRQPKGLLQAQGRLARKMRFMP 339
Query: 312 SSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKRE 371
SSL S SHRLLTALVDS HKKV+KVKN ITD DPE++KE+KE+A Q I++ L +K+E
Sbjct: 340 SSLRSKSHRLLTALVDSTHKKVFKVKNVITDFDPEKDKEQKEKAAEQRIKSKEDLQKKQE 399
Query: 372 KINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEELEAEVRAEKRIINA 431
K RKY T ER QLS GYLE ALEE+DE + RS RR++++L+AE RAE+RI
Sbjct: 400 KTMRKYPPTREREPQLSPGYLEGALEEEDEDYDDEG-RSNRRYQDQLDAESRAERRINQV 458
Query: 432 KK 433
K+
Sbjct: 459 KR 460
>gi|307111187|gb|EFN59422.1| hypothetical protein CHLNCDRAFT_137949 [Chlorella variabilis]
Length = 440
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 110/333 (33%), Positives = 171/333 (51%), Gaps = 45/333 (13%)
Query: 69 YPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAIRNDIDQDS 128
+ D+E QA R+ G E + + +AA ++FG SDE+D
Sbjct: 10 FGSDSDDEAPQAEKARAGGDEAQAERQAGKAAAT--NLFGSSDEDD-------------- 53
Query: 129 NRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPV---GPPLELEI-PLRP 184
E G+ P E E+E+ P P GPP+ +++
Sbjct: 54 -----EPPGASH---------PVTQALGEDEDEY--PARSPSPGAARGPPVYVDLETFDV 97
Query: 185 PPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRL-ENNIVRWRTVK 243
P D + ++K+ NI+G++ +PFDP T+ DE G K R+RL + N +RWR
Sbjct: 98 QPQD--SLRLVKLPNILGVEARPFDPDTFDAGAEEEVDERGFK-RVRLRDQNCIRWRWGV 154
Query: 244 NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRI 303
+++G ESNARFVRWSDGSLQL +G+EV+D+ D ++ ++ +R G++Q QG++
Sbjct: 155 DEEGNQVRESNARFVRWSDGSLQLVVGDEVMDVKEIDTTNEHTYMCVRL-PGLIQGQGQL 213
Query: 304 LKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRAN 363
+KM F P+SL SN H+ L A D H K +V+ T +DP++ K E+ERAE++ IR+
Sbjct: 214 TRKMVFQPASLQSNLHKQLRAAADKLHVKTQRVRATTTLVDPKKLKAEQERAEAERIRSR 273
Query: 364 VLLNRKREKINRKYTQTVERR----RQLSTGYL 392
L ++ K RK+ R +QL+ YL
Sbjct: 274 EKLAERQTKTMRKFGLPAMPRARMPQQLNADYL 306
>gi|384249595|gb|EIE23076.1| Leo1-like protein [Coccomyxa subellipsoidea C-169]
Length = 404
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 92/189 (48%), Positives = 124/189 (65%), Gaps = 5/189 (2%)
Query: 194 MIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
++K+SNI+ ID KPFDP TY EE+ +V D+ + R+R + N++RWR +DG +
Sbjct: 99 LVKLSNIINIDMKPFDPATYKEAEEEEEYVDDQGTRRVRVR-DQNVIRWRRTVKEDGSSG 157
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGK-GILQSQGRILKKMRF 309
ESNARFVRWSDGSLQL IG+EVLD+ D + D LFLRH + L+S G+I KK+ F
Sbjct: 158 IESNARFVRWSDGSLQLMIGDEVLDVAELDIEKDNNFLFLRHMQAAYLESMGQIQKKLAF 217
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRK 369
P+SL+S H+ L V+ RH KV +V++ T DP REK E+ER E IRA L R+
Sbjct: 218 RPASLNSRFHQRLREAVERRHTKVNRVRSVATVEDPAREKAERERREEARIRATEQLERR 277
Query: 370 REKINRKYT 378
+ K R+YT
Sbjct: 278 QNKEMRRYT 286
>gi|255083715|ref|XP_002508432.1| PAF1 complex protein [Micromonas sp. RCC299]
gi|226523709|gb|ACO69690.1| PAF1 complex protein [Micromonas sp. RCC299]
Length = 401
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/178 (42%), Positives = 109/178 (61%), Gaps = 6/178 (3%)
Query: 172 VGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYV-EEDTFVTDESGAKKRI 230
GPPLE+ + L P T +++++N++G++ +PFDP+TY +EDTFV +E K RI
Sbjct: 96 FGPPLEINVNLYDPLPPTTTAKLVRLTNVVGVETEPFDPETYQPQEDTFVDNE--GKTRI 153
Query: 231 RLE-NNIVRWRTVKN-KDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHL 288
NI+RWR V + + G +SNAR V+WSDGS QL IG+E ++T D +L
Sbjct: 154 STPFTNIIRWRKVIDPETGEEKVQSNARMVKWSDGSWQLLIGDEAFNVTETDLGDRNEYL 213
Query: 289 FLRH-GKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDP 345
F RH G G++Q Q RI K+ F P++L S +HR LT +D+RH K + + I DP
Sbjct: 214 FARHPGAGLIQGQKRISSKLAFTPANLDSATHRRLTRAIDTRHVKTKQTQKFIAVTDP 271
>gi|221119154|ref|XP_002158694.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Hydra
magnipapillata]
Length = 436
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 81/231 (35%), Positives = 128/231 (55%), Gaps = 16/231 (6%)
Query: 173 GPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKR 229
G + +E+P R + ++ IK+ N + +D KPFDP Y +++D + +E A+ +
Sbjct: 133 GLSIAIELP-RCSMSLGKELTFIKLPNFLSVDTKPFDPNFYEDEIDDDEVLDEEGRARLK 191
Query: 230 IRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLF 289
+R+EN I RWR K+KDG ESN R VRWSDGSL + +GNE+ D+ +Q HLF
Sbjct: 192 LRVENTI-RWRYAKDKDGNEVKESNCRMVRWSDGSLSMHLGNEIFDVFKMPVTGEQNHLF 250
Query: 290 LRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPERE 348
++ G G LQ+Q KK+ F P S +S +H+ +T + R K K+K T++DPE +
Sbjct: 251 VQQGTG-LQAQAVFKKKLIFRPHSTASQTHKKMTMSIADRMSKAQKIKMMPATELDPESQ 309
Query: 349 KEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEED 399
++ + E + +R+ +RE R+ R R S G + LE+D
Sbjct: 310 RKTMLKKEDERLRSAT----RRESAQRRI-----RERSHSKGLTSNYLEQD 351
>gi|242018542|ref|XP_002429733.1| RNA polymerase-associated protein LEO1, putative [Pediculus humanus
corporis]
gi|212514745|gb|EEB16995.1| RNA polymerase-associated protein LEO1, putative [Pediculus humanus
corporis]
Length = 748
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 187/364 (51%), Gaps = 52/364 (14%)
Query: 49 ATSRRREVI--ESGSERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDV 106
++R+R VI +SGS+ ++N P+++ E+ GE+ ++ + + A +
Sbjct: 324 TSTRKRAVISDDSGSDTGKKNADPENDRED-----------GEDSEKDLVIDAEA----L 368
Query: 107 FGD-----SDEEDVGEYAIRNDIDQDSNRSPMEE---EGSYEKS----IRPEDIVPDEDM 154
FGD SDEE + ++ ++ + N+S EE E EKS I + DED
Sbjct: 369 FGDASDISSDEEGRKKKGLKKHVESEDNKSQGEESDAESQKEKSDDEKIDMDGDEKDEDK 428
Query: 155 HYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTK-MNMIKVSNIMGIDPKPFDPKTY 213
EE H E + +++EIP D + ++ +K+ N + ++ +PF+P TY
Sbjct: 429 ENIPEEPHPETR---------IDVEIP--KITTDLGRDLHFVKLPNFLSVETRPFEPSTY 477
Query: 214 ---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIG 270
++E+ + +E A+ ++++EN I RW+ V ++G +SNARFV+WSDGS+ L +G
Sbjct: 478 EDEIDEEETLDEEGRARLKLKVENTI-RWKEVTKQNGEFEKQSNARFVKWSDGSMSLHLG 536
Query: 271 NEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRH 330
+E+ D+ Q Q D HLF+R G G LQ Q K+ F P S S +HR +T + R
Sbjct: 537 SEIFDVYKQPLQGDHNHLFIRQGTG-LQGQAVFRTKLTFRPHSTESFTHRKMTMSLADRS 595
Query: 331 KKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLST 389
+K +K + DPE + E + E + ++A+V K+ +I K R LS
Sbjct: 596 QKTAGIKVLSLVGKDPEAHRGELIKKEEERLKASVRRESKQRRIREKSAH-----RGLSA 650
Query: 390 GYLE 393
GYLE
Sbjct: 651 GYLE 654
>gi|348572288|ref|XP_003471925.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Cavia
porcellus]
Length = 658
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 115/205 (56%), Gaps = 9/205 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWRT +++DG
Sbjct: 365 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRTRRDEDGTE 424
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
+ ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 425 TKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 483
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 484 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI---- 539
Query: 369 KREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 540 RRESQQRRLREK-QHQRGLSASYLE 563
>gi|161611649|gb|AAI55870.1| leo1 protein [Xenopus (Silurana) tropicalis]
Length = 711
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 128/239 (53%), Gaps = 10/239 (4%)
Query: 159 EEEHVEPKHKEKPVGPPLELEIPLRPPPAD-PTKMNMIKVSNIMGIDPKPFDPKTYVEED 217
+E+ ++ H E+ P +E+ + D + +K+ N + ++P+PFDP+ Y +E
Sbjct: 375 DEDGMDQDHPEEEAVPETRIEVEIPKVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEF 434
Query: 218 TFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLD 275
+ R++L+ N +RWRT K+++G ESNAR V+WSDGS+ L +GNEV D
Sbjct: 435 EDEEMLDEEGRTRLKLKVENTIRWRTRKDEEGSDVRESNARIVKWSDGSMSLHLGNEVFD 494
Query: 276 ITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYK 335
+ Q D HLF+R G G LQ Q K+ F P S S +HR +T + R K K
Sbjct: 495 VYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQK 553
Query: 336 VKNC-ITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
++ + DPE ++ E + E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 554 IRILPMAGRDPESQRSEMIKKEEERLRASI----RRESQQRRMREK-QHQRGLSANYLE 607
>gi|340375700|ref|XP_003386372.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Amphimedon
queenslandica]
Length = 423
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/269 (33%), Positives = 139/269 (51%), Gaps = 29/269 (10%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLE-- 233
+E+EIP R T ++ +K+ N + I+ +PFDP Y +E D R RL+
Sbjct: 166 IEVEIP-RVSADLGTDLHFVKLPNFLSIETRPFDPVLYEDEGDDDGDVLDEDGRARLKLK 224
Query: 234 -NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRH 292
N +RWR N++G ESN+R V+WSDGS+ L +G+E+ DI Q Q + +HLF+R
Sbjct: 225 VENTIRWRKTVNENGEEVKESNSRIVQWSDGSMSLYLGSEIFDIHCQPLQREYSHLFVRQ 284
Query: 293 GKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEE 351
G G LQ Q K+ F P S S +HR +T + R + K++ + + DPE + E
Sbjct: 285 GTG-LQGQAVFKNKISFRPHSTDSQTHRKMTLSIADRFSRAQKIRVLPVVEKDPESRRSE 343
Query: 352 KERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDETDYHDSRRS 410
+ E + +R +V L +++K+ +K+ RR L+ YLE D L +DE
Sbjct: 344 MIKDEEKKLRESVKLRSQQQKVKKKFP-----RRGLTASYLEPDILMGEDE--------- 389
Query: 411 RRRFEEELEA-----EVRAEKRIINAKKP 434
F+E L A + R ++ + KKP
Sbjct: 390 ---FQESLSAIKSSVKSRMKQAVTRGKKP 415
>gi|301613789|ref|XP_002936391.1| PREDICTED: RNA polymerase-associated protein LEO1 [Xenopus
(Silurana) tropicalis]
Length = 797
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 128/239 (53%), Gaps = 10/239 (4%)
Query: 159 EEEHVEPKHKEKPVGPPLELEIPLRPPPAD-PTKMNMIKVSNIMGIDPKPFDPKTYVEED 217
+E+ ++ H E+ P +E+ + D + +K+ N + ++P+PFDP+ Y +E
Sbjct: 461 DEDGMDQDHPEEEAVPETRIEVEIPKVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEF 520
Query: 218 TFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLD 275
+ R++L+ N +RWRT K+++G ESNAR V+WSDGS+ L +GNEV D
Sbjct: 521 EDEEMLDEEGRTRLKLKVENTIRWRTRKDEEGSDVRESNARIVKWSDGSMSLHLGNEVFD 580
Query: 276 ITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYK 335
+ Q D HLF+R G G LQ Q K+ F P S S +HR +T + R K K
Sbjct: 581 VYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQK 639
Query: 336 VKNC-ITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
++ + DPE ++ E + E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 640 IRILPMAGRDPESQRSEMIKKEEERLRASI----RRESQQRRMREK-QHQRGLSANYLE 693
>gi|281205997|gb|EFA80186.1| RNA polymerase II complex component [Polysphondylium pallidum
PN500]
Length = 479
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/240 (33%), Positives = 132/240 (55%), Gaps = 16/240 (6%)
Query: 172 VGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIR 231
+GPP+ L + P +K+ +KV NI+GI P+PF KT+ E DE +++
Sbjct: 152 IGPPINLVHINKDPLPRRSKLMKLKVLNILGIQPQPFSIKTFNE------DEPVERRKSN 205
Query: 232 LENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLR 291
+E+ ++RWR + G+ ESN RFVRW DGS L IGNEVL++ + D +++
Sbjct: 206 VES-VIRWRWGVGEGGVEMKESNTRFVRWEDGSSHLFIGNEVLEVKEHELGGDHIYIYSS 264
Query: 292 HGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEE 351
G L+ +G++ ++ P+ ++S HR L SR KKV ++K T IDPE++KE
Sbjct: 265 Q-DGFLECEGKLDSRLNIRPTDITSKVHRRLAESATSRSKKVGRIKTIQTTIDPEKDKEA 323
Query: 352 KERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSR 411
KER E++ IR+ R +E + + + + L + YL + E+D+++ Y S R
Sbjct: 324 KERFEAEIIRS----RRLKENKVKSHAEKI----GLDSKYLTEGQEQDEDSGYGYSNEDR 375
>gi|291190148|ref|NP_001167340.1| RNA polymerase-associated protein LEO1 [Salmo salar]
gi|223649308|gb|ACN11412.1| RNA polymerase-associated protein LEO1 [Salmo salar]
Length = 756
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/216 (35%), Positives = 121/216 (56%), Gaps = 10/216 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFD + Y +E + R++L+ N +RWR K+++G
Sbjct: 455 LYFVKLPNFLSVEPRPFDSQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRAKKDEEGNE 514
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
S ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 515 SRESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 573
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 574 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESQRNEMIKKEEERLRASI---- 629
Query: 369 KREKINRKYTQTVERRRQLSTGYLE-DALEEDDETD 403
+RE R+ + + +R LS GYLE D ++D+E D
Sbjct: 630 RREGQQRRMREK-QHQRGLSAGYLEPDRYDDDEEGD 664
>gi|363737709|ref|XP_001232678.2| PREDICTED: RNA polymerase-associated protein LEO1-like [Gallus
gallus]
Length = 657
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 127/232 (54%), Gaps = 11/232 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 349 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 407
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 408 ENTIRWRMRRDEEGNEIRESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 467
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 468 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESQRTEM 526
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDETD 403
+ E + +RA++ +RE R+ + + +R LS YLE D EE+DE D
Sbjct: 527 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSANYLEPDRYEEEDEGD 573
>gi|326926603|ref|XP_003209488.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Meleagris
gallopavo]
Length = 657
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 127/232 (54%), Gaps = 11/232 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 349 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 407
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 408 ENTIRWRMRRDEEGNEIRESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 467
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 468 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESQRTEM 526
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDETD 403
+ E + +RA++ +RE R+ + + +R LS YLE D EE+DE D
Sbjct: 527 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSANYLEPDRYEEEDEGD 573
>gi|46309501|ref|NP_996950.1| RNA polymerase-associated protein LEO1 [Danio rerio]
gi|82185999|sp|Q6NYV9.1|LEO1_DANRE RecName: Full=RNA polymerase-associated protein LEO1
gi|42542974|gb|AAH66443.1| Leo1, Paf1/RNA polymerase II complex component, homolog (S.
cerevisiae) [Danio rerio]
Length = 696
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 129/230 (56%), Gaps = 11/230 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 336 IEVEIP-KVSTDLGSDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 394
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR+ ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 395 ENTIRWRSRRDEEGNEVKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 454
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 455 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESQRNEM 513
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDE 401
+ E + +RA++ +RE R+ + + +R L+ GYLE D +ED+E
Sbjct: 514 IKKEEERLRASI----RRESQQRRMREK-QHQRGLNAGYLEPDRYDEDEE 558
>gi|224062463|ref|XP_002198179.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Taeniopygia
guttata]
Length = 656
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/232 (34%), Positives = 127/232 (54%), Gaps = 11/232 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 348 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 406
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 407 ENTIRWRMRRDEEGNEIRESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 466
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 467 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESQRTEM 525
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDETD 403
+ E + +RA++ +RE R+ + + +R LS YLE D +E+DE D
Sbjct: 526 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLEPDRYDEEDEGD 572
>gi|353230140|emb|CCD76311.1| senescence downregulated leo1-like [Schistosoma mansoni]
Length = 471
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 124/226 (54%), Gaps = 13/226 (5%)
Query: 156 YESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTK-MNMIKVSNIMGIDPKPFDPKTY- 213
+ S E VE +P + ++ PL AD + ++++K+ N + ++ +PFDP Y
Sbjct: 167 HRSNAEDVEDDSAHEPDETRISVDFPL--IRADLGREVHLVKMPNFLSVETRPFDPNFYE 224
Query: 214 --VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGN 271
++ED + +E + ++++EN I RWR K DG +ESNAR VRWSDGSL L +G+
Sbjct: 225 DELDEDEILDEEGRTRLKLKVENTI-RWRIGKTPDGEMIHESNARIVRWSDGSLSLHLGD 283
Query: 272 EVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHK 331
E+ DI D Q D +LF+R G G LQ Q + K+ F P S S +HR +T + +
Sbjct: 284 EIFDIHKVDIQSDYNYLFIREGSG-LQGQSSLKTKLTFRPHSTDSFTHRKITLSLADKTN 342
Query: 332 KVYKVKNC-ITDIDPEREKEEKERAESQNIRANV----LLNRKREK 372
K+ KVK +T DPE + R E + +RA + L R REK
Sbjct: 343 KLQKVKILPVTGADPESNRNVLVRKEEEKLRATLRRESQLRRMREK 388
>gi|147900849|ref|NP_001085280.1| RNA polymerase-associated protein LEO1 [Xenopus laevis]
gi|82177886|sp|Q52KV5.1|LEO1_XENLA RecName: Full=RNA polymerase-associated protein LEO1
gi|62825913|gb|AAH94174.1| LOC443607 protein [Xenopus laevis]
Length = 703
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 389 IEVEIPKVNTDL-GNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 447
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWRT K+++G +SNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 448 ENTIRWRTRKDEEGNDVRDSNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 507
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 508 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESQRSEM 566
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 567 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSANYLE 602
>gi|256088040|ref|XP_002580168.1| senescence downregulated leo1-like [Schistosoma mansoni]
Length = 468
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 124/226 (54%), Gaps = 13/226 (5%)
Query: 156 YESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTK-MNMIKVSNIMGIDPKPFDPKTY- 213
+ S E VE +P + ++ PL AD + ++++K+ N + ++ +PFDP Y
Sbjct: 164 HRSNAEDVEDDSAHEPDETRISVDFPL--IRADLGREVHLVKMPNFLSVETRPFDPNFYE 221
Query: 214 --VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGN 271
++ED + +E + ++++EN I RWR K DG +ESNAR VRWSDGSL L +G+
Sbjct: 222 DELDEDEILDEEGRTRLKLKVENTI-RWRIGKTPDGEMIHESNARIVRWSDGSLSLHLGD 280
Query: 272 EVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHK 331
E+ DI D Q D +LF+R G G LQ Q + K+ F P S S +HR +T + +
Sbjct: 281 EIFDIHKVDIQSDYNYLFIREGSG-LQGQSSLKTKLTFRPHSTDSFTHRKITLSLADKTN 339
Query: 332 KVYKVKNC-ITDIDPEREKEEKERAESQNIRANV----LLNRKREK 372
K+ KVK +T DPE + R E + +RA + L R REK
Sbjct: 340 KLQKVKILPVTGADPESNRNVLVRKEEEKLRATLRRESQLRRMREK 385
>gi|443719062|gb|ELU09383.1| hypothetical protein CAPTEDRAFT_221877 [Capitella teleta]
Length = 631
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 23/238 (9%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRL 232
+E+EIP + A + +K+ N + ++ +PFDP+ Y ++ED + +E + ++++
Sbjct: 301 IEVEIP-KINTALGKNLYFVKLPNFLSVETRPFDPQVYEDEIDEDEVMDEEGRTRLKLKV 359
Query: 233 ENNIVRWRTVKNKDGMTSY--------ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHD 284
EN I RWRT+K+ DG ESNAR+VRWSDGS+ + +GNE+ D+ Q D
Sbjct: 360 ENTI-RWRTMKDDDGNDVLDEFGEPVRESNARYVRWSDGSMSMHLGNEIFDVHPMTLQGD 418
Query: 285 QAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDI 343
HLF+R G G LQ Q K+ F P S S +HR +T + R K KVK I
Sbjct: 419 FNHLFIRQGTG-LQGQAVFKTKLTFRPHSTDSFTHRKMTMSMVDRSSKTQKVKVLPIAGK 477
Query: 344 DPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDE 401
DPE ++ E + E + +RA++ +RE R+ + +R L+ YLED +DDE
Sbjct: 478 DPESQRSEMIKKEEERLRASI----RRESQQRRIRERAH-QRGLNPNYLED---DDDE 527
>gi|449272115|gb|EMC82203.1| RNA polymerase-associated protein LEO1 [Columba livia]
Length = 656
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 80/232 (34%), Positives = 127/232 (54%), Gaps = 11/232 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 348 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 406
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 407 ENTIRWRMRRDEEGNEIRESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 466
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 467 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESQRTEM 525
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDETD 403
+ E + +RA++ +RE R+ + + +R LS YLE D +E+DE D
Sbjct: 526 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSANYLEPDRYDEEDEGD 572
>gi|226479790|emb|CAX73191.1| RNA polymerase-associated protein LEO1 [Schistosoma japonicum]
Length = 448
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 109/189 (57%), Gaps = 10/189 (5%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++++K+ N + ++ +PFDP Y ++ED + +E + ++++EN I RWR K DG
Sbjct: 180 VHLVKMPNFLSVETRPFDPNFYEDELDEDEILDEEGRTRLKLKVENTI-RWRIGKTPDGE 238
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
+ESNAR VRWSDGSL L +G+E+ DI D Q D +LF+R G G LQ Q + K+
Sbjct: 239 MVHESNARIVRWSDGSLSLHLGDEIFDIHKVDIQSDYNYLFIREGSG-LQGQSSLKTKLT 297
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANV--- 364
F P S S +HR +T + + K+ KVK +T DPE + R E + +RA +
Sbjct: 298 FRPHSTDSFTHRKITLSLADKTNKLQKVKILPVTGADPESNRNMLVRKEEEKLRATLRRE 357
Query: 365 -LLNRKREK 372
L R REK
Sbjct: 358 SQLRRMREK 366
>gi|348541543|ref|XP_003458246.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Oreochromis
niloticus]
Length = 630
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 317 IEVEIP-KVSTDLGSDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 375
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G + ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 376 ENTIRWRAKRDEEGNETRESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 435
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE + E
Sbjct: 436 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESHRNEM 494
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ ++ +I K Q R LS+ YLE
Sbjct: 495 IKKEEERLRASIRRESQQRRIREKQHQ-----RGLSSSYLE 530
>gi|410961223|ref|XP_003987183.1| PREDICTED: RNA polymerase-associated protein LEO1 [Felis catus]
Length = 665
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 121/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 357 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 415
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWRT ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 416 ENTIRWRTRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 475
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 476 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 534
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 535 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 570
>gi|351715227|gb|EHB18146.1| RNA polymerase-associated protein LEO1 [Heterocephalus glaber]
Length = 665
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 114/205 (55%), Gaps = 9/205 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWRT ++++G
Sbjct: 373 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRTRRDEEGNE 432
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 433 IKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 491
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 492 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI---- 547
Query: 369 KREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 548 RRESQQRRMREK-QHQRGLSASYLE 571
>gi|417403742|gb|JAA48668.1| Putative rna polymerase-associated protein leo1 [Desmodus rotundus]
Length = 667
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 114/205 (55%), Gaps = 9/205 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWRT ++++G
Sbjct: 374 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRTRRDEEGNE 433
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 434 IKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 492
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 493 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI---- 548
Query: 369 KREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 549 RRESQQRRMREK-QHQRGLSASYLE 572
>gi|327282415|ref|XP_003225938.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Anolis
carolinensis]
Length = 679
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 78/234 (33%), Positives = 126/234 (53%), Gaps = 11/234 (4%)
Query: 164 EPKHKEKPVGPPLELEIPLRPPPAD-PTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTD 222
E + +E+PV P +E+ + D + +K+ N + ++P+PFDP+ Y +E
Sbjct: 357 EDQQEEEPV-PETRIEVEIPKVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEM 415
Query: 223 ESG-AKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQD 280
+ R++L+ N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+
Sbjct: 416 LDEEGRTRLKLKVENTIRWRIRRDEEGNEIRESNARIVKWSDGSMSLHLGNEVFDVYKAP 475
Query: 281 AQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC- 339
Q D HLF+R G G LQ Q K+ F P S S +HR +T + R K K++
Sbjct: 476 LQGDHNHLFIRQGTG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILP 534
Query: 340 ITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ DPE ++ E + E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 535 MAGRDPESQRTEMIKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 583
>gi|410912441|ref|XP_003969698.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Takifugu
rubripes]
Length = 643
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 122/221 (55%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 330 IEVEIP-KVSTDLGSDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 388
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G + ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 389 ENTIRWRAKRDEEGNEARESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 448
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE + E
Sbjct: 449 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESHRNEM 507
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS+ YLE
Sbjct: 508 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSSSYLE 543
>gi|54035548|gb|AAH83358.1| Leo1, Paf1/RNA polymerase II complex component, homolog (S.
cerevisiae) [Mus musculus]
Length = 667
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 121/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 359 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 417
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 418 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 477
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S++HR +T + R K K++ + DPE ++ E
Sbjct: 478 TG-LQGQAVFKTKLTFRPHSTDSDTHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 536
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 537 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 572
>gi|241623021|ref|XP_002407483.1| RNA polymerase-associated protein LEO1, putative [Ixodes
scapularis]
gi|215500992|gb|EEC10486.1| RNA polymerase-associated protein LEO1, putative [Ixodes
scapularis]
Length = 699
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 76/223 (34%), Positives = 124/223 (55%), Gaps = 13/223 (5%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRL 232
+++EIP R ++++ +K+ N + +D +P+DP+ Y V+ED + +E A+ ++++
Sbjct: 349 IDVEIP-RIVTNLGSEVHFVKLPNFLSVDTRPYDPQWYEDEVDEDEVLDEEGRARLKLKV 407
Query: 233 ENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQD-AQHDQAHLFLR 291
EN VRWR + +K G +SNAR ++WSDGS+ L +G+E+ D+ Q Q D HLF+R
Sbjct: 408 ENT-VRWRYIYDKLGNPVRQSNARIIKWSDGSMSLHLGSEIFDVHKQSLMQGDHNHLFIR 466
Query: 292 HGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKE 350
G G LQ Q K+ F P S S +HR +T + R +K K++ DPE +
Sbjct: 467 QGTG-LQGQAVFRTKLSFRPHSTDSFTHRKMTLSLAGRSQKTQKIRVLPQVGKDPEAHRS 525
Query: 351 EKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
E + E +RA++ K+ +I K R LS+ YLE
Sbjct: 526 EMIKREEDRLRASIRRESKQRRIREKA-----HSRGLSSSYLE 563
>gi|156394980|ref|XP_001636890.1| predicted protein [Nematostella vectensis]
gi|156223997|gb|EDO44827.1| predicted protein [Nematostella vectensis]
Length = 464
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 132/236 (55%), Gaps = 12/236 (5%)
Query: 187 ADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVK 243
A +++ K+ N + ++ KPFDP Y VE+D + +E A+ ++++EN I RWR K
Sbjct: 160 ALGSELYFSKLPNFLSLETKPFDPALYEDDVEDDEILDEEGRARLKLKVENTI-RWRYGK 218
Query: 244 NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRI 303
D ESNAR VRWSDGSL L +G EV DI + D HLF++ G G LQ Q
Sbjct: 219 YSDDNEIKESNARIVRWSDGSLSLLVGAEVFDIQKTRIEGDFNHLFVQQGTG-LQGQAVF 277
Query: 304 LKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRA 362
+K+ F P S +S +HR +T + R K K+K DPE ++ E + E + +R+
Sbjct: 278 KEKLTFRPHSTASQTHRKMTLSIADRALKAQKIKMLPAAGQDPEAQRSEMIKKEEERLRS 337
Query: 363 NVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEEL 418
+ L ++ ++ R+ +Q R L++ YLE A E+DDE + +++++EEL
Sbjct: 338 QLRLENQQRRM-RERSQA----RGLNSSYLE-AGEDDDEDLEQSLLKIKKKYKEEL 387
>gi|291402994|ref|XP_002717801.1| PREDICTED: Leo1, Paf1/RNA polymerase II complex component, homolog
[Oryctolagus cuniculus]
Length = 666
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 113/205 (55%), Gaps = 9/205 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWR ++++G
Sbjct: 373 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRIRRDEEGNE 432
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 433 IKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 491
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 492 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI---- 547
Query: 369 KREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 548 RRESQQRRMREK-QHQRGLSASYLE 571
>gi|395503256|ref|XP_003755986.1| PREDICTED: RNA polymerase-associated protein LEO1 [Sarcophilus
harrisii]
Length = 667
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 126/230 (54%), Gaps = 11/230 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 360 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 418
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 419 ENTIRWRMRRDEEGNDIRESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 478
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 479 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 537
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDE 401
+ E + +RA++ +RE R+ + + +R LS YLE D +E+DE
Sbjct: 538 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSANYLEPDRYDEEDE 582
>gi|426233296|ref|XP_004010653.1| PREDICTED: RNA polymerase-associated protein LEO1 isoform 1 [Ovis
aries]
Length = 668
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 360 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 418
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 419 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 478
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 479 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 537
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 538 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 573
>gi|296214028|ref|XP_002753537.1| PREDICTED: RNA polymerase-associated protein LEO1 isoform 1
[Callithrix jacchus]
Length = 666
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 416
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 417 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 476
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 477 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 535
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 536 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 571
>gi|129277507|ref|NP_001076073.1| RNA polymerase-associated protein LEO1 [Bos taurus]
gi|127802055|gb|AAI20400.1| LEO1 protein [Bos taurus]
gi|296483087|tpg|DAA25202.1| TPA: Leo1, Paf1/RNA polymerase II complex component, homolog [Bos
taurus]
Length = 584
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 113/205 (55%), Gaps = 9/205 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWR ++++G
Sbjct: 375 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRIRRDEEGNE 434
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 435 IKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 493
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 494 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI---- 549
Query: 369 KREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 550 RRESQQRRMREK-QHQRGLSASYLE 573
>gi|426233298|ref|XP_004010654.1| PREDICTED: RNA polymerase-associated protein LEO1 isoform 2 [Ovis
aries]
Length = 584
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 113/205 (55%), Gaps = 9/205 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWR ++++G
Sbjct: 375 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRIRRDEEGNE 434
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 435 IKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 493
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 494 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI---- 549
Query: 369 KREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 550 RRESQQRRMREK-QHQRGLSASYLE 573
>gi|74000154|ref|XP_535485.2| PREDICTED: RNA polymerase-associated protein LEO1 isoform 2 [Canis
lupus familiaris]
Length = 665
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 357 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 415
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 416 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 475
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 476 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 534
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 535 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 570
>gi|311244725|ref|XP_003121552.1| PREDICTED: RNA polymerase-associated protein LEO1 [Sus scrofa]
Length = 666
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 416
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 417 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 476
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 477 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 535
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 536 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 571
>gi|395822198|ref|XP_003784410.1| PREDICTED: RNA polymerase-associated protein LEO1 [Otolemur
garnettii]
Length = 667
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 359 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 417
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 418 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 477
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 478 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 536
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 537 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 572
>gi|440902162|gb|ELR52987.1| RNA polymerase-associated protein LEO1 [Bos grunniens mutus]
Length = 668
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 360 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 418
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 419 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 478
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 479 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 537
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 538 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 573
>gi|402874333|ref|XP_003900995.1| PREDICTED: RNA polymerase-associated protein LEO1 [Papio anubis]
Length = 666
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 416
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 417 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 476
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 477 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 535
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 536 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 571
>gi|53850624|ref|NP_001005548.1| RNA polymerase-associated protein LEO1 [Rattus norvegicus]
gi|81884075|sp|Q641X2.1|LEO1_RAT RecName: Full=RNA polymerase-associated protein LEO1
gi|51980343|gb|AAH82098.1| Leo1, Paf1/RNA polymerase II complex component, homolog (S.
cerevisiae) [Rattus norvegicus]
Length = 678
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 370 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 428
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 429 ENTIRWRMRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 488
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 489 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 547
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 548 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 583
>gi|380787999|gb|AFE65875.1| RNA polymerase-associated protein LEO1 [Macaca mulatta]
gi|383411713|gb|AFH29070.1| RNA polymerase-associated protein LEO1 [Macaca mulatta]
Length = 666
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 416
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 417 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 476
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 477 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 535
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 536 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 571
>gi|87299619|ref|NP_001034611.1| RNA polymerase-associated protein LEO1 [Mus musculus]
gi|341940897|sp|Q5XJE5.2|LEO1_MOUSE RecName: Full=RNA polymerase-associated protein LEO1
Length = 667
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 119/221 (53%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 359 IEVEIPKVNTDL-GNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 417
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 418 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 477
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 478 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 536
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 537 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 572
>gi|335773231|gb|AEH58323.1| RNA polymerase-associated protein LEO1-like protein, partial [Equus
caballus]
Length = 592
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 319 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 377
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 378 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 437
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 438 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 496
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 497 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 532
>gi|301776933|ref|XP_002923889.1| PREDICTED: RNA polymerase-associated protein LEO1-like, partial
[Ailuropoda melanoleuca]
Length = 631
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 357 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 415
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 416 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 475
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 476 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 534
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 535 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 570
>gi|281345934|gb|EFB21518.1| hypothetical protein PANDA_013112 [Ailuropoda melanoleuca]
Length = 612
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 338 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 396
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 397 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 456
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 457 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 515
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 516 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 551
>gi|194388628|dbj|BAG60282.1| unnamed protein product [Homo sapiens]
Length = 644
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 336 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 394
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 395 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 454
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 455 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 513
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 514 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 549
>gi|20270337|ref|NP_620147.1| RNA polymerase-associated protein LEO1 [Homo sapiens]
gi|350535427|ref|NP_001233372.1| RNA polymerase-associated protein LEO1 [Pan troglodytes]
gi|397515297|ref|XP_003827890.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Pan
paniscus]
gi|74751545|sp|Q8WVC0.1|LEO1_HUMAN RecName: Full=RNA polymerase-associated protein LEO1; AltName:
Full=Replicative senescence down-regulated leo1-like
protein
gi|17390326|gb|AAH18147.1| Leo1, Paf1/RNA polymerase II complex component, homolog (S.
cerevisiae) [Homo sapiens]
gi|31979171|gb|AAP68819.1| replicative senescence downregulated leo1-like protein [Homo
sapiens]
gi|119597839|gb|EAW77433.1| Leo1, Paf1/RNA polymerase II complex component, homolog (S.
cerevisiae), isoform CRA_b [Homo sapiens]
gi|343958524|dbj|BAK63117.1| RNA polymerase-associated protein LEO1 [Pan troglodytes]
gi|410209632|gb|JAA02035.1| Leo1, Paf1/RNA polymerase II complex component, homolog [Pan
troglodytes]
gi|410255092|gb|JAA15513.1| Leo1, Paf1/RNA polymerase II complex component, homolog [Pan
troglodytes]
gi|410333321|gb|JAA35607.1| Leo1, Paf1/RNA polymerase II complex component, homolog [Pan
troglodytes]
Length = 666
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 416
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 417 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 476
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 477 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 535
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 536 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 571
>gi|355778049|gb|EHH63085.1| Replicative senescence down-regulated leo1-like protein [Macaca
fascicularis]
Length = 694
Score = 125 bits (314), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 386 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 444
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 445 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 504
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 505 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 563
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 564 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 599
>gi|148694372|gb|EDL26319.1| mCG12252, isoform CRA_b [Mus musculus]
Length = 615
Score = 125 bits (314), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 307 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 365
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 366 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 425
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 426 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 484
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 485 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 520
>gi|355692723|gb|EHH27326.1| Replicative senescence down-regulated leo1-like protein [Macaca
mulatta]
Length = 666
Score = 125 bits (313), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 74/221 (33%), Positives = 118/221 (53%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 416
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 417 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 476
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 477 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 535
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ ++ ++ K Q R LS YLE
Sbjct: 536 IKKEEERLRASIRRESQQCRMREKQHQ-----RGLSASYLE 571
>gi|324505370|gb|ADY42310.1| RNA polymerase-associated protein LEO1 [Ascaris suum]
Length = 498
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 80/217 (36%), Positives = 122/217 (56%), Gaps = 15/217 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
+ +K N + I+P+PFD + Y +ED + +E A+ ++R+EN I RWR ++ G
Sbjct: 228 LYFVKFPNFLSIEPRPFDHEHYEDEFDEDEQMDEEGRARLKLRVENTI-RWRYAFDEGGN 286
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
SN++ VRWSDG++ L +G+E D+ VQ QHD HLF+R G G LQ +K+
Sbjct: 287 IVKLSNSKIVRWSDGTMSLFLGSEAFDVQVQPMQHDN-HLFMRQGAG-LQGHAVFKEKLT 344
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVK--NCITDIDPEREKEEKERAESQNIRANVLL 366
F P S S +HR +T + R K KVK N + +DPE +K E R E + +RA
Sbjct: 345 FRPVSTDSITHRKVTLSMADRSNKSQKVKVLNAVG-VDPESQKAELVRREEERLRAQ--- 400
Query: 367 NRKREKINR-KYTQTVERRRQLSTGYLEDALEEDDET 402
NR+ ++ R K +V R LS +LE+ ++ DE+
Sbjct: 401 NRREAQLRRNKMRPSV--RTGLSANFLEEREDDSDES 435
>gi|354465266|ref|XP_003495101.1| PREDICTED: RNA polymerase-associated protein LEO1 [Cricetulus
griseus]
Length = 757
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 359 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 417
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 418 ENTIRWRVRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 477
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 478 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 536
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 537 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 572
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 46/78 (58%), Gaps = 3/78 (3%)
Query: 235 NIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGK 294
N +RWR ++++G ESNAR V+WSDGS+ L +G+EV DI Q + LF+R
Sbjct: 636 NTIRWRICRDEEGSKIKESNARIVKWSDGSMSLHLGSEVFDIYKAPLQDNHNQLFIREDT 695
Query: 295 GILQSQGRILKKMRFIPS 312
G+ +G+ + K R S
Sbjct: 696 GL---RGQAIFKSRLTFS 710
>gi|197097524|ref|NP_001127657.1| RNA polymerase-associated protein LEO1 [Pongo abelii]
gi|75040941|sp|Q5R4D6.1|LEO1_PONAB RecName: Full=RNA polymerase-associated protein LEO1
gi|55733399|emb|CAH93380.1| hypothetical protein [Pongo abelii]
Length = 666
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/222 (34%), Positives = 122/222 (54%), Gaps = 12/222 (5%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDT---FVTDESGAKKRIRL 232
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + E + ++++
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDGEGRTRLKLKV 416
Query: 233 ENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRH 292
EN I RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R
Sbjct: 417 ENTI-RWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQ 475
Query: 293 GKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEE 351
G G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 476 GTG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTE 534
Query: 352 KERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 535 MIKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 571
>gi|431895992|gb|ELK05410.1| RNA polymerase-associated protein LEO1 [Pteropus alecto]
Length = 667
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/221 (33%), Positives = 119/221 (53%), Gaps = 10/221 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 359 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 417
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 418 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 477
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 478 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 536
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ E + +R ++ +RE R+ + + +R LS YLE
Sbjct: 537 IKKEEERLRVSI----RRESQQRRMREK-QHQRGLSASYLE 572
>gi|344297800|ref|XP_003420584.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Loxodonta
africana]
Length = 810
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 113/205 (55%), Gaps = 9/205 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWR ++++G
Sbjct: 517 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRIRRDEEGNE 576
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 577 IKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 635
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 636 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI---- 691
Query: 369 KREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 692 RRESQQRRMREK-QHQRGLSASYLE 715
>gi|391329276|ref|XP_003739101.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Metaseiulus
occidentalis]
Length = 459
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/227 (34%), Positives = 130/227 (57%), Gaps = 20/227 (8%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRL 232
+E+E+P R +++ +K+ N + ++ +P+DP+TY V+ED + +E A+ ++++
Sbjct: 148 IEVEVP-RIQTDLGKEVHFVKLPNFLSVETRPYDPETYEDEVDEDEVLDEEGRARLKLKV 206
Query: 233 ENNIVRWRTVKN-KDGMTSYESNARFVRWSDGSLQLQIGNEVLDI---TVQDAQHDQAHL 288
EN I RWR V + K G +SN+R VRWSDGSL L +G+E+ D+ ++ H+ HL
Sbjct: 207 ENTI-RWRRVYDPKTGDPIKQSNSRVVRWSDGSLSLHLGSEIFDVHKHSLMTGDHN--HL 263
Query: 289 FLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCIT-DIDPER 347
F+R G G LQ Q K+ F P S S +HR +T + R +K KV+ T DPER
Sbjct: 264 FIRQGTG-LQGQSVFRTKLSFRPHSTDSFTHRKMTLSLAGRSQKANKVRVLPTVGADPER 322
Query: 348 EKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLED 394
+ E + E + +RA + + K+ +I +R R ++ G+LED
Sbjct: 323 NRGEMIKKEEEKLRAFMRRDNKQRRIR-------DRARGMNVGFLED 362
>gi|321460827|gb|EFX71865.1| hypothetical protein DAPPUDRAFT_255226 [Daphnia pulex]
Length = 556
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/208 (35%), Positives = 116/208 (55%), Gaps = 10/208 (4%)
Query: 190 TKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG---AKKRIRLENNIVRWRTVKNKD 246
++ + +K+ N + ++P+P+DP+TY +E A+ ++++EN I RWRT +K+
Sbjct: 261 SEFHFVKLPNFLSVEPRPYDPETYEDELEEEETLDEEGRARLKLKVENTI-RWRTAFDKE 319
Query: 247 GMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKK 306
G +SNAR V+WSDGSL L +G+E+ D+ Q Q D HLF+R G G LQ Q K
Sbjct: 320 GNAFKQSNARMVKWSDGSLSLHLGSEIFDVYRQPLQGDYNHLFIRQGTG-LQGQAVFRTK 378
Query: 307 MRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVL 365
+ F P S S +HR +T + R K +K + DPE + EK + E + +RA+V
Sbjct: 379 LTFRPHSTDSFTHRKMTKSLAERSTKTSSIKIIGVFGADPEANRGEKIKKEEERLRASV- 437
Query: 366 LNRKREKINRKYTQTVERRRQLSTGYLE 393
+RE ++ + R +S GYLE
Sbjct: 438 ---RRESKQKRVRERTATGRGMSGGYLE 462
>gi|270007111|gb|EFA03559.1| hypothetical protein TcasGA2_TC013564 [Tribolium castaneum]
Length = 625
Score = 121 bits (304), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 74/229 (32%), Positives = 126/229 (55%), Gaps = 19/229 (8%)
Query: 150 PDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTK-MNMIKVSNIMGIDPKPF 208
PDED E+E E + P +++ + D + ++ +K+ N + ++ +PF
Sbjct: 329 PDEDEDKENEPEQI----------PETRIDVEIPKISCDLGRDIHFVKLPNFLSVETRPF 378
Query: 209 DPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSL 265
DP+TY ++E+ + +E A+ ++++EN I RWR +K+G ESNAR VRWSDGS
Sbjct: 379 DPETYEDEIDEEETLDEEGRARLKLKVENTI-RWREAVDKEGNVKKESNARLVRWSDGSY 437
Query: 266 QLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTAL 325
L +G+E+ D+ Q Q D HLF+R G G LQ Q K+ F P S S +HR +T
Sbjct: 438 SLHLGSEIFDVYKQPLQGDHNHLFIRQGTG-LQGQAVFRTKLSFRPHSTESFTHRKMTLS 496
Query: 326 VDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVLLNRKREK 372
+ R +K +K ++ + DP+ +++ + E + +R V +++ R K
Sbjct: 497 LADRSQKTSGIK-ILSQVGADPDHDRKLLLKKEEEKLRQTVRVSKPRRK 544
>gi|91082773|ref|XP_966460.1| PREDICTED: similar to AGAP003242-PA [Tribolium castaneum]
Length = 626
Score = 121 bits (304), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 74/229 (32%), Positives = 126/229 (55%), Gaps = 19/229 (8%)
Query: 150 PDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTK-MNMIKVSNIMGIDPKPF 208
PDED E+E E + P +++ + D + ++ +K+ N + ++ +PF
Sbjct: 329 PDEDEDKENEPEQI----------PETRIDVEIPKISCDLGRDIHFVKLPNFLSVETRPF 378
Query: 209 DPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSL 265
DP+TY ++E+ + +E A+ ++++EN I RWR +K+G ESNAR VRWSDGS
Sbjct: 379 DPETYEDEIDEEETLDEEGRARLKLKVENTI-RWREAVDKEGNVKKESNARLVRWSDGSY 437
Query: 266 QLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTAL 325
L +G+E+ D+ Q Q D HLF+R G G LQ Q K+ F P S S +HR +T
Sbjct: 438 SLHLGSEIFDVYKQPLQGDHNHLFIRQGTG-LQGQAVFRTKLSFRPHSTESFTHRKMTLS 496
Query: 326 VDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVLLNRKREK 372
+ R +K +K ++ + DP+ +++ + E + +R V +++ R K
Sbjct: 497 LADRSQKTSGIK-ILSQVGADPDHDRKLLLKKEEEKLRQTVRVSKPRRK 544
>gi|149691935|ref|XP_001501549.1| PREDICTED: RNA polymerase-associated protein LEO1 isoform 1 [Equus
caballus]
Length = 666
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 159/327 (48%), Gaps = 30/327 (9%)
Query: 84 RSPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAIRNDIDQDS-----NRSPMEEEGS 138
R P +E++Q H S SA G E++V +N I DS P + G+
Sbjct: 258 RRPSDDEEEQDHKSESAR------GSDSEDEVLRMKRKNAIASDSEADSDTEVPKDNSGT 311
Query: 139 YEKSIRPEDIVPDEDMHYE--------SEEEHVEPKHKEKPVGPPLELEIPLRPPPAD-P 189
+ +DI D + E + + +E+P+ P +E+ + D
Sbjct: 312 MDLFGGADDISSGSDGEDKPPTPGQPVDENGLPQDQQEEEPI-PETRIEVEIPKVNTDLG 370
Query: 190 TKMNMIKVSNIMGIDPKPFDPKTY-VEEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDG 247
+ +K+ N + ++P+PFDP+ Y E + + + R++L+ N +RWR ++++G
Sbjct: 371 NDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRIRRDEEG 430
Query: 248 MTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKM 307
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+
Sbjct: 431 NEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKL 489
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 366
F P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 490 TFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI-- 547
Query: 367 NRKREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 548 --RRESQQRRMREK-QHQRGLSASYLE 571
>gi|289741825|gb|ADD19660.1| uncharacterized conserved protein [Glossina morsitans morsitans]
Length = 680
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 69/192 (35%), Positives = 111/192 (57%), Gaps = 13/192 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKD 246
++ +K+ N + ++ +PFDP+TY EE+T DE G ++RI+L+ +N +RWR N
Sbjct: 373 IHFVKLPNFLSVETRPFDPETYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREYMNNS 429
Query: 247 GMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKK 306
G ESNARFVRWSDGS+ L +GNE+ D+ Q D H+F+R G G LQ Q K
Sbjct: 430 GKMIKESNARFVRWSDGSMSLHLGNEIFDVYRQPLHGDHNHMFIRQGTG-LQGQAVFRTK 488
Query: 307 MRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANV 364
+ F P S S +H+ +T + R +KV +K +T + DP ++ + E + +R
Sbjct: 489 LTFRPHSTESFTHKKMTMSLADRSQKVSGIK-ILTQVGKDPTADRLYNLKKEEEKLR-QA 546
Query: 365 LLNRKREKINRK 376
+ N+ + ++ +K
Sbjct: 547 MRNQHKPQVKKK 558
>gi|390347046|ref|XP_781595.3| PREDICTED: RNA polymerase-associated protein LEO1-like
[Strongylocentrotus purpuratus]
Length = 675
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 78/221 (35%), Positives = 119/221 (53%), Gaps = 18/221 (8%)
Query: 193 NMIKVSNIMGIDPKPFDPKTY-----VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDG 247
+ K+ N + ++ +PFDP+ Y +ED + +E + ++++EN I RWR V +++G
Sbjct: 411 HFAKLPNFLSVETRPFDPQFYEDEVHADEDEVLDEEGRTRLKLKVENTI-RWRKVFDEEG 469
Query: 248 M-TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKK 306
ESNAR VRWSDGSL L +G EV D+ Q D H F+R G G LQ K
Sbjct: 470 SEIGKESNARIVRWSDGSLSLHLGQEVFDVYKMTQQGDFNHFFIRQGTG-LQGLAVFRTK 528
Query: 307 MRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVL 365
+ F P S S +HR +T + R K K+K I+ DPE ++ E + E + +RA++
Sbjct: 529 LSFRPHSTDSATHRKMTLSMADRCSKTQKIKVVPISGNDPEAQRGEMIKREEERLRASLR 588
Query: 366 LNRKREKINRKYTQTVERRRQLSTGYL----EDALEEDDET 402
++ +I + Q R LS GYL ED EE+D++
Sbjct: 589 RQNQQRRIRERSHQ-----RGLSAGYLEPDREDGFEEEDDS 624
>gi|126278272|ref|XP_001380639.1| PREDICTED: RNA polymerase-associated protein LEO1 [Monodelphis
domestica]
Length = 679
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 123/230 (53%), Gaps = 11/230 (4%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 371 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 429
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 430 ENTIRWRMRRDEEGNDIRESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 489
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 352
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 490 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 548
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDE 401
+ E + + + +RE R+ + + +R LS YLE D +E+DE
Sbjct: 549 IKKEEERFQKGI----RRESQQRRMREK-QHQRGLSANYLEPDRYDEEDE 593
>gi|344243752|gb|EGV99855.1| RNA polymerase-associated protein LEO1 [Cricetulus griseus]
Length = 760
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 113/205 (55%), Gaps = 9/205 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWR ++++G
Sbjct: 216 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRVRRDEEGNE 275
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 276 IKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 334
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 335 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI---- 390
Query: 369 KREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 391 RRESQQRRMREK-QHQRGLSASYLE 414
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 76/162 (46%), Gaps = 24/162 (14%)
Query: 235 NIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGK 294
N +RWR ++++G ESNAR V+WSDGS+ L +G+EV DI Q + LF+R
Sbjct: 478 NTIRWRICRDEEGSKIKESNARIVKWSDGSMSLHLGSEVFDIYKAPLQDNHNQLFIREDT 537
Query: 295 GILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKER 354
G+ +G+ + K R S S R+L + DPE + + +
Sbjct: 538 GL---RGQAIFKSRLTFRS-SKTQIRILP----------------MAGRDPEWPRTDLIQ 577
Query: 355 AESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDAL 396
A +RA L + I KY T+E+ ++ + + D +
Sbjct: 578 ACRHRVRAGDTL----QGIALKYGVTMEQIKRANKLFTNDCI 615
>gi|358340991|dbj|GAA48773.1| RNA polymerase-associated protein LEO1 [Clonorchis sinensis]
Length = 483
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 100/177 (56%), Gaps = 6/177 (3%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
M+++K+ N + ++ +PFDP Y ++ED + +E + ++++EN I RWR KN G
Sbjct: 158 MHLVKMPNFLSVETRPFDPNFYEDELDEDEVLDEEGRTRLKLKVENTI-RWRVAKNDAGD 216
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
+ESNAR VRWSDGSL L +G+E+ DI D D +LF+R G G LQ Q + K+
Sbjct: 217 VVHESNARIVRWSDGSLSLHLGDEIFDIHKVDIHADYNYLFIREGSG-LQGQSSLRTKLT 275
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANV 364
F P S S +HR +T + + K KVK + DPE + + E + +R +
Sbjct: 276 FRPHSTDSFTHRKITLSLADKTNKSQKVKILPVAGADPESNRIALVKREEEKLRVTL 332
>gi|149019155|gb|EDL77796.1| Leo1, Paf1/RNA polymerase II complex component, homolog (S.
cerevisiae), isoform CRA_a [Rattus norvegicus]
Length = 379
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 113/205 (55%), Gaps = 9/205 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWR ++++G
Sbjct: 86 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRMRRDEEGNE 145
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 146 IKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 204
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNR 368
P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 205 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI---- 260
Query: 369 KREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 261 RRESQQRRMREK-QHQRGLSASYLE 284
>gi|145356413|ref|XP_001422426.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582668|gb|ABP00743.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 368
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 89/257 (34%), Positives = 133/257 (51%), Gaps = 30/257 (11%)
Query: 129 NRSPMEEEGSYEKSIR-PEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEIPL----R 183
NR E +++ S+ P+D DED +P + K G PL LE+P R
Sbjct: 31 NRDDRGRESTFKSSLELPDDAENDED----------DPATRAK--GEPLRLELPSAEAGR 78
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLEN-NIVRW 239
D TK+ K+SNI G +PKPFD T+ E + FV D G RIR N N VRW
Sbjct: 79 ETIEDETKI--AKLSNIFGAEPKPFDAATHRDDGEREEFV-DVDGVT-RIRTSNENTVRW 134
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQS 299
R + ESNA +VRW DG+ L +G+E L + + L++R +G++++
Sbjct: 135 RM---NESTGEAESNAYYVRWDDGTTHLVVGDEYLRCDERAVADADSFLYVRK-QGLMKA 190
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRH-KKVYKVKNCITDIDPEREKEEKERAESQ 358
Q R+ KM F P++L S +H+ LTA +D +H + + ++ +DPEREKE + +
Sbjct: 191 QRRLTTKMTFNPATLDSRTHKRLTAAIDKKHGSRATRTMAYVSRVDPEREKEALDAELER 250
Query: 359 NIRANVLLNRKREKINR 375
R + L RK+ K+ R
Sbjct: 251 AARESAALERKQAKMMR 267
>gi|405960501|gb|EKC26422.1| RNA polymerase-associated protein LEO1 [Crassostrea gigas]
Length = 507
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 72/201 (35%), Positives = 112/201 (55%), Gaps = 15/201 (7%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRL 232
+E+EIP R ++ +K+ N + ++ +P+D +TY ++ED + +E A+ ++++
Sbjct: 181 IEVEIP-RIMTNLGKILHYVKLPNFLSVETRPYDNQTYEDEIDEDEVLDEEGRARLKLKV 239
Query: 233 ENNIVRWRTVKNKDGMTSY--------ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHD 284
EN I RWR N+DG + ESNAR VRWSDGS+ L +GNE+ D+ Q D
Sbjct: 240 ENTI-RWRKKNNEDGTEATDELGKPIRESNARVVRWSDGSMSLHLGNEIFDVHTMPIQGD 298
Query: 285 QAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDI 343
HLF+R G G LQ Q K+ F P S S +HR +T + R K KVK I+
Sbjct: 299 FNHLFVRQGTG-LQGQSIFKTKLAFRPHSTESATHRKMTLSLADRSTKAQKVKVLPISGR 357
Query: 344 DPEREKEEKERAESQNIRANV 364
DP+ + E + E + +RA++
Sbjct: 358 DPDAHRSEMIKKEEEKLRASI 378
>gi|330841393|ref|XP_003292683.1| hypothetical protein DICPUDRAFT_157421 [Dictyostelium purpureum]
gi|325077054|gb|EGC30793.1| hypothetical protein DICPUDRAFT_157421 [Dictyostelium purpureum]
Length = 481
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 81/224 (36%), Positives = 133/224 (59%), Gaps = 20/224 (8%)
Query: 175 PLELEIPLRPPPADPT-KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLE 233
P+EL I + P P K+ IK+SNI+GI PKPFDP T+ E DE ++++ +
Sbjct: 155 PIEL-IHIETPKVRPVEKVMKIKLSNILGIQPKPFDPLTFEE------DELNSEQKNKNF 207
Query: 234 NNI---VRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFL 290
NNI +RWR + +G + ESN RFV W DGS L IGNEVL+I QD +++Q +++
Sbjct: 208 NNIESVIRWRWGLDSNGRPAKESNTRFVTWPDGSAHLFIGNEVLEIKEQDLRNEQFYVYS 267
Query: 291 RHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKE 350
+G ++ + +I ++ P+++ S H+ L+ V + KV ++K+ T +DPE+EKE
Sbjct: 268 SQ-EGFIECESKIDSRINIRPANIRSKVHQRLSESVAKKSLKVSRIKSIQTTLDPEKEKE 326
Query: 351 EKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLED 394
++E+ + IR N +K+E+ N+ + + L++ YLED
Sbjct: 327 QREKELLETIREN---KKKQEEENKP-----RKNKALTSDYLED 362
>gi|328874769|gb|EGG23134.1| RNA polymerase II complex component [Dictyostelium fasciculatum]
Length = 653
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 77/244 (31%), Positives = 135/244 (55%), Gaps = 16/244 (6%)
Query: 149 VPDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPF 208
+ D+D Y E + VE ++ + PP+ L + P K+ +KV NI+GI P PF
Sbjct: 315 MVDDDTGYGGEYQEVE--ELKEVIRPPVHLT-HIDSTPLPKAKIMKLKVLNILGIQPIPF 371
Query: 209 DPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQ 268
PK+Y E+D +E + R +++RWR ++ G + ESN R ++WSDGS L
Sbjct: 372 HPKSY-EKD----EEIAERGRPSNVESVIRWRWALDQHGRPTKESNTRLIQWSDGSTHLF 426
Query: 269 IGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDS 328
IGNE L++ + DQ +++ + ++ +G++ K+ P+ + S +H L + S
Sbjct: 427 IGNEALEVKEHELAGDQFYVYNKQ-DNFIECEGKVDSKIAIRPTKVRSKAHMRLRESIAS 485
Query: 329 RHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLS 388
R +K K+K+ +T +DPE+EK ++ERAE++ IR + ++ K+N+ + ++ L
Sbjct: 486 RTQKSAKIKSIVTTVDPEKEKVQRERAEAEKIR--IARGQRDTKVNK-----LAQKLNLD 538
Query: 389 TGYL 392
T YL
Sbjct: 539 TKYL 542
>gi|47230639|emb|CAF99832.1| unnamed protein product [Tetraodon nigroviridis]
Length = 665
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 79/241 (32%), Positives = 120/241 (49%), Gaps = 25/241 (10%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 233
+E+EIP + + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 329 IEVEIP-KVSTDLGSDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 387
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N +RWR ++++G + ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 388 ENTIRWRAKRDEEGNETRESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 447
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEE- 351
G LQ Q K+ F P S S +HR +T + R K K++ + DPE + E
Sbjct: 448 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESHRNEM 506
Query: 352 --------KERAESQNIRANVLLN-------RKREKINRKYTQTVERRRQ----LSTGYL 392
R A+V L+ R R I R+ Q R +Q LS+ YL
Sbjct: 507 IKRQKLFASRRQSGVCYHAHVPLHFLQKEEERLRASIRRESQQRRMREKQHQRGLSSSYL 566
Query: 393 E 393
E
Sbjct: 567 E 567
>gi|322800100|gb|EFZ21206.1| hypothetical protein SINV_12323 [Solenopsis invicta]
Length = 727
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 121/217 (55%), Gaps = 12/217 (5%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +PFD +TY ++E+ + +E A+ ++++EN + RW+ + N +G
Sbjct: 440 IHFVKLPNFLSVETRPFDHETYEDEIDEEETLDEEGRARLKLKVENTL-RWKELFNDEGK 498
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
+SNARFV+WSDGS+ L +G+E+ D+ Q Q D HL++R G G LQ Q K+
Sbjct: 499 MVKQSNARFVKWSDGSMSLHLGSEIFDVYKQPLQGDHNHLYIRQGTG-LQGQAVFRTKLT 557
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIRANVLLN 367
F P S S +HR +T + R +K +K ++P++ + E + E + +R +
Sbjct: 558 FRPHSTESFTHRKMTMSLADRSQKTSGIKVLSQVGMNPDQNRYEMIKKEEEKLRMAM--- 614
Query: 368 RKREKINRKYTQTVERRRQLSTG-YLEDALEEDDETD 403
R + +K T ++ + STG Y DA +D D
Sbjct: 615 --RVQKTKKSTGSIRNTNRSSTGAYSGDAYHDDGSDD 649
>gi|345479071|ref|XP_001607741.2| PREDICTED: hypothetical protein LOC100123957 [Nasonia vitripennis]
Length = 813
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 107/185 (57%), Gaps = 6/185 (3%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +PFD +TY ++E+ + +E A+ ++++EN I RWR + DG
Sbjct: 518 IHFVKLPNFLSVETRPFDHETYEDEIDEEETLDEEGRARLKLKVENTI-RWREEFSDDGK 576
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
+SNARFVRWSDGS+ L +G+E+ D+ Q Q D HLF+R G G LQ Q K+
Sbjct: 577 PERQSNARFVRWSDGSMSLHLGSEIFDVYKQPLQGDHNHLFIRQGTG-LQGQAVFRTKLT 635
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIRANVLLN 367
F P S S +HR +T + R +K +K ++P++ K E + E + +R + +
Sbjct: 636 FRPHSTESFTHRKMTMSLADRSQKTSGIKVLSHVGMNPDQNKYEMIKKEEEKLRMAMRVQ 695
Query: 368 RKREK 372
K +K
Sbjct: 696 SKSKK 700
>gi|339237373|ref|XP_003380241.1| RNA polymerase-associated protein LEO1 [Trichinella spiralis]
gi|316976952|gb|EFV60143.1| RNA polymerase-associated protein LEO1 [Trichinella spiralis]
Length = 440
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 120/211 (56%), Gaps = 19/211 (9%)
Query: 191 KMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDG 247
+++ +K+ N + ++P+ FD TY ++ED + +E ++ ++++EN I RWR ++ G
Sbjct: 132 QLHFVKLPNFISVEPREFDADTYEDEIDEDEVLDEEGRSRLKLKVENTI-RWRNTIDEHG 190
Query: 248 MTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKM 307
+ ESNA+ +RWSDGS+ L IG+E+ DI+V Q D HLF+R G G LQ Q K+
Sbjct: 191 NSVAESNAKIIRWSDGSMSLYIGSEIFDISVHSLQGDHHHLFIRQGPG-LQGQAIFDTKV 249
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVL 365
F P S S +HR +T + R K KVK ++ I DP+ K++ + E + +RA
Sbjct: 250 TFRPHSTESFTHRKMTLSMADRSTKSQKVK-VLSMIGEDPDVLKQKMIKQEEERLRAKA- 307
Query: 366 LNRKREKINRKYTQTVERR---RQLSTGYLE 393
+RE R+ V+ R R LS YLE
Sbjct: 308 ---RRETQQRR----VKERPPVRGLSANYLE 331
>gi|198453028|ref|XP_002137588.1| GA27305 [Drosophila pseudoobscura pseudoobscura]
gi|198132184|gb|EDY68146.1| GA27305 [Drosophila pseudoobscura pseudoobscura]
Length = 719
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 106/197 (53%), Gaps = 13/197 (6%)
Query: 193 NMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDG 247
+ IK+ N + + PFDP+TY EE+T DE G ++RI+L+ +N +RWR N G
Sbjct: 394 HFIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREFMNNKG 450
Query: 248 MTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKM 307
ESNARFVRWSDGS+ L +GNE+ D Q D HLF+R G G LQ Q K+
Sbjct: 451 DMVRESNARFVRWSDGSMSLHLGNEIFDAYRQPLLGDHNHLFIRQGTG-LQGQAVFRTKL 509
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVL 365
F P S S +H+ +T + R K +K +T + DP ++ + + E +R +
Sbjct: 510 TFRPHSTESFTHKKMTMSLADRSSKTSGIK-ILTQVGKDPTTDRPSQLKEEEAKLR-QAM 567
Query: 366 LNRKREKINRKYTQTVE 382
N+ + + +K T E
Sbjct: 568 RNQHKAQPKKKKAGTGE 584
>gi|195146294|ref|XP_002014122.1| GL24508 [Drosophila persimilis]
gi|194103065|gb|EDW25108.1| GL24508 [Drosophila persimilis]
Length = 719
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 106/197 (53%), Gaps = 13/197 (6%)
Query: 193 NMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDG 247
+ IK+ N + + PFDP+TY EE+T DE G ++RI+L+ +N +RWR N G
Sbjct: 394 HFIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREFMNNKG 450
Query: 248 MTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKM 307
ESNARFVRWSDGS+ L +GNE+ D Q D HLF+R G G LQ Q K+
Sbjct: 451 DMVRESNARFVRWSDGSMSLHLGNEIFDAYRQPLLGDHNHLFIRQGTG-LQGQAVFRTKL 509
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVL 365
F P S S +H+ +T + R K +K +T + DP ++ + + E +R +
Sbjct: 510 TFRPHSTESFTHKKMTMSLADRSSKTSGIK-ILTQVGKDPTTDRPSQLKEEEAKLR-QAM 567
Query: 366 LNRKREKINRKYTQTVE 382
N+ + + +K T E
Sbjct: 568 RNQHKAQPKKKKAGTGE 584
>gi|328700005|ref|XP_001951223.2| PREDICTED: RNA polymerase-associated protein LEO1-like
[Acyrthosiphon pisum]
Length = 616
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 117/207 (56%), Gaps = 13/207 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +P+D +TY ++E+ + +E A+ ++++EN I RWR V +KDG
Sbjct: 353 LHFVKLPNFLSVETRPYDRETYEDEIDEEETLDEEGRARVKLKVENTI-RWREVFDKDGN 411
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESN R ++WSDGS+ L +G+E+ D+ Q D HLF+R G G LQ Q K+
Sbjct: 412 VVKESNTRVIKWSDGSMSLHLGSEIFDVYKQPLVGDHNHLFIRQGTG-LQGQAVFRTKLS 470
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVLL 366
F P S S +HR +T + R +K +K ++ + DP+ + EK + E + +R ++
Sbjct: 471 FRPHSTESFTHRKITLSLADRSQKSSGIK-VLSQVGADPDANRYEKIKREEEKLRQSM-- 527
Query: 367 NRKREKINRKYTQTVERRRQLSTGYLE 393
RK K+ R + LS+ YLE
Sbjct: 528 -RKETKVKRARERGAN--SNLSSSYLE 551
>gi|307178673|gb|EFN67312.1| Another transcription unit protein [Camponotus floridanus]
Length = 1128
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 67/216 (31%), Positives = 117/216 (54%), Gaps = 10/216 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +PFD +TY ++E+ + +E A+ ++++EN + RW+ N DG
Sbjct: 841 IHFVKLPNFLSVETRPFDHETYEDEIDEEETLDEEGRARLKLKVENTL-RWKETFNDDGK 899
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNARFV+WSDGS+ L +G+E+ D+ Q Q D HL++R G G LQ Q K+
Sbjct: 900 VVKESNARFVKWSDGSMSLHLGSEIFDVYKQPLQGDHNHLYIRQGTG-LQGQAVFRTKLT 958
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIRANVLLN 367
F P S S +HR +T + R +K +K ++P++ + E + E + +R + +
Sbjct: 959 FRPHSTESFTHRKMTMSLADRSQKTSGIKVLSQVGMNPDQNRYEMIKKEEEKLRMAMRVQ 1018
Query: 368 RKREKINRKYTQTVERRRQLSTGYLEDALEEDDETD 403
K +K T + + + Y DA +D D
Sbjct: 1019 NK----TKKSTSGIRNTNRTTGAYSGDAYHDDGSDD 1050
>gi|313233264|emb|CBY24379.1| unnamed protein product [Oikopleura dioica]
Length = 476
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/237 (34%), Positives = 121/237 (51%), Gaps = 27/237 (11%)
Query: 180 IPLRPPPADPT---KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-N 234
IP+ P + ++N ++ N + ++PKPFD + Y +E D + R++L+
Sbjct: 163 IPVEIPKINTNLGEQINFVRFPNFLSVEPKPFDAEHYEDEHEEDDDLDEEGRTRLKLKVE 222
Query: 235 NIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGK 294
N +RWR V DG ESNA+ VRWSDGS L +GNE DI + D HLF+R G
Sbjct: 223 NTIRWRKVTQPDGTEKIESNAKIVRWSDGSQSLILGNEKYDIQSMKLKGDFNHLFIRQGT 282
Query: 295 GILQSQGRILKKMRFIPSSLSSNSH-RLLTALVDSRHKKVYKVK------NCITDIDPER 347
G LQ Q K+ F PSS +S +H +++ L + + KVK NC P +
Sbjct: 283 G-LQGQAVFQTKLTFRPSSTNSLTHKKIMGRLAQRINNPMQKVKVIPVSGNC-----PHK 336
Query: 348 EKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDY 404
E+E R + RA + L ++R+K+ + E R+ + GYLED DD+ Y
Sbjct: 337 ERERLIREADERGRAELRLTQQRQKLRER-----EMRKNMGAGYLED----DDDNSY 384
>gi|332019967|gb|EGI60427.1| RNA polymerase-associated protein LEO1 [Acromyrmex echinatior]
Length = 713
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 118/217 (54%), Gaps = 12/217 (5%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +PFD TY ++E+ + +E A+ ++++EN + RW+ + DG
Sbjct: 428 IHFVKLPNFLSVETRPFDHDTYEDEIDEEETLDEEGRARLKLKVENTL-RWKESFSDDGK 486
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
+SNARFV+WSDGS+ L +G+E+ D+ Q Q D HL++R G G LQ Q K+
Sbjct: 487 MVKQSNARFVKWSDGSMSLHLGSEIFDVYKQPLQGDHNHLYIRQGTG-LQGQAVFRTKLT 545
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIRANVLLN 367
F P S S +HR +T + R +K +K ++P++ + E + E + +R + +
Sbjct: 546 FRPHSTESFTHRKMTMSLADRSQKTSGIKVLSQVGMNPDQNRYEMIKKEEEKLRMAMRVQ 605
Query: 368 RKREKINRKYTQTVERRRQLSTG-YLEDALEEDDETD 403
K +K + + R +TG Y DA +D D
Sbjct: 606 NKTKK-----STSGSRNTNRATGAYSGDAYHDDGSDD 637
>gi|326504334|dbj|BAJ90999.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 596
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 116/207 (56%), Gaps = 13/207 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +P+D +TY ++E+ + +E A+ ++++EN I RWR V + DG
Sbjct: 333 LHFVKLPNFLSVETRPYDRETYEDEIDEEETLDEEGRARVKLKVENTI-RWREVFDNDGN 391
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESN R ++WSDGS+ L +G+E+ D+ Q D HLF+R G G LQ Q K+
Sbjct: 392 VVKESNTRVIKWSDGSMSLHLGSEIFDVYKQPLVGDHNHLFIRQGTG-LQGQAVFRTKLS 450
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVLL 366
F P S S +HR +T + R +K +K ++ + DP+ + EK + E + +R ++
Sbjct: 451 FRPHSTESFTHRKITLSLADRSQKSSGIK-VLSQVGADPDANRYEKIKREEEKLRQSM-- 507
Query: 367 NRKREKINRKYTQTVERRRQLSTGYLE 393
RK K+ R + LS+ YLE
Sbjct: 508 -RKETKVKRARERGAN--SNLSSSYLE 531
>gi|307199108|gb|EFN79818.1| Another transcription unit protein [Harpegnathos saltator]
Length = 717
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 107/185 (57%), Gaps = 6/185 (3%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +PFD +TY ++E+ + +E A+ ++++EN + RW+ N +G
Sbjct: 430 IHFVKLPNFLSVETRPFDHETYEDEIDEEETLDEEGRARLKLKVENTL-RWKDTFNDEGK 488
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNARFV+WSDGS+ L +G+E+ D+ Q Q D HL++R G G LQ Q K+
Sbjct: 489 IVKESNARFVKWSDGSMSLHLGSEIFDVYKQPLQGDHNHLYIRQGTG-LQGQAVFRTKLT 547
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIRANVLLN 367
F P S S +HR +T + R +K +K ++P++ + E + E + +R + +
Sbjct: 548 FRPHSTESFTHRKMTMSLADRSQKTSGIKVLSQVGMNPDQNRYEMIKKEEEKLRMAMRVQ 607
Query: 368 RKREK 372
K +K
Sbjct: 608 SKTKK 612
>gi|1658503|gb|AAB18341.1| Atu [Drosophila melanogaster]
Length = 724
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 75/194 (38%), Positives = 106/194 (54%), Gaps = 15/194 (7%)
Query: 176 LELEIPLRPPPADPTK-MNMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRI 230
+++EIP AD K + IK+ N + + PFDP+TY EE+T DE G ++RI
Sbjct: 379 IDVEIPR--ISADLGKEQHFIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRI 433
Query: 231 RLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLF 289
+L+ +N +RWR N G ESNARFVRWSDGS+ L +GNE+ D Q D HLF
Sbjct: 434 KLKVSNTIRWREYMNNKGDMVRESNARFVRWSDGSMSLHLGNEIFDAYRQPLLGDHNHLF 493
Query: 290 LRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPER 347
+R G G LQ Q K+ F P S S +H+ +T + R K +K +T + DP
Sbjct: 494 VRQGTG-LQGQSVFRTKLTFRPHSTESFTHKKMTMSLADRSSKTSGIK-ILTQVGKDPTT 551
Query: 348 EKEEKERAESQNIR 361
++ + R E +R
Sbjct: 552 DRPTQLREEEAKLR 565
>gi|195343725|ref|XP_002038442.1| GM10820 [Drosophila sechellia]
gi|194133463|gb|EDW54979.1| GM10820 [Drosophila sechellia]
Length = 726
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 96/175 (54%), Gaps = 12/175 (6%)
Query: 194 MIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDGM 248
IK+ N + + PFDP+TY EE+T DE G ++RI+L+ +N +RWR N G
Sbjct: 397 FIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREYMNNKGD 453
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNARFVRWSDGS+ L +GNE+ D Q D HLF+R G G LQ Q K+
Sbjct: 454 MVRESNARFVRWSDGSMSLHLGNEIFDAYRQPLLGDHNHLFVRQGTG-LQGQSVFRTKLT 512
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 361
F P S S +H+ +T + R K +K +T + DP ++ + R E +R
Sbjct: 513 FRPHSTESFTHKKMTMSLADRSSKTSGIK-ILTQVGKDPTTDRPTQLREEEAKLR 566
>gi|313214003|emb|CBY40790.1| unnamed protein product [Oikopleura dioica]
Length = 479
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/237 (34%), Positives = 121/237 (51%), Gaps = 27/237 (11%)
Query: 180 IPLRPPPADPT---KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-N 234
IP+ P + ++N ++ N + ++PKPFD + Y +E D + R++L+
Sbjct: 163 IPVEIPKINTNLGEQINFVRFPNFLSVEPKPFDAEHYEDEHEEDDDLDEEGRTRLKLKVE 222
Query: 235 NIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGK 294
N +RWR V DG ESNA+ VRWSDGS L +GNE DI + D HLF+R G
Sbjct: 223 NTIRWRKVTQPDGTEKIESNAKIVRWSDGSQSLILGNEKYDIQSMKLKGDFNHLFIRQGT 282
Query: 295 GILQSQGRILKKMRFIPSSLSSNSH-RLLTALVDSRHKKVYKVK------NCITDIDPER 347
G LQ Q K+ F PSS +S +H +++ L + + KVK NC P +
Sbjct: 283 G-LQGQAVFQTKLTFRPSSTNSLTHKKIMGRLAQRINNPMQKVKVIPVSGNC-----PHK 336
Query: 348 EKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDY 404
E+E R + RA + L ++R+K+ + E R+ + GYLED DD+ Y
Sbjct: 337 ERERLIREVDERGRAELRLTQQRQKLRER-----EMRKNMGAGYLED----DDDNSY 384
>gi|194745526|ref|XP_001955238.1| GF16336 [Drosophila ananassae]
gi|190628275|gb|EDV43799.1| GF16336 [Drosophila ananassae]
Length = 694
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 74/194 (38%), Positives = 106/194 (54%), Gaps = 15/194 (7%)
Query: 176 LELEIPLRPPPADPTK-MNMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRI 230
+++EIP AD K + IK+ N + + PFDP+TY EE+T DE G ++RI
Sbjct: 347 IDVEIPR--ISADLGKEQHFIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRI 401
Query: 231 RLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLF 289
+L+ +N +RWR N G ESNARFVRWSDGS+ L +GNE+ D Q D HLF
Sbjct: 402 KLKVSNTIRWREFMNNKGEMVRESNARFVRWSDGSMSLHLGNEIFDAYRQPLLGDHNHLF 461
Query: 290 LRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPER 347
+R G G LQ Q K+ F P S S +H+ +T + R K +K +T + DP
Sbjct: 462 IRQGTG-LQGQSVFRTKLTFRPHSTESFTHKKMTMSLADRSSKTSGIK-ILTQVGKDPTT 519
Query: 348 EKEEKERAESQNIR 361
++ + + E +R
Sbjct: 520 DRYSQLKEEEAKLR 533
>gi|17737783|ref|NP_524240.1| another transcription unit [Drosophila melanogaster]
gi|48474240|sp|Q94546.2|ATU_DROME RecName: Full=Another transcription unit protein
gi|7296712|gb|AAF51991.1| another transcription unit [Drosophila melanogaster]
gi|18447320|gb|AAL68232.1| LD30655p [Drosophila melanogaster]
gi|220945978|gb|ACL85532.1| Atu-PA [synthetic construct]
gi|220952606|gb|ACL88846.1| Atu-PA [synthetic construct]
Length = 725
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 96/175 (54%), Gaps = 12/175 (6%)
Query: 194 MIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDGM 248
IK+ N + + PFDP+TY EE+T DE G ++RI+L+ +N +RWR N G
Sbjct: 397 FIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREYMNNKGD 453
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNARFVRWSDGS+ L +GNE+ D Q D HLF+R G G LQ Q K+
Sbjct: 454 MVRESNARFVRWSDGSMSLHLGNEIFDAYRQPLLGDHNHLFVRQGTG-LQGQSVFRTKLT 512
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 361
F P S S +H+ +T + R K +K +T + DP ++ + R E +R
Sbjct: 513 FRPHSTESFTHKKMTMSLADRSSKTSGIK-ILTQVGKDPTTDRPTQLREEEAKLR 566
>gi|195111763|ref|XP_002000447.1| GI10238 [Drosophila mojavensis]
gi|193917041|gb|EDW15908.1| GI10238 [Drosophila mojavensis]
Length = 677
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/176 (38%), Positives = 97/176 (55%), Gaps = 12/176 (6%)
Query: 193 NMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDG 247
+ IK+ N + + PFDP+TY EE+T DE G ++RI+L+ +N +RWR N G
Sbjct: 348 HFIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREYMNNKG 404
Query: 248 MTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKM 307
ESNARFVRWSDGS+ L +GNE+ D Q D HLF+R G G LQ Q K+
Sbjct: 405 EMVRESNARFVRWSDGSMSLHLGNEIFDAYRQPLVGDHNHLFIRQGTG-LQGQSVFRTKL 463
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 361
F P S S +H+ +T + R K +K +T + DP ++ + + E +R
Sbjct: 464 TFRPHSTESFTHKKITMSLADRSTKTSGIK-ILTQVGKDPTTDRYSQLKEEEAKLR 518
>gi|194898813|ref|XP_001978958.1| GG12952 [Drosophila erecta]
gi|190650661|gb|EDV47916.1| GG12952 [Drosophila erecta]
Length = 725
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 80/212 (37%), Positives = 111/212 (52%), Gaps = 15/212 (7%)
Query: 160 EEHVEPKHKEKPVGPPLELEIPLRPP--PADPTK-MNMIKVSNIMGIDPKPFDPKTYV-- 214
E VEP KE P E I + P AD K + IK+ N + + PFDP+TY
Sbjct: 360 ESQVEPAPKEDEPEPLPETRIDVEIPRISADLGKEQHFIKLPNFLSVVTHPFDPETYEDE 419
Query: 215 --EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGN 271
EE+T DE G ++RI+L+ +N +RWR + G ESNARFVRWSDGS+ L +GN
Sbjct: 420 IDEEETM--DEEG-RQRIKLKVSNTIRWREFMDNKGDMVRESNARFVRWSDGSMSLHLGN 476
Query: 272 EVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHK 331
E+ D Q D HLF+R G G LQ Q K+ F P S S +H+ +T + R
Sbjct: 477 EIFDAYRQPLLGDHNHLFVRQGTG-LQGQSVFRTKLTFRPHSTESFTHKKMTMSLADRSS 535
Query: 332 KVYKVKNCITDI--DPEREKEEKERAESQNIR 361
K +K +T + DP ++ + + E +R
Sbjct: 536 KTSGIK-ILTQVGKDPTTDRPTQLKEEEAKLR 566
>gi|383856691|ref|XP_003703841.1| PREDICTED: another transcription unit protein-like [Megachile
rotundata]
Length = 551
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 117/216 (54%), Gaps = 10/216 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +PFD +TY ++E+ + +E A+ ++++EN + RW+ + +G
Sbjct: 264 IHFVKLPNFLSVETRPFDTETYEDEIDEEETLDEEGRARLKLKVENTL-RWKETFDDEGK 322
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNARFV+WSDGS+ L +G+E+ D+ Q Q D HL++R G G LQ Q K+
Sbjct: 323 VVKESNARFVKWSDGSMSLHLGSEIFDVYKQPLQGDHNHLYIRQGTG-LQGQAVFRTKLT 381
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIRANVLLN 367
F P S S +HR +T + R +K +K +P++ + E + E + +R + +
Sbjct: 382 FRPHSTESFTHRKMTMSLADRSQKTSGIKVLSQVGTNPDQNRYEMIKKEEEKLRMAMRVQ 441
Query: 368 RKREKINRKYTQTVERRRQLSTGYLEDALEEDDETD 403
K +K T R +S GY DA +D D
Sbjct: 442 SKTKKSTAGRGTT---GRSVS-GYGGDAYHDDGSDD 473
>gi|302835004|ref|XP_002949064.1| hypothetical protein VOLCADRAFT_89365 [Volvox carteri f.
nagariensis]
gi|300265809|gb|EFJ49999.1| hypothetical protein VOLCADRAFT_89365 [Volvox carteri f.
nagariensis]
Length = 489
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/205 (31%), Positives = 109/205 (53%), Gaps = 7/205 (3%)
Query: 172 VGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIR 231
G P++ PL P + +++ + ++G++P PF P+T+ E+ DE G K
Sbjct: 105 TGAPIDYAAPLLSVPPPQGDVYLLREA-LIGVEPTPFSPETFAGEEEIFVDEKGIMKIKP 163
Query: 232 LENNIVRWRTV--KNKDGMTSY--ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAH 287
++ +RWR V + DG ESNARFVRWSDGSLQL +G+EV D+ D +A+
Sbjct: 164 VDRTKIRWRYVTRQGPDGSEELVPESNARFVRWSDGSLQLLLGDEVFDVVTADISQQRAY 223
Query: 288 LFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPER 347
+ G++Q Q + KM F ++L+S + + VD R K+ +V+ + DP +
Sbjct: 224 MVAYS--GVVQGQAALTHKMGFRLATLNSKLYARMREEVDKRTVKLKRVQQHVEAFDPSK 281
Query: 348 EKEEKERAESQNIRANVLLNRKREK 372
EKE + + + +R L ++++
Sbjct: 282 EKERRAKETEELLRNRAQLEARQQR 306
>gi|195395989|ref|XP_002056616.1| GJ11042 [Drosophila virilis]
gi|194143325|gb|EDW59728.1| GJ11042 [Drosophila virilis]
Length = 683
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/176 (38%), Positives = 97/176 (55%), Gaps = 12/176 (6%)
Query: 193 NMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDG 247
+ IK+ N + + PFDP+TY EE+T DE G ++RI+L+ +N +RWR N G
Sbjct: 353 HFIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREYMNNKG 409
Query: 248 MTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKM 307
ESNARFVRWSDGS+ L +GNE+ D Q D HLF+R G G LQ Q K+
Sbjct: 410 EMVRESNARFVRWSDGSMSLHLGNEIFDAYRQPLVGDHNHLFIRQGTG-LQGQSVFRTKL 468
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 361
F P S S +H+ +T + R K +K +T + DP ++ + + E +R
Sbjct: 469 TFRPHSTESFTHKKITMSLADRSTKTSGIK-ILTQVGKDPTTDRYSQLKEEEAKLR 523
>gi|195568485|ref|XP_002102245.1| GD19799 [Drosophila simulans]
gi|194198172|gb|EDX11748.1| GD19799 [Drosophila simulans]
Length = 577
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 81/212 (38%), Positives = 110/212 (51%), Gaps = 15/212 (7%)
Query: 160 EEHVEPKHKEKPVGPPLELEIPLRPP--PADPTK-MNMIKVSNIMGIDPKPFDPKTYV-- 214
E V P KE P E I + P AD K + IK+ N + + PFDP+TY
Sbjct: 254 ESQVAPAPKEDEPEPLPETRIDVEIPRISADLGKEQHFIKLPNFLSVVTHPFDPETYEDE 313
Query: 215 --EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGN 271
EE+T DE G ++RI+L+ +N +RWR N G ESNARFVRWSDGS+ L +GN
Sbjct: 314 IDEEETM--DEEG-RQRIKLKVSNTIRWREYMNNKGDMVRESNARFVRWSDGSMSLHLGN 370
Query: 272 EVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHK 331
E+ D Q D HLF+R G G LQ Q K+ F P S S +H+ +T + R
Sbjct: 371 EIFDAYRQPLLGDHNHLFVRQGTG-LQGQSVFRTKLTFRPHSTESFTHKKMTMSLADRSS 429
Query: 332 KVYKVKNCITDI--DPEREKEEKERAESQNIR 361
K +K +T + DP ++ + R E +R
Sbjct: 430 KTSGIK-ILTQVGKDPTTDRPTQLREEEAKLR 460
>gi|403302053|ref|XP_003941681.1| PREDICTED: RNA polymerase-associated protein LEO1 [Saimiri
boliviensis boliviensis]
Length = 668
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 74/224 (33%), Positives = 117/224 (52%), Gaps = 14/224 (6%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTF----VTDESGAKKRIR 231
+E+EIP + + +K+ N + ++P +Y DT + S KR++
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPSSL-ILSYELTDTASLRDILKISTTLKRLK 415
Query: 232 LE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFL 290
L+ N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+
Sbjct: 416 LKVENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFI 475
Query: 291 RHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREK 349
R G G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++
Sbjct: 476 RQGTG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQR 534
Query: 350 EEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
E + E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 535 TEMIKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 573
>gi|195054351|ref|XP_001994089.1| GH22951 [Drosophila grimshawi]
gi|193895959|gb|EDV94825.1| GH22951 [Drosophila grimshawi]
Length = 683
Score = 112 bits (280), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 102/191 (53%), Gaps = 12/191 (6%)
Query: 193 NMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDG 247
+ IK+ N + + PFDP+TY EE+T DE G ++RI+L+ +N +RWR N G
Sbjct: 353 HFIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREYMNNKG 409
Query: 248 MTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKM 307
ESNAR+VRWSDGS+ L +GNE+ D Q D HLF+R G G LQ Q K+
Sbjct: 410 EMVRESNARYVRWSDGSMSLHLGNEIFDAYRQPLVGDHNHLFIRQGTG-LQGQSVFRTKL 468
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVL 365
F P S S +H+ +T + R K +K +T + DP ++ + + E +R +
Sbjct: 469 TFRPHSTESFTHKKITMSLADRSTKTSGIK-ILTQVGKDPTTDRYSQLKEEEAKLRLAMR 527
Query: 366 LNRKREKINRK 376
K + +K
Sbjct: 528 TQHKAQPKKKK 538
>gi|432113984|gb|ELK36041.1| RNA polymerase-associated protein LEO1 [Myotis davidii]
Length = 819
Score = 112 bits (280), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 91/163 (55%), Gaps = 4/163 (2%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWR ++++G
Sbjct: 372 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRIRRDEEGNE 431
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+W+DGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 432 IKESNARIVKWTDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 490
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEE 351
P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 491 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTE 533
>gi|328909537|gb|AEB61436.1| RNA polymerase-associated protein LEO1-like protein, partial [Equus
caballus]
Length = 329
Score = 112 bits (279), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 59/163 (36%), Positives = 91/163 (55%), Gaps = 4/163 (2%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGMT 249
+ +K+ N + ++P+PFDP+ Y +E + R++L+ N +RWR ++++G
Sbjct: 139 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRIRRDEEGNE 198
Query: 250 SYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F
Sbjct: 199 IKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTF 257
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEE 351
P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 258 RPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTE 300
>gi|384495511|gb|EIE86002.1| hypothetical protein RO3G_10712 [Rhizopus delemar RA 99-880]
Length = 417
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 78/247 (31%), Positives = 124/247 (50%), Gaps = 17/247 (6%)
Query: 176 LELEIPLRPPP-ADPTKMNMIKVSNIMGIDPKPFDP---KTYVEEDTFVTDESGAKKRIR 231
+ L +P P P +D K + K+ + ++ PF P T +EE E ++ IR
Sbjct: 104 VALHMPQLPLPYSDNNKYYLAKLPRFLDVEVDPFVPEEFNTKIEEGLTPAQE---RESIR 160
Query: 232 LE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFL 290
+ N +RWR + N+ G + +SNA V W DG + L +G+E D+ + D+ L
Sbjct: 161 EQIENTIRWRRIINEHGEEAMQSNAHLVEWEDGKVSLMLGDECFDVGSKPVGADEHVFLL 220
Query: 291 RHG--KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPERE 348
H G L+SQ M F PS L+S++HR +TA + R K +K K T+ DPE
Sbjct: 221 AHQTESGALESQVEFTDHMTFRPSGLTSDTHRHITAQIADRQVKKHKTKMYYTEKDPELI 280
Query: 349 KEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSR 408
K+E E E++ ++A L +R+K + +YT + RR G + A D+ YH
Sbjct: 281 KQELELQEAERLKAQKKLEAQRKKADMRYTDSARRR---EFGDYDTA----DDYGYHQIN 333
Query: 409 RSRRRFE 415
R++ R+E
Sbjct: 334 RAQDRYE 340
>gi|170030495|ref|XP_001843124.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867365|gb|EDS30748.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 712
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 101/177 (57%), Gaps = 12/177 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKD 246
++ +K+ N + ++ +PFD TY EE+T DE G ++R++L+ N +RWR + +
Sbjct: 397 LHFVKLPNFLSVETRPFDADTYEDEIDEEETL--DEEG-RQRVKLKVGNTIRWRNNIDDE 453
Query: 247 GMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKK 306
G E+NAR VRWSDGS+ L +G+E+ D+ Q Q D HLF+R G G LQ Q K
Sbjct: 454 GNGVRETNARLVRWSDGSMSLHLGSEIFDVYKQPLQGDHNHLFIRQGTG-LQGQSVFRTK 512
Query: 307 MRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 361
+ F P S S +H+ +T + R K +K +T + DP+ +++ + E + +R
Sbjct: 513 LTFRPHSTESFTHKKMTMSLADRSTKTSGIK-ILTQVGFDPDTNRKQNIKKEEEKLR 568
>gi|328792720|ref|XP_624295.3| PREDICTED: hypothetical protein LOC551906 [Apis mellifera]
Length = 624
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 108/186 (58%), Gaps = 8/186 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +PFD +TY ++E+ + +E A+ ++++EN + RW+ + G
Sbjct: 338 IHFVKLPNFLSVETRPFDTETYEDEIDEEETLDEEGRARLKLKVENTL-RWKETFDDQGK 396
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNARFV+WSDGS+ L +G+E+ D+ Q Q D HL++R G G LQ Q K+
Sbjct: 397 VIKESNARFVKWSDGSMSLHLGSEIFDVYKQPLQGDHNHLYIRQGTG-LQGQAVFRTKLT 455
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVLL 366
F P S S +HR +T + R +K +K ++ + +P++ + E + E + +R + +
Sbjct: 456 FRPHSTESFTHRKMTMSLADRSQKTSGIK-VLSQVGTNPDQNRYEMIKKEEEKLRMAMRV 514
Query: 367 NRKREK 372
K +K
Sbjct: 515 QSKTKK 520
>gi|157112516|ref|XP_001651815.1| hypothetical protein AaeL_AAEL006172 [Aedes aegypti]
gi|108878036|gb|EAT42261.1| AAEL006172-PA [Aedes aegypti]
Length = 678
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 102/177 (57%), Gaps = 12/177 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKD 246
++ +K+ N + ++ +PFD +TY EE+T DE G ++R++L+ N +RWR + +
Sbjct: 360 LHFVKLPNFLSVETRPFDVETYEDEIDEEETL--DEEG-RQRLKLKVGNTIRWRNNIDDE 416
Query: 247 GMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKK 306
G E+NAR VRWSDGS+ L +G+E+ D+ Q Q D HLF+R G G LQ Q K
Sbjct: 417 GNGIRETNARLVRWSDGSMSLHLGSEIFDVYKQPLQGDHNHLFIRQGTG-LQGQSVFRTK 475
Query: 307 MRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 361
+ F P S S +H+ +T + R K +K +T + DP+ ++E R E + +R
Sbjct: 476 LTFRPHSTESFTHKKMTMSLADRSTKTSGIK-ILTQVGFDPDANRKENIRKEEEKLR 531
>gi|195451713|ref|XP_002073044.1| GK13925 [Drosophila willistoni]
gi|194169129|gb|EDW84030.1| GK13925 [Drosophila willistoni]
Length = 653
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 96/176 (54%), Gaps = 12/176 (6%)
Query: 193 NMIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDG 247
+ IK+ N + + PFDP TY EE+T DE G ++RI+L+ +N +RWR N G
Sbjct: 342 HFIKLPNFLSVVTHPFDPDTYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREFMNNKG 398
Query: 248 MTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKM 307
ESNAR+VRWSDGS+ L +GNE+ D Q D HLF+R G G LQ Q K+
Sbjct: 399 EMERESNARYVRWSDGSMSLHLGNEIFDAYRQPLLGDHNHLFIRQGTG-LQGQSVFRTKL 457
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 361
F P S S +H+ +T + R K +K +T + DP ++ + + E +R
Sbjct: 458 TFRPHSTESFTHKKMTMSLADRSTKTSGIK-ILTQVGKDPTTDRPTQLKEEEARLR 512
>gi|380023451|ref|XP_003695536.1| PREDICTED: uncharacterized protein LOC100872303 [Apis florea]
Length = 625
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 108/186 (58%), Gaps = 8/186 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +PFD +TY ++E+ + +E A+ ++++EN + RW+ + G
Sbjct: 339 IHFVKLPNFLSVETRPFDTETYEDEIDEEETLDEEGRARLKLKVENTL-RWKETFDDQGK 397
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNARFV+WSDGS+ L +G+E+ D+ Q Q D HL++R G G LQ Q K+
Sbjct: 398 VIKESNARFVKWSDGSMSLHLGSEIFDVYKQPLQGDHNHLYIRQGTG-LQGQAVFRTKLT 456
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVLL 366
F P S S +HR +T + R +K +K ++ + +P++ + E + E + +R + +
Sbjct: 457 FRPHSTESFTHRKMTMSLADRSQKTSGIK-VLSQVGTNPDQNRYEMIKKEEEKLRMAMRV 515
Query: 367 NRKREK 372
K +K
Sbjct: 516 QSKTKK 521
>gi|303287000|ref|XP_003062789.1| PAF1 complex protein Leo1 [Micromonas pusilla CCMP1545]
gi|226455425|gb|EEH52728.1| PAF1 complex protein Leo1 [Micromonas pusilla CCMP1545]
Length = 293
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 100/179 (55%), Gaps = 11/179 (6%)
Query: 169 EKP-VGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY-VEEDTFVTDESGA 226
EKP P L +++P P + + K +N+ ++P+P+DPKT+ +E +T +
Sbjct: 101 EKPEPAPDLHIDVPYIDPIPEGVTAKIFKQTNVCRVEPQPYDPKTFQLETETITHADGTT 160
Query: 227 KKRIRLENNIVRWRTVKNKDGMT--------SYESNARFVRWSDGSLQLQIGNEVLDITV 278
+ R +N++RWR V ++DG ESNA+FVRWSDGSL L IG EVL +T
Sbjct: 161 ETRPADVHNVIRWRHVLDEDGKQVMGEDGEPKIESNAKFVRWSDGSLTLHIGEEVLRVTE 220
Query: 279 QDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVK 337
D + +++ RH G+LQ I +++ P +L S +H LT VD+RH K KV+
Sbjct: 221 TDMSGEHQYVYARH-PGVLQGVKHIESRLQMAPRNLDSATHSRLTRRVDARHVKSKKVQ 278
>gi|347969528|ref|XP_312950.5| AGAP003242-PA [Anopheles gambiae str. PEST]
gi|333468563|gb|EAA08401.5| AGAP003242-PA [Anopheles gambiae str. PEST]
Length = 678
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 105/176 (59%), Gaps = 10/176 (5%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +PFD TY ++E+ + +E + ++++EN I RWR ++ G
Sbjct: 361 IHFVKLPNFLSVETRPFDVDTYEDEIDEEETLDEEGRQRLKLKVENTI-RWRNNFDRQGN 419
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
E+NARFV+WSDGS+ L +G+E+ D+ Q Q D HLF+R G G LQ Q K+
Sbjct: 420 ALRETNARFVKWSDGSMSLHLGSEIFDVYKQPLQGDHNHLFIRQGTG-LQGQSVFRTKLT 478
Query: 309 FIPSSLSSNSH-RLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 361
F P S S +H ++L +L D R +K +K +T + DP+ ++++ + E + +R
Sbjct: 479 FRPHSTESFTHQKMLISLAD-RSQKTSGIK-ILTQVGFDPDADRKQNLKKEEEKLR 532
>gi|195502021|ref|XP_002098042.1| GE10143 [Drosophila yakuba]
gi|194184143|gb|EDW97754.1| GE10143 [Drosophila yakuba]
Length = 732
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 78/209 (37%), Positives = 109/209 (52%), Gaps = 15/209 (7%)
Query: 163 VEPKHKEKPVGPPLELEIPLRPP--PADPTK-MNMIKVSNIMGIDPKPFDPKTYV----E 215
VEP KE P E I + P AD K + IK+ N + + PFD +TY E
Sbjct: 370 VEPAPKEDEPEPLPETRIDVEIPRISADLGKEQHFIKLPNFLSVVTHPFDSETYEDEIDE 429
Query: 216 EDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVL 274
E+T DE G ++RI+L+ +N +RWR + G ESNARFVRWSDGS+ L +GNE+
Sbjct: 430 EETM--DEEG-RQRIKLKVSNTIRWREFMDNKGDMVRESNARFVRWSDGSMSLHLGNEIF 486
Query: 275 DITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVY 334
D Q D HLF+R G G LQ Q K+ F P S S +H+ +T + R K
Sbjct: 487 DAYRQPLLGDHNHLFIRQGTG-LQGQSVFRTKLTFRPHSTESFTHKKMTMSLADRSSKTS 545
Query: 335 KVKNCITDI--DPEREKEEKERAESQNIR 361
+K +T + DP ++ + + E +R
Sbjct: 546 GIK-ILTQVGKDPTTDRPTQLKEEEAKLR 573
>gi|170594523|ref|XP_001902013.1| Leo1-like protein [Brugia malayi]
gi|158590957|gb|EDP29572.1| Leo1-like protein [Brugia malayi]
Length = 476
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 96/176 (54%), Gaps = 9/176 (5%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG---AKKRIRLENNIVRWRTVKNKDGM 248
+ +K N + I+P+PFD Y +E A+ ++R+EN I RWR V + G
Sbjct: 202 LYFVKFPNFLSIEPRPFDQDHYEDEFDEDDLLDEEGRARLKLRVENTI-RWRYVLDDAGN 260
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
+ N++ VRWSDG++ L +G E+ DITVQ Q D HLFLR G G LQ +K+
Sbjct: 261 IQKQGNSKVVRWSDGTMSLYLGGEIFDITVQPMQLDN-HLFLRQGAG-LQGHAVFKEKLV 318
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRA 362
F P S S +HR +T + R K KVK +T + +PE +K E R E + +RA
Sbjct: 319 FRPHSTDSITHRKVTLSMADRSNKSQKVK-VLTAVGSNPESKKAEIVRKEEERMRA 373
>gi|393912546|gb|EFO28440.2| hypothetical protein LOAG_00036 [Loa loa]
Length = 499
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 96/176 (54%), Gaps = 9/176 (5%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG---AKKRIRLENNIVRWRTVKNKDGM 248
+ +K N + I+P+PFD Y +E A+ ++R+EN I RWR V + G
Sbjct: 226 LYFVKFPNFLSIEPRPFDQDHYEDEFDEDDLLDEEGRARLKLRVENTI-RWRYVLDDAGN 284
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
+ N++ VRWSDG++ L +G E+ DITVQ Q D HLFLR G G LQ +K+
Sbjct: 285 IQKQGNSKVVRWSDGTMSLYLGGEIFDITVQPMQLDN-HLFLRQGAG-LQGHAVFKEKLI 342
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRA 362
F P S S +HR +T + R K KVK +T + +PE +K E R E + +RA
Sbjct: 343 FRPHSTDSITHRKVTLSMADRSNKSQKVK-VLTAVGSNPESKKAEIVRKEEERMRA 397
>gi|210075238|ref|XP_002143008.1| YALI0B07095p [Yarrowia lipolytica]
gi|199425147|emb|CAG82826.4| YALI0B07095p [Yarrowia lipolytica CLIB122]
Length = 413
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 119/235 (50%), Gaps = 25/235 (10%)
Query: 180 IPLRPPPADPT-KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLEN---N 235
+P PP P K+ + +V + IDP PFD +++ ++ K +RL+ N
Sbjct: 123 LPRFPPAIKPNEKLFLSRVPAFLQIDPHPFDSSEFLKSAEEGNKDADTSKSLRLKTQAEN 182
Query: 236 IVRWRTVKNKDGMT-SYESNARFVRWSDGSLQLQIGNEVLDIT---VQDAQHDQAHLFLR 291
+RW+ K DG T + ESNARFV+WSDGSL LQ+G E+ D+ V+D L L
Sbjct: 183 TLRWKYGKGGDGSTVTKESNARFVKWSDGSLSLQLGEELFDVIRKPVEDTS-----LALS 237
Query: 292 HG-KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKE 350
H + +LQ+ G + + ++P+S S +H+ LT + R K V + T DP++ +
Sbjct: 238 HPEQEVLQTSGVLSENFSYVPTSTKSLTHKRLTEALAKRTLKSAAVSSWDTTDDPDKVQR 297
Query: 351 EKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYH 405
E E+AE+ A L KR + + +T DA+ ED+E +++
Sbjct: 298 EAEKAENMANAARRKLESKRRQAEEDFGKTA-----------ADAMAEDEEMEHN 341
>gi|357620947|gb|EHJ72955.1| hypothetical protein KGM_06508 [Danaus plexippus]
Length = 687
Score = 105 bits (262), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 100/178 (56%), Gaps = 8/178 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
++ +K+ N + ++ +P+DP TY ++E+ + +E A+ ++++EN K K+G
Sbjct: 408 LHFVKLPNFLSVETRPYDPNTYEDEIDEEETLDEEGRARLKLKVENTYATA-CHKLKEGN 466
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNAR V+WSDGS+ L +G+E+ D+ Q D HLF+R G G LQ Q K+
Sbjct: 467 AVKESNARMVKWSDGSMSLHLGSEIFDVYKQPLHGDHNHLFVRQGTG-LQGQAVFRTKLS 525
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANV 364
F P S S +HR +T V R K +K ++ + DP+ +++ + + E +RA +
Sbjct: 526 FRPHSTDSFTHRKMTLSVADRSTKTSAIK-ILSQVGSDPDADRKYQLKKEEMELRAAM 582
>gi|66808387|ref|XP_637916.1| RNA polymerase II complex component [Dictyostelium discoideum AX4]
gi|74853579|sp|Q54MB8.1|LEO1_DICDI RecName: Full=RNA polymerase-associated protein LEO1
gi|60466344|gb|EAL64404.1| RNA polymerase II complex component [Dictyostelium discoideum AX4]
Length = 487
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 94/155 (60%), Gaps = 5/155 (3%)
Query: 191 KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
K+ +K+ NI+GI PKPFDP T+ +E+ DE K + +E+ ++RWR + +G +
Sbjct: 181 KIMKLKLLNILGIQPKPFDPITFEDEEAMNGDE---KSKFNVES-VIRWRWGLDLNGRPA 236
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFI 310
ESN R V WSDGS L IGNEVL+I Q Q++Q +++ G ++ +G+I ++
Sbjct: 237 KESNTRLVTWSDGSSHLYIGNEVLEIKEQPLQNEQFYVYSSQ-DGFIECEGKIDSRLSIR 295
Query: 311 PSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDP 345
P+++ S H+ L+ V R KV K+K+ T +DP
Sbjct: 296 PTNIKSKVHQRLSENVAKRTVKVSKIKSIHTTLDP 330
>gi|397515698|ref|XP_003828084.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Pan
paniscus]
Length = 347
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/190 (34%), Positives = 103/190 (54%), Gaps = 13/190 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
+ +K+ + I+PKPFDP+ Y E++ + +E + ++++EN I RWR ++K+G
Sbjct: 72 LYFVKLPKFLSIEPKPFDPQFYEDEFEDEKVLDEEDRIRLKLKVENTI-RWRIRRDKEGN 130
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNAR V+WSD S+ L +GNEV D+ + HLF+R G LQ Q K+
Sbjct: 131 KIKESNARMVKWSDRSMSLHLGNEVFDVYKAPLLGNYNHLFIREDTG-LQGQAVFKSKLT 189
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPE-------REKEEKERAESQNI 360
F P S S ++R +T + +R K K++ + DPE ++KEE+ RA +Q
Sbjct: 190 FRPHSRDSATYRKMTLPLANRSSKTQKIRILPMAGRDPEGQHTEVMKKKEERLRASTQRE 249
Query: 361 RANVLLNRKR 370
+ L KR
Sbjct: 250 SQGIHLREKR 259
>gi|410049414|ref|XP_003952747.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Pan
troglodytes]
Length = 347
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/190 (34%), Positives = 103/190 (54%), Gaps = 13/190 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
+ +K+ + I+PKPFDP+ Y E++ + +E + ++++EN I RWR ++K+G
Sbjct: 72 LYFVKLPKFLSIEPKPFDPQFYEDEFEDEKVLDEEDRIRLKLKVENTI-RWRIRRDKEGN 130
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNAR V+WSD S+ L +GNEV D+ + HLF+R G LQ Q K+
Sbjct: 131 KIKESNARMVKWSDRSMSLHLGNEVFDVYKAPLLGNYNHLFIREDTG-LQGQAVFKSKLT 189
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPE-------REKEEKERAESQNI 360
F P S S ++R +T + +R K K++ + DPE ++KEE+ RA +Q
Sbjct: 190 FRPHSRDSATYRKMTLPLANRSSKTQKIRILPMAGRDPEGQHTEVMKKKEERLRASTQRE 249
Query: 361 RANVLLNRKR 370
+ L KR
Sbjct: 250 SQGIHLREKR 259
>gi|412992201|emb|CCO19914.1| RNA polymerase-associated protein LEO1 [Bathycoccus prasinos]
Length = 433
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 103/184 (55%), Gaps = 11/184 (5%)
Query: 196 KVSNIMGIDPKPFDPKTYVEE-DTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS-YES 253
K +NI ++ K FD T+ EE D + DE G + E N+ RW +D T+ +S
Sbjct: 145 KATNIFDVNAKKFDENTFQEEEDKTIVDEFGRSRLKAKEGNVARW----TRDERTNELKS 200
Query: 254 NARFVRWSDGSLQLQIGNEVLDI---TVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFI 310
NA+FV WSDGS L +G EV ++ V Q ++F RH +G+++ + +I K+M+F
Sbjct: 201 NAKFVEWSDGSRTLHLGEEVFNVMESAVSSKQGLNQYVFARH-QGLMEGKAKISKRMQFR 259
Query: 311 PSSLSSNSHRLLTALVDSRHKKVYK-VKNCITDIDPEREKEEKERAESQNIRANVLLNRK 369
P++ S++H+ L ++ +H K ++++DPE+EK+ +R Q R L RK
Sbjct: 260 PATTDSHTHKRLRDAIEKKHGARAKGTIRFVSNVDPEKEKDASDRLFEQRAREEAQLQRK 319
Query: 370 REKI 373
R+K+
Sbjct: 320 RKKM 323
>gi|297296459|ref|XP_001083776.2| PREDICTED: RNA polymerase-associated protein LEO1-like [Macaca
mulatta]
Length = 735
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/191 (35%), Positives = 101/191 (52%), Gaps = 16/191 (8%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEE--DTFVTDESGAKKRIRLE---NNIVRWRTVKNKD 246
+ +K+ + I+PKPFDP+ Y +E D V DE + RIRL+ N +RWR ++++
Sbjct: 373 LYFVKLPKFLSIEPKPFDPQFYEDEFEDEKVLDE---EDRIRLKLKVENTIRWRIRRDEE 429
Query: 247 GMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKK 306
G ESNAR V+WSD S+ L +GNEV D+ + HLF+R G LQ Q K
Sbjct: 430 GNKIKESNARMVKWSDRSMSLHLGNEVFDVYKAPLLGNYIHLFIREDTG-LQGQAIFKSK 488
Query: 307 MRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPERE------KEEKERAESQN 359
+ F P S S ++R +T + +R K K++ + DPE + KEE+ RA +
Sbjct: 489 LTFRPHSRDSATYRKMTLPLANRSSKTQKIRILPMAGRDPEGQHTEVMKKEERLRASTHR 548
Query: 360 IRANVLLNRKR 370
+ L KR
Sbjct: 549 ESQAIHLREKR 559
>gi|328771472|gb|EGF81512.1| hypothetical protein BATDEDRAFT_87709 [Batrachochytrium
dendrobatidis JAM81]
Length = 408
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 77/245 (31%), Positives = 119/245 (48%), Gaps = 22/245 (8%)
Query: 174 PPLELEIPLRPPPADPT----KMNMIKVSNIMGIDPKPFDPKTYVEEDT-FVTDESGAKK 228
P L +E L P P + +IK N + I+P P+D TY + D DE G+++
Sbjct: 117 PELVVETSL-PTVGGPRPENENLFLIKPPNFLHIEPIPYDRATYAQLDNDLEKDEDGSRE 175
Query: 229 RIRLE-NNIVRWRT-VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQA 286
R RL N +RWR K+ D S NAR VRW D S L +G E ++ V Q+
Sbjct: 176 RARLHVENTIRWRYDAKHPDQKNS---NARLVRWEDNSFSLLVGEEQFEVAVTSLQNQHH 232
Query: 287 HLFLRHGK-GILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDP 345
+L ++H + G +Q++ R K M F P S S +H+ +T + +H K + + T DP
Sbjct: 233 YLAVQHSEEGYIQNRARFNKLMSFRPFSAQSLTHKRVTMAIVGKHSKKNRTRLITTTEDP 292
Query: 346 EREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYH 405
E+ K+ E+AE++ ++A R K+ K T R R Y + DD++ Y
Sbjct: 293 EKAKQLVEKAETERLKA-------RRKLESKRASTSARERGGRVDYYNGS---DDDSHYR 342
Query: 406 DSRRS 410
R+
Sbjct: 343 SGNRN 347
>gi|441616941|ref|XP_004088409.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Nomascus
leucogenys]
Length = 349
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 97/177 (54%), Gaps = 6/177 (3%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
+ K+ + I+PKPFDP+ Y E++ + +E + ++++EN I RWR ++K+G
Sbjct: 71 LYFAKLPKFLSIEPKPFDPQFYEDEFEDEKVLDEEDRIRLKLKVENTI-RWRIRQDKEGN 129
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNAR V+WSD S+ L +GNEV D+ + HLF+R G LQ Q K+
Sbjct: 130 KIKESNARMVKWSDRSMSLHLGNEVFDVYKAPLLGNYIHLFIREDTG-LQGQAVFKSKLT 188
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANV 364
F P S S ++R +T + +R K K++ + DPE + E + + + +RA+
Sbjct: 189 FRPHSRDSATYRKMTLPLANRSSKTQKIRILPMAGRDPEGQHTEVMKKKEERLRAST 245
>gi|395746737|ref|XP_003778500.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Pongo
abelii]
Length = 353
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 98/177 (55%), Gaps = 6/177 (3%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
+ +K+ + I+PKPFDP+ Y E++ + +E + ++++EN I RWR ++K+G
Sbjct: 78 LYFVKLPKFLSIEPKPFDPQFYEDEFEDEKVLDEEDRIRLKLKVENTI-RWRIRRDKEGN 136
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNAR V+WSD S+ L +GNEV D+ + HLF+R G LQ Q K+
Sbjct: 137 KIKESNARMVKWSDRSMSLHLGNEVFDVYKAPLLGNYIHLFIREDTG-LQGQAVFKSKLT 195
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANV 364
F P S S ++R +T + +R K K++ + DPE + E + + + +RA+
Sbjct: 196 FRPHSRDSATYRKMTLPLANRSSKTQKIRILPMAGRDPEGQHMEVMKKKEERLRAST 252
>gi|149019156|gb|EDL77797.1| Leo1, Paf1/RNA polymerase II complex component, homolog (S.
cerevisiae), isoform CRA_b [Rattus norvegicus]
Length = 264
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 99/176 (56%), Gaps = 9/176 (5%)
Query: 220 VTDESGAKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITV 278
+ DE G + R++L+ N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+
Sbjct: 1 MLDEEG-RTRLKLKVENTIRWRMRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYK 59
Query: 279 QDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKN 338
Q D HLF+R G G LQ Q K+ F P S S +HR +T + R K K++
Sbjct: 60 APLQGDHNHLFIRQGTG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRI 118
Query: 339 C-ITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ DPE ++ E + E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 119 LPMAGRDPECQRTEMIKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 169
>gi|148694371|gb|EDL26318.1| mCG12252, isoform CRA_a [Mus musculus]
Length = 264
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 99/176 (56%), Gaps = 9/176 (5%)
Query: 220 VTDESGAKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITV 278
+ DE G + R++L+ N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+
Sbjct: 1 MLDEEG-RTRLKLKVENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYK 59
Query: 279 QDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKN 338
Q D HLF+R G G LQ Q K+ F P S S +HR +T + R K K++
Sbjct: 60 APLQGDHNHLFIRQGTG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRI 118
Query: 339 C-ITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+ DPE ++ E + E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 119 LPMAGRDPECQRTEMIKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 169
>gi|409040956|gb|EKM50442.1| hypothetical protein PHACADRAFT_263737 [Phanerochaete carnosa
HHB-10118-sp]
Length = 542
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 101/211 (47%), Gaps = 44/211 (20%)
Query: 194 MIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKK----RIRLENNIVRWRTVKNK 245
+I++ N + +D KPF P TY+ EE+ ES +K ++R+EN I RW +K++
Sbjct: 138 VIRIPNFIKVDSKPFHPDTYIGPEQEEEAAHVGESSREKDMSIKLRVENTI-RWHWIKDE 196
Query: 246 DGMTSYESNARFVRWSDGSLQLQIGNEVLDIT-----------------VQDAQHDQ--- 285
G +SN+R +RWSDGSL L++G E+ DIT Q +Q
Sbjct: 197 HGHDHRQSNSRIIRWSDGSLSLKLGKELFDITQTIDTSGAIPRQAIGGNTQSSQQASQAP 256
Query: 286 -------------AHLFLRHGKG-ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHK 331
+L +H + ILQ + I M P+ + S +HR+L V +H
Sbjct: 257 AAESSASAKAQGLTYLVAQHKRAEILQCEAMIAGTMSLRPTGMQSETHRMLVRAVGQKHT 316
Query: 332 KVYKVKNCI-TDIDPEREKEEKERAESQNIR 361
KV +++ +DPEREK E R S+ R
Sbjct: 317 KVARLRMAPDPTMDPEREKLELLRQVSKKTR 347
>gi|17538230|ref|NP_502135.1| Protein B0035.11 [Caenorhabditis elegans]
gi|3873697|emb|CAA97406.1| Protein B0035.11 [Caenorhabditis elegans]
Length = 430
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 119/222 (53%), Gaps = 20/222 (9%)
Query: 150 PDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFD 209
PDE E +EHV H + V + LE PP + +++ N + + PFD
Sbjct: 134 PDEIDEDEKAKEHV---HDTEMVTGRVTLEYAADPP-------HFVRMPNFLSVATHPFD 183
Query: 210 PKTYVE----EDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSL 265
P+ Y E E + DE + ++R+EN + RWR K+++G ESNA+ V+W DG++
Sbjct: 184 PQHYEEDEDDEQAKLDDEGRTRLKLRVENTL-RWRVRKDENGKEIRESNAKIVKWDDGTM 242
Query: 266 QLQIGNEVLDIT-VQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTA 324
L +GNE+ +++ V ++ HL+++ ++ +Q + +M F P S S +HR +T
Sbjct: 243 SLYLGNEIFEVSLVPLNSNNLPHLYVKQ-PTLMSAQAVLTHRMTFRPHSTDSQTHRKVTL 301
Query: 325 LVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANV 364
+ R +K +VK + D+ +PE + E R E +++RA++
Sbjct: 302 NMADRSRKNAQVK-VMDDVGQNPEITRRENARKEEESLRAHI 342
>gi|341890600|gb|EGT46535.1| hypothetical protein CAEBREN_18382 [Caenorhabditis brenneri]
Length = 427
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 132/247 (53%), Gaps = 24/247 (9%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEE---DTFVTDESG-AKKRIRLENNIVRWRTVKNKDGMTS 250
+++ N + + PFDP+ Y E+ D DE G + ++R+EN + RWR K++ G
Sbjct: 167 VRMPNFLSVATHPFDPQHYEEDEDDDQAKLDEEGRTRLKLRVENTL-RWRISKDEHGREV 225
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDA-QHDQAHLFLRHGKGILQSQGRILKKMRF 309
+SNA+ V+W DG++ L +G+E+ D+T+ +++ HL+++H I+ +Q + +M F
Sbjct: 226 KQSNAKIVKWEDGTMSLYLGSEIFDVTLMPLNENNLPHLYVKHSN-IMSAQSVLSHRMNF 284
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI---DPEREKEEKERAESQNIRANVLL 366
P S S +HR +T + R +K +VK + D +PE + E R E +++RA++
Sbjct: 285 RPHSTDSQTHRKVTLNMADRSRKNAQVK--VMDAVGQNPEITRRENARKEEESLRAHI-- 340
Query: 367 NRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEELEAEVRAEK 426
R+ + + + + RR + G+ D EED T SRR +++ + A +E
Sbjct: 341 -RRTQMVRNNF----KVRRPVYGGHYSD--EEDMPT---SSRRGKKKEAPIIGASSESED 390
Query: 427 RIINAKK 433
N KK
Sbjct: 391 EGDNGKK 397
>gi|449547950|gb|EMD38917.1| hypothetical protein CERSUDRAFT_112625 [Ceriporiopsis subvermispora
B]
Length = 540
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 68/210 (32%), Positives = 99/210 (47%), Gaps = 39/210 (18%)
Query: 180 IPLRPPP-ADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK--------RI 230
IP P P K ++++ N + +D KPF P TYV D D A+ ++
Sbjct: 123 IPNIPVPRTSDGKYWVMRMPNFVKVDSKPFHPDTYVGPDQEDEDAQPAESLREKSMSIKL 182
Query: 231 RLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQ----------- 279
++EN VRWR VK+ DG +SN R +RWSDGSL L++G E+ D+T
Sbjct: 183 KVENT-VRWRWVKDADGRDRRQSNGRIIRWSDGSLSLKLGKELFDVTQTIDNSGAIPRSA 241
Query: 280 ---DAQHDQA-------------HLFLRHGK-GILQSQGRILKKMRFIPSSLSSNSHRLL 322
Q QA +L +H + GILQ + + M P+ + S +HR+L
Sbjct: 242 IGGSQQASQAPLPSGSTKSQGLTYLVAQHKRAGILQCEALVTGYMSLRPTGMQSETHRML 301
Query: 323 TALVDSRHKKVYKVKNCI-TDIDPEREKEE 351
V +H KV +++ DPEREK E
Sbjct: 302 VRAVGQKHHKVARLRMAPDPTTDPEREKME 331
>gi|406601538|emb|CCH46844.1| Midasin [Wickerhamomyces ciferrii]
Length = 420
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 110/220 (50%), Gaps = 12/220 (5%)
Query: 151 DEDMHYESEEEHVE-PKHKEKPVGPPLELEI-PLRPPPA---DPTKMNMIKVSNIMGIDP 205
++D+ S++E +E +EKP P + + +R PP + K+ K+ N + +DP
Sbjct: 88 NDDVQRYSDQEGLEFAVEQEKPTKEPTVIGLDVVRHPPGFQHEDEKLYYTKIPNFLQVDP 147
Query: 206 KPFDPKTYVEEDTFVTDESGAKKRIRLEN----NIVRWRTVKNKDGMTSYESNARFVRWS 261
FD K Y+EE ES +K I L N +RWR K+ +G +ESNA+ + WS
Sbjct: 148 TRFDGKAYLEE-LVENSESLSKDDIELNRLKAENTIRWRYAKDSNGELIHESNAQVIEWS 206
Query: 262 DGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRL 321
DGSL L++G+E DI + D G+L + G ++ +P+S +S H+
Sbjct: 207 DGSLSLKLGDEYFDI-YKTPLTDTFLTTFNEESGLLGTDGVFTDSLKLVPTSTNSKIHKK 265
Query: 322 LTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIR 361
LT V + K ++ D DP++E E+ + Q IR
Sbjct: 266 LTNAVQATQFKP-SAQSVFIDKDPDQEARRLEKQQEQLIR 304
>gi|403418704|emb|CCM05404.1| predicted protein [Fibroporia radiculosa]
Length = 531
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 166/382 (43%), Gaps = 86/382 (22%)
Query: 102 EIRDVFGDSDEEDVGEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPD------EDMH 155
E+ D+FG++++++V N + D +P E P D + +DM
Sbjct: 43 EMHDLFGEAEDDNV------NIVKHDEATTPASSE-----PAEPLDGLTSPERRHRQDME 91
Query: 156 Y-ESEEEHVEPKHKEKPVGPPLELEIPLRPPP---ADPTKMNMIKVSNIMGIDPKPFDPK 211
Y E+E++ EP + LE + P+ P + +I++ N + +D KPF P
Sbjct: 92 YAEAEQDEGEPV-----INQVLEADAPIPNIPVPRSSDGNYWVIRMPNFVKVDSKPFHPD 146
Query: 212 TYV-----EEDTF---VTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDG 263
TY+ +ED E ++++EN VRWR VK+ +G +SN R VRWSDG
Sbjct: 147 TYIGPEQEDEDAQHAETVREKSMTIKLKVENT-VRWRWVKDGNGEDRRQSNGRIVRWSDG 205
Query: 264 SLQLQIGNEVLDITVQ----DAQHDQA---------------------------HLFLRH 292
+L LQ+G E+ DIT Q A QA +L +H
Sbjct: 206 TLSLQLGKELFDITQQVDTSGAMPRQAFGGSQQGSQSQPTASQPTAPGRSQGLTYLVAQH 265
Query: 293 GKG-ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKE 350
+ ILQ + + M P+ + S +HR+L V +H KV +++ DPEREK
Sbjct: 266 KRAEILQCEALVTGYMSLRPTGMQSETHRMLVRAVGQKHNKVARLRMAPDPTTDPEREKL 325
Query: 351 EKERAESQNIRA-----NVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYH 405
E + ++ R +VL R+R YT R+R TG D + DDE D
Sbjct: 326 EMMKQAARKPRKVRGEDDVLGARRRRA---SYT----RKR---TG---DDMWSDDEEDAV 372
Query: 406 DSRRSRRRFEEELEAEVRAEKR 427
S + +E A R KR
Sbjct: 373 FGGGSEDEYGDEGAAGGRGAKR 394
>gi|50308103|ref|XP_454052.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49643187|emb|CAG99139.1| KLLA0E02311p [Kluyveromyces lactis]
Length = 417
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 187 ADPTKMNMIKVSNIMGIDPKPFDPKTY---VEE--DTFVTDESGAKKRIRLENNIVRWRT 241
AD + KV + IDP PFDP ++ VE+ + + E R+ +E N +RWR
Sbjct: 114 ADDKTIYYAKVPQFLTIDPVPFDPPSFQKKVEDRVSKYSSKEDQLGDRL-IEENTIRWRY 172
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRH-GKGILQS- 299
++KD ESNA+ V WSDG+ L++G+E DI V + ++ +L + H + ++Q
Sbjct: 173 SRDKDERVFKESNAQIVEWSDGTFSLKVGDEYTDILVNNTEN--TYLTVSHEEQELIQCV 230
Query: 300 -QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAES 357
G I K M F+P+S +S H++L+ V R ++ N IT DPE EK E E+
Sbjct: 231 EGGEITKSMMFVPTSTNSKVHKILSKAVARRDERGQHGPNTYITRTDPELEKRELEKKHD 290
Query: 358 QNIR 361
Q +R
Sbjct: 291 QILR 294
>gi|320167212|gb|EFW44111.1| hypothetical protein CAOG_02136 [Capsaspora owczarzaki ATCC 30864]
Length = 465
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/236 (33%), Positives = 121/236 (51%), Gaps = 17/236 (7%)
Query: 170 KPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK- 228
+P+GP L+ E+ P A P + +++K+ N + ID F P Y E+GA +
Sbjct: 168 RPLGPMLK-EMTASIPQALP-RSHLVKLPNFISIDNNEFIPSEYQPR------EAGADEG 219
Query: 229 -RIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQA 286
IRL+ N +RWR+V + G ESNAR VRWS+G+ L +G+EV DI ++ D
Sbjct: 220 DHIRLDVENTIRWRSVDDGAGGAERESNARLVRWSNGTQSLFLGSEVFDIADKNILADNN 279
Query: 287 HLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPE 346
HLF + ++ + + + F P S +S +HR LTA V SR K+ K T E
Sbjct: 280 HLFEVLPQYLI-AHAVVDSRFIFKPLSATSATHRKLTASVASRAAKMNKSSVMQTKAVSE 338
Query: 347 REKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDET 402
+ + R E + IRA +RE R+ + + LS +LED L++ D++
Sbjct: 339 QSRLSAIRQEDETIRA----RSRREGRQRQRLEQAQ-NSGLSADFLEDRLDDGDDS 389
>gi|430811236|emb|CCJ31330.1| unnamed protein product [Pneumocystis jirovecii]
Length = 458
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 108/194 (55%), Gaps = 7/194 (3%)
Query: 191 KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-----AKKRIRLE-NNIVRWRTVKN 244
++ +K+ N + I KPFD ++Y+EE +E+ +RIRL+ N +RWR VKN
Sbjct: 129 QLYYVKMPNFLTIVQKPFDRESYLEEAQSEREETTIHQYDTNQRIRLKVENTIRWRYVKN 188
Query: 245 KDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRH-GKGILQSQGRI 303
KDG S +SNARF++WSDGSL L +G+E+ +++ + +L L H + +L S+ R
Sbjct: 189 KDGTYSKQSNARFIKWSDGSLSLLLGSELFSAVTKNSFSEHTYLCLSHESQNLLMSRKRF 248
Query: 304 LKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRAN 363
K M F+P S++H+ LT + + K + + DPE+ K E ER E + IR
Sbjct: 249 TKNMTFLPIDTGSSTHKRLTEAILRGNMKKCSIVEFVNVEDPEKVKREAERIEEEKIRFR 308
Query: 364 VLLNRKREKINRKY 377
L KR + KY
Sbjct: 309 RRLETKRRAQDAKY 322
>gi|255717194|ref|XP_002554878.1| KLTH0F15928p [Lachancea thermotolerans]
gi|238936261|emb|CAR24441.1| KLTH0F15928p [Lachancea thermotolerans CBS 6340]
Length = 423
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 102/198 (51%), Gaps = 25/198 (12%)
Query: 180 IPLRPPPADPTK-MNMIKVSNIMGIDPKPFDPKTY-------VEEDTFVTDESGAKKRIR 231
+P P+ K + KV + IDP PFDP + + T V D+ R+
Sbjct: 103 VPYSTEPSQGDKTIYYTKVPQFLTIDPIPFDPPAFQSKIEERAAQRTSVEDQ--IDDRL- 159
Query: 232 LENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQ-------HD 284
+E N VRWR ++K+ ESNA V+WSDG+ L++G+E D+ + D + HD
Sbjct: 160 IEENTVRWRYSRDKNQRVFKESNAHIVQWSDGTYSLRLGDEYTDVLINDTEDTYLAVSHD 219
Query: 285 QAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDI 343
Q L ++ KG G + K M FIP+S +S SH+ L+ V + ++ + N I +
Sbjct: 220 QQEL-IQSVKG-----GEVTKNMIFIPTSTNSKSHQRLSKAVARKEQQEHHGPNTYILRV 273
Query: 344 DPEREKEEKERAESQNIR 361
DPE E++E E+ + Q IR
Sbjct: 274 DPELEQKELEKKQGQIIR 291
>gi|238582405|ref|XP_002389923.1| hypothetical protein MPER_10888 [Moniliophthora perniciosa FA553]
gi|215452715|gb|EEB90853.1| hypothetical protein MPER_10888 [Moniliophthora perniciosa FA553]
Length = 277
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 68/221 (30%), Positives = 109/221 (49%), Gaps = 47/221 (21%)
Query: 189 PTKMN----MIKVSNIMGIDPKPFDPKTYV-----EEDTF---VTDESGAKKRIRLENNI 236
PT N +I++ N + +D KPF P TY+ +EDT E ++++EN +
Sbjct: 2 PTSSNGESWVIRMPNFVKLDSKPFHPDTYIGPEQEDEDTQHAETLREKSMTIKLKVENTL 61
Query: 237 VRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT------------------- 277
RWR +K++ G +SN+R +RWSDG+L L++G E+ DIT
Sbjct: 62 -RWRWIKDQSGNDKRQSNSRIIRWSDGTLSLRLGKELFDITQSVDTAGGVPRNINPSASQ 120
Query: 278 ----------VQDAQHDQAHLFLRHGKG-ILQSQGRILKKMRFIPSSLSSNSHRLLTALV 326
+Q AQ +L +H + +LQ++ I M P+ + S +HR+L V
Sbjct: 121 SQSQSQSQNPLQKAQ-GLTYLVAQHKRSQVLQAEAVITGHMTLRPTGMQSETHRMLVRAV 179
Query: 327 DSRHKKVYKVKNCI-TDIDPEREKEE--KERAESQNIRANV 364
+H KV +++ +DPEREK E K+ A+ R+NV
Sbjct: 180 GQKHNKVARLRMAPDPQVDPEREKMELMKQDAKKSKRRSNV 220
>gi|403214355|emb|CCK68856.1| hypothetical protein KNAG_0B04210 [Kazachstania naganishii CBS
8797]
Length = 463
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 77/240 (32%), Positives = 119/240 (49%), Gaps = 23/240 (9%)
Query: 134 EEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKMN 193
EEE Y + ED+ DM S+ E E + KE V ++ +P + P D N
Sbjct: 98 EEEEMYNRKFYGEDV----DM---SDTEQAEHEFKEANV-ELVKHIVPYKLVPDDAIGTN 149
Query: 194 MIKVSNI---MGIDPKPFDPKTYVEE-----DTFVTDESGAKKRIRLENNIVRWRTVKNK 245
I V+ + + IDP PFDP ++ + D ++ E R+ ++ N VRWR ++
Sbjct: 150 DIYVAKVPPFLMIDPAPFDPTSFEGKVADRLDGSLSKEDRLGDRL-IDENTVRWRYSRDT 208
Query: 246 DGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQS--QGR 302
+ ESN + V+WSDGS L++GNE DI + D D L + H + ++Q G
Sbjct: 209 NQSVFKESNTQIVQWSDGSYSLKLGNEYTDILINDT--DNTFLTVSHDQQELMQCVEGGL 266
Query: 303 ILKKMRFIPSSLSSNSHRLLTALVDSRH-KKVYKVKNCITDIDPEREKEEKERAESQNIR 361
+ K + FIP+S SS H+ L+ V R+ + I +DPE EK+E ER + Q +
Sbjct: 267 VNKTLMFIPTSTSSKIHQKLSKAVARRNVMQSAGPGTYIVSVDPEIEKKELERKQGQIFK 326
>gi|323352097|gb|EGA84634.1| Leo1p [Saccharomyces cerevisiae VL3]
Length = 464
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 104/193 (53%), Gaps = 17/193 (8%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 250
++ N + IDP PFDP ++ + S +++ R+ ++ N VRWR ++KD
Sbjct: 172 RIPNFLTIDPIPFDPPSFEAKVNERASNSASREDQLDDRL-IDENTVRWRYSRDKDQHVF 230
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESN + V+WSDG+ L++G E DI V D + L + H + ++Q G I K +
Sbjct: 231 KESNTQIVQWSDGTYSLKVGEECTDILVNDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 288
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 366
FIP+S +S H+ L+ V R+++ K I +DPE EK+E ER +SQ +L
Sbjct: 289 MFIPTSTNSKIHQKLSKAVIRRNQRQSKGPGTYIVSMDPEVEKKELERKQSQ-----ILR 343
Query: 367 NRKREKINRKYTQ 379
+R+R ++ K Q
Sbjct: 344 DRRRRQLKEKEKQ 356
>gi|410079412|ref|XP_003957287.1| hypothetical protein KAFR_0D05050 [Kazachstania africana CBS 2517]
gi|372463872|emb|CCF58152.1| hypothetical protein KAFR_0D05050 [Kazachstania africana CBS 2517]
Length = 419
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 102/192 (53%), Gaps = 14/192 (7%)
Query: 180 IPLRPPPA--DPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDT--FVTDESGAKKRIRL 232
+P + P D + KV + IDP PFDP ++ V+E T F + E R+ +
Sbjct: 105 VPYKTFPENEDDKTIYYTKVPAFLSIDPLPFDPPSFENKVKERTSNFSSVEDQLGDRL-I 163
Query: 233 ENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRH 292
+ N +RWR ++ + ESNA+ V+WSDGS L++G E DI V D L + H
Sbjct: 164 DENTIRWRYSRDAEQRVFKESNAQIVQWSDGSYSLKLGAEYTDILVNGT--DNTFLTVSH 221
Query: 293 GKG-ILQS--QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKV-KNCITDIDPERE 348
+ ++Q G+I K + FIP+S +S H+ L+ + R ++V+ I ++DPE E
Sbjct: 222 DQQELMQCVDGGQINKTLIFIPTSTNSKIHQRLSKAIARRDQRVHSGPGTMIVNLDPEVE 281
Query: 349 KEEKERAESQNI 360
K E E+ ++Q I
Sbjct: 282 KRELEKKQTQII 293
>gi|302685904|ref|XP_003032632.1| hypothetical protein SCHCODRAFT_107897 [Schizophyllum commune H4-8]
gi|300106326|gb|EFI97729.1| hypothetical protein SCHCODRAFT_107897, partial [Schizophyllum
commune H4-8]
Length = 471
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 108/235 (45%), Gaps = 36/235 (15%)
Query: 161 EHVEPKHKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY------V 214
E EP E PV E E+ L P + + + N + +D KPF +TY +
Sbjct: 85 EDTEPLQDEAPVALK-EAEVSLPNLPLPRSTDGSVAMPNFLTLDTKPFHSETYEGPENEM 143
Query: 215 EEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVL 274
E ++++EN I RWR KN+ G +SN+R +RWSDGSL L++G E+
Sbjct: 144 AGSAAEIREQSLSIKLKVENTI-RWRWAKNQLGEDVRQSNSRIIRWSDGSLSLRLGKELF 202
Query: 275 DI--TVQD--------------------AQHDQ----AHLFLRHGKG-ILQSQGRILKKM 307
DI TV + +QH + +L +H +LQ++ I M
Sbjct: 203 DIQQTVDNSATTTRQTMGGASQRPSQAPSQHTRPQGLTYLVAQHKHSQVLQAEAPITGFM 262
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIR 361
PS + S SHRLL + V + K +V+ +DPE+EK E R +++ +
Sbjct: 263 SLTPSDMQSESHRLLVSAVSQKRNKSVRVRMAPDPTVDPEKEKAELIRQDAKRTK 317
>gi|365763070|gb|EHN04601.1| Leo1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 464
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 103/190 (54%), Gaps = 17/190 (8%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 250
++ N + IDP PFDP ++ + S +++ R+ ++ N VRWR ++KD
Sbjct: 172 RIPNFLTIDPIPFDPPSFEAKVNERASNSASREDQLDDRL-IDENTVRWRYSRDKDQHVF 230
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESN + V+WSDG+ L++G E DI V D + L + H + ++Q G I K +
Sbjct: 231 KESNTQIVQWSDGTYSLKVGEECTDILVNDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 288
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 366
FIP+S +S H+ L+ V R+++ K I +DPE EK+E ER +SQ +L
Sbjct: 289 MFIPTSTNSKIHQKLSKAVIRRNQRQSKGPGTYIVSMDPEVEKKELERKQSQ-----ILR 343
Query: 367 NRKREKINRK 376
+R+R ++ K
Sbjct: 344 DRRRRQLKEK 353
>gi|207341150|gb|EDZ69282.1| YOR123Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 464
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 103/190 (54%), Gaps = 17/190 (8%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 250
++ N + IDP PFDP ++ + S +++ R+ ++ N VRWR ++KD
Sbjct: 172 RIPNFLTIDPIPFDPPSFEAKVNERASNSASREDQLDDRL-IDENTVRWRYSRDKDQHVF 230
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESN + V+WSDG+ L++G E DI V D + L + H + ++Q G I K +
Sbjct: 231 KESNTQIVQWSDGTYSLKVGEECTDILVNDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 288
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 366
FIP+S +S H+ L+ V R+++ K I +DPE EK+E ER +SQ +L
Sbjct: 289 MFIPTSTNSKIHQKLSKAVIRRNQRQSKGPGTYIVSMDPEVEKKELERKQSQ-----ILR 343
Query: 367 NRKREKINRK 376
+R+R ++ K
Sbjct: 344 DRRRRQLKEK 353
>gi|259149606|emb|CAY86410.1| Leo1p [Saccharomyces cerevisiae EC1118]
Length = 464
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 103/190 (54%), Gaps = 17/190 (8%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 250
++ N + IDP PFDP ++ + S +++ R+ ++ N VRWR ++KD
Sbjct: 172 RIPNFLTIDPIPFDPPSFEAKVNERASNSASREDQLDDRL-IDENTVRWRYSRDKDQHVF 230
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESN + V+WSDG+ L++G E DI V D + L + H + ++Q G I K +
Sbjct: 231 KESNTQIVQWSDGTYSLKVGEECTDILVNDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 288
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 366
FIP+S +S H+ L+ V R+++ K I +DPE EK+E ER +SQ +L
Sbjct: 289 MFIPTSTNSKIHQKLSKAVIRRNQRQSKGPGTYIVSMDPEVEKKELERKQSQ-----ILR 343
Query: 367 NRKREKINRK 376
+R+R ++ K
Sbjct: 344 DRRRRQLKEK 353
>gi|401623631|gb|EJS41724.1| leo1p [Saccharomyces arboricola H-6]
Length = 460
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 104/193 (53%), Gaps = 17/193 (8%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 250
++ N + IDP PFDP ++ + S +K+ R+ ++ N VRWR ++KD
Sbjct: 171 RIPNFLTIDPIPFDPPSFEAKVNERASNSASKEDQLDDRL-IDENTVRWRYSRDKDQHVF 229
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESN + V+WSDGS L++G E DI + D + L + H + ++Q G I K +
Sbjct: 230 KESNTQIVQWSDGSYSLKVGEECTDILINDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 287
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 366
FIP+S +S H+ L+ V ++++ K I ++DPE EK E E+ +SQ +L
Sbjct: 288 MFIPTSTNSKIHQKLSKAVIRKNQRQNKGPGTYIVNMDPEVEKRELEKKQSQ-----ILR 342
Query: 367 NRKREKINRKYTQ 379
+R+R ++ K Q
Sbjct: 343 DRRRRQLKEKEKQ 355
>gi|395331775|gb|EJF64155.1| Leo1-domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 544
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 111/240 (46%), Gaps = 57/240 (23%)
Query: 177 ELEIPLRPPP-ADPTKMNMIKVSNIMGIDPKPFDPKTYV-----EEDT----FVTDESGA 226
++ IP P P + +I++ N + +D KPF P TY+ +ED F ++S +
Sbjct: 110 DVAIPNIPVPRSSDGNYWVIRMPNFIKLDSKPFHPDTYIGPEQEDEDAQHPEFSREKSMS 169
Query: 227 KKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT--------- 277
K +++EN VRWR K++ G +SN+R VRWSDGSL LQ+G E+ DI+
Sbjct: 170 VK-LKVENT-VRWRWSKDEFGQDRRQSNSRIVRWSDGSLSLQLGKELFDISQSIDTSGAV 227
Query: 278 --------------------------------VQDAQHDQA--HLFLRHGKG-ILQSQGR 302
V A H Q +L +H + ILQ +
Sbjct: 228 SRQALGSTSGPLSQSQSSQAIPSLSQSQSQTAVSGAGHAQGLTYLVAQHKRAEILQCEAV 287
Query: 303 ILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIR 361
+ M P+ + S +HR+L V +H +V +++ +DPEREK E +A ++ R
Sbjct: 288 VTGYMSIRPTGMQSETHRMLVRAVGQKHSRVARLRMAPDPTMDPEREKLELAKAAARKPR 347
>gi|6324697|ref|NP_014766.1| Leo1p [Saccharomyces cerevisiae S288c]
gi|585391|sp|P38439.1|LEO1_YEAST RecName: Full=RNA polymerase-associated protein LEO1
gi|443971|emb|CAA54391.1| Leo1 protein [Saccharomyces cerevisiae]
gi|1050827|emb|CAA62122.1| ORF O3278 [Saccharomyces cerevisiae]
gi|1164967|emb|CAA64042.1| YOR3278c [Saccharomyces cerevisiae]
gi|1420322|emb|CAA99322.1| LEO1 [Saccharomyces cerevisiae]
gi|151945744|gb|EDN63985.1| conserved protein [Saccharomyces cerevisiae YJM789]
gi|190407451|gb|EDV10718.1| protein LEO1 [Saccharomyces cerevisiae RM11-1a]
gi|285815004|tpg|DAA10897.1| TPA: Leo1p [Saccharomyces cerevisiae S288c]
gi|323331490|gb|EGA72905.1| Leo1p [Saccharomyces cerevisiae AWRI796]
gi|392296452|gb|EIW07554.1| Leo1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 464
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 103/190 (54%), Gaps = 17/190 (8%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 250
++ N + IDP PFDP ++ + S +++ R+ ++ N VRWR ++KD
Sbjct: 172 RIPNFLTIDPIPFDPPSFEAKVNERASNSASREDQLDDRL-IDENTVRWRYSRDKDQHVF 230
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESN + V+WSDG+ L++G E DI V D + L + H + ++Q G I K +
Sbjct: 231 KESNTQIVQWSDGTYSLKVGEECTDILVNDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 288
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 366
FIP+S +S H+ L+ V R+++ K I +DPE EK+E ER +SQ +L
Sbjct: 289 MFIPTSTNSKIHQKLSKAVIRRNQRQSKGPGTYIVSMDPEVEKKELERKQSQ-----ILR 343
Query: 367 NRKREKINRK 376
+R+R ++ K
Sbjct: 344 DRRRRQLKEK 353
>gi|441616938|ref|XP_003266743.2| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase-associated protein
LEO1 [Nomascus leucogenys]
Length = 663
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 102/206 (49%), Gaps = 11/206 (5%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIV---RWRTVKNKDGM 248
+ +K+ N + ++P+PFDP+ Y E++ + + R RL+ ++ RWR + G
Sbjct: 370 LYFVKLPNFLSVEPRPFDPQ-YYEDEFEDEEMLDEEGRTRLKLKVLFHQRWRMPNKEPGN 428
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
+ S S+ L +GNEV D+ Q D HLF+R G G LQ Q K+
Sbjct: 429 QXKLYDFSIXNVSLSSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLT 487
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLN 367
F P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 488 FRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI--- 544
Query: 368 RKREKINRKYTQTVERRRQLSTGYLE 393
+RE R+ + + +R LS YLE
Sbjct: 545 -RRESQQRRMREK-QHQRGLSASYLE 568
>gi|268536400|ref|XP_002633335.1| Hypothetical protein CBG06074 [Caenorhabditis briggsae]
Length = 424
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 101/177 (57%), Gaps = 10/177 (5%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEE---DTFVTDESG-AKKRIRLENNIVRWRTVKNKDGMTS 250
+++ N + + PFDP+ Y E+ D DE G + ++R+EN + RWR K+++G
Sbjct: 165 VRMPNFLSVATHPFDPQHYEEDEDDDQAKMDEEGRTRLKLRVENTL-RWRIRKDENGEEI 223
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQ-HDQAHLFLRHGKGILQSQGRILKKMRF 309
ESNA+ V+W DG++ L +GNE+ D+T+ ++ HL++ ++ +Q + +M F
Sbjct: 224 RESNAKIVKWEDGTMSLYLGNEIFDVTLMPLNVNNLPHLYVSQ-PNMMSAQHVLTHRMNF 282
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANV 364
P S S +HR +T + R +K +VK + + +PE + E R E +++RA++
Sbjct: 283 RPHSTDSQTHRKVTLNMADRSRKNAQVK-VMDSVGQNPEIVRRENARKEEESLRAHI 338
>gi|299745029|ref|XP_001831421.2| RNA polymerase-associated protein LEO1 [Coprinopsis cinerea
okayama7#130]
gi|298406400|gb|EAU90584.2| RNA polymerase-associated protein LEO1 [Coprinopsis cinerea
okayama7#130]
Length = 611
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 102/212 (48%), Gaps = 41/212 (19%)
Query: 177 ELEIPLRPPP--ADPTKMNMIKVSNIMGIDPKPFDPKTYV-------EEDTFVTDESGAK 227
E+ P P P +D K +I+ N + +D KPF P TY+ + T +
Sbjct: 237 EVTFPNIPVPRSSDGDKW-VIRTPNFVQVDSKPFHPDTYIGPEHDEDDAGTETLKQRSMS 295
Query: 228 KRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDI----------- 276
++R+E+ I RWR K+++G +SN+R +RWSDGSL L++G E+ DI
Sbjct: 296 IKLRVESTI-RWRWTKDENGRDVQQSNSRIIRWSDGSLSLRLGKELFDISKSIDTSAGLP 354
Query: 277 -------------TVQDAQHDQAHLFLRHGKG-ILQSQGRILKKMRFIPSSLSSNSHRLL 322
T +L +H + +LQ++ I M P+S+ S +HR+L
Sbjct: 355 RQLGGSQPSTQSQTPLGKNQGLTYLVAQHKRSQVLQAEAVITGNMTLRPTSMQSETHRML 414
Query: 323 TALVDSRHKKVYKVKNCITD---IDPEREKEE 351
V +H KV ++K + D +DPEREK E
Sbjct: 415 VRAVGQKHNKVARLK--MADDPTMDPEREKME 444
>gi|256272988|gb|EEU07952.1| Leo1p [Saccharomyces cerevisiae JAY291]
Length = 464
Score = 92.0 bits (227), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 102/190 (53%), Gaps = 17/190 (8%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 250
++ N + IDP PFDP ++ + S +++ R+ ++ N VRWR ++KD
Sbjct: 172 RIPNFLTIDPIPFDPPSFEAKVNERASNSASREDQLDDRL-IDENTVRWRYSRDKDQHVF 230
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESN + V+WSDG+ L++G E DI V D + L + H + ++Q G I K +
Sbjct: 231 KESNTQIVQWSDGTYSLKVGEECTDILVNDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 288
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 366
FIP+S +S H+ L+ V R+++ K I DPE EK+E ER +SQ +L
Sbjct: 289 MFIPTSTNSKIHQKLSKAVIRRNQRQSKGPGTYIVSTDPEVEKKELERKQSQ-----ILR 343
Query: 367 NRKREKINRK 376
+R+R ++ K
Sbjct: 344 DRRRRQLKEK 353
>gi|323335522|gb|EGA76807.1| Leo1p [Saccharomyces cerevisiae Vin13]
Length = 464
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 103/190 (54%), Gaps = 17/190 (8%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 250
++ N + IDP PFDP ++ + S +++ R+ ++ N VRWR ++KD
Sbjct: 172 RIPNFLTIDPIPFDPPSFEAKVNERASNSASREDQLDDRL-IDENTVRWRYSRDKDQHVF 230
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESN + V+WSDG+ L++G E DI V D + L + H + ++Q G I K +
Sbjct: 231 KESNTQIVQWSDGTYSLKVGEECTDILVNDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 288
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 366
FIP+S +S H+ L+ V R+++ K I +DPE EK+E ER +SQ +L
Sbjct: 289 MFIPTSTNSIIHQKLSKAVIRRNQRQSKGPGTYIVSMDPEVEKKELERKQSQ-----ILR 343
Query: 367 NRKREKINRK 376
+R+R ++ K
Sbjct: 344 DRRRRQLKEK 353
>gi|366996204|ref|XP_003677865.1| hypothetical protein NCAS_0H02080 [Naumovozyma castellii CBS 4309]
gi|342303735|emb|CCC71518.1| hypothetical protein NCAS_0H02080 [Naumovozyma castellii CBS 4309]
Length = 435
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 135/277 (48%), Gaps = 34/277 (12%)
Query: 105 DVFGDSDEED----VGEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMH-YESE 159
D+FGD +EED V E + +DI R+ ++EE +++ D+D + Y S+
Sbjct: 47 DLFGDEEEEDENKSVDEQSEDDDIGSRPRRTDIDEE----EAMYTRKFYGDDDYNRYSSD 102
Query: 160 EEHVEPKHKEKPVGPPLELEIPLRPPP--ADPTKMNMIKVSNIMGIDPKPFDPKTYVEED 217
+++ KE+ V + +P + D T + KV + I+P PFDP + E D
Sbjct: 103 QDNALHHFKEENVELIRHM-VPYKTSSEKTDETTVYYAKVPPFLTINPVPFDPIAF-ESD 160
Query: 218 T------FVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGN 271
+ + E R+ ++ N VRWR ++ + ESNA+ V+WSDGS L++G+
Sbjct: 161 VRDRLSNYSSKEDQLGDRL-IDENTVRWRYSRDANQRVFKESNAQIVQWSDGSFSLKLGD 219
Query: 272 EVLDITVQDAQ-------HDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTA 324
E DI V + HDQ L + + G I K + FIP+S +S H+ L+
Sbjct: 220 EYTDILVNETNNTFLAVSHDQQELMQCY------NGGEITKTLMFIPTSTNSKIHQQLSK 273
Query: 325 LVDSRHKKVYKV-KNCITDIDPEREKEEKERAESQNI 360
V R+++ + I + DPE EK E E+ + Q I
Sbjct: 274 AVTRRNQREHLGPGTMIINRDPEVEKRELEKKQGQII 310
>gi|254570413|ref|XP_002492316.1| hypothetical protein [Komagataella pastoris GS115]
gi|238032114|emb|CAY70036.1| hypothetical protein PAS_chr3_1154 [Komagataella pastoris GS115]
gi|328353675|emb|CCA40073.1| Midasin [Komagataella pastoris CBS 7435]
Length = 429
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 102/192 (53%), Gaps = 13/192 (6%)
Query: 173 GPPLELEIPLRPPPADPTKMN---MIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKR 229
G L++ +P+ P P ++K+ + + I+ +PFDP+ + E T E ++
Sbjct: 83 GETLDISLPIHPKSHIPQGKQDRWVVKLPDFLDINAEPFDPRPF--EMNVKTHEDKNQEL 140
Query: 230 I--RLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAH 287
+ + N VRWR K++ G E+N++ ++W DG+ L++G+E+ D+ + D +
Sbjct: 141 LDKLIAVNTVRWRYAKSETGGIFKETNSQIIQWEDGTYSLRVGSEIFDMFTTNT--DDNY 198
Query: 288 LFLRHG-KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPE 346
L H +GIL ++ + K ++ +P+S S +H+ L + ++ KK ++ +T DPE
Sbjct: 199 LVSEHNEEGILMTESTLSKSVKLVPASFQSTTHQKLAKALSAKQKKESYARSVVTKEDPE 258
Query: 347 REKEEKERAESQ 358
E + R ESQ
Sbjct: 259 ---ERQRRLESQ 267
>gi|367013112|ref|XP_003681056.1| hypothetical protein TDEL_0D02610 [Torulaspora delbrueckii]
gi|359748716|emb|CCE91845.1| hypothetical protein TDEL_0D02610 [Torulaspora delbrueckii]
Length = 440
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 90/182 (49%), Gaps = 27/182 (14%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIR------------LENNIVRWRTVK 243
KV + IDP PFDP ++ ES K+R+ ++ N +RWR +
Sbjct: 142 KVPEFLTIDPVPFDPPSF---------ESKVKERLNSSSREDQLGDRLIDENTIRWRYSR 192
Query: 244 NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG-KGILQSQ-- 300
+++ ESNA+ V+WSDG L L++G E DI V D D + H + ++Q
Sbjct: 193 DENQQVYKESNAQIVQWSDGGLSLKLGTEYTDILVNDT--DNTFFAVSHDEQELMQCSQG 250
Query: 301 GRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQN 359
G + K + FIP+S +S H+ LT V R ++ K + DPE EK E E+ + Q
Sbjct: 251 GEVTKSLMFIPTSTNSKMHQKLTKAVIRRDQRQTKGPGVYFVNKDPELEKNELEKKQQQV 310
Query: 360 IR 361
+R
Sbjct: 311 VR 312
>gi|444314605|ref|XP_004177960.1| hypothetical protein TBLA_0A06490 [Tetrapisispora blattae CBS 6284]
gi|387510999|emb|CCH58441.1| hypothetical protein TBLA_0A06490 [Tetrapisispora blattae CBS 6284]
Length = 428
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 134/305 (43%), Gaps = 68/305 (22%)
Query: 105 DVFGDSDEEDVGEYAIRNDI----------------------DQDSNRSPMEEEGSYEKS 142
D+FGD ++E+ G+ +N+I + D N +++ YEK
Sbjct: 3 DLFGDDEDEEAGDKNTQNEIDDDDDDDDDDDDEGFVRKNRRSEYDENSDDEQDKAMYEKK 62
Query: 143 IRPEDIVP---DEDMHYESEEE------HVEPKHKEKPVGPPLELEIPLRPPPADPTKMN 193
ED D+D YE +E H+ P +K +P L+ L P M
Sbjct: 63 FYGEDGFNRSDDDDNEYEYKESDIELVRHIVP-YKTRPSNKELDKHRELYYAKVPPFLM- 120
Query: 194 MIKVSNIMGIDPKPFDPKTY-----------VEEDTFVTDESGAKKRIRLENNIVRWRTV 242
+DP PFD + + EDT +TD R+R EN I RWR
Sbjct: 121 ---------VDPVPFDSTNFESAVKERIKNSLNEDTQITD------RLRDENTI-RWRYS 164
Query: 243 K--NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQS 299
+ N + +SNA+ ++WSDGS L++GN+ DI + D L + H + ++Q
Sbjct: 165 RDSNDNDKVFKQSNAQIIQWSDGSYSLKLGNDYTDILTNET--DNIFLTVSHDQQELMQC 222
Query: 300 Q--GRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKN-CITDIDPEREKEEKERAE 356
G I K + FIPSS S H+ LTA V +R KK + + DPE EK+ E+ +
Sbjct: 223 YNGGEITKTLMFIPSSTKSKIHQRLTAAVQNRDKKTTSGPGMMLINTDPEIEKKLLEKKQ 282
Query: 357 SQNIR 361
Q +R
Sbjct: 283 WQMMR 287
>gi|440902163|gb|ELR52988.1| RNA polymerase-associated protein LEO1 [Bos grunniens mutus]
Length = 407
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 68/121 (56%), Gaps = 5/121 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGM 248
+ +K+ + I+ KPFDP+ Y E + +E + ++++EN I RWR ++KDG
Sbjct: 63 LYFVKLPKFLSIESKPFDPQYYEDEFENKKMLDEEERTRLKLKIENTI-RWRIRRDKDGN 121
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
ESNAR V+WSDGSL L +GN+V D+ Q HLF+R G LQ Q K+
Sbjct: 122 KIKESNARIVKWSDGSLSLHLGNKVFDVYKAPMQATHNHLFVRDDTG-LQGQAVFKSKLT 180
Query: 309 F 309
F
Sbjct: 181 F 181
>gi|328860981|gb|EGG10085.1| hypothetical protein MELLADRAFT_94430 [Melampsora larici-populina
98AG31]
Length = 454
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 116/246 (47%), Gaps = 38/246 (15%)
Query: 152 EDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPK 211
E+M Y + ++ + +++ + L +PL P A K ++ N + +D + F +
Sbjct: 89 EEMEYLEDPSQLQHQARDRLIAEATLLNLPL--PYASDEKYWDFRLPNFLELDSQAFTDQ 146
Query: 212 TYVEE-DTFVTDESGAKKR---IRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQL 267
++ E T+E G + + ++N +RWR V +DG +SN+R V WSDGS L
Sbjct: 147 KFLNEVKRSTTEEEGNPSKSSSVFPDHNTIRWRWVMGRDGKPVKQSNSRVVSWSDGSKSL 206
Query: 268 QIGNEVLDITV-------QDAQHDQ-------------------AHLFLRHGKG-ILQSQ 300
Q+G+E+ D+ QHD +LF+R G+ +L+SQ
Sbjct: 207 QVGSELFDMVFIHDHQNQNTPQHDSKPGSLSQPKPDLTPNTQGLTYLFVRQGESQVLESQ 266
Query: 301 GRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVY---KVKNCITDIDPEREKEEKERAES 357
I ++ P SL+S +HR L A + K Y + ++ + +DPE EK KE ES
Sbjct: 267 TSITGQISLRPYSLNSITHRNLVA--NRSMNKTYSQRQTQSVVVTVDPEMEKLGKELEES 324
Query: 358 QNIRAN 363
+ + N
Sbjct: 325 KQFKKN 330
>gi|349581283|dbj|GAA26441.1| K7_Leo1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 464
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 102/190 (53%), Gaps = 17/190 (8%)
Query: 196 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 250
++ N + IDP PFDP ++ + S +++ R+ ++ N VRWR ++KD
Sbjct: 172 RIPNFLTIDPIPFDPPSFEAKVNERASNSASREDQLDDRL-IDENTVRWRYSRDKDQHVF 230
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESN + V+WSDG+ L++G E DI V D + L + H + ++Q G I K +
Sbjct: 231 KESNTQIVQWSDGTYSLKVGEECTDILVNDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 288
Query: 308 RFIPSSLSSNSHRLLT-ALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLL 366
FIP+S +S H+ L+ A++ ++ I +DPE EK+E ER +SQ +L
Sbjct: 289 MFIPTSTNSKIHQKLSKAVIRINQRQSKGPGTYIVSMDPEVEKKELERKQSQ-----ILR 343
Query: 367 NRKREKINRK 376
+R+R ++ K
Sbjct: 344 DRRRRQLKEK 353
>gi|50292463|ref|XP_448664.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527976|emb|CAG61627.1| unnamed protein product [Candida glabrata]
Length = 446
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 95/175 (54%), Gaps = 12/175 (6%)
Query: 196 KVSNIMGIDPKPFDPKTY---VEE--DTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
KV + IDP PFDP+T+ V+E + E R+ ++ N +RWR ++ +
Sbjct: 163 KVPAFLTIDPVPFDPQTFEATVDERLSKSASKEDQLGDRL-IDENTIRWRYSRDAEQHVF 221
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 307
ESNA+ V+WSDGS L++G+E DI V D D L + H + ++Q G I K M
Sbjct: 222 KESNAQIVQWSDGSFSLKLGDEYTDIIVNDT--DNTFLTVSHDQQELMQCYDGGEITKTM 279
Query: 308 RFIPSSLSSNSHRLLTALVDSRH-KKVYKVKNCITDIDPEREKEEKERAESQNIR 361
FIP+S +S H+ L+ V R+ ++ + I DPE EK+E E+ + Q IR
Sbjct: 280 MFIPTSTNSKIHQKLSKAVSRRNARQNLGPGSYIIKQDPEIEKKELEKLQGQIIR 334
>gi|194206648|ref|XP_001499428.2| PREDICTED: RNA polymerase-associated protein LEO1-like [Equus
caballus]
Length = 319
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 72/123 (58%), Gaps = 9/123 (7%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTF-----VTDESGAKKRIRLENNIVRWRTVKNKD 246
+ +K+ + I+PKPFDP+ Y ED F + +E + ++++EN I RWR ++++
Sbjct: 62 LYFVKLPKFLSIEPKPFDPQYY--EDEFADEKVLDEEDRTRLKLKVENTI-RWRIRQDEE 118
Query: 247 GMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKK 306
G ESN R V+WSDGSL L +G+EV D+ Q + +HLF+R G LQ Q K
Sbjct: 119 GNKIKESNTRMVKWSDGSLSLHLGSEVFDVYKAPLQDNHSHLFVREDTG-LQGQAVFKSK 177
Query: 307 MRF 309
+ F
Sbjct: 178 LTF 180
>gi|254584142|ref|XP_002497639.1| ZYRO0F10164p [Zygosaccharomyces rouxii]
gi|238940532|emb|CAR28706.1| ZYRO0F10164p [Zygosaccharomyces rouxii]
Length = 423
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 132/304 (43%), Gaps = 64/304 (21%)
Query: 86 PGGEEKDQTHISHSAAEIRDVF--------------GDSDEEDVGEYAIRNDIDQDSNRS 131
P G+EK S S E+ D+F G DE D GE +R+ + D +
Sbjct: 32 PSGDEK-----SGSQDEMEDLFGEDEGEDEEDEKRKGSEDENDEGEVTLRHRQEMDDEEA 86
Query: 132 PMEEEGSYEKSIRPEDI--VPDEDMHYESEEEHVE-PKHKEKPVGPPLELEIPLRPPPA- 187
EE Y + ED+ V DE+ E EE VE +H +P R
Sbjct: 87 --EEHAMYTRKFYGEDVDRVSDEEEAQEFREEDVELVRHI-----------VPYRATQGT 133
Query: 188 DPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRI-------------RLEN 234
D + K+ + IDP PFDP ++ E+ K+R+ ++
Sbjct: 134 DKPVLYYAKIPEFLTIDPVPFDPPSF---------EASVKERLGNKASKEDQLGDRLIDE 184
Query: 235 NIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGK 294
N VRWR ++ + ESNA+ V+WSDG+ L++G+E DI D D + H +
Sbjct: 185 NTVRWRYSRDANQQVFKESNAQIVQWSDGTFSLKLGDEYTDILSNDT--DNTFFAVSHDQ 242
Query: 295 G-ILQSQ--GRILKKMRFIPSSLSSNSHRLLTALVDSR-HKKVYKVKNCITDIDPEREKE 350
++Q G + K + FIP+S SS H+ LT V R HK+ I DPE E+
Sbjct: 243 QELMQCYEGGEVTKTLMFIPTSTSSRMHQKLTKAVMRRDHKQAAGPGTYIVQKDPELERG 302
Query: 351 EKER 354
E E+
Sbjct: 303 ELEK 306
>gi|308491606|ref|XP_003107994.1| hypothetical protein CRE_12785 [Caenorhabditis remanei]
gi|308249941|gb|EFO93893.1| hypothetical protein CRE_12785 [Caenorhabditis remanei]
Length = 425
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 95/176 (53%), Gaps = 8/176 (4%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEED----TFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
+++ N + + PFDP+ Y E++ + DE + ++R+EN + RWR K+ +G
Sbjct: 166 VRMPNFLSVATHPFDPQHYEEDEDDDQAKLDDEGRTRLKLRVENTL-RWRIRKDAEGRDI 224
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQA-HLFLRHGKGILQSQGRILKKMRF 309
ESNA+ V+W DG+ L +G E+ D+T+ + HL++ ++ +Q + +M F
Sbjct: 225 RESNAKIVKWEDGTSSLYLGTEIFDVTLMPLNANNLPHLYVTQ-PNLMSAQSVLTHRMNF 283
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANV 364
P S S +HR +T + R +K +VK +PE + E R E +++RA++
Sbjct: 284 RPHSTDSQTHRKVTLNMADRSRKNAQVKVMDAVGQNPEITRRENARKEEESLRAHI 339
>gi|392342153|ref|XP_003754517.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Rattus
norvegicus]
gi|392350188|ref|XP_003750589.1| PREDICTED: RNA polymerase-associated protein LEO1-like [Rattus
norvegicus]
Length = 396
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 79/147 (53%), Gaps = 9/147 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTF-----VTDESGAKKRIRLENNIVRWRTVKNKD 246
+ +K+ + I+PKPFDP+ Y ED F + +E + ++++EN I RWR ++++
Sbjct: 65 LYFVKLPRFLRIEPKPFDPQHY--EDEFDDEKELYEEDKTRLKLKVENTI-RWRMCRDQE 121
Query: 247 GMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKK 306
G + ESN R V+WSDGS+ L +G EV DI Q + LF+R G L+ Q K
Sbjct: 122 GSKTKESNTRIVKWSDGSMSLHLGKEVFDIYKAPLQDNLNQLFIREDTG-LRGQAIFKSK 180
Query: 307 MRFIPSSLSSNSHRLLTALVDSRHKKV 333
+ F P +++ +T +R K
Sbjct: 181 LTFRPHCTDGATYKKMTLSFGTRSSKT 207
>gi|392565569|gb|EIW58746.1| Leo1-domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 572
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 97/217 (44%), Gaps = 49/217 (22%)
Query: 194 MIKVSNIMGIDPKPFDPKTYV---EEDTFVTDESGAKKRIRLE---NNIVRWRTVKNKDG 247
+I++ N + +D KPF P TY E+D ES +K + ++ N VRWR K++ G
Sbjct: 142 VIRMPNFVKLDSKPFHPDTYAGPEEDDDAQQAESLREKSMSIKLKVENTVRWRWAKDEFG 201
Query: 248 MTSYESNARFVRWSDGSLQLQIGNEVLDIT------------------------------ 277
+SN+R VRWSDGSL LQ+G E+ D++
Sbjct: 202 QDRRQSNSRIVRWSDGSLSLQLGKELFDVSQSIDTSGAVPRQALGGASLSQASFSQSQSQ 261
Query: 278 -----------VQDAQHDQAHLFLRHGKG-ILQSQGRILKKMRFIPSSLSSNSHRLLTAL 325
A +L +H + ILQ + + + P+ + S +HR+L
Sbjct: 262 SQSLAQSQSAPGTTAGQGLTYLVAQHKRAEILQCEAVVTGYLSLRPTGMQSETHRMLVRA 321
Query: 326 VDSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIR 361
V +H +V +++ DPERE+ E E+A ++ R
Sbjct: 322 VGQKHNRVARLRMAPEPTTDPERERLELEKAAARKPR 358
>gi|326435992|gb|EGD81562.1| hypothetical protein PTSG_02277 [Salpingoeca sp. ATCC 50818]
Length = 490
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 99/178 (55%), Gaps = 12/178 (6%)
Query: 190 TKMNMIKVSNIMGIDPKPFDPKTYV---EEDTFVTDESGAKK-RIRLENNIVRWRTVKNK 245
+K+ + K+ N + I+PKPFD +T+ E++ V D+ G ++ ++RLEN I RWR +N
Sbjct: 226 SKIYLAKMPNFISINPKPFDAETFTAEENEESHVMDDLGRRRVKLRLENTI-RWR--RND 282
Query: 246 DGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHD---QAHLFLRHGKGILQSQGR 302
DG + ESNAR VRW+D S+ L +G+EV D+ + D Q L R +++
Sbjct: 283 DG--AMESNARLVRWNDNSMSLILGDEVHDVVLTDLGAGGPVQQLLQTREDGSGYEARAP 340
Query: 303 ILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNI 360
+++ F ++LS + + L V + + V + K + +PE+ K++ + E + I
Sbjct: 341 FEQRLTFKAATLSGRTQQRLMHSVKTHSRAVRRTKTTVVSENPEKAKQKAIKEEQEAI 398
>gi|219115607|ref|XP_002178599.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410334|gb|EEC50264.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 439
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 128/282 (45%), Gaps = 63/282 (22%)
Query: 177 ELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLE 233
+L + P ++ ++M K+ N++GI + FDP TY +EED F +
Sbjct: 120 QLYLQQEPKISENIALHMTKLPNLVGIQTQAFDPDTYHPAMEEDDFG----------QSV 169
Query: 234 NNIVRWRTVKNKDGMTSY----------ESNARFVRWSDGSLQLQIGNEVLDITVQDAQH 283
N+VRWR K+ G +Y ESN+R V+W DGS L IGNE +I DA
Sbjct: 170 YNLVRWRYRKDDTG--NYVRDEQDRLLRESNSRLVQWEDGSYTLHIGNEAFEI---DALK 224
Query: 284 DQAHLFL---------------------RHGKGILQSQGRILKKMRFIPSSLSSNSHRLL 322
A+ F R +L+ G + +M PSSL S +H+ L
Sbjct: 225 TAANGFPGLNGFIYRTQKAVLKTADDNERAAGTVLECVGAVATRMTIRPSSLQSEAHKSL 284
Query: 323 TALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVE 382
T + + K ++ +T DPE+ K+++ I+ L + + Y+Q
Sbjct: 285 TVGIRQKTMKKARIAEYVTQEDPEKAKQDR-------IKVKQDLEKVSARKRSGYSQAGG 337
Query: 383 R--RRQLSTGYLEDALEEDDETDYHDSRRSRRRF---EEELE 419
R R ++S YLE+ EED + D + + ++RR +EEL+
Sbjct: 338 RTARPRMSRSYLEE--EEDGDYDTFNIKDTKRRIRDADEELK 377
>gi|402590943|gb|EJW84873.1| hypothetical protein WUBG_04219 [Wuchereria bancrofti]
Length = 330
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 62/108 (57%), Gaps = 5/108 (4%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG---AKKRIRLENNIVRWRTVKNKDGM 248
+ +K N + I+P+PFD Y +E A+ ++R+EN I RWR V + G
Sbjct: 211 LYFVKFPNFLSIEPRPFDQDHYEDEFDEDDLLDEEGRARLKLRVENTI-RWRYVLDDAGN 269
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGI 296
+ N++ VRWSDG++ L +G E+ DITVQ Q D HLFLR G G+
Sbjct: 270 IQKQGNSKVVRWSDGTMSLYLGGEIFDITVQPMQLDN-HLFLRQGAGL 316
>gi|325185227|emb|CCA19716.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 450
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/308 (25%), Positives = 136/308 (44%), Gaps = 74/308 (24%)
Query: 102 EIRDVFG---DSDEEDVGEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYES 158
E++D+FG DS+EE+ I+ + RS + + P +D H S
Sbjct: 63 EMKDLFGSDYDSEEEEFKASGIKESPVRAQLRSGLSND-------------PSDDFHSHS 109
Query: 159 EEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDT 218
+ L +P P P+ K + K+ NI+ + P+P+ ++ EE
Sbjct: 110 NQ-----------------LWLPRTPKPSSTAKCFVTKMPNILRLIPEPYTEESIQEEMI 152
Query: 219 FVTDESGAKKRIRLENNIVRWRTVKNKDGMTSY---------ESNARFVRWSDGSLQLQI 269
TDE+ L N VRWR +++ G E+N++ VRW DG+L L +
Sbjct: 153 NPTDET-------LYRNYVRWRYRRDEKGALVMDPITHHPLRETNSKVVRWEDGTLTLFV 205
Query: 270 GNEVLDITVQDAQHDQAHLFLR-------------------HGKGILQSQGRILKKMRFI 310
G E L + Q + + LF+ K +L+ GR+ +K+
Sbjct: 206 GREALYLARQKIAN--SFLFVNEVSLNEKAFQNNKVEDNATGQKTVLECHGRLDEKLTIR 263
Query: 311 PSSLSSNSHRLLTALVDSRHKK-VYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRK 369
P + SS SH+ LT + ++H K V K+K ++++D RE+E++ + +R L +RK
Sbjct: 264 PMTTSSKSHKSLTMSMRAKHNKGVQKLKEYVSELDASREQEQRAKINEDRLR---LQHRK 320
Query: 370 REKINRKY 377
+ + +Y
Sbjct: 321 KARQTFEY 328
>gi|365758348|gb|EHN00196.1| Leo1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 484
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 108/206 (52%), Gaps = 20/206 (9%)
Query: 180 IPLRPPPADPTKMNMI---KVSNIMGIDPKPFDPKTY---VEEDTF--VTDESGAKKRIR 231
IP + + N I ++ N + ID PFDP ++ V E T + E R+
Sbjct: 172 IPSKANVNETASHNEIFYARIPNFLTIDAVPFDPPSFEAKVNERTINSTSKEDQLDDRL- 230
Query: 232 LENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLR 291
++ N VRWR ++KD ESN + V+WSDGS L++G E DI V D + L +
Sbjct: 231 IDENTVRWRYSRDKDQHVFKESNTQIVQWSDGSYSLKVGEECTDILVNDTSN--TFLTVS 288
Query: 292 HGKG-ILQSQ--GRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPER 347
H + ++Q G I K + FIP+S +S H+ L+ V R+++ K I ++DPE
Sbjct: 289 HDQQELIQCYEGGEIKKTLMFIPTSTNSKIHQKLSKAVIRRNQRQNKGPGTYIVNMDPEV 348
Query: 348 EKEEKERAESQNIRANVLLNRKREKI 373
EK+E E+ +SQ VL +R+R ++
Sbjct: 349 EKKELEKKQSQ-----VLRDRRRRQL 369
>gi|390596735|gb|EIN06136.1| hypothetical protein PUNSTDRAFT_106334 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 511
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 96/221 (43%), Gaps = 54/221 (24%)
Query: 177 ELEIPLRPPP-ADPTKMNMIKVSNIMGIDPKPFDPKTYV------EEDTFVTDESGAKKR 229
+++IP P P + + ++++ + + ID KPF P TY E + E R
Sbjct: 83 DVQIPNVPAPRSSDGQQWVLRIPSYVRIDSKPFHPDTYYGPEDEDSEQASLMREKSMSIR 142
Query: 230 IRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLF 289
+ +EN VRWR K+K+G+ +SNAR +RWSDG+L L++G E+ DI + D +
Sbjct: 143 LAVENT-VRWRWTKDKNGVDRRQSNARVIRWSDGTLSLKLGRELFDI---NQTLDTSGAV 198
Query: 290 LRHGKG----------------------------------------ILQSQGRILKKMRF 309
R G G ILQ++ I M
Sbjct: 199 ARQGLGASQSQQGLSQSQSQSQSQIAGANGVYTGLTYLVAQHKRAEILQAEAVITGTMSL 258
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKE 350
P+ + S +HR+L V +H K +++ DP R+ E
Sbjct: 259 QPTDMLSETHRMLVKAVGQKHVKTARLRMA---PDPTRDPE 296
>gi|156844225|ref|XP_001645176.1| hypothetical protein Kpol_1062p26 [Vanderwaltozyma polyspora DSM
70294]
gi|156115834|gb|EDO17318.1| hypothetical protein Kpol_1062p26 [Vanderwaltozyma polyspora DSM
70294]
Length = 457
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 124/259 (47%), Gaps = 38/259 (14%)
Query: 121 RNDIDQDSNRSPMEEEGSYEKSIRPEDIV----PDEDMHYESEEEHVEPKHKEKPVGPPL 176
RN +D+D + EE+ Y + ED +ED H EE+ V +H V P
Sbjct: 82 RNYVDEDED----EEQAMYTRKFYGEDDYDKSDQEEDKHEFREEDVVLTRH----VVPT- 132
Query: 177 ELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEE--DTFVTDESGAKKRIR 231
+I AD T + KV + IDP PFDP T+ V E ++ E + R+
Sbjct: 133 --KITSEENDADTT-IYYAKVPPFLTIDPIPFDPPTFENVVRERLTNGLSKEEQLEDRL- 188
Query: 232 LENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQD-------AQHD 284
++ N +RWR ++ + ESNA V+WSDGS L++G+E DI + D HD
Sbjct: 189 IDQNTIRWRYSRDDNQRVFKESNATIVQWSDGSYSLKLGDEYTDILLNDTDNTFLTVSHD 248
Query: 285 QAHLF-LRHGKGILQSQGRILKKMRFIPSSLSSNSH-RLLTALVDSRHKKVYKVKNCITD 342
Q L HG G I K + F+P S +S H RL A+V + ++ I +
Sbjct: 249 QQELMQCVHG-------GEISKSLLFVPISTTSKLHQRLSKAIVRRNATETMGPRSMIVN 301
Query: 343 IDPEREKEEKERAESQNIR 361
IDPE EK+E E+ +SQ R
Sbjct: 302 IDPELEKKELEKKQSQVFR 320
>gi|365984461|ref|XP_003669063.1| hypothetical protein NDAI_0C01590 [Naumovozyma dairenensis CBS 421]
gi|343767831|emb|CCD23820.1| hypothetical protein NDAI_0C01590 [Naumovozyma dairenensis CBS 421]
Length = 470
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 81/150 (54%), Gaps = 9/150 (6%)
Query: 190 TKMNMIKVSNIMGIDPKPFDPKTY-VEEDTFVTDESGAKKRI---RLENNIVRWRTVKNK 245
T + KV + IDP PFDP ++ + + D S + ++ ++ N +RWR +++
Sbjct: 166 TTIYYAKVPPFLTIDPIPFDPVSFETKVKDRLADYSSREDQLGDRLIDENTIRWRYSRDQ 225
Query: 246 DGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQ---SQGR 302
+ ESNA+ V+WSDGS L++G+E DI V D D L + H + L + G
Sbjct: 226 NQRVFKESNAQIVQWSDGSFSLKLGDEYTDILVNDT--DNTFLTVSHDQQELMQCYNGGE 283
Query: 303 ILKKMRFIPSSLSSNSHRLLTALVDSRHKK 332
I K M FIP+S +S H+ L+ V R+++
Sbjct: 284 ITKTMMFIPTSTNSKIHQQLSKAVTRRNQR 313
>gi|388582080|gb|EIM22386.1| hypothetical protein WALSEDRAFT_63588 [Wallemia sebi CBS 633.66]
Length = 358
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 94/184 (51%), Gaps = 20/184 (10%)
Query: 178 LEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIV 237
++I PP K + K+ N + I FD T+ ++D + D + + N +
Sbjct: 63 MDINNLPPIRSVDKNWLAKIPNFLNIQSAGFDRNTFEDDDKNLPDNANSL-------NTI 115
Query: 238 RWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDI-------TVQDAQ--HDQAHL 288
RWR ++ G T +SN+R ++WSDGSL LQ+G+E+ D+ TV ++ + + +
Sbjct: 116 RWRWHQDNSGQTIAQSNSRVIKWSDGSLSLQVGSEIYDMNTNEDVKTVSSSEPNNPRQYA 175
Query: 289 FLRHGKG-ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI---D 344
+ H K ++ I ++ F+P SL+SN H+ + V+ +H K ++K + D
Sbjct: 176 YTNHEKSEVVMGDTVINGQLNFVPPSLTSNVHKKYASTVNDKHFKAARLKMVDKETEAKD 235
Query: 345 PERE 348
PE+E
Sbjct: 236 PEKE 239
>gi|432113985|gb|ELK36042.1| RNA polymerase-associated protein LEO1 [Myotis davidii]
Length = 220
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 73/131 (55%), Gaps = 2/131 (1%)
Query: 235 NIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGK 294
N +RWR ++ +G ESN R V+WSDGS+ L +G+EV D+ Q + +HLF+R
Sbjct: 37 NTMRWRVRRDTEGNEIKESNTRMVKWSDGSMSLHLGSEVYDVYKAPLQGNHSHLFVREDT 96
Query: 295 GILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKE 353
G LQ Q K+ F P S+ +H+ +T + R + K++ I DPE ++ E+
Sbjct: 97 G-LQGQAIFKSKLTFRPHSMDYATHKKMTLPLADRCSRTQKIRIIPIAGSDPECQRTEEI 155
Query: 354 RAESQNIRANV 364
+ E Q +RA+
Sbjct: 156 KKEKQRLRAST 166
>gi|308814230|ref|XP_003084420.1| unnamed protein product [Ostreococcus tauri]
gi|116056305|emb|CAL56688.1| unnamed protein product [Ostreococcus tauri]
Length = 162
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 80/143 (55%), Gaps = 6/143 (4%)
Query: 235 NIVRWRTVKNKDGMTS-YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 293
N++R+R D +T E+NARFVRW DG+ L IG+E L +T + +
Sbjct: 12 NVIRYRI----DPVTGEVETNARFVRWGDGTTHLVIGDEHLRVTERATAEGADSVLYARK 67
Query: 294 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRH-KKVYKVKNCITDIDPEREKEEK 352
G ++++ R+ KM F P+SL S +HR L+ VD H + + + ++ +DPEREKE
Sbjct: 68 PGAMEARKRLRTKMTFAPASLESKTHRALSRRVDKAHGSRATRTRQHVSRVDPEREKEAV 127
Query: 353 ERAESQNIRANVLLNRKREKINR 375
+ A + R + L RK+EK+ R
Sbjct: 128 DAALVKAEREQLALKRKQEKMMR 150
>gi|58264526|ref|XP_569419.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134110049|ref|XP_776235.1| hypothetical protein CNBC6260 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258907|gb|EAL21588.1| hypothetical protein CNBC6260 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57225651|gb|AAW42112.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 479
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 120/296 (40%), Gaps = 67/296 (22%)
Query: 85 SPGGEEKDQTHISHSAAE--------------IRDVFGDSDEEDVGEYAIRN-DIDQDSN 129
SPG E+ +SH + + + D+FGD EED + R+ D +
Sbjct: 21 SPGYEKSPSPQLSHKSDDEEQDDEEQDDEEQDVGDLFGDDAEEDEAQIGRRSTDTPASGS 80
Query: 130 RSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRP-PPAD 188
RSP + Y E+E P+ + + L IP P A
Sbjct: 81 RSP-------------------HPLEYVEEDEEAVPQKQNV-----VTLPIPQWPHMTAT 116
Query: 189 PTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTF--VTDESGAKKRIRLENNIVRWRTVK 243
K+ +K + +DPKPFDP Y EE+ D AK + N +RWR V
Sbjct: 117 DGKVWQMKFPAYVNLDPKPFDPDLYRATQEEEPIDGAADPIAAKSMMIGVKNTIRWRWVT 176
Query: 244 NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDA-------------------QHD 284
DG +SNAR +RWSDGS+ LQ+G++ D+ +
Sbjct: 177 GPDGEPVRQSNARMLRWSDGSVTLQLGDDFYDVAPSQGATLARPSDPQPVRKRDDKPATN 236
Query: 285 QAHLFLRHGKG---ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVK 337
+ FL G +L ++ I ++ +P+S++S ++ L V +H K K+K
Sbjct: 237 SSTTFLCVGAAAERVLVTERPIAGQLSLLPTSMTSKTYLELVKHVGKQHTKHSKMK 292
>gi|331247712|ref|XP_003336483.1| hypothetical protein PGTG_18154 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309315473|gb|EFP92064.1| hypothetical protein PGTG_18154 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 449
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 117/240 (48%), Gaps = 32/240 (13%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK--RIRLENNIVRWRT 241
P P+D +M ++ N + +P PF + ++ E+ E G + ++ + N++RWR
Sbjct: 110 PKPSDGKHWDM-RIPNFLSFNPLPFSDEEFLNENESSEKEGGESEAAKVLTDQNVIRWRW 168
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLD-ITVQDAQ---------------HDQ 285
K+ +G SNAR + W DGS LQ+G+E+ D I+ D +
Sbjct: 169 QKDSEGNPVKRSNARVICWEDGSQSLQVGSELFDMISTVDGRQNNNTTTQQQPGATTQGL 228
Query: 286 AHLFLRHGK-GILQSQGRILKKMRFIPSSLSSNSHRLLTA---LVDSRHKKVYKVKNCIT 341
+LF +H + +L++Q I ++ P SL+S +HR + A L+ + ++ ++ +
Sbjct: 229 TYLFAQHSELKLLEAQASITGQVTLRPYSLNSMTHRNIVANRSLLKANSQRQTSLR--VI 286
Query: 342 DIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVE-------RRRQLSTGYLED 394
+DPE+EK +KE E + ++ + K + N + T +E R+R+ + L D
Sbjct: 287 TLDPEKEKIDKELEEERKLKDLKKQDAKLARQNSRQTNLLEGGSRGGVRKRRFNAPLLSD 346
>gi|448115124|ref|XP_004202754.1| Piso0_001610 [Millerozyma farinosa CBS 7064]
gi|359383622|emb|CCE79538.1| Piso0_001610 [Millerozyma farinosa CBS 7064]
Length = 409
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 115/260 (44%), Gaps = 29/260 (11%)
Query: 102 EIRDVFG-DSDEEDVGEYAIRNDIDQDSNR---SPMEEEGSYEKSIRPEDIVPDEDMHYE 157
E+ D+FG DS+ E + + I D+D N+ + E G ++ ED +ED +
Sbjct: 16 EVDDLFGEDSEGEKIDDEKIEQSEDEDENKQGSAVQETRGGHDDDEENED-SAEEDFDFG 74
Query: 158 SEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---- 213
SE K K + P +P P + KM + + +D PFD +
Sbjct: 75 SER-----KLKNLDISIPRHAILPNLEPSSYVMKMPVF-----LNVDAHPFDANEFKETV 124
Query: 214 ----VEEDTFVTDESGAKKRI----RLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSL 265
V+ DE + + L N +RWR D +SNA FV+W DGSL
Sbjct: 125 KQNAVDRSNRYLDEKQMQNDLVAEKLLNENTIRWRYSNRGDDEIIKQSNAHFVQWDDGSL 184
Query: 266 QLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTAL 325
L+IGNE+ D + +D + ILQ I ++ IPSS +++HR LT
Sbjct: 185 SLKIGNELFDFK-ELPLYDNLLVKSHDDHEILQVDSTIKSQVNLIPSSTFTDTHRKLTQA 243
Query: 326 VDSRHKKVYKVKNCITDIDP 345
V + +K K+ N +TD DP
Sbjct: 244 VKNIQRK-DKILNTVTDTDP 262
>gi|313242983|emb|CBY39704.1| unnamed protein product [Oikopleura dioica]
Length = 285
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 64/112 (57%), Gaps = 3/112 (2%)
Query: 191 KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE-NNIVRWRTVKNKDGM 248
++N ++ N + ++PKPFD + Y +E D + R++L+ N +RWR V DG
Sbjct: 175 QINFVRFPNFLSVEPKPFDAEHYEDEHEEDDDLDEEGRTRLKLKVENTIRWRKVTQPDGT 234
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQ 300
ESNA+ VRWSDGS L +GNE DI + D HLF+R G G LQ Q
Sbjct: 235 EKIESNAKIVRWSDGSQSLILGNEKYDIQSMKLKGDFNHLFIRQGTG-LQGQ 285
>gi|405123027|gb|AFR97792.1| hypothetical protein CNAG_01587 [Cryptococcus neoformans var.
grubii H99]
Length = 468
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 86/190 (45%), Gaps = 28/190 (14%)
Query: 176 LELEIPLRP-PPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTF--VTDESGAKKR 229
+ L IP P A K+ +K + +DPKPFDP Y EE+ TD AK
Sbjct: 92 VTLPIPQWPHMTATDGKVWQMKFPAYVNLDPKPFDPDLYRATQEEEPIDGATDPIAAKSM 151
Query: 230 IRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDA-------- 281
+ N +RWR V DG +SNAR +RWSDGS+ LQ+G++ D+
Sbjct: 152 MIGVKNTIRWRWVTGPDGEPVRQSNARMLRWSDGSVTLQLGDDFYDVAPSQGATLARPSD 211
Query: 282 -----------QHDQAHLFLRHGKG---ILQSQGRILKKMRFIPSSLSSNSHRLLTALVD 327
+ + FL G +L ++ I ++ +P+S++S ++ L V
Sbjct: 212 PKPAPKRDDRPATNSSTTFLCVGAAAERVLVTERPIAGQLSLLPTSMTSKTYLELVKHVG 271
Query: 328 SRHKKVYKVK 337
+H K K+K
Sbjct: 272 QQHTKHSKMK 281
>gi|392575823|gb|EIW68955.1| hypothetical protein TREMEDRAFT_68884 [Tremella mesenterica DSM
1558]
Length = 446
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 140/331 (42%), Gaps = 68/331 (20%)
Query: 85 SPGGEEKDQTHISHSAAEIRDVFGDSDEEDVGEYAIRNDIDQDSNRSPMEEEGSYEKSIR 144
+P EE+D + AA++ D+FGD D D+ A DS R+P E YE+ +
Sbjct: 35 TPKQEEED---VKTEAAQLDDLFGDEDA-DLPSTATPPSASPDSPRTPHPLE--YEELDQ 88
Query: 145 PEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPT--KMNMIKVSNIMG 202
P + EP E P L DPT K+ +++ +
Sbjct: 89 PGPVTE-------------EPAKAEIPAWKRL-----------DPTDGKVWHLRLPAYVN 124
Query: 203 IDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVR 259
D KP+ P + E TD GAK R+ N +RWR V DG + ++N+R +R
Sbjct: 125 YDSKPYHPDLFRATAEVTDGKTDPLGAKGRMIGVRNTMRWRWVTGADGEPTRQTNSRMLR 184
Query: 260 WSDGSLQLQIGNEVLDIT---------------------VQDAQHDQAHLFLRHGKG-IL 297
WSDGS+ LQIG E+ D++ + ++ L G +L
Sbjct: 185 WSDGSVSLQIGLEMFDVSPSHNTLSRPSDPIPPTTSTLPTSSKESTNTYILLIDGTNQLL 244
Query: 298 QSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI-DPEREKEEKERAE 356
+Q + + IP+S++S +H + V +H K ++K + D+ D +E ++A
Sbjct: 245 VTQNILAGTLSLIPTSMTSKTHLEILGHVGQQHVKHSRMK-ILDDLEDLSLVQELLQKAS 303
Query: 357 SQNIRANVLLNRKREKINRKYTQTVERRRQL 387
+ N + KI RK + ER R+
Sbjct: 304 TSN---------GKTKIVRKKSTGGERGRRF 325
>gi|336368233|gb|EGN96576.1| hypothetical protein SERLA73DRAFT_111176 [Serpula lacrymans var.
lacrymans S7.3]
Length = 538
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 108/235 (45%), Gaps = 45/235 (19%)
Query: 161 EHVEPKHKEKPVGPPLE--LEIPLRPPP-ADPTKMNMIKVSNIMGIDPKPFDPKTYV--- 214
E+ EP+ V P E + IP P P +I++ N + +D KPF P TY+
Sbjct: 115 EYAEPEEPATLVEHPQEANVSIPNIPVPRTSDGDHWVIRMPNFVKVDSKPFHPDTYIGPE 174
Query: 215 EED-----TFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQI 269
ED E ++++EN VRW+ K++ +SN+R +RWSDG+L L++
Sbjct: 175 HEDEEAHHAETIREKSMTIKLKVENT-VRWKWDKDEFEQDKKQSNSRVIRWSDGTLSLRL 233
Query: 270 GNEVLDI-------------TVQDAQHDQA------------------HLFLRHGKG-IL 297
G E+ DI ++ +Q Q+ +L +H + +L
Sbjct: 234 GKELFDINQTVDTSAGVARTSLGGSQASQSSLAPSQQSASGSKSQGLTYLVAQHKRSELL 293
Query: 298 QSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPEREKEE 351
Q++ I M P+ + S +HR+L V +H KV +++ +DPEREK E
Sbjct: 294 QAEAVITGYMSLRPTGMQSETHRMLVRAVGQKHNKVARLRMAPDPTMDPEREKME 348
>gi|302306853|ref|NP_983237.2| ACL167Cp [Ashbya gossypii ATCC 10895]
gi|299788718|gb|AAS51061.2| ACL167Cp [Ashbya gossypii ATCC 10895]
gi|374106442|gb|AEY95351.1| FACL167Cp [Ashbya gossypii FDAG1]
Length = 438
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/175 (32%), Positives = 89/175 (50%), Gaps = 12/175 (6%)
Query: 196 KVSNIMGIDPKPFDPKTY---VEEDT--FVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
KV + IDP PFDP ++ +E+ F + E + ++ N +RWR + D
Sbjct: 138 KVPQFLTIDPVPFDPPSFQGQIEKRMARFSSKEDQLGDSL-IDENTIRWRYSRGSDQRVF 196
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRH-GKGILQS--QGRILKKM 307
ESNA+ + WSDG+L L++G+E DI + ++ +L + H + I+Q G I M
Sbjct: 197 KESNAQVIEWSDGTLSLKLGDEYTDILTNEIEN--TYLTVSHEQQEIMQCVEGGTISNSM 254
Query: 308 RFIPSSLSSNSHRLLTALVDSRHKKVYKV-KNCITDIDPEREKEEKERAESQNIR 361
FIP+S +S H+ LT + R + + I DPE EK E ++ Q +R
Sbjct: 255 LFIPTSTNSKIHKRLTKAIARREAREHSGPSTYIVRKDPELEKRELQKKHDQVLR 309
>gi|385304512|gb|EIF48526.1| member of the rna polymerase ii-associated paf1 complex [Dekkera
bruxellensis AWRI1499]
Length = 430
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/182 (24%), Positives = 87/182 (47%), Gaps = 23/182 (12%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEE-DTFVTDE-------------SGAKKRIRLEN----N 235
M + + +DP+PF P + E+ F+ ++ +G K I+ N
Sbjct: 86 MFNLPRFISVDPEPFSPTNFEEQLKEFIKEKEQEGEGSKDXATTNGIKDSIQFRKLQLLN 145
Query: 236 IVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG 295
+RWR K G +SNA+ + W DG++ L++GNE +I ++ ++ G
Sbjct: 146 TIRWRYAKTPSGELYKQSNAKIIEWEDGTMSLKVGNEFFEIKF---NRNEDNILAFQGGQ 202
Query: 296 ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHK--KVYKVKNCITDIDPEREKEEKE 353
+L + K ++ +P S +S +H++L + + + K + K+ +T +DPE + E E
Sbjct: 203 VLIGAFNVNKSIKVLPPSTNSIAHKILASTIQNNMKLRRTRKINTIVTKVDPELKARENE 262
Query: 354 RA 355
+A
Sbjct: 263 KA 264
>gi|363752297|ref|XP_003646365.1| hypothetical protein Ecym_4511 [Eremothecium cymbalariae
DBVPG#7215]
gi|356890000|gb|AET39548.1| hypothetical protein Ecym_4511 [Eremothecium cymbalariae
DBVPG#7215]
Length = 461
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 97/181 (53%), Gaps = 12/181 (6%)
Query: 190 TKMNMIKVSNIMGIDPKPFDPKTY---VEEDT--FVTDESGAKKRIRLENNIVRWRTVKN 244
+ + KV + IDP PFDP + +E+ F + E R+ ++ N +RWR +
Sbjct: 138 SSLYYAKVPQFLTIDPIPFDPPGFQSQIEQRLKRFSSKEDQLGDRL-IDENTIRWRYSRG 196
Query: 245 KDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGK-GILQS--QG 301
+ ESNA+ ++WSDG+ L++G+E DI + D ++ ++ + H + ++Q G
Sbjct: 197 ANQQVFKESNAQIIQWSDGTFSLKLGDEFSDILINDIEN--TYMTVSHEQEELMQCVEGG 254
Query: 302 RILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKV-KNCITDIDPEREKEEKERAESQNI 360
I K M FIP+S +S +H+ L+A + R + + I +DPE EK+E E+ Q I
Sbjct: 255 EIKKSMLFIPTSTNSKTHQRLSAAIARREAREHSGPSTYIVRMDPELEKKELEKKHDQVI 314
Query: 361 R 361
R
Sbjct: 315 R 315
>gi|348682464|gb|EGZ22280.1| hypothetical protein PHYSODRAFT_542915 [Phytophthora sojae]
Length = 391
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 121/253 (47%), Gaps = 41/253 (16%)
Query: 178 LEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIV 237
+ +P P K + K+ NI+ + P+P+ + E +DE+ L N V
Sbjct: 60 MWLPKTPRAPKTAKYFISKMPNILRLVPEPYTKEAIRAEMENPSDET-------LYRNYV 112
Query: 238 RWR----------TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT---------- 277
RWR + +K + ESNA+ V+W DG+ + +G E L ++
Sbjct: 113 RWRYKRDNATGRVLLDDKTKLPLRESNAKLVQWEDGTFSMFVGKEALTLSRQKLANSFLF 172
Query: 278 VQDAQHDQAHL-----FLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKK 332
V + D H + + +L++ R+ +K P + +S SH+ LT + ++H K
Sbjct: 173 VNEMASDSPHFQDSDDVVPGQESVLEAHARLKEKFTIRPMTTASKSHKSLTMSMRAKHNK 232
Query: 333 -VYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGY 391
V K+K I+++D ERE+E++ + + +R L NRK+ + +Y + ER ++ +
Sbjct: 233 SVQKLKEYISELDGEREQEQRVKITDEKLR---LQNRKKARQGYEYDR--ERSSRMDAQF 287
Query: 392 LE---DALEEDDE 401
LE DA++ DD+
Sbjct: 288 LEEGYDAMDYDDD 300
>gi|448112568|ref|XP_004202130.1| Piso0_001610 [Millerozyma farinosa CBS 7064]
gi|359465119|emb|CCE88824.1| Piso0_001610 [Millerozyma farinosa CBS 7064]
Length = 403
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 77/260 (29%), Positives = 115/260 (44%), Gaps = 29/260 (11%)
Query: 102 EIRDVFG-DSDEEDVGEYAIRNDIDQDSNR---SPMEEEGSYEKSIRPEDIVPDEDMHYE 157
E+ D+FG DS+ E V E I ++D N+ + E G ++ ED +ED +
Sbjct: 16 EVDDLFGEDSEGEKVDEEKIEQSEEEDENKQGSTAQETRGGHDDDEENED-SAEEDFDFS 74
Query: 158 SEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---- 213
SE K K + P +P P + KM + + +D PFD +
Sbjct: 75 SER-----KLKNLDISIPRHAVLPNLEPSSYVMKMPVF-----LNVDAHPFDASEFKETV 124
Query: 214 ----VEEDTFVTDESGAKKRI----RLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSL 265
V+ DE + + L N +RWR D +SNA FV+W DGSL
Sbjct: 125 KQNAVDRSNRDLDEKQMQNDLVAEKLLNENTIRWRYSNRGDDEIIKQSNAHFVQWDDGSL 184
Query: 266 QLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTAL 325
L+IGNE+ D + +D + ILQ I ++ IPSS +++HR LT
Sbjct: 185 SLKIGNELFDFK-ELPLYDNLLVKSHDDHEILQVDSTIKSQVNLIPSSTFTDTHRKLTQA 243
Query: 326 VDSRHKKVYKVKNCITDIDP 345
V + +K K+ N +TD DP
Sbjct: 244 VKNIQRK-DKILNTVTDTDP 262
>gi|255731472|ref|XP_002550660.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240131669|gb|EER31228.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 374
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 99/194 (51%), Gaps = 27/194 (13%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESG-AKKRIR--------LENNIVRW 239
++++K + + ++ PFDP + +EE+ G K I L N +RW
Sbjct: 80 VSLLKTPDFLNLEAHPFDPTAFKEQIEENQNARKSKGLTAKEIHNEQITEKLLNENTIRW 139
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG--IL 297
R + +SNA FV W+DGS+ L++GNE+ D+ + +D H+ +R + IL
Sbjct: 140 RYHNAGNDDIIKQSNAHFVAWNDGSVSLKVGNEIYDVR-ELPLYD--HMLVRSYETAEIL 196
Query: 298 QSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAES 357
QS I K M +P +S++HR + LV + KK K+ N ITD+DP EK+R
Sbjct: 197 QSDSIISKTMNLLP---ASSAHRKI--LVKNITKK-EKILNTITDVDP----LEKQRIAD 246
Query: 358 QNIRANVLLNRKRE 371
+N R + L R+ E
Sbjct: 247 ENARKAMKLKRQLE 260
>gi|159482508|ref|XP_001699311.1| Paf1 complex component [Chlamydomonas reinhardtii]
gi|158272947|gb|EDO98741.1| Paf1 complex component [Chlamydomonas reinhardtii]
Length = 106
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 45/107 (42%), Positives = 59/107 (55%), Gaps = 6/107 (5%)
Query: 207 PFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTV--KNKDGMTSY--ESNARFVRWSD 262
PFDP+T+ E+ TD+ G K + +RWR V K DG ESNARFVRWSD
Sbjct: 1 PFDPETFEAEEEIFTDDKGFTKIKPADRTKIRWRYVQRKTADGRDEIVPESNARFVRWSD 60
Query: 263 GSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRF 309
GSLQL +G+EV D+ D H ++ G++Q Q + KM F
Sbjct: 61 GSLQLLLGDEVFDVDTADMSAQ--HAYMVAYSGVVQGQAALTTKMGF 105
>gi|68471259|ref|XP_720406.1| hypothetical protein CaO19.7116 [Candida albicans SC5314]
gi|77022394|ref|XP_888641.1| hypothetical protein CaO19_7116 [Candida albicans SC5314]
gi|46442272|gb|EAL01563.1| hypothetical protein CaO19.7116 [Candida albicans SC5314]
gi|76573454|dbj|BAE44538.1| hypothetical protein [Candida albicans]
Length = 317
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 99/192 (51%), Gaps = 27/192 (14%)
Query: 194 MIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESG-AKKRIR--------LENNIVRWRT 241
++K + + I+ PFDP ++ +E + V E G K I L N +RWR
Sbjct: 38 LLKTPDFLNIEAHPFDPSSFKEQIESNNKVRRERGLTAKEIHNEQMTEKLLNENTIRWRY 97
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG--ILQS 299
+ +SNA FV WSDGS+ L++GNEV D+ + D HL ++ + +LQ+
Sbjct: 98 HNAGNDEIVKQSNAHFVEWSDGSVSLKVGNEVYDVR-ELPMFD--HLLVKSHQAAEVLQA 154
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
+ K + +P +S+SHR + LV + KK K+ N ITD+DP EK+R ++
Sbjct: 155 DSILSKSINLLP---ASSSHRKI--LVKNITKK-EKILNTITDVDP----LEKQRLADED 204
Query: 360 IRANVLLNRKRE 371
+R + + R+ E
Sbjct: 205 VRKAMKMKRQME 216
>gi|190347856|gb|EDK40207.2| hypothetical protein PGUG_04305 [Meyerozyma guilliermondii ATCC
6260]
Length = 373
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 97/195 (49%), Gaps = 28/195 (14%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVE--EDTF------------VTDESGAKKRIRLENNIVRW 239
++K+ + +D PFDP + + + F + ++ A+K L +N VRW
Sbjct: 77 LLKMPVFLNVDAHPFDPSEFKQKVQQNFEQRKKSDMSSKQIHNDLVAEKL--LNDNTVRW 134
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLD---ITVQDAQHDQAHLFLRHGKGI 296
R D +SNA F++W DGSL L+IGNE+ D + + D +AH L I
Sbjct: 135 RYSNRGDDEIIKQSNAHFIQWDDGSLSLKIGNELFDYKTLPITDNFLVRAHDDLE----I 190
Query: 297 LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
LQ+ + K + +P+S ++++HR LT V + +K + N IT+ DP K+R
Sbjct: 191 LQNDAILTKSVNLLPTSTTASTHRKLTQAVKNIQRKD-TILNTITENDPMM----KQRLA 245
Query: 357 SQNIRANVLLNRKRE 371
+N R + L R+ E
Sbjct: 246 DENDRKALKLKRQLE 260
>gi|344299880|gb|EGW30233.1| hypothetical protein SPAPADRAFT_63850 [Spathaspora passalidarum
NRRL Y-27907]
Length = 387
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 113/232 (48%), Gaps = 31/232 (13%)
Query: 194 MIKVSNIMGIDPKPFDPKTY---------VEEDTFVTDESGAKKRI--RLEN-NIVRWRT 241
M+K+ + ++ PFDP + + + T +T++ + +L N N +RWR
Sbjct: 84 MLKMPVFLNVEAHPFDPNEFKARIKENAELRQKTDLTEKQKHNDLVSEKLTNENTIRWRY 143
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT---VQDAQHDQAHLFLRHGKGILQ 298
D + +SNA F++W DGSL L+IG E+ D+ + D ++H L ILQ
Sbjct: 144 SNLNDEIIK-QSNAHFIQWDDGSLSLKIGQELFDLKELPIFDNYLVRSHDSLE----ILQ 198
Query: 299 SQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQ 358
+ I K + +P+S S+++HR LT V + KK K+ ITD DP +++ + E +
Sbjct: 199 NDSIITKTINLLPASTSTSTHRKLTQAVKNIQKKA-KILGTITDDDPLKKQRLADEYERK 257
Query: 359 NIRANVLLNRKR-------EKINRKYTQTVER---RRQLSTGYLEDALEEDD 400
++ L KR E+ N +T ER + + Y ED EEDD
Sbjct: 258 TLKMKRQLESKRRLQEERLERSNSPSLRTNERPSAYERYARTYGEDEYEEDD 309
>gi|19113055|ref|NP_596263.1| RNA polymerase II associated Paf1 complex (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74625294|sp|Q9P6R2.1|YOH8_SCHPO RecName: Full=Uncharacterized protein C13E7.08c
gi|7688328|emb|CAB89883.1| RNA polymerase II associated Paf1 complex (predicted)
[Schizosaccharomyces pombe]
Length = 429
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/223 (28%), Positives = 102/223 (45%), Gaps = 37/223 (16%)
Query: 152 EDMHYESEEEHVEPKHKEKPVGPPLELEIP-LRPPPADPTKMNMIKVSNIMGIDPKPFDP 210
+D + + EE E + PV LE+ +P + P + + + N + ++ P+DP
Sbjct: 47 DDGLFSNTEEATEAPEADVPVKKVLEVAVPNFKSPASASNDVFHAHIPNFLSVEQTPYDP 106
Query: 211 KTYVEEDTFVTD----ESGAKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSL 265
+ Y E ++ +RI+ + +N VRWR + SY+SNA+ V+WSDGS
Sbjct: 107 EQYAAEAEADAALLEHDAHWGQRIKHKVDNTVRWRLGPSG----SYQSNAQIVQWSDGSY 162
Query: 266 QLQIGNEVLDI--------TVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSN 317
L+IGN++ D T A H+ HL L+ Q F+PS++++
Sbjct: 163 SLRIGNDIYDTQNKLISQPTFVTASHEAQHL--------LRVQTSFKSSFTFLPSAINTA 214
Query: 318 ------SHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKER 354
S RL T V SR V+ I + DPE K + E+
Sbjct: 215 TRSKLPSMRLTTVQVPSR-----SVQEIIIEKDPELLKRQAEK 252
>gi|320583256|gb|EFW97471.1| member of the RNA polymerase II-associated Paf1 complex [Ogataea
parapolymorpha DL-1]
Length = 376
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 97/195 (49%), Gaps = 19/195 (9%)
Query: 177 ELEIPLRPPPADPTKMNMI-KVSNIMGIDPKPFDPKTYVEE-DTFVTD---ESGAK---- 227
++ IP P PT M+ + + +DP+PF P T+ + F+ D +S +K
Sbjct: 65 DISIPRHPRSHVPTGDAMVFSLPRYLFVDPEPFAPTTFESNINEFLKDSLKDSTSKELQD 124
Query: 228 ----KRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQH 283
K++ ++N I RWR K +SNA V W DGS+ L+IG+E +I + +
Sbjct: 125 SLEFKKLEVQNTI-RWRYAKTASDELYKQSNASIVEWEDGSMSLKIGDEFFNIKL---NN 180
Query: 284 DQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDS--RHKKVYKVKNCIT 341
++ + + + + K ++ +PSS SS +H++L + S R KK K+ +T
Sbjct: 181 NEDEILVAESGDLYLPVTELKKSIQVLPSSTSSRAHKILANTLQSNLRLKKSKKINTIVT 240
Query: 342 DIDPEREKEEKERAE 356
DPE + E E+A+
Sbjct: 241 TEDPELKAREMEKAQ 255
>gi|241957057|ref|XP_002421248.1| RNA polymerase-associated protein, putative [Candida dubliniensis
CD36]
gi|223644592|emb|CAX40580.1| RNA polymerase-associated protein, putative [Candida dubliniensis
CD36]
Length = 350
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 98/195 (50%), Gaps = 33/195 (16%)
Query: 194 MIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESG-AKKRIR--------LENNIVRWRT 241
++K + + I+ PFDP ++ +E + E G K I L N +RWR
Sbjct: 65 LLKTPDFLNIEAHPFDPSSFKEQIESNNKARAERGLTAKEIHNEQMTEKLLNENTIRWRY 124
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT---VQDAQHDQAHLFLRHGKG--I 296
+ +SNA FV W+DGS+ L++GNE+ D+ + D HL ++ + I
Sbjct: 125 HNAGNDEIVKQSNAHFVEWNDGSVSLKVGNEIYDVRELPLLD------HLLVKSHQAAEI 178
Query: 297 LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
LQS I K + +P +S+SHR + LV + KK K+ N ITD+DP EK+R
Sbjct: 179 LQSDSIISKAINLLP---ASSSHRKI--LVKNITKK-EKILNTITDVDP----LEKQRLA 228
Query: 357 SQNIRANVLLNRKRE 371
+++R + + R+ E
Sbjct: 229 DEDVRKAMKMKRQME 243
>gi|146415212|ref|XP_001483576.1| hypothetical protein PGUG_04305 [Meyerozyma guilliermondii ATCC
6260]
Length = 373
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 97/195 (49%), Gaps = 28/195 (14%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVE--EDTF------------VTDESGAKKRIRLENNIVRW 239
++K+ + +D PFDP + + + F + ++ A+K L +N VRW
Sbjct: 77 LLKMPVFLNVDAHPFDPSEFKQKVQQNFEQRKKSDMSLKQIHNDLVAEKL--LNDNTVRW 134
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLD---ITVQDAQHDQAHLFLRHGKGI 296
R D +SNA F++W DGSL L+IGNE+ D + + D +AH L I
Sbjct: 135 RYSNRGDDEIIKQSNAHFIQWDDGSLSLKIGNELFDYKTLPITDNFLVRAHDDLE----I 190
Query: 297 LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
LQ+ + K + +P+S ++++HR LT V + +K + N IT+ DP K+R
Sbjct: 191 LQNDAILTKLVNLLPTSTTASTHRKLTQAVKNIQRK-DTILNTITENDPMM----KQRLA 245
Query: 357 SQNIRANVLLNRKRE 371
+N R + L R+ E
Sbjct: 246 DENDRKALKLKRQLE 260
>gi|353234693|emb|CCA66715.1| hypothetical protein PIIN_00395 [Piriformospora indica DSM 11827]
Length = 449
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/209 (25%), Positives = 89/209 (42%), Gaps = 43/209 (20%)
Query: 182 LRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLE-NNIVRWR 240
L P + K+ M+++ + +D PF P T+ + + I+LE N +RW+
Sbjct: 77 LGVPRSSDGKVWMLRLPQYLKVDSHPFHPDTF---QGYSDKDKSQASAIKLELTNTIRWK 133
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDA------------------- 281
++ DG + +SNAR ++W DG++ LQ+GNE+ DI+ D
Sbjct: 134 WIRKDDGTMAKQSNARVIKWDDGTMSLQLGNEMFDISTTDETTTTHRGTETPPLSQSQSQ 193
Query: 282 ------------------QHDQAHLFLRH-GKGILQSQGRILKKMRFIPS-SLSSNSHRL 321
QH ++L +H + +LQ + I M P+ S HR
Sbjct: 194 SQSQSQSQSKSQQTNSMRQHGVSYLVTQHSAQQLLQVEATITGSMSLRPAVGTQSTLHRQ 253
Query: 322 LTALVDSRHKKVYKVKNCITDIDPEREKE 350
L V +H KV K++ + E E+E
Sbjct: 254 LVQAVAKKHSKVSKLRFASDAMMREAERE 282
>gi|238883181|gb|EEQ46819.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 317
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 98/192 (51%), Gaps = 27/192 (14%)
Query: 194 MIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESG-AKKRIR--------LENNIVRWRT 241
++K + + I+ PFDP ++ +E + E G K I L N +RWR
Sbjct: 38 LLKTPDFLNIEAHPFDPSSFKEQIESNNKARRERGLTAKEIHNEQMTEKLLNENTIRWRY 97
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG--ILQS 299
+ +SNA FV WSDGS+ L++GNEV D+ + D HL ++ + +LQ+
Sbjct: 98 HNAGNDEIVKQSNAHFVEWSDGSVSLKVGNEVYDVR-ELPMFD--HLLVKSHQAAEVLQA 154
Query: 300 QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQN 359
+ K + +P +S+SHR + LV + KK K+ N ITD+DP EK+R ++
Sbjct: 155 DSILSKSINLLP---ASSSHRKI--LVKNITKK-EKILNTITDVDP----LEKQRLADED 204
Query: 360 IRANVLLNRKRE 371
+R + + R+ E
Sbjct: 205 VRKAMKMKRQME 216
>gi|260941796|ref|XP_002615064.1| hypothetical protein CLUG_05079 [Clavispora lusitaniae ATCC 42720]
gi|238851487|gb|EEQ40951.1| hypothetical protein CLUG_05079 [Clavispora lusitaniae ATCC 42720]
Length = 333
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 93/194 (47%), Gaps = 24/194 (12%)
Query: 194 MIKVSNIMGIDPKPFDP---KTYVEEDTFVTDESGAKKRIR---------LENNIVRWRT 241
+IKV + +D PFDP K V E+ S + L N +RWR
Sbjct: 51 IIKVPVFLQVDAHPFDPNEFKAKVAENARERHASAMDAKAMENDVIAEKLLNQNTLRWRY 110
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLD---ITVQDAQHDQAHLFLRHGKGILQ 298
+ +SNA FV+W DGSL L++G+E+ D + + D ++H+ +LQ
Sbjct: 111 TNAGNDEIVKQSNAHFVQWDDGSLSLKVGSELFDFRALPLADNFLARSHV----DHEVLQ 166
Query: 299 SQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQ 358
+ + K +P+S + +HR LT V + KK K+ N +TD DP K+RA +
Sbjct: 167 NDSILTKTASVLPASTGTATHRQLTQAVKNIQKK-DKILNTLTDADP----MSKQRAADE 221
Query: 359 NIRANVLLNRKREK 372
+ R ++ L R+ E+
Sbjct: 222 HERRSLKLRRQMEQ 235
>gi|321253026|ref|XP_003192602.1| hypothetical protein CGB_C1520W [Cryptococcus gattii WM276]
gi|317459071|gb|ADV20815.1| conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 468
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 93/212 (43%), Gaps = 33/212 (15%)
Query: 154 MHYESEEEHVEPKHKEKPVGPPLELEIPLRP-PPADPTKMNMIKVSNIMGIDPKPFDPKT 212
+ Y E+E P+ + + L IP P A K+ +K + +DPKPFD
Sbjct: 76 LEYVEEDEEAVPQRQNV-----VTLPIPQWPHMTATDGKVWQMKFPAYINLDPKPFDSDL 130
Query: 213 Y---VEEDTF--VTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQL 267
Y EE+ D AK + N +RWR V DG +SNAR +RWSDGS+ L
Sbjct: 131 YRATQEEEPIDGAADPIAAKSMMIGVKNTIRWRWVTGPDGEPVRQSNARMLRWSDGSVTL 190
Query: 268 QIGNEVLDITVQDAQH-------------------DQAHLFLRHGKG---ILQSQGRILK 305
Q+G++ D+ + + + FL G +L ++ I
Sbjct: 191 QLGDDFYDVAPSQSATLARPSDPQPVPKRDDRPAVNSSTTFLCVGAAAERVLVTERPIAG 250
Query: 306 KMRFIPSSLSSNSHRLLTALVDSRHKKVYKVK 337
++ +P+S++S ++ L V +H K K+K
Sbjct: 251 QLSLLPTSMTSKTYLELVKHVGQQHTKHSKMK 282
>gi|170086810|ref|XP_001874628.1| RNA polymerase II-associated protein [Laccaria bicolor S238N-H82]
gi|164649828|gb|EDR14069.1| RNA polymerase II-associated protein [Laccaria bicolor S238N-H82]
Length = 504
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 93/210 (44%), Gaps = 41/210 (19%)
Query: 177 ELEIPLRPPPADPTKMN-MIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENN 235
E+ P P P N +I++ N + +D KPF P DT++ E ++ +++
Sbjct: 113 EVSFPNLPVPRSSDGDNWVIRMPNFVKVDSKPFHP------DTYIGPEQDDEETQHIDST 166
Query: 236 IVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDI------------------- 276
R T+K K +SN+R ++WSDGSL L++G E+ DI
Sbjct: 167 RERSMTIKLKVENMQRQSNSRIIQWSDGSLSLRLGKELFDIKQDRDTSAGVARQYIGAAS 226
Query: 277 -------------TVQDAQHDQAHLFLRHGKG-ILQSQGRILKKMRFIPSSLSSNSHRLL 322
+ A +L +H + +LQS+ I M P+ + S +HR+L
Sbjct: 227 QQSQSSSQLPQSQSQPPAAKGLTYLVAQHKRSQVLQSEAVISGYMSLRPTGMQSETHRML 286
Query: 323 TALVDSRHKKVYKVKNCI-TDIDPEREKEE 351
V +H +V +++ +DPEREK E
Sbjct: 287 VRAVGQKHSRVARLRMAPEPTVDPEREKME 316
>gi|342319404|gb|EGU11353.1| RNA polymerase-associated protein LEO1 [Rhodotorula glutinis ATCC
204091]
Length = 552
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 77/168 (45%), Gaps = 41/168 (24%)
Query: 235 NIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT-------VQDAQ----- 282
N +RWR K++ G +SNAR VRWSDGSL LQ+G E+ D++ V A
Sbjct: 242 NTIRWRWTKDELGQVIKQSNARIVRWSDGSLSLQLGAELFDMSLSLDHSAVMTASAAAGL 301
Query: 283 -----------------------HDQAHLFLRHGKG--ILQSQGRILKKMRFIPSSLSSN 317
H +L RH + ++Q + M F P++L+SN
Sbjct: 302 PVPPAINPVTAGLTASSFDTSRGHGLTYLTARHTYNGQLSEAQASVHGSMSFRPATLTSN 361
Query: 318 SHRLLTALVDSRHKKV---YKVKNC-ITDIDPEREKEEKERAESQNIR 361
+H+ L + R V KVK + +DPER++ EKE+A + R
Sbjct: 362 THKRLAGSIRGRLNDVDAKRKVKMQELPAMDPERQRLEKEKAAQEKAR 409
>gi|354544297|emb|CCE41020.1| hypothetical protein CPAR2_300090 [Candida parapsilosis]
Length = 398
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 66/227 (29%), Positives = 106/227 (46%), Gaps = 36/227 (15%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEE--------------DTFVT 221
LEL +P + ++K + IDP PFDP T+ EE +
Sbjct: 76 LELSLPRHAVSSMTEGTQLLKTPMHLNIDPHPFDPITFKEEVETNELERVEKGLTSKQIY 135
Query: 222 DESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDA 281
+E A+K I VRWR + + +SNA FV+W DG++ L+IGNE+ D+
Sbjct: 136 NEKLAEKLIN--ETTVRWRYHNSGNDEIVKQSNAHFVQWDDGTISLKIGNEMFDVKTLPV 193
Query: 282 QHDQAHLFLRHGKG-ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI 340
+ +L + ILQ+ + + +P+S S + H+ LT ++ + H K K+ N
Sbjct: 194 ADN--YLVKSYKTAEILQTDSIVENTINLLPAS-SDSIHKRLTQVMKNIHFKD-KILNTT 249
Query: 341 TDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQL 387
T+ DP + K+R +N RKR K+ R+ +E RR+L
Sbjct: 250 TEDDPLK----KQRIADEN-------ERKRMKMKRQ----LESRRRL 281
>gi|443925408|gb|ELU44249.1| Leo1 domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 505
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 76/156 (48%), Gaps = 29/156 (18%)
Query: 234 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQ-------- 285
N +RWR ++ +D +SNAR V+WSDGS+ LQ+G E+ DI +A+ +
Sbjct: 180 TNTIRWRWIQGEDVPQKKQSNARIVKWSDGSMSLQLGTEIFDINA-NAEGSRPVPAASQP 238
Query: 286 ------------------AHLFLRHG-KGILQSQGRILKKMRFIPSSLSSNSHRLLTALV 326
++L+ +H G+LQ + I + P+ + S +HR L V
Sbjct: 239 QTQSQTQSSQTPKRAGGLSYLYTQHKHAGVLQCEVPITGTLTLQPTGMFSATHRQLVRAV 298
Query: 327 DSRHKKVYKVKNCI-TDIDPEREKEEKERAESQNIR 361
RH + +++ +DPERE+ E ++ +++ R
Sbjct: 299 GQRHSRTARLRLAPEPTMDPEREQRELQKTANRSAR 334
>gi|331242705|ref|XP_003333998.1| hypothetical protein PGTG_15728 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309312988|gb|EFP89579.1| hypothetical protein PGTG_15728 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 433
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/326 (24%), Positives = 148/326 (45%), Gaps = 64/326 (19%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK--RIRLENNIVRWRT 241
P P+D +M ++ N + +P PF + ++ E+ E G + ++ + N++RWR
Sbjct: 110 PKPSDGKHWDM-RIPNFLSFNPLPFSDEEFLNENESSEKEGGESEAAKVLTDQNVIRWRW 168
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLD-ITVQDAQ---------------HDQ 285
K+ +G SNAR + W DGS LQ+G+E+ D I+ D +
Sbjct: 169 QKDSEGNPVKRSNARVICWEDGSQSLQVGSELFDMISTVDGRQNNNTTTQQQPGATTQGL 228
Query: 286 AHLFLRHGK-GILQSQGRILKKMRFIPSSLSSNSHRLLTA---LVDSRHKKVYKVKNCIT 341
+LF +H + +L++Q I ++ P SL+S +HR + A L+ + ++ ++ +
Sbjct: 229 TYLFAQHSELKLLEAQASITGQVTLRPYSLNSMTHRNIVANRSLLKANSQRQTSLR--VI 286
Query: 342 DIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVE-------RRRQLSTGYLED 394
+DPE+EK +KE E + ++ + K + N + T +E R+R+ + L D
Sbjct: 287 TLDPEKEKIDKELEEERKLKDLKKQDAKLARQNSRQTNLLEGGSRGGVRKRRFNAPLLSD 346
Query: 395 ALEED--------------------DETDYHDSRRSRRRFEE------ELE-AEVRAEKR 427
E D ++ + DS R+ R+ ELE A+ + E++
Sbjct: 347 EGEFDQVDEEEEEEEEEYEEEMEEDEDENLDDSGRNHRQPSHHEPQLTELELADQKLEEQ 406
Query: 428 IINAKKP-----QGHRDIPRKSSTLP 448
++ KK +GH P K +P
Sbjct: 407 ELDRKKKSSDRDRGHVQKPSKPEPVP 432
>gi|448537088|ref|XP_003871259.1| hypothetical protein CORT_0H00160 [Candida orthopsilosis Co 90-125]
gi|380355616|emb|CCG25134.1| hypothetical protein CORT_0H00160 [Candida orthopsilosis]
Length = 389
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/304 (27%), Positives = 138/304 (45%), Gaps = 32/304 (10%)
Query: 110 SDEEDVGEYAIRNDIDQ---DSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPK 166
S+ ED ++ +ID D P + E +Y KS + D S+ E +
Sbjct: 2 SEVEDNSPGGLKEEIDDLFGDEGEEPQQNE-AYLKSETEDAPSGRIDFDQSSDGEDYDDA 60
Query: 167 HKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTF---- 219
EK + LEL +P A ++K + IDP PFDP T+ VE +
Sbjct: 61 DSEKNI---LELSLPRHAVTALTEGTQILKTPMHLNIDPHPFDPTTFKEEVENNALERVQ 117
Query: 220 -------VTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNE 272
+ +E A+K I + +RWR + + +SNA FV+W DG++ L+IGNE
Sbjct: 118 KGLTSKQIYNEQLAEKLINA--STIRWRYHNSGNDEIVKQSNAHFVQWDDGTVSLKIGNE 175
Query: 273 VLDITVQDAQHDQAHLFLRHGKG-ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHK 331
+ D V+ +L + ILQ+ I K + +P+S + + H+ LT ++ + H
Sbjct: 176 MFD--VKSLPTTDNYLVKSYKTAEILQTDSLIEKTINLLPAS-NDSIHKRLTQVMKNIHF 232
Query: 332 KVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGY 391
K K+ N T+ DP + K+R +N R + + R+ E R + ER ++G+
Sbjct: 233 KD-KILNTTTEDDPLK----KQRIADENERKKMKMKRQLESRRRLEEEKFERGDSPASGF 287
Query: 392 LEDA 395
E +
Sbjct: 288 KESS 291
>gi|150866861|ref|XP_001386594.2| member of the RNA polymerase II-associated Paf1 complex
[Scheffersomyces stipitis CBS 6054]
gi|149388117|gb|ABN68565.2| member of the RNA polymerase II-associated Paf1 complex
[Scheffersomyces stipitis CBS 6054]
Length = 347
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 92/191 (48%), Gaps = 21/191 (10%)
Query: 195 IKVSNIMGIDPKPFDPKTYVE----------EDTFVTDESGAKKRI---RLENNIVRWRT 241
+K+ + ++ PFDP + E T +TD+ + + L N +RWR
Sbjct: 52 LKMPVFLNVEAHPFDPTEFKERVQQTAAERSTRTSLTDKQKQNELVAEKLLNENTLRWRY 111
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRH-GKGILQSQ 300
+ + +SN+ FV+W+DGSL L+IGNE+ D V++ L H ILQ
Sbjct: 112 SNSGNDEIIKQSNSHFVQWNDGSLSLKIGNELFD--VRELPLTDNFLVKTHDSYEILQHT 169
Query: 301 GRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNI 360
+ K + +PSS +++HR LT V S K K+ N IT+ DP K+R +N
Sbjct: 170 SILNKTINLLPSSTLTSTHRKLTQAVKSIQTK-DKILNTITNDDPMM----KQRLADENE 224
Query: 361 RANVLLNRKRE 371
R + L R+ E
Sbjct: 225 RKTLRLRRQLE 235
>gi|213407026|ref|XP_002174284.1| RNA polymerase-associated protein LEO1 [Schizosaccharomyces
japonicus yFS275]
gi|212002331|gb|EEB07991.1| RNA polymerase-associated protein LEO1 [Schizosaccharomyces
japonicus yFS275]
Length = 424
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 123/272 (45%), Gaps = 34/272 (12%)
Query: 103 IRDVFGDSDEEDVGEYAIR---NDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESE 159
+ D+FGD + V E + D SN S +++ S + + ED V DED +
Sbjct: 13 LDDLFGDESDGHVSENESKPQEGTGDASSNVSK-KQQNSTDNGGKNEDAVDDEDEKWLFS 71
Query: 160 EEHVEPKHKE-KPVGPPLELEIP-LRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---V 214
+E V H+E K E ++P PP + + N + ++ +PFD + Y
Sbjct: 72 DEEVAEGHQEQKAQVKVFEAKVPNYHPPGKQEDETIHAHMPNFLSMEQRPFDHEVYRTEA 131
Query: 215 EEDTFVTD---ESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGN 271
E D + + E G + +++N I RWR +N + SNA+ V+WSDG+ L+IGN
Sbjct: 132 EADAEMIEHNMEWGQMIKHKVDNTI-RWRFDENGKKV----SNAQLVQWSDGTYSLRIGN 186
Query: 272 EVLDI--------TVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSL-SSNSHRLL 322
++ D T H+ HL L+ Q + + F+PS++ ++ + R
Sbjct: 187 DIFDAQSKPVSQPTFLTVSHEAQHL--------LRVQASFKQSLTFLPSAINTAKTARAP 238
Query: 323 TALVDSRHKKVYKVKNCITDIDPEREKEEKER 354
+ + V+ IT+ DPE K E E+
Sbjct: 239 AMRYFNTPPRTRGVQEIITEKDPELLKRETEK 270
>gi|119597838|gb|EAW77432.1| Leo1, Paf1/RNA polymerase II complex component, homolog (S.
cerevisiae), isoform CRA_a [Homo sapiens]
Length = 606
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 68/131 (51%), Gaps = 7/131 (5%)
Query: 264 SLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLT 323
S+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F P S S +HR +T
Sbjct: 387 SMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTFRPHSTDSATHRKMT 445
Query: 324 ALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVE 382
+ R K K++ + DPE ++ E + E + +RA++ +RE R+ + +
Sbjct: 446 LSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI----RRESQQRRMREK-Q 500
Query: 383 RRRQLSTGYLE 393
+R LS YLE
Sbjct: 501 HQRGLSASYLE 511
>gi|444514635|gb|ELV10620.1| RNA polymerase-associated protein LEO1 [Tupaia chinensis]
Length = 606
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 68/131 (51%), Gaps = 7/131 (5%)
Query: 264 SLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLT 323
S+ L +GNEV D+ Q D HLF+R G G LQ Q K+ F P S S +HR +T
Sbjct: 387 SMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTFRPHSTDSATHRKMT 445
Query: 324 ALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVE 382
+ R K K++ + DPE ++ E + E + +RA++ +RE R+ + +
Sbjct: 446 LSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI----RRESQQRRMREK-Q 500
Query: 383 RRRQLSTGYLE 393
+R LS YLE
Sbjct: 501 HQRGLSASYLE 511
>gi|345329262|ref|XP_001512640.2| PREDICTED: RNA polymerase-associated protein LEO1-like, partial
[Ornithorhynchus anatinus]
Length = 363
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 104/218 (47%), Gaps = 14/218 (6%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTY-VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
+ +K+ N + ++P+PFDP+ Y E + + + R++L+ + + V +
Sbjct: 66 LYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVSSLLAFNVFSFLYDLF 125
Query: 251 YESNARFVRWSDGSL-----QLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILK 305
+ + + +GS Q N V D+ Q D HLF+R G G LQ Q
Sbjct: 126 FVMKPQLLITLEGSFCPTQWQFWAHNVVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKT 184
Query: 306 KMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANV 364
K+ F P S S +HR +T + R K K++ + DPE ++ E + E + +RA++
Sbjct: 185 KLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI 244
Query: 365 LLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDE 401
+RE R+ + + +R LS YLE D +++DE
Sbjct: 245 ----RRESQQRRMREK-QHQRGLSANYLEPDRYDDEDE 277
>gi|426379103|ref|XP_004056244.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase-associated protein
LEO1-like [Gorilla gorilla gorilla]
Length = 618
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 69/135 (51%), Gaps = 7/135 (5%)
Query: 260 WSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSH 319
+ D + L +GNEV D+ Q D HLF+R G G LQ Q K+ F P S S +H
Sbjct: 395 FQDVDMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTG-LQGQAVFKTKLTFRPHSTDSATH 453
Query: 320 RLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYT 378
R +T + R K K++ + DPE ++ E + E + +RA++ +RE R+
Sbjct: 454 RKMTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASI----RRESQQRRMR 509
Query: 379 QTVERRRQLSTGYLE 393
+ + +R LS YLE
Sbjct: 510 EK-QHQRGLSASYLE 523
>gi|164657031|ref|XP_001729642.1| hypothetical protein MGL_3186 [Malassezia globosa CBS 7966]
gi|159103535|gb|EDP42428.1| hypothetical protein MGL_3186 [Malassezia globosa CBS 7966]
Length = 481
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 71/140 (50%), Gaps = 6/140 (4%)
Query: 220 VTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQ 279
VT + A+K +R + VRWR + + +SNAR VRW DG+ LQIG+E D+T
Sbjct: 187 VTGDQKARKFLRTMST-VRWRWKEAGPSRVTPQSNARIVRWDDGTESLQIGSEFFDMTRH 245
Query: 280 DAQHDQA----HLFLRHG-KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVY 334
Q ++++ H +G+L+++ + + F P S R+ +A+ R +V
Sbjct: 246 AEPSAQGTPLTYIYVPHAEEGVLEAECPVRTSLSFKPGLHSDTHSRIASAIRHQRGMRVV 305
Query: 335 KVKNCITDIDPEREKEEKER 354
+DPEREKE ER
Sbjct: 306 ASSEMFGTMDPEREKERIER 325
>gi|430814169|emb|CCJ28561.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 665
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 98/195 (50%), Gaps = 20/195 (10%)
Query: 191 KMNMIKVSNIMGIDPKPFDPKTYVEE-------DTFVTDESGAKKRIRLENNIVRWRTVK 243
++ +K+ N + I KPFD ++Y+EE T ++ + R+++EN I RWR VK
Sbjct: 375 QLYYVKMPNFLTIVQKPFDRESYLEEAQSEREETTIHQYDTNQRIRLKVENTI-RWRYVK 433
Query: 244 NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRH-GKGILQSQGR 302
NKDG + DGSL L +G+E+ +++ + +L L H + +L S+ R
Sbjct: 434 NKDGT-----------YVDGSLSLLLGSELFSAVTKNSFSEHTYLCLSHESQNLLMSRKR 482
Query: 303 ILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRA 362
K M F+P S++H+ LT + + K + + DPE+ K E ER E + IR
Sbjct: 483 FTKNMTFLPIDTGSSTHKRLTEAILRGNMKKCSIVEFVNVEDPEKVKREAERIEEEKIRF 542
Query: 363 NVLLNRKREKINRKY 377
L KR + KY
Sbjct: 543 RRRLETKRRAQDAKY 557
>gi|149239759|ref|XP_001525755.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449878|gb|EDK44134.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 407
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 84/171 (49%), Gaps = 29/171 (16%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEEDTF--------------VTDESGAKKRIRLENNIVRW 239
++K + I+P+PFDP + +E + + +E A+K I +RW
Sbjct: 98 ILKTPAHLKIEPRPFDPNKFKDEVEYKMQERKEQGLTSKQIYNEQLAEKLIN--ETTIRW 155
Query: 240 RTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT---VQDAQHDQAHLFLRHGKG- 295
R + + +SNA FV+W+DGS+ L+IGNE+ D+ V D +L +R K
Sbjct: 156 RYHNSGNDEIIKQSNAHFVQWNDGSVSLKIGNEMFDVKSLPVVD------NLLVRSHKTA 209
Query: 296 -ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDP 345
LQ+ I + +P+S +S RLL + + + K K+ + T++DP
Sbjct: 210 EFLQTDSIIELNINLLPASTNSIHKRLLQVMKNVQFKD--KILSTTTEVDP 258
>gi|344228420|gb|EGV60306.1| Leo1-like protein [Candida tenuis ATCC 10573]
Length = 367
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 81/182 (44%), Gaps = 32/182 (17%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEE--------DTFVTDESGAKK---RIRLEN-NIVRWRTV 242
+K+ + +D PFDP + E D + D K R +L N N +RW+
Sbjct: 80 LKMPLFVNVDAHPFDPVDFKENIELNARNRDINIKDSKLKKNDLIREKLTNENTIRWKYS 139
Query: 243 KNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDI-------TVQDAQHDQAHLFLRHGKG 295
D + +SN FV W DGSL L+IGNE+ D + HD
Sbjct: 140 NENDEIIK-QSNCHFVEWDDGSLSLKIGNEMFDFKSLPLYDNILVRSHDDLE-------- 190
Query: 296 ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDP---EREKEEK 352
ILQ+ + K +P+S +++H+ L + +K K+ N ITD DP +R +E
Sbjct: 191 ILQASTILTKTSNLLPTSTGTSTHKQLANAIKFLQQK-DKILNTITDSDPLEVQRNADEN 249
Query: 353 ER 354
ER
Sbjct: 250 ER 251
>gi|388857081|emb|CCF49296.1| uncharacterized protein [Ustilago hordei]
Length = 633
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/260 (25%), Positives = 104/260 (40%), Gaps = 69/260 (26%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEE-----------DTFVTDESGAKKRIRLENNIVRWRTV 242
+ ++ + + PFD T+ EE ++ ++ GA +R N I RWR
Sbjct: 208 LARIPHFLRYVTTPFDADTWDEEQEEKALLEEGFNSQFGNDVGAASLLRTSNTI-RWRYT 266
Query: 243 K--NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDI------------------TVQDAQ 282
++DG ESNAR VRWSDG++ LQ+G+E+ DI T Q +
Sbjct: 267 DQVDQDGARIPESNARIVRWSDGTMSLQVGSELFDITQQTEKGRVSTATNTGTNTPQMSS 326
Query: 283 HDQA-------------------------------HLFLRHGK-GILQSQGRILKKMRFI 310
QA +L + H + ++++QG I + F
Sbjct: 327 SSQAAGPSSQLLSQATQSMPMGSRGSSNRPAQSLSYLVVPHEREQVMEAQGPIAGSLAFT 386
Query: 311 PSSLSSNSH-RLLTALVDSRHKKVYKVKNCITDIDPEREKEEKER----AESQNIRANVL 365
P+ + S +H R+ AL + +V +DPE EK E+ AE + IR
Sbjct: 387 PADIQSETHRRIAKALRFQKTARVVATAAGEGALDPEVEKARIEKELKDAEKKRIRERQK 446
Query: 366 LNRKREKINRKYTQTVERRR 385
+RK K + RRR
Sbjct: 447 ADRKSGKFDDDMFDIESRRR 466
>gi|367001931|ref|XP_003685700.1| hypothetical protein TPHA_0E01730 [Tetrapisispora phaffii CBS 4417]
gi|357523999|emb|CCE63266.1| hypothetical protein TPHA_0E01730 [Tetrapisispora phaffii CBS 4417]
Length = 469
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/324 (27%), Positives = 143/324 (44%), Gaps = 67/324 (20%)
Query: 52 RRREVIESGSERSEENHYPDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSD 111
++R V ESG + E N + D AR R E++D+ H ++ R +GD
Sbjct: 76 KKRGVAESGDDDDEHN-------DHDDIARRRRGYMDEDEDEEHAMYT----RKFYGD-- 122
Query: 112 EEDVGEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKP 171
EDV +Y DE+MH E +E VE +
Sbjct: 123 -EDVNKYD------------------------------EDENMH-EFKEADVELV---RH 147
Query: 172 VGPPLELEIPLRPPPADPTKMNMIKVSNI---MGIDPKPFDPKTY-------VEEDTFVT 221
+ P LE D T N + I + IDP PFDP + +E T V
Sbjct: 148 IVPYKILEQKNDNSKDDATSKNSTYYARIPPFLTIDPVPFDPPQFESEVKERLENTTNVK 207
Query: 222 DESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDA 281
D G + ++ N +RWR ++ + ESNA+ ++WSDGS L++G++ D+ V D
Sbjct: 208 DRIGDR---LIDENTIRWRYSRDANQTVFKESNAQIIQWSDGSYSLKLGDDYTDLLVNDI 264
Query: 282 QHDQAHLFLRHGKG-ILQS--QGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKV-K 337
+ L + H + ++Q G I K F+P S +S H+ L+ + R + + +
Sbjct: 265 SN--TFLTVSHDQQELMQCVDGGEISKSAMFVPISTNSRLHQRLSKAIARRDQNSTESPR 322
Query: 338 NCITDIDPEREKEEKERAESQNIR 361
+ I ++DPE EK+E E+ +SQ R
Sbjct: 323 SMIINVDPEIEKKELEKKQSQIFR 346
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/200 (23%), Positives = 80/200 (40%), Gaps = 70/200 (35%)
Query: 200 IMGIDPKPFDPKTY-------VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYE 252
+ IDP PFDP + +E T V D G + ++ N +RWR ++ + E
Sbjct: 179 FLTIDPVPFDPPQFESEVKERLENTTNVKDRIGDR---LIDENTIRWRYSRDANQTVFKE 235
Query: 253 SNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQS--QGRILKKMRF 309
SNA+ ++WSDGS L++G++ D+ V D + L + H + ++Q G I K F
Sbjct: 236 SNAQIIQWSDGSYSLKLGDDYTDLLVNDISN--TFLTVSHDQQELMQCVDGGEISKSAMF 293
Query: 310 IPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRK 369
+P +S+NS
Sbjct: 294 VP--ISTNS--------------------------------------------------- 300
Query: 370 REKINRKYTQTVERRRQLST 389
+++++ ++ + RR Q ST
Sbjct: 301 --RLHQRLSKAIARRDQNST 318
>gi|452821898|gb|EME28923.1| hypothetical protein Gasu_36600 [Galdieria sulphuraria]
Length = 380
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 98/206 (47%), Gaps = 18/206 (8%)
Query: 191 KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
++ + K+ + +G++P F +E + +ES + + ++ +R+R V N
Sbjct: 95 QIYLAKLPSTVGVEPFAFQE---WKEKSREQNESDSGRAALVK---IRYRKVGN-----D 143
Query: 251 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFI 310
+ESN R V+W DGS L IGNE D+ QD + R ++ RI +K++
Sbjct: 144 WESNTRLVKWEDGSSSLFIGNECFDVIEQDISEESQLFLFRRQPTAQAAECRISRKLKIQ 203
Query: 311 PSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIRANVLLNRKR 370
P+S+SS R L + K KVK D PE E+++ + E + RA L KR
Sbjct: 204 PASISSGIQRKLGTY--EKFAKERKVKMAEMDRAPELEEQKLAQLEHERARAQARLEAKR 261
Query: 371 EKINRKYTQTVERRRQ--LSTGYLED 394
RK +E R +ST +L++
Sbjct: 262 ---RRKEQLVMENWRSSGVSTEFLDN 284
>gi|343426257|emb|CBQ69788.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 628
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 100/260 (38%), Gaps = 69/260 (26%)
Query: 194 MIKVSNIMGIDPKPFDPKTY--VEEDTFVTDES---------GAKKRIRLENNIVRWRTV 242
+ ++ + + PFD T+ +E+ + DE GA +R N I RWR
Sbjct: 202 LARIPHFLRYVTTPFDADTWDDEQEEKMLLDEGFNSGFGNDVGAASLLRTSNTI-RWRYT 260
Query: 243 K--NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQD-------------------- 280
++DG ESNAR VRWSDG++ LQ+G+E+ DIT Q
Sbjct: 261 DQVDEDGAKVPESNARIVRWSDGTMSLQVGSELFDITQQTEKGRVSTVNTGTNTPQQIGS 320
Query: 281 ------------AQHDQAHLFLRHGKG------------------ILQSQGRILKKMRFI 310
+Q Q+ G +++SQG I + F
Sbjct: 321 SSQAAGPSSQLLSQATQSMPMGSSGSANRPAQSLSYLVVPHEREQVMESQGAIAGSLAFT 380
Query: 311 PSSLSSNSH-RLLTALVDSRHKKVYKVKNCITDIDPEREKEEKER----AESQNIRANVL 365
P+ S +H R+ AL + +V +DPE EK E+ AE + IR
Sbjct: 381 PADTQSETHRRIAKALRFQKTARVVATAAGAGALDPEVEKARIEKELKDAEKKRIRERQK 440
Query: 366 LNRKREKINRKYTQTVERRR 385
+RK K + RRR
Sbjct: 441 EDRKSGKFDDDMFDVDARRR 460
>gi|71021493|ref|XP_760977.1| hypothetical protein UM04830.1 [Ustilago maydis 521]
gi|46101052|gb|EAK86285.1| hypothetical protein UM04830.1 [Ustilago maydis 521]
Length = 617
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 104/259 (40%), Gaps = 68/259 (26%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEE-----------DTFVTDESGAKKRIRLENNIVRWRTV 242
+ ++ + + PFD ++ EE ++ ++ GA +R N I RWR
Sbjct: 191 LARIPHFLRYVTTPFDADSWDEEQEEKMLLEEGFNSGFGNDVGAASLLRTSNTI-RWRYT 249
Query: 243 K--NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQD-------------------- 280
++DG ESNAR VRWSDG++ LQ+G+E+ DIT Q
Sbjct: 250 DQVDEDGARVPESNARIVRWSDGTMSLQVGSELFDITQQTEKGRVSTANTGTSTPQPGTS 309
Query: 281 -----------AQHDQ-----------------AHLFLRHGK-GILQSQGRILKKMRFIP 311
+Q Q ++L + H + I++SQG I + F P
Sbjct: 310 SQAAGSSSRLLSQATQSMPMGSSGSSDRPAQSLSYLVVPHEREQIMESQGPIAGTLAFTP 369
Query: 312 SSLSSNSH-RLLTALVDSRHKKVYKVKNCITDIDPEREKEEKER----AESQNIRANVLL 366
+ S +H R+ AL + +V +DPE EK E+ AE + IR
Sbjct: 370 ADTQSETHRRIAKALRFQKTARVVATAAGAGALDPEVEKARIEKELKDAEKKRIRERQKA 429
Query: 367 NRKREKINRKYTQTVERRR 385
+RK K + RRR
Sbjct: 430 DRKSGKFDDDTFDVDARRR 448
>gi|24648355|ref|NP_650866.1| CG10887 [Drosophila melanogaster]
gi|23171779|gb|AAF55743.2| CG10887 [Drosophila melanogaster]
gi|327478456|gb|AEA77317.1| MIP29958p [Drosophila melanogaster]
gi|332000067|gb|AED98582.1| MIP29558p [Drosophila melanogaster]
Length = 676
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 61/120 (50%), Gaps = 7/120 (5%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEEDTF--VTDESGAKKRIRLENNIVRWRTVKNKDGMTSY 251
+K+ + ++ K + P+T+ + T + DE + I VRWR +NK +
Sbjct: 396 FLKMPYFIPVESKAYVPETFQDRMTKNDLKDEQSREDFINRLKATVRWRENENK----TK 451
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIP 311
ESNA+ VRWSDGS +GNEV D+ +Q HL++R G Q QG I K+ P
Sbjct: 452 ESNAKIVRWSDGSETFHVGNEVFDMMHHPVTVNQNHLYVRLGS-FYQPQGLIQNKLTVRP 510
>gi|428182196|gb|EKX51057.1| hypothetical protein GUITHDRAFT_102981 [Guillardia theta CCMP2712]
Length = 611
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/264 (27%), Positives = 117/264 (44%), Gaps = 34/264 (12%)
Query: 176 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENN 235
L +P P P D + ++VS + +P F VE + E ++ + N
Sbjct: 303 LHATMPELPRPDDGAVLYYMRVSKHLQFEPYAFGS---VESEEVRIRE----RKDNVSEN 355
Query: 236 IVRWRTVKNKDG----MTSYESNARFVRWSDGSLQLQIGNEVLDIT------VQDAQHDQ 285
+VRWR+V + DG + ESNARFV+WSDGSL L +G VL + V ++
Sbjct: 356 MVRWRSVAS-DGDEKREAAKESNARFVQWSDGSLTLHVGETVLQVKKEPLKQVGLVRNIF 414
Query: 286 AHLFLRHG---KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI-T 341
AHL I+++ G + ++ +P + S+ + ++ +L + KV K I
Sbjct: 415 AHLTTTETDERLAIVEAHGVMQNRLVCVPFTKSAATKQIFRSLSKMANVKVDKSDLLIKK 474
Query: 342 DIDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDE 401
ID +K + E+ IR +RE +N RRR + E+ LE +D
Sbjct: 475 QIDENEQKALLGQKETMRIRQ----QSRREGLN--------RRRGANAEMTEEFLESEDV 522
Query: 402 TDYHDSRRSRRRFEEELEAEVRAE 425
+ RS +R EA RA+
Sbjct: 523 GGMVRAFRSNKRKATNPEALQRAK 546
>gi|195389188|ref|XP_002053259.1| GJ23439 [Drosophila virilis]
gi|194151345|gb|EDW66779.1| GJ23439 [Drosophila virilis]
Length = 436
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 67/153 (43%), Gaps = 12/153 (7%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEEDTF----VTDESGAKKRIRLENNIVRWRTVKNKDGMT 249
++++ + M ++PK ++ T+ +D + D I VRWR ++K
Sbjct: 126 VLRMPHFMPVEPKAYEAHTF--QDALRPEDLNDREARDAFITKLMTTVRWRECQDKTTGA 183
Query: 250 SY-ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
Y ESNAR VRWSDGS +G E D+ Q L++R G QG I K+
Sbjct: 184 LYKESNARIVRWSDGSETFHVGGEAFDVVNHPMPAGQNQLYVRQG-SYYHMQGHIKDKLS 242
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYK-VKNCI 340
P SS + L R YK V +C+
Sbjct: 243 LRPKLESSFGQSHVQGL---RKLAFYKPVNSCV 272
>gi|443900071|dbj|GAC77398.1| uncharacterized conserved protein [Pseudozyma antarctica T-34]
Length = 614
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 16/100 (16%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEE------------DTFVTDESGAKKRIRLENNIVRWRT 241
+ ++ + + PF+P ++ EE F DE GA +R N I RWR
Sbjct: 198 LARIPHFLRYVTTPFEPDSWDEEQEEDMLRDEGFNSGFAHDE-GATSLLRTSNTI-RWRY 255
Query: 242 VK--NKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQ 279
++DG ESNAR VRWSDG++ LQ+G+E+ DIT Q
Sbjct: 256 TDELDQDGARVPESNARIVRWSDGTMSLQVGSELFDITQQ 295
>gi|294659658|ref|XP_462057.2| DEHA2G12100p [Debaryomyces hansenii CBS767]
gi|199434131|emb|CAG90543.2| DEHA2G12100p [Debaryomyces hansenii CBS767]
Length = 392
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 114/264 (43%), Gaps = 58/264 (21%)
Query: 102 EIRDVFGDSDEEDV-GEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEE 160
E+ D+FG+ DE+ + D + + + +++ E ++ DI
Sbjct: 17 EVDDLFGEDDEQQTETQMENGEDNEDEDDTDGNDDDDKEEHELKTLDIAL---------- 66
Query: 161 EHVEPKH--KEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTY---VE 215
P+H KP L++P+ + ++ PFDP + V
Sbjct: 67 ----PRHAVAHKPEEDTYTLKMPV-----------------FLNVEAHPFDPTEFKEKVG 105
Query: 216 EDTF-----------VTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGS 264
++ V ++ A+K L N +RWR + + +SNA FV+W+DGS
Sbjct: 106 QNAIERKNSSMNAKQVQNDLNAEK--LLNENTIRWRYTNSGNDEIIKQSNAHFVQWNDGS 163
Query: 265 LQLQIGNEVLD---ITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRL 321
+ L+IG+E+ D + V D + H ILQ+ I K + +PSS +++H+
Sbjct: 164 ISLKIGSEMFDYKELPVYDNFLVKTH----DDYEILQNDSIINKSVNLLPSSAFTSTHKK 219
Query: 322 LTALVDSRHKKVYKVKNCITDIDP 345
LT V + KK K+ N ITD DP
Sbjct: 220 LTEAVKNIQKKD-KILNTITDNDP 242
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/280 (23%), Positives = 113/280 (40%), Gaps = 87/280 (31%)
Query: 195 IKVSNIMGIDPKPFDPKTY---VEEDTF-----------VTDESGAKKRIRLENNIVRWR 240
+K+ + ++ PFDP + V ++ V ++ A+K L N +RWR
Sbjct: 82 LKMPVFLNVEAHPFDPTEFKEKVGQNAIERKNSSMNAKQVQNDLNAEKL--LNENTIRWR 139
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLD---ITVQDAQHDQAHLFLRHGKGIL 297
+ + +SNA FV+W+DGS+ L+IG+E+ D + V D + H IL
Sbjct: 140 YTNSGNDEIIKQSNAHFVQWNDGSISLKIGSEMFDYKELPVYDNFLVKTH----DDYEIL 195
Query: 298 QSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAES 357
Q+ I K + +PSS +++H+ LT V + KK K+ N ITD DP
Sbjct: 196 QNDSIINKSVNLLPSSAFTSTHKKLTEAVKNIQKKD-KILNTITDNDP------------ 242
Query: 358 QNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALEEDDETDYHDSRRSRRRFEEE 417
LL ++ N K + ++R+ ++ +RR +EE
Sbjct: 243 -------LLKQRMADENEKKSLKMKRQMEM-----------------------KRRLQEE 272
Query: 418 LEAEVRAEKRIINAKKPQGHRDIPRKSSTLPAAKSSRRPV 457
+ R +S +P ++S P
Sbjct: 273 ---------------------RLERTNSPVPGMRNSYEPA 291
>gi|195498138|ref|XP_002096397.1| GE25650 [Drosophila yakuba]
gi|194182498|gb|EDW96109.1| GE25650 [Drosophila yakuba]
Length = 373
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 82/173 (47%), Gaps = 10/173 (5%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEEDTF--VTDESGAKKRIRLENNIVRWRTVKNKDGMTSY 251
+K+ + + ++ KP+ P+T+ T + DE + + VRWR +NK
Sbjct: 96 FLKMPSFIPVESKPYVPQTFENRMTKDDLKDEQAREDFLNRLRATVRWREDENK----VK 151
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIP 311
ESN++ VRWSDGS +G+EV D+ +Q HL++R Q QG I KM P
Sbjct: 152 ESNSKIVRWSDGSETFHVGDEVFDVMHHPVTDNQNHLYVRLAS-CYQPQGTIKDKMTLRP 210
Query: 312 SSLSSNSHRLLTALVD-SRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 361
SS + L + + +K I D+ +PE+++E + E +R
Sbjct: 211 MLDSSFGQSHVQGLRNRTTNKPQQGCVKVIMDLGSNPEQDQEWLMKQELAKLR 263
>gi|195354052|ref|XP_002043515.1| GM23097 [Drosophila sechellia]
gi|194127656|gb|EDW49699.1| GM23097 [Drosophila sechellia]
Length = 566
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 7/120 (5%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEEDT--FVTDESGAKKRIRLENNIVRWRTVKNKDGMTSY 251
+K+ + ++ + F P+T+ ++ T + DE I VRWR +NK +
Sbjct: 296 FLKMPYFIPVESEAFVPETFQDQMTEDDLKDEQSRNYFINRLKATVRWRENENK----TK 351
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIP 311
ESNA+ VRWSDGS +GNEV D+ +Q HL++R Q QG I K+ P
Sbjct: 352 ESNAKIVRWSDGSQTFHVGNEVFDMMQHPVTVNQNHLYVRL-DSCYQPQGLIKNKLTVRP 410
>gi|195569514|ref|XP_002102754.1| GD19338 [Drosophila simulans]
gi|194198681|gb|EDX12257.1| GD19338 [Drosophila simulans]
Length = 662
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 60/120 (50%), Gaps = 7/120 (5%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEEDT--FVTDESGAKKRIRLENNIVRWRTVKNKDGMTSY 251
+K+ + ++ + + P+T+ ++ T + DE I VRWR +NK +
Sbjct: 385 FLKMPYFIPVESEAYVPETFQDQMTEDDLKDEQSRNDFINRLKATVRWRENENK----TK 440
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIP 311
ESNA+ VRWSDGS +GNEV D+ +Q HL++R Q QG I K+ P
Sbjct: 441 ESNAKIVRWSDGSQTFHVGNEVFDMMQHPVTVNQNHLYVRL-DSCYQPQGLIKNKLTVRP 499
>gi|195450052|ref|XP_002072343.1| GK22791 [Drosophila willistoni]
gi|194168428|gb|EDW83329.1| GK22791 [Drosophila willistoni]
Length = 537
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 55/120 (45%), Gaps = 3/120 (2%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEEDTF--VTDESGAKKRIRLENNIVRWRTVKNKDGMTSY 251
+++ + I+P ++ KTY + + + E + + VRWR +N G
Sbjct: 187 FLRMPIFIPIEPIAYNDKTYTSDMSAEDLKSEQSRENFVSRLKTTVRWRENQNDLGKLCK 246
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIP 311
ESNAR VRWSDGS +G EV DI DQ L++R Q Q I KM P
Sbjct: 247 ESNARIVRWSDGSETFHVGAEVFDIMHHPVPDDQNQLYVRL-DDFYQVQTSIKDKMTLRP 305
>gi|27374395|gb|AAO01131.1| CG10887-PA [Drosophila willistoni]
Length = 555
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 55/120 (45%), Gaps = 3/120 (2%)
Query: 194 MIKVSNIMGIDPKPFDPKTYVEEDTF--VTDESGAKKRIRLENNIVRWRTVKNKDGMTSY 251
+++ + I+P ++ KTY + + + E + + VRWR +N G
Sbjct: 205 FLRMPIFIPIEPIAYNDKTYTSDMSAEDLKSEQSRENFVSRLKTTVRWRENQNDLGKLCK 264
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIP 311
ESNAR VRWSDGS +G EV DI DQ L++R Q Q I KM P
Sbjct: 265 ESNARIVRWSDGSETFHVGAEVFDIMHHPVPDDQNQLYVRL-DDFYQVQTSIKDKMTLRP 323
>gi|401887301|gb|EJT51291.1| hypothetical protein A1Q1_07472 [Trichosporon asahii var. asahii
CBS 2479]
gi|406696369|gb|EKC99660.1| hypothetical protein A1Q2_06079 [Trichosporon asahii var. asahii
CBS 8904]
Length = 402
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 68/153 (44%), Gaps = 22/153 (14%)
Query: 226 AKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT-------- 277
AK ++ N +RW+ V G SNAR +RWSDGS+ LQ+GN++ D+
Sbjct: 108 AKSKMIGVRNTIRWKWVNGPHG-PERRSNARMLRWSDGSVSLQLGNDLFDLAPSYGSTLA 166
Query: 278 -----------VQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALV 326
+ H + +L ++ I ++ +P+S+ S +HR L V
Sbjct: 167 RPQDAAPPAAAEAGNAETSFLCYTSHHEQVLVAETAIAGQVNLVPTSMDSKTHRELVNHV 226
Query: 327 DSRHKKVYKVKNCITDI-DPEREKEEKERAESQ 358
+H K ++K + DI D + E RA Q
Sbjct: 227 GQQHTKHSRMK-ILDDIGDSSKVAELLARAAPQ 258
>gi|76155512|gb|AAX26803.2| SJCHGC09013 protein [Schistosoma japonicum]
Length = 193
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 6/111 (5%)
Query: 267 LQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALV 326
L +G+E+ DI D Q D +LF+R G G LQ Q + K+ F P S S +HR +T +
Sbjct: 2 LHLGDEIFDIHKVDIQSDYNYLFIREGSG-LQGQSSLKTKLTFRPHSTDSFTHRKITLSL 60
Query: 327 DSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANV----LLNRKREK 372
+ K+ KVK +T DPE + R E + +RA + L R REK
Sbjct: 61 ADKTNKLQKVKILPVTGADPESNRNMLVRKEEEKLRATLRRESQLRRMREK 111
>gi|223999609|ref|XP_002289477.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974685|gb|EED93014.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 586
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 69/122 (56%), Gaps = 22/122 (18%)
Query: 169 EKPV-GPPLELEIPLRPPPADP---TKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDES 224
+KPV PP ++E+ L D TK+++ K+ N++GI+ ++P+T+ ED +E
Sbjct: 186 DKPVYAPPRKMEL-LNLATTDGNHQTKIHITKLPNLVGINTAAYNPETHNLED----EEE 240
Query: 225 GAKKRIRLENNIVRWR----------TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVL 274
+ + +N++RWR V++++G ESN R V+WSDGS L +G EVL
Sbjct: 241 YYRGYV---HNMIRWRYKNKPEDETELVRDENGHPIRESNTRLVKWSDGSYTLHVGQEVL 297
Query: 275 DI 276
++
Sbjct: 298 EV 299
Score = 45.4 bits (106), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 33/54 (61%)
Query: 296 ILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREK 349
+L+ G I+ + PSSL+S +HR LT V R+ K ++ +TD DPE+EK
Sbjct: 380 VLECLGPIVSRFAPRPSSLASEAHRNLTLAVRQRNVKRARIAEIVTDFDPEKEK 433
>gi|294890835|ref|XP_002773338.1| RNA polymerase-associated protein LEO1, putative [Perkinsus marinus
ATCC 50983]
gi|239878390|gb|EER05154.1| RNA polymerase-associated protein LEO1, putative [Perkinsus marinus
ATCC 50983]
Length = 487
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 97/235 (41%), Gaps = 52/235 (22%)
Query: 191 KMNMIKVS-NIMGIDPKPFDPKTYVEEDTFVTDESGAKKRI-----------RLENNIVR 238
+ +++K+ + GIDP PF DTF S A ++ R EN+I R
Sbjct: 231 RCHIVKLMPTVTGIDPIPF------SRDTFPMLHSAALEKCPDEKSRAAMTSRAENSI-R 283
Query: 239 WRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG----- 293
WR + D ESNAR V W+DGSL L +G E + +D ++ ++F + G
Sbjct: 284 WRFKEGTDN-KEIESNARLVEWADGSLTLMVGKESFKVLKRD---ERVYIFDKGGSNPPG 339
Query: 294 ---KGI--------LQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHK---KVYKVKNC 339
G+ + G ++ P SL S +H+ L V H K K K
Sbjct: 340 FTANGVPVLGSITDMSCHGLCTSQLTVTPGSLESRTHKNLARWVIRPHGDSLKGQKKKTS 399
Query: 340 ITDIDPEREKEEKERAESQNIR-ANVLLNRKREKINRKYTQTVERRRQLSTGYLE 393
+T D + K ++ R A +R+R+ + RR +S +LE
Sbjct: 400 LTTTDEVLAAQAKAEQSLEDSRFALSAGSRRRDMLG---------RRGMSEAFLE 445
>gi|403374095|gb|EJY86984.1| PAF1 complex protein Leo1 [Oxytricha trifallax]
Length = 350
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 68/142 (47%), Gaps = 28/142 (19%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEED--TFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSY- 251
+K+ + + I +DP +Y ED + D+ ++++ +I+RWR V+++D + +
Sbjct: 32 LKLPSCLRIQSVAYDPSSYQVEDPIVYTNDKGIVQEKVCAVEDIIRWRYVEDEDNQSQFG 91
Query: 252 ----------------------ESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLF 289
ESNA+FV+WSDG+ L IG+E +I +++ + Q +
Sbjct: 92 KDITIPQLDKKLEIDQSDHRRIESNAKFVQWSDGTYSLAIGDEFFEINMENLTNRQCY-- 149
Query: 290 LRHGKGILQSQGRILKKMRFIP 311
+ L +G + KKM P
Sbjct: 150 -SQFEKFLLYKGDVNKKMIVKP 170
>gi|402584892|gb|EJW78833.1| hypothetical protein WUBG_10258 [Wuchereria bancrofti]
Length = 189
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/100 (41%), Positives = 55/100 (55%), Gaps = 5/100 (5%)
Query: 265 LQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTA 324
+ L +G E+ DITVQ Q D HLFLR G G LQ +K+ F P S S +HR +T
Sbjct: 1 MSLYLGGEIFDITVQPMQLDN-HLFLRQGAG-LQGHAVFKEKLVFRPHSTDSITHRKVTL 58
Query: 325 LVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRA 362
+ R K KVK +T + +PE +K E R E + +RA
Sbjct: 59 SMADRSNKSQKVK-VLTAVGSNPESKKAEIVRKEEERMRA 97
>gi|194741416|ref|XP_001953185.1| GF17638 [Drosophila ananassae]
gi|190626244|gb|EDV41768.1| GF17638 [Drosophila ananassae]
Length = 481
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 9/140 (6%)
Query: 176 LELEIPLRPPPADPTK--MNMIKVSNIMGIDPKPFDPKTYVEEDTF--VTDESGAKKRIR 231
++ ++ L P D K +++ + ++ P+D + Y ++ T + DE +
Sbjct: 160 VKFKVTLPSPANDLLKGPQRFLRMPQFIPVESGPYDAQKYEDQMTPGDLKDEQARGDFVN 219
Query: 232 LENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLR 291
VRWR K++ ESNAR +RWSDGS +G E D+ DQ +++R
Sbjct: 220 RMKTTVRWRIGKDQ----VKESNARLIRWSDGSETFHVGEEAFDVMHHPVTDDQNQMYVR 275
Query: 292 HGKGILQSQGRILKKMRFIP 311
+ Q+QG I KM P
Sbjct: 276 L-ESCYQTQGAITDKMTLRP 294
>gi|146101344|ref|XP_001469092.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134073461|emb|CAM72192.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 554
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 97/221 (43%), Gaps = 48/221 (21%)
Query: 70 PDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSD-EEDVGEY--------AI 120
P + EE+D+A + + PG D I H D+FG + E+DV + A+
Sbjct: 49 PIFQVEELDRA-AAKDPG--LIDTRSILH------DLFGGAIREDDVHLFQEERHSAEAL 99
Query: 121 RNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEK---PVGPPLE 177
++ + + EE Y S + + D ++ + E + + EK P
Sbjct: 100 AREMKMNKDVLVYEEGARYFGSAGADVLRADHNVPFTLLESSLPERWGEKRERAATPSWL 159
Query: 178 LEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKT--YVEEDTFVTDESGAKKRIRLENN 235
L++P+ P + DP+P DP T Y+E + F+ + N
Sbjct: 160 LDMPVIFPN-----------RQTLHADPRPCDPSTCAYLESNKFM---------LYTPTN 199
Query: 236 IVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDI 276
++RW +V +K SN+R VRWSDGS+ + +GN+VL +
Sbjct: 200 VLRW-SVSDK----GVSSNSRIVRWSDGSMTMHVGNDVLTL 235
>gi|149019154|gb|EDL77795.1| rCG25492 [Rattus norvegicus]
Length = 149
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 1/92 (1%)
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQG 301
++++G + ESN R V+WSDGS+ L +G EV DI Q + LF+R G L+ Q
Sbjct: 2 CRDQEGSKTKESNTRIVKWSDGSMSLHLGKEVFDIYKAPLQDNLNQLFIREDTG-LRGQA 60
Query: 302 RILKKMRFIPSSLSSNSHRLLTALVDSRHKKV 333
K+ F P +++ +T +R K
Sbjct: 61 IFKSKLTFRPHCTDGATYKKMTLSFGTRSSKT 92
>gi|398023415|ref|XP_003864869.1| hypothetical protein, conserved [Leishmania donovani]
gi|322503105|emb|CBZ38189.1| hypothetical protein, conserved [Leishmania donovani]
Length = 554
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 43/75 (57%), Gaps = 16/75 (21%)
Query: 204 DPKPFDPKT--YVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWS 261
DP+P DP T Y+E + F+ + N++RW +V +K SN+R VRWS
Sbjct: 175 DPRPCDPSTCAYLESNKFM---------LYTPTNVLRW-SVSDK----GVSSNSRIVRWS 220
Query: 262 DGSLQLQIGNEVLDI 276
DGS+ + +GN+VL +
Sbjct: 221 DGSMTMHVGNDVLTL 235
>gi|401429598|ref|XP_003879281.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322495531|emb|CBZ30836.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 555
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 95/221 (42%), Gaps = 48/221 (21%)
Query: 70 PDHEDEEVDQARSPRSPGGEEKDQTHISHSAAEIRDVFGDSDEED---------VGEYAI 120
P + EE+D+A + + PG D I H D+FG + ED G +
Sbjct: 50 PIFQVEELDRA-TAKDPG--LIDTRSILH------DLFGGAIREDDVHLFQEERYGVEML 100
Query: 121 RNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVEPKH---KEKPVGPPLE 177
++ + + EE Y S + + D+++ + E + + +E+ P
Sbjct: 101 AREMKMNKDVLVYEEGARYFGSAGADVLRADQNVPFTLLESSLPERWDEKRERTATPSWL 160
Query: 178 LEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKT--YVEEDTFVTDESGAKKRIRLENN 235
L++P+ P + DP+P DP T Y+E + F+ + N
Sbjct: 161 LDMPVIFPN-----------RQTLHADPRPCDPSTCAYLESNKFM---------LYTPTN 200
Query: 236 IVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDI 276
++RW V +K SN+R VRWSDGS+ + +GN+VL +
Sbjct: 201 VLRW-NVSDK----GVSSNSRIVRWSDGSMTMHVGNDVLTL 236
>gi|27374250|gb|AAO01008.1| CG10887-PA [Drosophila erecta]
Length = 693
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 85/179 (47%), Gaps = 14/179 (7%)
Query: 189 PTKMNMIKVSNIMGIDPKPFDPKTY---VEEDTFVTDESGAKKRIRLENNIVRWRTVKNK 245
PT+ +K+ + ++ K ++P+ + +D D++ RL+ VRWR +NK
Sbjct: 412 PTRF--LKMPYFIPVESKAYEPQNLENRMTKDDLKDDQARVDFINRLKA-TVRWREDENK 468
Query: 246 DGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILK 305
ESN++ VRWSDGS +G+EV D+ Q H ++R Q +G I
Sbjct: 469 ----IKESNSKIVRWSDGSETFHVGDEVFDVMHHPVNDSQNHFYVRLA-SCYQHKGSIKD 523
Query: 306 KMRFIPSSLSSNSHRLLTALVD-SRHKKVYKVKNCITDID--PEREKEEKERAESQNIR 361
KM P SS + L + + +K I D+D PE+++E+ ++ E +R
Sbjct: 524 KMTLRPMLDSSFGQSHVQGLRNRATNKPQTGFVKVIMDMDSNPEQDQEKLKKQELAKLR 582
>gi|312065099|ref|XP_003135625.1| hypothetical protein LOAG_00036 [Loa loa]
Length = 510
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 74/176 (42%), Gaps = 47/176 (26%)
Query: 192 MNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG---AKKRIRLENNIVRWRTVKNKDGM 248
+ +K N + I+P+PFD Y +E A+ ++R+EN I RWR V + G
Sbjct: 275 LYFVKFPNFLSIEPRPFDQDHYEDEFDEDDLLDEEGRARLKLRVENTI-RWRYVLDDAGN 333
Query: 249 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 308
+ N++ VRWSDG+ +K+
Sbjct: 334 IQKQGNSKVVRWSDGT----------------------------------------EKLI 353
Query: 309 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRA 362
F P S S +HR +T + R K KVK +T + +PE +K E R E + +RA
Sbjct: 354 FRPHSTDSITHRKVTLSMADRSNKSQKVK-VLTAVGSNPESKKAEIVRKEEERMRA 408
>gi|296803665|ref|XP_002842685.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238846035|gb|EEQ35697.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 451
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/236 (26%), Positives = 101/236 (42%), Gaps = 55/236 (23%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVK 243
P P D N + N + I+ + F+P+TYV + + WRT
Sbjct: 101 PEPTDGQVFN-CAIPNFLSIESEDFNPETYV------------APPFSSASTSLCWRT-- 145
Query: 244 NKDGMTSYESNARFVRWSDGSLQLQIGN---EVLDITVQD-AQHDQ------------AH 287
+ DG + +SNAR +RWSDGSL LQ+ + E I+ + AQ + AH
Sbjct: 146 SPDG-SELQSNARIIRWSDGSLTLQLASNPTEQYRISSKPLAQASRPGKREEYDPDLDAH 204
Query: 288 LFL---RHGKGILQSQGRILKKMRFIPSSLSSNS--HRLLTALVDS--RHKKVYKVKNCI 340
+L +++ RI + +PS++ ++ RL +L + KK I
Sbjct: 205 TYLGVASEASSVIRLTSRITSSLSILPSTVETDDAVQRLQESLAAAARSGKKTADGSVAI 264
Query: 341 TDI--DPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVER---RRQLSTGY 391
++ DPE K++ E+AE + +R RK VER R + STG+
Sbjct: 265 IEVKQDPELAKKQAEQAEREKLRE-----------ARKRQAAVEREVDRGRRSTGF 309
>gi|396462992|ref|XP_003836107.1| hypothetical protein LEMA_P054480.1 [Leptosphaeria maculans JN3]
gi|312212659|emb|CBX92742.1| hypothetical protein LEMA_P054480.1 [Leptosphaeria maculans JN3]
Length = 460
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 50/99 (50%), Gaps = 9/99 (9%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENN---IVRWR 240
P P D ++ ++KV + I+P F+ KT+ T + A + N +RWR
Sbjct: 109 PEPTD-NELYLLKVPRFLSIEPNAFNHKTFQAPTTDHHSKVAASETFSAFNTAMTTIRWR 167
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQ 279
++ +SNAR +RWSDGSL LQ+ N D T+Q
Sbjct: 168 --RSPSNNAQLQSNARILRWSDGSLTLQMAN---DPTIQ 201
>gi|167519627|ref|XP_001744153.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777239|gb|EDQ90856.1| predicted protein [Monosiga brevicollis MX1]
Length = 403
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 69/137 (50%), Gaps = 4/137 (2%)
Query: 221 TDESGAKKRIRL-ENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQ 279
D SG + R++L + + +RWR ++ G ESNARFV+W DGS+ L +G+++ D T
Sbjct: 227 VDASG-RIRVKLGDEHTLRWRQIE-MGGQLYNESNARFVQWDDGSVSLVLGDQIYDCTRT 284
Query: 280 DAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC 339
+ L+ +G++ ++ +++ F P+ L + +H L V R + K
Sbjct: 285 ALTDNNMQLYEIDRRGVM-ARHVFRERVGFKPAGLRNAAHAQLAREVSKRTHQAQATKLL 343
Query: 340 ITDIDPEREKEEKERAE 356
+D + K E + E
Sbjct: 344 SVQVDHDAAKAEAMKRE 360
>gi|226509152|ref|NP_001142331.1| uncharacterized protein LOC100274501 [Zea mays]
gi|194708248|gb|ACF88208.1| unknown [Zea mays]
Length = 406
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 60/228 (26%), Positives = 97/228 (42%), Gaps = 34/228 (14%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVK 243
P P D + ++++ M I P+ +DP+T+ T+ A + ++ R R
Sbjct: 80 PKPKD-GALRILRIPKFMKIMPEVYDPETFEP-----TEFDLANAKAEHPKHVARVR--- 130
Query: 244 NKDGMTS-YESNARFVRWSDGSLQLQIGNEVLDIT--------------VQDAQHDQAHL 288
KD T ++SN WSDGS+ + +G E +I V D + A
Sbjct: 131 -KDHSTGEFKSNTNIFHWSDGSVTISVGGEHYEIQRKALAPPPDKPYNEVNDGHYYAAAA 189
Query: 289 FLRHGKGILQSQGRILKKMRFIPS-SLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPE 346
L IL + G I ++ P+ ++ ++ LL + K V I T +DPE
Sbjct: 190 ELH--SNILMTVGHITEQYNVRPNKAVGDDALTLLAERMAVASKPVNATDMIIQTTVDPE 247
Query: 347 REKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRR-QLSTGYLE 393
+K++ E AE + ++A R+RE K R R LS G LE
Sbjct: 248 LQKKQAELAEKERMKA----QRRRENAAAKMDGGYGRSRGGLSIGDLE 291
>gi|451848412|gb|EMD61718.1| hypothetical protein COCSADRAFT_192590 [Cochliobolus sativus
ND90Pr]
Length = 470
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 51/99 (51%), Gaps = 9/99 (9%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENN---IVRWR 240
P P D ++ ++KV + + K FD +T+ T + GA + N +RWR
Sbjct: 114 PEPTD-NELYLLKVPRFLSFEDKAFDHRTFQPPTTDHHSKVGASEHFSAYNTAMTTIRWR 172
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQ 279
+ + + +SNAR +RWSDGSL LQ+ + D TVQ
Sbjct: 173 RSPSSNAV--LQSNARILRWSDGSLTLQLAS---DPTVQ 206
>gi|295674549|ref|XP_002797820.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226280470|gb|EEH36036.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 466
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/294 (23%), Positives = 121/294 (41%), Gaps = 64/294 (21%)
Query: 105 DVFGDSDEEDVGEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVE 164
D+FG EE+ E+ R D++ + ++EG Y++ P D ED Y E +V
Sbjct: 46 DLFGSGSEEEPKEHRRRGLDDEELDSG--DDEGRYDRQGSPMD----EDGIYYPESLNV- 98
Query: 165 PKHKEKPVGPPLELEIPLRPPP-ADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDE 223
++L + P P + ++ + + N + I+ + F+P+TYV
Sbjct: 99 -----------MDLSLSRVPEPESTDGEVYTLAMPNFLAIESEDFNPETYV--------- 138
Query: 224 SGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIG------------- 270
+ + WR N DG T +SNAR +RWSDGSL LQ+
Sbjct: 139 ---APPFNSASTSLCWRYDPN-DGET-LQSNARVIRWSDGSLTLQLASNPKEQYRMPSKR 193
Query: 271 ----NEVLDITVQDAQHDQAHLFL---RHGKGILQSQGRILKKMRFIPSSLSSNS--HRL 321
N V D++ D +H +L + + + + +P+++ ++ RL
Sbjct: 194 LARPNTVRKTADYDSELD-SHAYLGAAAEASSVFRITSHLTSSLSILPTTVETDDAVKRL 252
Query: 322 LTALVDSRHKKVYKVKNCITDI----DPEREKEEKERAESQNIRANVLLNRKRE 371
+L + +T DPE K+ E AE + +RA +RKR+
Sbjct: 253 KESLEAASRGAKRNADGTVTVFDVAEDPELAKKRAELAEREKLRA----DRKRQ 302
>gi|325095493|gb|EGC48803.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 473
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 85/203 (41%), Gaps = 45/203 (22%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESN 254
+ + N + I+ + F+P+TYV + + WR N +G T +SN
Sbjct: 117 LTMPNFLAIESEDFNPETYV------------APPFNSASTSLCWRYDPN-NGET-LQSN 162
Query: 255 ARFVRWSDGSLQLQIG-----------------NEVLDITVQDAQHDQAHLFL---RHGK 294
AR VRWSDGSL LQ+ N+ T D++ D AH +L
Sbjct: 163 ARIVRWSDGSLTLQLASNPKEQYRMPSKRLARSNKACKATDYDSELD-AHAYLGAAAEAS 221
Query: 295 GILQSQGRILKKMRFIPSSLSSNS--HRLLTALVDSRHKKVYKVKNCITDI----DPERE 348
+ + + + +P+++ ++ RL +L + V +T DPE
Sbjct: 222 SVFRITSHLTSSLSILPTTVETDDAVQRLKESLEAASRGPKKNVDGTVTMFDVAEDPELA 281
Query: 349 KEEKERAESQNIRANVLLNRKRE 371
K+ E AE + +RA +RKR+
Sbjct: 282 KKRAELAEREKLRA----DRKRQ 300
>gi|342873913|gb|EGU76007.1| hypothetical protein FOXB_13479 [Fusarium oxysporum Fo5176]
Length = 407
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 96/228 (42%), Gaps = 34/228 (14%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVK 243
P P D + ++++ M I P+ +DP+T+ T+ A + ++ R R
Sbjct: 80 PKPKD-GALRILRIPKFMKIMPEVYDPETFEP-----TEFDLANAKAEHPKHVARVR--- 130
Query: 244 NKDGMTS-YESNARFVRWSDGSLQLQIGNEVLDIT--------------VQDAQHDQAHL 288
KD T ++SN WSDGS+ + +G E +I V D + A
Sbjct: 131 -KDHSTGEFKSNTNIFHWSDGSVTISVGGEHYEIQRKALAPPPDKPYNEVNDGHYYAAAA 189
Query: 289 FLRHGKGILQSQGRILKKMRFIPS-SLSSNSHRLLTALVDSRHKKVYKVKNCI-TDIDPE 346
L IL + G I ++ P+ ++ ++ LL + K I T +DPE
Sbjct: 190 ELH--SNILMTVGHITEQYNVRPNKAVGDDALTLLAERMAVASKPANATDMIIQTTVDPE 247
Query: 347 REKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRR-QLSTGYLE 393
+K++ E AE + ++A R+RE K R R LS G LE
Sbjct: 248 LQKKQAELAEKERMKA----QRRRENAAAKMDGGYGRSRGGLSIGDLE 291
>gi|407918021|gb|EKG11319.1| Leo1-like protein [Macrophomina phaseolina MS6]
Length = 505
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 48/92 (52%), Gaps = 6/92 (6%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDT---FVTDESGAKKRIRLENNIVRWR 240
P P+D ++ ++KV + IDPK F + + T T+ S + +RWR
Sbjct: 137 PEPSD-NELYLMKVPRFLAIDPKNFVLEKWQPPTTDHHSKTEPSSTFSPFKTAQTTLRWR 195
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNE 272
+ + +SNAR +RWSDGSL LQ+G++
Sbjct: 196 --HSPSDPSKLQSNARILRWSDGSLTLQLGSD 225
>gi|154288256|ref|XP_001544923.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150408564|gb|EDN04105.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 334
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/203 (27%), Positives = 90/203 (44%), Gaps = 45/203 (22%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESN 254
+ + N + I+ + F+P+TYV + + WR N +G T +SN
Sbjct: 120 LTMPNFLAIESEDFNPETYV------------APPFNSASTSLCWRYDPN-NGET-LQSN 165
Query: 255 ARFVRWSDGSLQLQIG-----------------NEVLDITVQDAQHDQAHLFL---RHGK 294
AR VRWSDGSL LQ+ N+V T D++ D AH +L
Sbjct: 166 ARIVRWSDGSLTLQLASNPKEQYRMPSKRLARSNKVRKATDYDSELD-AHAYLGAAAEAS 224
Query: 295 GILQSQGRILKKMRFIPSSLSSNS--HRLLTAL-VDSR-HKKVYKVKNCITDI--DPERE 348
+ + + + +P+++ ++ RL +L SR HKK + D+ DPE
Sbjct: 225 SVFRITSHLTSSLSILPTTVETDDAVQRLKESLEAASRGHKKNVDGTVTMFDVAEDPELA 284
Query: 349 KEEKERAESQNIRANVLLNRKRE 371
K+ E AE + +RA +RKR+
Sbjct: 285 KKRAELAEREKLRA----DRKRQ 303
>gi|315039487|ref|XP_003169119.1| hypothetical protein MGYG_08667 [Arthroderma gypseum CBS 118893]
gi|311337540|gb|EFQ96742.1| hypothetical protein MGYG_08667 [Arthroderma gypseum CBS 118893]
Length = 448
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/253 (23%), Positives = 108/253 (42%), Gaps = 50/253 (19%)
Query: 191 KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
++ + N + I+ + F+P+TYV S + WRT + DG
Sbjct: 105 QLFNCAIPNFLSIESEDFNPETYVAPPFSSASTS------------LCWRT--SPDG-GE 149
Query: 251 YESNARFVRWSDGSLQLQ----------IGNEVLDITVQDAQHD------QAHLFL---R 291
+SNAR +RWSDGS+ LQ I ++ L + + + + ++H +L
Sbjct: 150 LQSNARIIRWSDGSMTLQLASNPTEQYRISSKALAQSSRPGKREEYDPDLESHTYLGVAA 209
Query: 292 HGKGILQSQGRILKKMRFIPSSLSSNS--HRLLTALVDSRH--KKVYKVKNCITDI--DP 345
+++ RI + +PS++ ++ RL +L + KK I ++ DP
Sbjct: 210 EASSVIRLTSRITSSLSILPSTVETDDAVQRLQESLAAASRSGKKTADGSVAIIEVKQDP 269
Query: 346 EREKEEKERAESQNIRANVLLNRKREK-----INRKYTQTVERRRQLSTGYLEDALEEDD 400
E K++ E+AE + +R RKR+ ++R T +R + G LE D+
Sbjct: 270 ELAKKQAEQAEREKLRE----ARKRQAAAEREVDRGRRSTTFQRSG-AAGLTIAGLEGDE 324
Query: 401 ETDYHDSRRSRRR 413
+ SR +R
Sbjct: 325 DMGTTKSRAGAKR 337
>gi|327302240|ref|XP_003235812.1| hypothetical protein TERG_02864 [Trichophyton rubrum CBS 118892]
gi|326461154|gb|EGD86607.1| hypothetical protein TERG_02864 [Trichophyton rubrum CBS 118892]
Length = 449
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 86/196 (43%), Gaps = 40/196 (20%)
Query: 191 KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 250
++ + N + I+ + F+P+TYV S + WRT + DG +
Sbjct: 105 QLFNCAIPNFLSIESEDFNPETYVAPPFSSASTS------------LCWRT--SPDG-SE 149
Query: 251 YESNARFVRWSDGSLQLQIGN---EVLDITVQDAQHD-------------QAHLFL---R 291
+SNAR +RWSDGS+ LQ+ + E I+ + ++H +L
Sbjct: 150 LQSNARIIRWSDGSMTLQLASNPTEQYRISSKPLAQSSRPGKREEYDPDLESHTYLGVAA 209
Query: 292 HGKGILQSQGRILKKMRFIPSSLSSNS--HRLLTALVDSRH--KKVYKVKNCITDI--DP 345
+++ RI + +PS++ ++ RL +L + KK I ++ DP
Sbjct: 210 EASSVIRLTSRITASLSILPSTVETDDAVQRLQESLAAASRSGKKTADGSVAIIEVKQDP 269
Query: 346 EREKEEKERAESQNIR 361
E K++ E+AE + +R
Sbjct: 270 ELAKKQAEQAEREKLR 285
>gi|226290681|gb|EEH46165.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 464
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 70/294 (23%), Positives = 119/294 (40%), Gaps = 66/294 (22%)
Query: 105 DVFGDSDEEDVGEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVE 164
D+FG EE+ E +D + DS ++EG Y++ P D ED Y E +V
Sbjct: 46 DLFGSGSEEEPKERRGLDDEELDSG----DDEGRYDRQGSPMD----EDGIYYPESLNV- 96
Query: 165 PKHKEKPVGPPLELEIPLRPPP-ADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDE 223
++L + P P + ++ + + N + I+ + F+P+TYV
Sbjct: 97 -----------MDLSLSRVPEPESTDGEVYTLAMPNFLAIESEDFNPETYV--------- 136
Query: 224 SGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIG------------- 270
+ + WR N DG T +SNAR +RWSDGSL LQ+
Sbjct: 137 ---APPFNSASTSLCWRYDPN-DGET-LQSNARIIRWSDGSLTLQLASNPKEQYRMPSKR 191
Query: 271 ----NEVLDITVQDAQHDQAHLFL---RHGKGILQSQGRILKKMRFIPSSLSSNS--HRL 321
N D++ D +H +L + + + + +P+++ ++ RL
Sbjct: 192 LARPNTARKTADYDSELD-SHAYLGAAAEASSVFRITSHLTSSLSILPTTVETDDAVKRL 250
Query: 322 LTALVDSRHKKVYKVKNCITDI----DPEREKEEKERAESQNIRANVLLNRKRE 371
+L + +T DPE K+ E AE + +RA +RKR+
Sbjct: 251 KESLEAASRGAKRNADGTVTVFDVAEDPELAKKRAELAEREKLRA----DRKRQ 300
>gi|225557761|gb|EEH06046.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 471
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 85/203 (41%), Gaps = 45/203 (22%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESN 254
+ + N + I+ + F+P+TYV + + WR N +G T +SN
Sbjct: 117 LTMPNFLAIESEDFNPETYV------------APPFNSASTSLCWRYDPN-NGET-LQSN 162
Query: 255 ARFVRWSDGSLQLQIG-----------------NEVLDITVQDAQHDQAHLFL---RHGK 294
AR VRWSDGSL LQ+ N+ T D++ D AH +L
Sbjct: 163 ARIVRWSDGSLTLQLASNPKEQYRMPSKRLARSNKARKATDYDSELD-AHAYLGAAAEAS 221
Query: 295 GILQSQGRILKKMRFIPSSLSSNS--HRLLTALVDSRHKKVYKVKNCITDI----DPERE 348
+ + + + +P+++ ++ RL +L + V +T DPE
Sbjct: 222 SVFRITSHLTSSLSILPTTVETDDAVQRLKESLEAASRGAKKNVDGTVTMFDVAEDPELA 281
Query: 349 KEEKERAESQNIRANVLLNRKRE 371
K+ E AE + +RA +RKR+
Sbjct: 282 KKRAELAEREKLRA----DRKRQ 300
>gi|451998938|gb|EMD91401.1| hypothetical protein COCHEDRAFT_1135891 [Cochliobolus
heterostrophus C5]
Length = 469
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 51/99 (51%), Gaps = 9/99 (9%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENN---IVRWR 240
P P D ++ ++KV + + K FD +T+ T + GA + + +RWR
Sbjct: 114 PEPTD-NELYLLKVPRFLSFEDKAFDHRTFQPPTTDHHSKVGASEHFSAYHTAMTTIRWR 172
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQ 279
+ + + +SNAR +RWSDGSL LQ+ + D TVQ
Sbjct: 173 RSPSNNAV--LQSNARILRWSDGSLTLQLAS---DPTVQ 206
>gi|119496147|ref|XP_001264847.1| hypothetical protein NFIA_016460 [Neosartorya fischeri NRRL 181]
gi|119413009|gb|EAW22950.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 470
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 98/232 (42%), Gaps = 44/232 (18%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESN 254
+ V N + I+ + FDP+TYV F T + + WR N + + +SN
Sbjct: 133 MPVPNFLSIETEEFDPETYVAP-PFTTAATS-----------LCWRHDPNDESL--LQSN 178
Query: 255 ARFVRWSDGSLQLQ----------IGNEVLDITVQDAQHDQ---AHLFLRHGK---GILQ 298
AR +RW DGSL LQ I + L T + ++D +H++L + +
Sbjct: 179 ARIIRWEDGSLTLQLASAPKEQYRISTKPLAPTNKAGEYDTKLDSHVYLGAAAETSSVFR 238
Query: 299 SQGRILKKMRFIPSSLSSNS--HRLLTALVDSRH--KKVYKVKNCITDI--DPEREKEEK 352
+ + +P+++ ++ RL +L + KK + ++ DPE K +
Sbjct: 239 LTSHLTHGLTVLPTTVETDDAVQRLQESLAAAARGSKKTADGSAPVIEVKEDPELAKRQA 298
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVER----RRQLSTGYLEDALEEDD 400
E AE + ++A R+R+++ + R R G LE+DD
Sbjct: 299 EMAEREKLKA----ARRRQQLAERELDRGRRVGFSHRSGGAGLTVGGLEDDD 346
>gi|239609882|gb|EEQ86869.1| conserved hypothetical protein [Ajellomyces dermatitidis ER-3]
gi|327350809|gb|EGE79666.1| hypothetical protein BDDG_02607 [Ajellomyces dermatitidis ATCC
18188]
Length = 464
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 82/203 (40%), Gaps = 45/203 (22%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESN 254
+ + N + I+ + F+P+TYV + + WR N M +SN
Sbjct: 117 LAIPNFLAIESEDFNPETYV------------APPFNSASTSLCWRYDPNDGEM--LQSN 162
Query: 255 ARFVRWSDGSLQLQIG-----------------NEVLDITVQDAQHDQAHLFL---RHGK 294
AR VRWSDGSL LQ+ N+ T D++ D AH +L
Sbjct: 163 ARIVRWSDGSLTLQLASNPKEQYRMPSKRLARSNKARKATDYDSELD-AHAYLGAAAEAS 221
Query: 295 GILQSQGRILKKMRFIPSSLSSNS--HRLLTALVDSRHKKVYKVKNCITDI----DPERE 348
+ + + + +P+++ ++ RL +L + +T DPE
Sbjct: 222 SVFRITSHLTSSLSVLPTTVETDDAVQRLKESLEAAARGAKKNPDGTVTMFDVAEDPELA 281
Query: 349 KEEKERAESQNIRANVLLNRKRE 371
K+ E AE + +RA +RKR+
Sbjct: 282 KKRAELAEREKLRA----DRKRQ 300
>gi|261198907|ref|XP_002625855.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
gi|239595007|gb|EEQ77588.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
Length = 464
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 82/203 (40%), Gaps = 45/203 (22%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESN 254
+ + N + I+ + F+P+TYV + + WR N M +SN
Sbjct: 117 LAIPNFLAIESEDFNPETYV------------APPFNSASTSLCWRYDPNDGEM--LQSN 162
Query: 255 ARFVRWSDGSLQLQIG-----------------NEVLDITVQDAQHDQAHLFL---RHGK 294
AR VRWSDGSL LQ+ N+ T D++ D AH +L
Sbjct: 163 ARIVRWSDGSLTLQLASNPKEQYRMPSKRLARSNKARKATDYDSELD-AHAYLGAAAEAS 221
Query: 295 GILQSQGRILKKMRFIPSSLSSNS--HRLLTALVDSRHKKVYKVKNCITDI----DPERE 348
+ + + + +P+++ ++ RL +L + +T DPE
Sbjct: 222 SVFRITSHLTSSLSVLPTTVETDDAVQRLKESLEAAARGAKKNPDGTVTMFDVAEDPELA 281
Query: 349 KEEKERAESQNIRANVLLNRKRE 371
K+ E AE + +RA +RKR+
Sbjct: 282 KKRAELAEREKLRA----DRKRQ 300
>gi|326470016|gb|EGD94025.1| hypothetical protein TESG_01554 [Trichophyton tonsurans CBS 112818]
gi|326482768|gb|EGE06778.1| hypothetical protein TEQG_05773 [Trichophyton equinum CBS 127.97]
Length = 449
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 86/192 (44%), Gaps = 40/192 (20%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESN 254
+ N + I+ + F+P+TYV F + + + WRT + DG + +SN
Sbjct: 109 CAIPNFLSIESEDFNPETYVAP-PFSS-----------ASTSLCWRT--SPDG-SELQSN 153
Query: 255 ARFVRWSDGSLQLQIGN---EVLDITVQDAQHD-------------QAHLFL---RHGKG 295
AR +RWSDGS+ LQ+ + E I+ + ++H +L
Sbjct: 154 ARIIRWSDGSMTLQLASNPTEQYRISSKPLAQSSRPGKREEYDPDLESHTYLGVAAEASS 213
Query: 296 ILQSQGRILKKMRFIPSSLSSNS--HRLLTALVDSRH--KKVYKVKNCITDI--DPEREK 349
+++ RI + +PS++ ++ +L +L + KK I ++ DPE K
Sbjct: 214 VIRLTSRITASLSILPSTVETDDAVQKLQESLAAAARSGKKTADGSVAIIEVKQDPELAK 273
Query: 350 EEKERAESQNIR 361
++ E+AE + +R
Sbjct: 274 KQAEQAEREKLR 285
>gi|330941957|ref|XP_003306105.1| hypothetical protein PTT_19139 [Pyrenophora teres f. teres 0-1]
gi|311316545|gb|EFQ85782.1| hypothetical protein PTT_19139 [Pyrenophora teres f. teres 0-1]
Length = 458
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 50/99 (50%), Gaps = 9/99 (9%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENN---IVRWR 240
P P D ++ ++KV + + K FD KT+ T + A + + VRWR
Sbjct: 106 PEPTD-NELYLLKVPRFLSFEDKAFDHKTFQPPMTDHHSKVSASEHFSAYHTAMTTVRWR 164
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQ 279
+ + + +SNAR +RWSDGSL LQ+ + D TVQ
Sbjct: 165 RSPSNNAL--LQSNARILRWSDGSLTLQLAS---DPTVQ 198
>gi|154318381|ref|XP_001558509.1| hypothetical protein BC1G_03358 [Botryotinia fuckeliana B05.10]
gi|347837618|emb|CCD52190.1| hypothetical protein [Botryotinia fuckeliana]
Length = 437
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 55/233 (23%), Positives = 107/233 (45%), Gaps = 28/233 (12%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTY--VEEDTFVTDESGAKKRIRLENNIVRWRT 241
P P+D + + ++ GI+P+ ++P+T+ D V+ +S + + +R+R
Sbjct: 88 PRPSD-GEFHTFRLPKFFGIEPREYNPETFEIPVSDHHVSTQSANFSARNVAESTIRYR- 145
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT------VQDAQHDQ---AHLFLRH 292
K ESN WSDGS L IG++ ++ +D+++++ +H ++
Sbjct: 146 ---KSDSGKLESNTNIYTWSDGSATLAIGDQHYELQSKPLAPAKDSKYNEVLDSHSYMAS 202
Query: 293 ---GKGILQSQGRILKKMRFIPS-SLSSNS-HRL---LTALVDSRHKKVYKVKNCITDID 344
IL G + + P+ S+ ++ +RL LTA + ++ + + D
Sbjct: 203 PSVTSQILVVVGHMTNEYTVRPNKSIEDDALNRLKKNLTAATGLEGGEKKRLNMIVKNED 262
Query: 345 PEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALE 397
PE +K+ E AE + +RA R+RE + +Q+ RR + G D L+
Sbjct: 263 PELQKKRAEMAEKERMRA----QRRREAQLDRASQSTGRRVGVGAGLTLDDLD 311
>gi|429857493|gb|ELA32357.1| lipid acyl hydrolase [Colletotrichum gloeosporioides Nara gc5]
Length = 402
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 96/236 (40%), Gaps = 47/236 (19%)
Query: 183 RPPPADPTKMNMIKVSNIMGIDPKPFDPKTY--VEEDTFVTDESGAKKRIRLENNIVRWR 240
R P A ++ ++V P +DP+T+ EED K IR+ R
Sbjct: 76 RIPKAKEQRLGSLRVPKFFRFIPTIYDPETFEPTEEDIENAKADTPKHDIRV-------R 128
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT--------------VQDAQHDQA 286
NK +SNA RWSDGS+ + +G+E L++T +QD + A
Sbjct: 129 RQGNK-----LQSNAMIHRWSDGSVTITVGDEHLEVTTKALAPGPEQPYSDLQDGHYYAA 183
Query: 287 HLFLRHGKGILQSQGRILKKMRFIPS------SLSSNSHRLLTALVDSRHKKVYKVKNCI 340
+ L G + + P +L++ + R+ T + K + + N I
Sbjct: 184 A--AEYSSNFLMFVGHVTDQFIVRPGKEVQDDALAALAERMATVTQKPQEKDM--IINTI 239
Query: 341 TDIDPEREKEEKERAESQNIRANVLLNRKREKINRKYT---QTVERRRQLSTGYLE 393
DPE K + E+AE + ++A R+RE K T + R LS G LE
Sbjct: 240 Q--DPELRKRQAEQAEKERMKA----QRRRETAAAKLTDSSRGYGRGGGLSIGDLE 289
>gi|156053049|ref|XP_001592451.1| hypothetical protein SS1G_06692 [Sclerotinia sclerotiorum 1980]
gi|154704470|gb|EDO04209.1| hypothetical protein SS1G_06692 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 435
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 54/235 (22%), Positives = 103/235 (43%), Gaps = 32/235 (13%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTY--VEEDTFVTDESGAKKRIRLENNIVRWRT 241
P P+D + + ++ GI+P+ ++P+T+ D VT +S + + +R+R
Sbjct: 86 PRPSD-GEFHTFRLPKFFGIEPREYNPETFEIPISDHHVTTQSANFSARSVAESTIRYR- 143
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDIT------VQDAQHDQ---AHLFLRH 292
K ESN WSDGS L IG++ ++ +D ++++ +H ++
Sbjct: 144 ---KTDSGKLESNTNIYTWSDGSTTLAIGDQHYELQSKPLAPAKDNKYNEVLDSHYYMAS 200
Query: 293 ---GKGILQSQGRILKKMRFIPS------SLSSNSHRLLTAL-VDSRHKKVYKVKNCITD 342
IL G + + P+ +L+ A ++ KK ++ + +
Sbjct: 201 PSVTSQILVVVGHMTNEYTVRPNKDIEDDALNRLKKNFAAATGLEGGEKK--RLNMIVKN 258
Query: 343 IDPEREKEEKERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLEDALE 397
DPE +K+ E AE + +RA R+RE + +Q+ RR + G D L+
Sbjct: 259 EDPELQKKRAETAEKERMRA----QRRREAQLERASQSTGRRPGIGAGLTLDDLD 309
>gi|380493767|emb|CCF33640.1| hypothetical protein CH063_05803 [Colletotrichum higginsianum]
Length = 402
Score = 47.0 bits (110), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 99/233 (42%), Gaps = 42/233 (18%)
Query: 183 RPPPADPTKMNMIKVSNIMGIDPKPFDPKTY--VEEDTFVTDESGAKKRIRLENNIVRWR 240
R P A ++ ++V M I P FDP+T+ E+D AK + + +R R
Sbjct: 76 RIPKAKDQRLRSLRVPKFMRIVPTLFDPETFEPTEQDI-----ENAKAEVPKAD--IRVR 128
Query: 241 TVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQ--HDQAHLFLRHG----- 293
NK +SNA RWSDGS+ + +G+E ++T + DQ + L+ G
Sbjct: 129 RQGNK-----LQSNAMIHRWSDGSVTMTVGDEHFEVTTKALAPGPDQPYSELQDGHYYAA 183
Query: 294 -----KGILQSQGRILKKMRFIPS------SLSSNSHRLLTALVDSRHKKVYKVKNCITD 342
L G + + P +L + + R+ T + K + + N I
Sbjct: 184 AAEYSSNFLMFVGHVTDQYIVRPGKEVADDALIALAERMATISQKPQEKDM--IINTIQ- 240
Query: 343 IDPEREKEEKERAESQNIRANVLLNRKREKINRKY--TQTVERRRQLSTGYLE 393
DPE K + E+AE + ++ + R+RE + ++ R LS G LE
Sbjct: 241 -DPELRKRQAEQAEKERMK----MQRRRETQTARMDSSRGFGRSGGLSIGDLE 288
>gi|255939666|ref|XP_002560602.1| Pc16g02300 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211585225|emb|CAP92900.1| Pc16g02300 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 435
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 51/230 (22%), Positives = 93/230 (40%), Gaps = 47/230 (20%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESN 254
++V + + I+ + F+P+TYV S + WR K+ + +SN
Sbjct: 109 MRVPDFLSIEGEEFNPETYVPPPYQTAATS------------LCWR--KDPADESLLQSN 154
Query: 255 ARFVRWSDGSLQLQ----------IGNEVLDITVQDAQHDQ---AHLFLRHG---KGILQ 298
AR ++W DGS+ LQ I ++ L + +D +H++L G + +
Sbjct: 155 ARIIKWEDGSITLQLASAPLEQYRIASKPLAPLTKSGDYDHKLDSHVYLAAGLETAQVFR 214
Query: 299 SQGRILKKMRFIPSSLSSN------SHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEK 352
I + + +P++L ++ +L A S+ DPE +
Sbjct: 215 LTSHITQGLTVLPTTLETDDAVQRLQEQLAAAARGSKQTAAGTAPRYELTEDPELAGKRA 274
Query: 353 ERAESQNIRANVLLNRKREKI-------NRKYTQTVERRRQLSTGYLEDA 395
E E + I+A R+R+++ R++ T LS G LEDA
Sbjct: 275 EMMEKEAIKA----ERRRQQLADREADRGRRHGATRPGPSGLSVGGLEDA 320
>gi|225678274|gb|EEH16558.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 533
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 75/168 (44%), Gaps = 35/168 (20%)
Query: 105 DVFGDSDEEDVGEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPDEDMHYESEEEHVE 164
D+FG EE+ E +D + DS ++EG Y++ P D ED Y E +V
Sbjct: 115 DLFGSGSEEEPKERRGLDDEELDSG----DDEGRYDRQGSPMD----EDGIYYPESLNV- 165
Query: 165 PKHKEKPVGPPLELEIPLRPPP-ADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDE 223
++L + P P + ++ + + N + I+ + F+P+TYV
Sbjct: 166 -----------MDLSLSRVPEPESTDGEVYTLAMPNFLAIESEDFNPETYVAPPFNSAST 214
Query: 224 SGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGN 271
S + WR N DG +SNAR +RWSDGSL LQ+ +
Sbjct: 215 S------------LCWRYDPN-DGEI-LQSNARIIRWSDGSLTLQLAS 248
>gi|70995038|ref|XP_752285.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66849920|gb|EAL90247.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|159131041|gb|EDP56154.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 470
Score = 46.2 bits (108), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 54/232 (23%), Positives = 97/232 (41%), Gaps = 44/232 (18%)
Query: 195 IKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESN 254
+ V + I+ + FDP+TYV F T + + WR N + + +SN
Sbjct: 133 MPVPTFLSIETEEFDPETYVAP-PFNTAATS-----------LCWRHDPNDESL--LQSN 178
Query: 255 ARFVRWSDGSLQLQ----------IGNEVLDITVQDAQHDQ---AHLFLRHG---KGILQ 298
AR +RW DGSL LQ I + L T + ++D +H++L + +
Sbjct: 179 ARIIRWEDGSLTLQLASAPKEQYRISTKPLAPTNKAGEYDTKLDSHVYLGAAAETSSVFR 238
Query: 299 SQGRILKKMRFIPSSLSSNS--HRLLTALVDSR--HKKVYKVKNCITDI--DPEREKEEK 352
+ + +P+++ ++ RL +L + KK + ++ DPE K +
Sbjct: 239 LTSHLTHGLTVLPTTVETDDAVQRLQESLAAAARGSKKTADGSAPVIEVKEDPELAKRQA 298
Query: 353 ERAESQNIRANVLLNRKREKINRKYTQTVER----RRQLSTGYLEDALEEDD 400
E AE + ++A R+R+++ + R R G LE+DD
Sbjct: 299 EMAEREKLKA----ARRRQQLAERELDRGRRVGFSNRSGGAGLTVGGLEDDD 346
>gi|310790679|gb|EFQ26212.1| hypothetical protein GLRG_01356 [Glomerella graminicola M1.001]
Length = 402
Score = 45.8 bits (107), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 58/231 (25%), Positives = 99/231 (42%), Gaps = 38/231 (16%)
Query: 183 RPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTV 242
R P A ++ ++V M I P FDP T+ + + D AK + + +R R
Sbjct: 76 RIPKAKDQRLRSLRVPKFMRIVPTLFDPDTFEPTEQDIED---AKAEVPRAD--IRVRRQ 130
Query: 243 KNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQ--HDQAHLFLRHG------- 293
NK +SNA +WSDGS+ + +G+E ++T + DQ + L+ G
Sbjct: 131 GNK-----LQSNAMIHKWSDGSVTMTVGDEHFEVTTKALAPGSDQPYSELQDGHYYAAAA 185
Query: 294 ---KGILQSQGRILKKMRFIPS------SLSSNSHRLLTALVDSRHKKVYKVKNCITDID 344
L G + + P +L + + R+ T + K + + N I D
Sbjct: 186 EYSSNFLMFVGHVTDQYIVRPGKEVADDALIALAERMATISQKPQEKDM--IINTIQ--D 241
Query: 345 PEREKEEKERAESQNIRANVLLNRKREKINRKY--TQTVERRRQLSTGYLE 393
PE K + E+AE + ++ + R+RE + + ++ R LS G LE
Sbjct: 242 PELRKRQAEQAEKERMK----MQRRRETQSARMDASRGFGRSGGLSIGDLE 288
>gi|406862841|gb|EKD15890.1| hypothetical protein MBM_05901 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 434
Score = 45.1 bits (105), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTY--VEEDTFVTDESGAKKRIRLENNIVRWRT 241
P PAD + N +++ +GI+P FD T+ D T S + ++ +R+R
Sbjct: 79 PKPAD-GEYNALRLPAFLGIEPHKFDSNTFQPPTSDHHSTVRSAKFSASAVSSSTMRYRV 137
Query: 242 VKNKDGMTSYESNARFVRWSDGSLQLQIGNE 272
N ESN +WSDGS L IG++
Sbjct: 138 GSN----GKLESNTVVYKWSDGSTTLSIGDQ 164
>gi|345561977|gb|EGX45049.1| hypothetical protein AOL_s00173g150 [Arthrobotrys oligospora ATCC
24927]
Length = 531
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 41/82 (50%), Gaps = 13/82 (15%)
Query: 194 MIKVSNIMGIDPKPFDPKTY----VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMT 249
++K+ + + P+ F P+T+ + D +T + A +RWR +
Sbjct: 163 LLKIPQFISLAPQVFKPETFQAPKLPPDAAITPYTFA-------TTAIRWRHSPYDS--S 213
Query: 250 SYESNARFVRWSDGSLQLQIGN 271
ESN+R ++WSDG+ LQIG+
Sbjct: 214 KLESNSRIIQWSDGTFSLQIGS 235
>gi|400595287|gb|EJP63092.1| RNA polymerase-associated protein LEO1 [Beauveria bassiana ARSEF
2860]
Length = 407
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 105/260 (40%), Gaps = 43/260 (16%)
Query: 156 YESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFD--PKTY 213
Y+ E+ V+ + KEK V + P P D NM++V + PK P+ Y
Sbjct: 52 YDDEDNQVQHETKEKVVMAVQAYRHRI-PKPKD----NMLRVMRV----PKFIKIMPEEY 102
Query: 214 VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEV 273
V E T+ A + ++ R V+ D SNA RWSDGS+ + IG E
Sbjct: 103 VSEKFEPTEFDTANAKQEHPAHVAR---VRRNDSTGLLHSNANVYRWSDGSITIGIGGEH 159
Query: 274 LDIT--------------VQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSNSH 319
+I ++D + A L +L + G + ++ P+ +
Sbjct: 160 FEIQKKALAPAPGQPYDELKDGHYYAAAAEL--SSNLLMTVGHVAEQFNVRPNKGGGDD- 216
Query: 320 RLLTALVDSRHKKVYKVKN-----CITDIDPEREKEEKERAESQNIRANVLLNRKREKIN 374
AL+ +R ++ K N T DPE +K+E E AE + ++A R+RE
Sbjct: 217 --ALALLATRMQQASKTVNEDDMIIRTTRDPELQKKEAELAEKERMKA----QRRRENAA 270
Query: 375 RKYTQTVERRR-QLSTGYLE 393
K R LS G LE
Sbjct: 271 AKMDGLGRYNRGGLSIGDLE 290
>gi|389624643|ref|XP_003709975.1| hypothetical protein MGG_16287 [Magnaporthe oryzae 70-15]
gi|351649504|gb|EHA57363.1| hypothetical protein MGG_16287 [Magnaporthe oryzae 70-15]
gi|440467433|gb|ELQ36656.1| hypothetical protein OOU_Y34scaffold00648g3 [Magnaporthe oryzae
Y34]
Length = 414
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 113/260 (43%), Gaps = 56/260 (21%)
Query: 165 PKHKEKPVGPPLELEIPLR---PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVT 221
P++KEK + + L+I R P P D + ++KV M I P+ ++ +Y +
Sbjct: 60 PQYKEKRI---MGLKI-FRHRIPKPKD-GNLRILKVPQFMKIVPEQYNEDSY---EPTAW 111
Query: 222 DESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQD- 280
D S KK+ +++RWR KN + ESNA RWSDGS+ + +G++ D+ +
Sbjct: 112 DLSEDKKK--YPASVIRWR--KNPE-TGKLESNANIYRWSDGSISVAVGDQHYDMDAKSL 166
Query: 281 ------------------AQHDQAHLFLRHGKGILQSQGRILKKMRFIPS-SLSSNSHRL 321
A H FL + G I ++++ P+ +L + +
Sbjct: 167 APTGKGYDEKRDANYYAAAGHLSTEAFL--------TVGHITEELKVRPNEALEAAAVDR 218
Query: 322 LTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIRANVLLNRKREKINRKY-- 377
L A +D K + + I DPE ++++ E AE + R L RK+E +
Sbjct: 219 LMAKMDEATKHRHTQDDVIFRATEDPELQRKQAEMAEKERER----LRRKKENAAARLEM 274
Query: 378 ----TQTVERRRQLSTGYLE 393
++ R LS G LE
Sbjct: 275 GFGSSRFSGRSGGLSVGGLE 294
>gi|440480456|gb|ELQ61118.1| hypothetical protein OOW_P131scaffold01201g48 [Magnaporthe oryzae
P131]
Length = 414
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 61/115 (53%), Gaps = 16/115 (13%)
Query: 165 PKHKEKPVGPPLELEIPLR---PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVT 221
P++KEK + + L+I R P P D + ++KV M I P+ ++ +Y +
Sbjct: 60 PQYKEKRI---MGLKI-FRHRIPKPKD-GNLRILKVPQFMKIVPEQYNEDSY---EPTAW 111
Query: 222 DESGAKKRIRLENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDI 276
D S KK+ +++RWR KN + ESNA RWSDGS+ + +G++ D+
Sbjct: 112 DLSEDKKK--YPASVIRWR--KNPE-TGKLESNANIYRWSDGSISVAVGDQHYDM 161
>gi|171687387|ref|XP_001908634.1| hypothetical protein [Podospora anserina S mat+]
gi|170943655|emb|CAP69307.1| unnamed protein product [Podospora anserina S mat+]
Length = 406
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 82/198 (41%), Gaps = 34/198 (17%)
Query: 180 IPL---RPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNI 236
IPL R P + ++ +KV N + +P + P + + + + EN I
Sbjct: 70 IPLYRHRIPKSKDNTLHNLKVPNFLKFNPVEYKPDEWQPSKWELNNANS-------ENPI 122
Query: 237 --VRWRTVKNKDGMTS-YESNARFVRWSDGSLQLQIGNEVLDIT--------------VQ 279
+ WR +D T +SNA RWSDGS LQIG++ +IT V+
Sbjct: 123 PSIMWR----RDHKTGKLKSNANIHRWSDGSTTLQIGDDHYEITSNSLVGPPNEPYQDVK 178
Query: 280 DAQHDQAHLFLRHGKGILQSQGRILKKMRFIPS-SLSSNSHRLLTALVDSRHKKVYKVKN 338
DA A L ++ G ++ P L + L A ++ K+ +
Sbjct: 179 DAHEYAAAAHLTTNSLLI--VGHYTEQYSIKPPEELQDAAFDRLVAGLNMNKKESTTSRL 236
Query: 339 CITDIDPEREKEEKERAE 356
T+ DPER+K E E AE
Sbjct: 237 IATNEDPERKKREAELAE 254
>gi|296422158|ref|XP_002840629.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636849|emb|CAZ84820.1| unnamed protein product [Tuber melanosporum]
Length = 471
Score = 42.7 bits (99), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 73/311 (23%), Positives = 138/311 (44%), Gaps = 46/311 (14%)
Query: 80 ARSPRSPGGEEKDQTHIS-HSAAEIRDVFGD-SDEEDVG---EYAIRNDIDQDSNRSPME 134
A SP P G + D+ S + A RD + SD+ED+G + +++D ++ E
Sbjct: 17 AASPERPTGNKYDRLFGSDNEGAVARDEEAEASDDEDIGRKRRSRVTDNLDGVNDEEEEE 76
Query: 135 EEG--SYEKSIRPEDIVPDEDMHYESEEEHVEPKHKEKPVGPPLELEIPLRPPPADPT-K 191
E+G ++ + ES H +P + E+ +++ +P P
Sbjct: 77 EDGDRGLFGDDDDDEPEKKDLSDVESHASH-DPGNDEQQTFHSIDVSLPRHACPDQGNDD 135
Query: 192 MNMIKVSNIMGIDPKPFDPKTY----VEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDG 247
+ ++++ + + + F P+ + ED+ V+ + A IR+ K+
Sbjct: 136 LYLMRIPSTVNFQQQIFYPENFTMPKATEDSTVSPYTMATTTIRVR---------KSPAN 186
Query: 248 MTSYESNARFVRWSDGSLQLQIGNE--VLDITVQ------------DAQHDQAHLFLRH- 292
+ +SNAR V+WSDGSL LQI + + D+T + AQ +L H
Sbjct: 187 PSVLQSNARVVKWSDGSLSLQIASSPNLYDLTSKPLAPPQNSKAYDAAQDSHTYLLTTHE 246
Query: 293 GKGILQSQGRILKKMRFIPSSLS-SNS---HRLLTALVDSRHKKVYKVKNC----ITDI- 343
G+L+ G + + +P+S S +N+ +RL L ++ + + + I+D+
Sbjct: 247 SAGMLRFVGHVNTHLNLLPTSTSQANAEAVNRLQERLYQAQSTRGARGRGAQALEISDMR 306
Query: 344 DPEREKEEKER 354
DPE E++E E+
Sbjct: 307 DPEAERKEAEK 317
>gi|302926473|ref|XP_003054302.1| hypothetical protein NECHADRAFT_74883 [Nectria haematococca mpVI
77-13-4]
gi|256735243|gb|EEU48589.1| hypothetical protein NECHADRAFT_74883 [Nectria haematococca mpVI
77-13-4]
Length = 409
Score = 42.4 bits (98), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 95/229 (41%), Gaps = 35/229 (15%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVK 243
P P D + ++++ + I P+ +DP T+ T+ A + + ++ R R
Sbjct: 80 PKPKDGA-LRVMRIPKFVKIMPEEYDPDTFQP-----TEFDLANAKAQHPKHVARVR--- 130
Query: 244 NKDGMTS-YESNARFVRWSDGSLQLQIGNEVLDIT--------------VQDAQHDQAHL 288
+D T +SN RWSDGS+ + IG E ++ + D + A
Sbjct: 131 -RDHSTGELKSNTNIFRWSDGSVTISIGGEHYEVQKKALAPSADKPYNEIHDGHYYAAAA 189
Query: 289 FLRHGKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNCI--TDIDPE 346
L +L + G I ++ P+ + + A ++ K + + I T DPE
Sbjct: 190 ELH--SNLLMTVGHITEQYNVRPNKAVGDDALSILAERMAQASKPSQGGDMIIRTTRDPE 247
Query: 347 REKEEKERAESQNIRANVLLNRKREKINRKYTQTVER--RRQLSTGYLE 393
+K++ E AE + ++A R+RE K R R LS G LE
Sbjct: 248 LQKKQAELAEKERMKA----QRRRENAAAKMDGGYGRSGRGNLSIGELE 292
>gi|46107272|ref|XP_380695.1| hypothetical protein FG00519.1 [Gibberella zeae PH-1]
Length = 405
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 97/237 (40%), Gaps = 49/237 (20%)
Query: 184 PPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVK 243
P P D + +++V + I P+ ++P TY + + A R ++ R R
Sbjct: 76 PKPKDGA-LRVLRVPKFIKIMPEEYNPDTYQPSEFDI-----ANARAEHPKHVARVR--- 126
Query: 244 NKDGMTS-YESNARFVRWSDGSLQLQIGNEVLDIT--------------VQDAQHDQAHL 288
+D T +SN WSDGS+ + +G E +I VQD + A
Sbjct: 127 -RDHSTGELKSNTNVFHWSDGSVTISVGGEHYEINRKALAPPADKPYTEVQDGHYYAAAA 185
Query: 289 FLRHGKGILQSQGRILKKMRFIPS------SLSSNSHRLLTALVDSRHKKVYKVKNCITD 342
L IL + G I ++ P+ +L+ + R+ +A K N I +
Sbjct: 186 ELH--SNILMTVGHITEQYSIRPNKAVGDDALTRLAERMASA------SKPANATNMIIN 237
Query: 343 I--DPEREKEEKERAESQNIRANVLLNRKRE----KINRKYTQTVERRRQLSTGYLE 393
DPE +K++ E AE + +A RKRE K+ ++ R LS G LE
Sbjct: 238 TTQDPELQKKQAELAEKERNKA----QRKRENAEAKMVGGVGRSSGRGGGLSIGDLE 290
>gi|340924223|gb|EGS19126.1| RNA polymerase II-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 419
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 115/279 (41%), Gaps = 56/279 (20%)
Query: 105 DVFGDSDEEDVGEYAIRNDIDQDSNRSPMEEEGSYEKSIRPEDIVPD--EDMHYESEEEH 162
D+FGD DEE ND D EK + D+ D ED + +
Sbjct: 18 DLFGDGDEE-------VNDAVLDD-----------EKEVSERDVASDNDEDEYTRHRDGE 59
Query: 163 VEPK-HKEKPVGPPLELEIPL---RPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDT 218
EP+ ++EK V E+PL R P + ++ ++V + + +P + P E +
Sbjct: 60 SEPREYREKLVA-----EVPLYRHRIPRSSDGSLHSLRVPDFIKFNPMEYKPD---EWEP 111
Query: 219 FVTDESGAKKRIRLENNIVRWRTVKNKDGMTS-YESNARFVRWSDGSLQLQIGNEVLDIT 277
D AK + + + R +D T +SNA RWSDGS+ L IGNE ++
Sbjct: 112 SKWDLENAKAENPVPSILFR------RDPKTGKMQSNANIYRWSDGSVTLAIGNEHYELQ 165
Query: 278 --------------VQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSS-LSSNSHRLL 322
VQDA + A L ++ G + ++ P+ L ++ L
Sbjct: 166 AKPLAPPRNKPYNEVQDAHYYVAAAHLTTNSLLI--VGHLTEQYTVRPNKELQDHALERL 223
Query: 323 TALVDSRHKKVYKVKNCITDIDPEREKEEKERAESQNIR 361
A + K +T+ DPE +K++ E AE + ++
Sbjct: 224 KANLAGARKDRSADMIIVTNEDPELQKKQAELAEKERMK 262
>gi|367052575|ref|XP_003656666.1| hypothetical protein THITE_2121623 [Thielavia terrestris NRRL 8126]
gi|347003931|gb|AEO70330.1| hypothetical protein THITE_2121623 [Thielavia terrestris NRRL 8126]
Length = 414
Score = 38.9 bits (89), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 56/125 (44%), Gaps = 17/125 (13%)
Query: 252 ESNARFVRWSDGSLQLQIGNEVLDIT--------------VQDAQHDQAHLFLRHGKGIL 297
+SNA RWSDGS+ L IG+E +I VQDA + A L ++
Sbjct: 139 QSNANMYRWSDGSVTLAIGDEHFEIQSKPLAPPPDKPYREVQDAHYYVAAAHLTTNSLLI 198
Query: 298 QSQGRILKKMRFIPS-SLSSNSHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKERAE 356
G + ++ P+ L+ ++ L A + K +T+ DPE K++ E AE
Sbjct: 199 --VGHLTEQYTVRPNEELADHALERLKADLAGAKKDRAADMIIVTNEDPELLKKQAELAE 256
Query: 357 SQNIR 361
+ ++
Sbjct: 257 KERLK 261
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.306 0.127 0.348
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,547,788,994
Number of Sequences: 23463169
Number of extensions: 461276163
Number of successful extensions: 5554797
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 15189
Number of HSP's successfully gapped in prelim test: 46071
Number of HSP's that attempted gapping in prelim test: 3525086
Number of HSP's gapped (non-prelim): 917630
length of query: 564
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 416
effective length of database: 8,886,646,355
effective search space: 3696844883680
effective search space used: 3696844883680
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 80 (35.4 bits)