BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 005794
(677 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356522920|ref|XP_003530090.1| PREDICTED: uncharacterized protein LOC100800148 [Glycine max]
Length = 659
Score = 765 bits (1975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/515 (73%), Positives = 436/515 (84%), Gaps = 10/515 (1%)
Query: 166 RASKEVSRHDPGHSKQHR--PPVPPPGVKKVNGGS-GRVETEEERRIRKKREYEKHRQEE 222
R E S H H KQH+ PPVP VKK+N G GR ET+EE+R+RKKRE+EK RQEE
Sbjct: 152 RREYEHSNHGIAH-KQHKQQPPVP---VKKMNNGPPGRAETDEEKRLRKKREFEKQRQEE 207
Query: 223 KHRLQMKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFL 282
KHR Q+KESQN V+QK+ M++SGKG HG + GSRMG+RR+ PLL ER ENRLKKPTTFL
Sbjct: 208 KHRQQLKESQNTVLQKTHMLSSGKG-HGMIAGSRMGERRSTPLLGAERVENRLKKPTTFL 266
Query: 283 CKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLS 342
CKLKFRNELP+PSAQPKLMA KKDKD++ +YT +SLEK YKP+L VEPDLGIPLDLLDLS
Sbjct: 267 CKLKFRNELPDPSAQPKLMASKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPLDLLDLS 326
Query: 343 VYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMES 402
VYNPPSVRPPL PED+ELLRDDE VTP+KKDGIKRKERPTDKGV+WLVKTQYISPLSMES
Sbjct: 327 VYNPPSVRPPLAPEDKELLRDDEAVTPIKKDGIKRKERPTDKGVAWLVKTQYISPLSMES 386
Query: 403 ARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEI 462
+QSLTEKQAKELREMKGGR IL+NLN RERQI+EIEASFEA K P+HATNK+L PVE+
Sbjct: 387 TKQSLTEKQAKELREMKGGRGILDNLNSRERQIREIEASFEAAKSDPVHATNKDLYPVEV 446
Query: 463 LPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANP 522
+PLLPDF+RYDDQFV A FD APTADSE+++KMDKSVRDA ES+A+MKSYVAT SD ANP
Sbjct: 447 MPLLPDFDRYDDQFVVAAFDNAPTADSEMHAKMDKSVRDAFESKAVMKSYVATSSDPANP 506
Query: 523 EKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYV 582
EKFLAYMVP+ ELSKD+YDENEDVS+SW+REYHWDVRGDDADDP T+LV+FD+ EARY+
Sbjct: 507 EKFLAYMVPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPATFLVAFDESEARYL 566
Query: 583 PLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSS 642
PLPTKL LRKKRA EGRS DEVE P+P+ + VRRR++V AIE K+ G Y++SKGN SS
Sbjct: 567 PLPTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSKGN--SS 624
Query: 643 KMGRVDSQEDLERSHNGSRQQDPYQSSGAEDDMYD 677
K G ++ + LE H G+ QD YQSSGAED M D
Sbjct: 625 KRGGLEMDDGLEDQHRGAPHQDNYQSSGAEDYMSD 659
>gi|449480786|ref|XP_004155995.1| PREDICTED: uncharacterized LOC101203806 [Cucumis sativus]
Length = 706
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/724 (61%), Positives = 518/724 (71%), Gaps = 65/724 (8%)
Query: 1 MASYRPFPQPPQSSFPPPPPPNQNPSQPPPPPQQQQQQQR-----PNPYSQNWG------ 49
MASYRP+P PQSSF P N S PPP Q + Y+QNWG
Sbjct: 1 MASYRPYP--PQSSFGSAPAQN---SIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDA 55
Query: 50 ------------GYSNTGGGAQQHYHQPY----SYAQPPPPPPPESSYPPPPPPPPPPPP 93
Y+N ++HQ Y + PPPPPPP SYP PPPPPPP
Sbjct: 56 SAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPHQSYPYASQPPPPPPP 115
Query: 94 TQQQTQP----------SMYYSSNQYNQNSMYPPMQPPLPPPPPSSPPPSSSIPPPPPPG 143
P ++YY S+QY+Q + P PPP P S PPPP
Sbjct: 116 DSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSP 175
Query: 144 SPPPPPPKDVEGRDTGTSDRDK---------RASKEVSRHDPGHSKQHRPPVPPPGVKKV 194
PP + EG + G +RDK R +E S HD H K PP+PP KK
Sbjct: 176 PPPSASQQKAEGTNMGAHERDKGAPKDPSYGRRDRENSNHDK-HQKHSGPPMPP---KKA 231
Query: 195 NGGSGRVETEEERRIRKKREYEKHRQEEKHRLQMKESQNVVMQKSQMVASGKGGHGSMVG 254
NG SGR+ET++E+R+RKKRE+EK RQ+E+HR +KESQN ++QK+QM+++GK HGS+VG
Sbjct: 232 NGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKV-HGSIVG 290
Query: 255 SRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYT 314
SRMG+R+A P LSGER ENRLKKPTTFLCKLKFRNELP+ SAQPKLM+L+K+KD +TRYT
Sbjct: 291 SRMGERKATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYT 350
Query: 315 FSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDG 374
+SLEK YKPQL+VEPDLGIPLDLLDLSVYNP SVR PL PEDEELLRDD + TPVKKDG
Sbjct: 351 ITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDG 410
Query: 375 -IKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRER 433
IKRKERPTDKGV+WLVKTQYISPLS+ESA+QSLTEKQAKELREMKGGR+ILENLN+RER
Sbjct: 411 GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRER 470
Query: 434 QIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYS 493
QIKEIE SFEACK RPIHATNKNL PVE+LPLLPDF+RYDD FV FD APTADSE ++
Sbjct: 471 QIKEIETSFEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFN 530
Query: 494 KMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVR 553
K+D+S+RDAHES+AIMKSY+ATGSD + PEKFLAYMVPS +ELSKD+YDE EDVS+SWVR
Sbjct: 531 KLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVR 590
Query: 554 EYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSI 613
EYHWDVRGD+ DDPTTYLVSFDD EARYVPLPTKL LRKKRA EGRS+DEVEHFP P+ +
Sbjct: 591 EYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARV 650
Query: 614 AVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAED 673
VRRR V +E+K+ G YSNSK S D ++ + RSH R QD Q SGAED
Sbjct: 651 TVRRRPTVATLEVKDPGIYSNSKRGS--------DIEDGIGRSHKHDRNQDMDQFSGAED 702
Query: 674 DMYD 677
+M D
Sbjct: 703 EMSD 706
>gi|449448058|ref|XP_004141783.1| PREDICTED: uncharacterized protein LOC101203806 [Cucumis sativus]
Length = 706
Score = 763 bits (1970), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/724 (60%), Positives = 512/724 (70%), Gaps = 65/724 (8%)
Query: 1 MASYRPFPQPPQSSFPPPPPPNQNPSQPPPPPQQQQQQQR-----PNPYSQNWG------ 49
MASYRP+P PQSSF P N S PPP Q + Y+QNWG
Sbjct: 1 MASYRPYP--PQSSFGSAPAQN---SIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDA 55
Query: 50 ------------GYSNTGGGAQQHYHQPYS--------------YAQPPPPPPPESSYPP 83
Y+N ++HQ Y + P P P PP
Sbjct: 56 SAPPAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPHQSYPYAPQPPPPPPP 115
Query: 84 PPPPPPPPPPTQQQTQPSMYYSSNQYNQNSMYPPMQPPLPPPPPSSPPPSSSIPPPPPPG 143
PPPPPP P++YY S+QY+Q + P PPP P S PPPP
Sbjct: 116 DSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSP 175
Query: 144 SPPPPPPKDVEGRDTGTSDRDK---------RASKEVSRHDPGHSKQHRPPVPPPGVKKV 194
PP + EG + G +RDK R +E S HD H K PP+PP KK
Sbjct: 176 PPPSASQQKAEGTNMGAHERDKGVPKDPSYGRRDRENSNHDK-HQKHSGPPMPP---KKA 231
Query: 195 NGGSGRVETEEERRIRKKREYEKHRQEEKHRLQMKESQNVVMQKSQMVASGKGGHGSMVG 254
NG SGR+ET++E+R+RKKRE+EK RQ+E+HR +KESQN ++QK+QM+++GK HGS+VG
Sbjct: 232 NGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKV-HGSIVG 290
Query: 255 SRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYT 314
SRMG+R+A P LSGER ENRLKKPTTFLCKLKFRNELP+ SAQPKLM+L+K+KD +TRYT
Sbjct: 291 SRMGERKATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYT 350
Query: 315 FSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDG 374
+SLEK YKPQL+VEPDLGIPLDLLDLSVYNP SVR PL PEDEELLRDD + TPVKKDG
Sbjct: 351 ITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDG 410
Query: 375 -IKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRER 433
IKRKERPTDKGV+WLVKTQYISPLS+ESA+QSLTEKQAKELREMKGGR+ILENLN+RER
Sbjct: 411 GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRER 470
Query: 434 QIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYS 493
QIKEIEASFEACK RPIHATNKNL PVE+LPLLPDF+RYDD FV FD APTADSE ++
Sbjct: 471 QIKEIEASFEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFN 530
Query: 494 KMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVR 553
K+D+S+RDAHES+AIMKSY+AT SD + PEKFLAYMVPS +ELSKD+YDE EDVS+SWVR
Sbjct: 531 KLDQSIRDAHESQAIMKSYMATSSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVR 590
Query: 554 EYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSI 613
EYHWDVRGD+ DDPTTYLVSFDD EARYVPLPTKL LRKKRA EGRS+DEVEHFP P+ +
Sbjct: 591 EYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARV 650
Query: 614 AVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAED 673
VRRR V +E+K+ G YSNSK S D ++ + RSH R QD Q SGAED
Sbjct: 651 TVRRRPTVATLEVKDPGIYSNSKRGS--------DIEDGIGRSHKHDRHQDMDQFSGAED 702
Query: 674 DMYD 677
+M D
Sbjct: 703 EMSD 706
>gi|356526079|ref|XP_003531647.1| PREDICTED: uncharacterized protein LOC100797526 [Glycine max]
Length = 666
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/513 (72%), Positives = 431/513 (84%), Gaps = 6/513 (1%)
Query: 166 RASKEVSRHDPGHSKQHRPPVPPPGVKKVNGGS-GRVETEEERRIRKKREYEKHRQEEKH 224
R E S H H KQH+ PP VKK+N G GR ET+EE+R+RKKRE+EK RQEEKH
Sbjct: 159 RREYEHSNHGIAH-KQHKQQQPPLPVKKMNNGPPGRAETDEEKRLRKKREFEKQRQEEKH 217
Query: 225 RLQMKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCK 284
R Q+KESQN V+QK+ +++SGKG HG + GSRMG+RR+ PLL ER ENRLKKPTTFLCK
Sbjct: 218 RQQLKESQNTVLQKTHLLSSGKG-HGMIAGSRMGERRSTPLLGAERVENRLKKPTTFLCK 276
Query: 285 LKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVY 344
LKFRNELP+PSAQPKLM+ KKDKD++ +YT +SLEK YKP+L VEPDLGIPLDLLDLSVY
Sbjct: 277 LKFRNELPDPSAQPKLMSFKKDKDQYAKYTITSLEKMYKPKLFVEPDLGIPLDLLDLSVY 336
Query: 345 NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESAR 404
NPP VRPPL PEDEELLRDDE TP+KKDGIKRKERPTDKGV+WLVKTQYISPLSMES +
Sbjct: 337 NPPRVRPPLAPEDEELLRDDEAATPIKKDGIKRKERPTDKGVAWLVKTQYISPLSMESTK 396
Query: 405 QSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILP 464
QSLTEKQAKELREMKG R IL+NLN RERQI+EI+ASFEA K P+HATNK+L PVE++P
Sbjct: 397 QSLTEKQAKELREMKG-RGILDNLNSRERQIREIQASFEAAKSDPVHATNKDLYPVEVMP 455
Query: 465 LLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEK 524
LLPDF+RYDDQFV A FD APTADSE+Y+KM+KSVRDA ES+A+MKSYVATG D ANPEK
Sbjct: 456 LLPDFDRYDDQFVVAAFDNAPTADSEMYAKMNKSVRDAFESKAVMKSYVATGLDPANPEK 515
Query: 525 FLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPL 584
FLAYM P+ ELSKD+YDENEDVS+SW+REYHWDVRGDDADDPTT+LV+FD+ EARY+PL
Sbjct: 516 FLAYMAPAPGELSKDIYDENEDVSYSWIREYHWDVRGDDADDPTTFLVAFDESEARYLPL 575
Query: 585 PTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKM 644
PTKL LRKKRA EGRS DEVE P+P+ + VRRR++V AIE K+ G Y++SKGN S ++
Sbjct: 576 PTKLVLRKKRAKEGRSGDEVEQCPVPARVTVRRRSSVAAIERKDSGVYTSSKGN-SFKRV 634
Query: 645 GRVDSQEDLERSHNGSRQQDPYQSSGAEDDMYD 677
G ++ + LE H G+ QD YQSSGAED M D
Sbjct: 635 G-LEMDDGLEDQHRGAPHQDNYQSSGAEDYMSD 666
>gi|359482895|ref|XP_002278075.2| PREDICTED: RNA polymerase II-associated factor 1 homolog [Vitis
vinifera]
Length = 589
Score = 749 bits (1933), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/592 (71%), Positives = 486/592 (82%), Gaps = 19/592 (3%)
Query: 102 MYYSSNQYNQNSMYPPMQPPLPPPPPSSPPPSSSIPPPPPPGSPPPPPPKDVEGR----- 156
MYY S+QY+Q S + PMQPP PPPP S PP S PPPPP PPPP +G+
Sbjct: 1 MYYPSSQYSQFS-HQPMQPPPPPPPSSPPPSSLIPPPPPPASPPPPPSSVPPQGQNKEPA 59
Query: 157 -DTGTSDRDKRASKEV---SRHDPGHS------KQHRPPVPPPGVKKVNGGSGRVETEEE 206
D G+ RDK A K++ R +PGHS KQ +PPVPP VKK NG GRVETEEE
Sbjct: 60 PDGGSHGRDKGAPKDLRGAGRREPGHSNQGPSGKQQKPPVPPAPVKKSNGPPGRVETEEE 119
Query: 207 RRIRKKREYEKHRQEEKHRLQMKESQNVVMQKSQMVASGKGGHGSMVG-SRMGDRRAAPL 265
RR+RKKRE+EK RQEEK + Q+KESQN V+QK+QM++SGKG HGS+VG SRMG+RR P
Sbjct: 120 RRLRKKREFEKQRQEEKQKHQLKESQNTVLQKTQMLSSGKG-HGSVVGGSRMGERRTTPF 178
Query: 266 LSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQ 325
LSG+R ENRL+KPTTFLCKLKFRNELP+P+AQPKLMALK DKDRFT+YT +SLEK +KPQ
Sbjct: 179 LSGDRIENRLRKPTTFLCKLKFRNELPDPTAQPKLMALKTDKDRFTKYTITSLEKMHKPQ 238
Query: 326 LHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKG 385
L VEPDLGIPLDLLDLSVYNPPSVR PLDPEDEELLRDDE VTPVKK+GIK+KERPTDKG
Sbjct: 239 LFVEPDLGIPLDLLDLSVYNPPSVRRPLDPEDEELLRDDESVTPVKKEGIKKKERPTDKG 298
Query: 386 VSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEAC 445
VSWLVKTQYISPLS ES +QSLTEKQAKELRE KGGR+ILEN N RER+I+ IEA+F A
Sbjct: 299 VSWLVKTQYISPLSTESTKQSLTEKQAKELRETKGGRNILENFNSRERKIQNIEAAFAAS 358
Query: 446 KLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHES 505
K+ P+H+TNK+L+PVEILPLLPDF RYDD FV A+FD APTADSEIYSK+DK+VRD+HES
Sbjct: 359 KITPVHSTNKSLKPVEILPLLPDFARYDDSFVVASFDSAPTADSEIYSKLDKTVRDSHES 418
Query: 506 RAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDAD 565
+AI+KSY+ATGSD + PEKFLAYM PS +ELSKD+YDENED S+SWVREYHWDVRGDDAD
Sbjct: 419 QAILKSYMATGSDPSKPEKFLAYMAPSPDELSKDIYDENEDTSYSWVREYHWDVRGDDAD 478
Query: 566 DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIE 625
DPTTYLVSF+ +ARY+PLPTKL LRKKRA EGRS+DEVEHFP+PS + VR+R NV AIE
Sbjct: 479 DPTTYLVSFNKTDARYLPLPTKLLLRKKRAKEGRSSDEVEHFPVPSKVTVRQRPNVAAIE 538
Query: 626 LKEQGAYSNSKGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAEDDMYD 677
LK++ YS+SK SSSK G VD ++ L RS+ G + Q QSSGAED+M D
Sbjct: 539 LKDEEVYSSSKRGVSSSKRG-VDMEDGLGRSYKGVQDQHMDQSSGAEDEMSD 589
>gi|255549826|ref|XP_002515964.1| conserved hypothetical protein [Ricinus communis]
gi|223544869|gb|EEF46384.1| conserved hypothetical protein [Ricinus communis]
Length = 672
Score = 745 bits (1924), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/527 (72%), Positives = 438/527 (83%), Gaps = 12/527 (2%)
Query: 156 RDTGTSDRDKRASKEVSRHDPG----HSKQHRPPVPPPGVKKVNGG-SGRVETEEERRIR 210
RD G S R +E+ + G H +QHRPP PPPG KKV+G SGRVETEEERR+R
Sbjct: 149 RDKGMS----RERRELGNSNHGDVSRHEQQHRPPAPPPGGKKVSGPPSGRVETEEERRLR 204
Query: 211 KKREYEKHRQEEKHRLQMKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGER 270
KKRE+EKHRQEEKHR Q+KESQN ++QK+QM+++ KG HGS+VGSRMGDRRA PLL GER
Sbjct: 205 KKREFEKHRQEEKHRQQVKESQNSILQKTQMLSAQKG-HGSIVGSRMGDRRAPPLLGGER 263
Query: 271 TENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEP 330
ENRLKKPTTFLCKLKFRNELP+PSAQPKLM +K+DKDRFT+YT +SLEK YKPQL VEP
Sbjct: 264 IENRLKKPTTFLCKLKFRNELPDPSAQPKLMTMKRDKDRFTKYTITSLEKMYKPQLFVEP 323
Query: 331 DLGIPLDLLDLSVYN--PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSW 388
DLGIPLDLLDLSVYN P S RPPLDPEDEELLRDDE VTPVK++G+K KERPTDKGVSW
Sbjct: 324 DLGIPLDLLDLSVYNRPPASERPPLDPEDEELLRDDEAVTPVKREGLKIKERPTDKGVSW 383
Query: 389 LVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLR 448
LVKTQYIS LS +S +QS+TEKQAKELRE KGG ++L+NLN+RE QIKEIEASFEACKL
Sbjct: 384 LVKTQYISSLSTDSTKQSMTEKQAKELRERKGGHNLLKNLNNRESQIKEIEASFEACKLT 443
Query: 449 PIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAI 508
P+HATNKNL+PVEILPL+PDF+RY+D+FV FD APTADSEIYSK+D SVR+A ESRA+
Sbjct: 444 PVHATNKNLKPVEILPLIPDFDRYEDKFVTVAFDNAPTADSEIYSKLDSSVREACESRAV 503
Query: 509 MKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPT 568
MK+ VATGSD ANPEKFLAYM PS NELSKDMYDENED+S++WVREYHWDV+GD +DPT
Sbjct: 504 MKACVATGSDPANPEKFLAYMAPSPNELSKDMYDENEDISYNWVREYHWDVQGDGGNDPT 563
Query: 569 TYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKE 628
T+LVSFD+D ARYVPLPTK+NLRKKRA EGRS DEVEHFP PSS+ VRRR A EL++
Sbjct: 564 TFLVSFDEDAARYVPLPTKINLRKKRAREGRSGDEVEHFPAPSSVTVRRRPTAAARELRD 623
Query: 629 QGAYSNSKGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAEDDM 675
S+S+GN S+MG D + L R H +R D SS AEDD+
Sbjct: 624 SAGASSSRGNILDSRMGTGDDDDGLGRVHRVARDDDLDHSSEAEDDL 670
>gi|224070975|ref|XP_002303312.1| PAF1 complex component [Populus trichocarpa]
gi|222840744|gb|EEE78291.1| PAF1 complex component [Populus trichocarpa]
Length = 569
Score = 731 bits (1886), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/516 (71%), Positives = 426/516 (82%), Gaps = 5/516 (0%)
Query: 162 DRDKRASKEVSRHD-PGHSK-QHRPPVPPPGVKKVNGGSGRVETEEERRIRKKREYEKHR 219
+RDK S+E HD P H K Q + P VKK NG GRVETEEERR+RKKRE+EK R
Sbjct: 55 ERDKGVSQEKREHDHPNHGKHQQQQSQLPLVVKKANGHPGRVETEEERRLRKKREFEKQR 114
Query: 220 QEEKHRLQMKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPT 279
QEE R Q+KESQN + K+ +++S KG HGS+VGSR+GDR A PLL GER ENRLKKPT
Sbjct: 115 QEENRRQQLKESQNSALLKNHVISSQKG-HGSIVGSRLGDRVATPLLGGERAENRLKKPT 173
Query: 280 TFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLL 339
TF+CKLKFRNELP+PSAQPKLM LK++KDRFT+YT +SLEK YKPQL+VEPDLGIPLDLL
Sbjct: 174 TFMCKLKFRNELPDPSAQPKLMPLKREKDRFTKYTITSLEKMYKPQLYVEPDLGIPLDLL 233
Query: 340 DLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLS 399
DLSVYNPPSVRP L PEDEELL DDE VTPVK+DGIKRKERPTDKGVSWLVKTQYISPLS
Sbjct: 234 DLSVYNPPSVRPLLAPEDEELLHDDESVTPVKRDGIKRKERPTDKGVSWLVKTQYISPLS 293
Query: 400 MESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQP 459
MESA+ SLTEKQAKELREMKGG +L+NLN RERQIKEI+ASF + KL P+HATNKNL+P
Sbjct: 294 MESAKLSLTEKQAKELREMKGGCKLLDNLNKRERQIKEIQASFASNKLPPVHATNKNLKP 353
Query: 460 VEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDS 519
VEILPLLPDF+RY D+FV FDGAPTAD+E Y K D S RDA+ES AIMK+ VA+GSD
Sbjct: 354 VEILPLLPDFDRYGDKFVTVAFDGAPTADAENYRKFDPSDRDAYESWAIMKACVASGSDP 413
Query: 520 ANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEA 579
ANPEKFLAY VPS +ELSKDMYDENED+ +SW+REYHWDVRGDD DDP+T+LVSFD+ EA
Sbjct: 414 ANPEKFLAYTVPSPDELSKDMYDENEDILYSWIREYHWDVRGDDVDDPSTFLVSFDEAEA 473
Query: 580 RYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNS 639
RY+PLPTK++LRKKRA EGRS DE+EHFPIPS + VR+RA IE ++ GA SNS+GN
Sbjct: 474 RYLPLPTKISLRKKRAREGRSGDEIEHFPIPSRVTVRKRAVAATIEQRDSGAISNSRGN- 532
Query: 640 SSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAEDDM 675
+S+M R + ++ L R + +D + SSGAED+M
Sbjct: 533 -NSRMERFEDEDGLGRLQRVALDEDLHHSSGAEDEM 567
>gi|297839959|ref|XP_002887861.1| hypothetical protein ARALYDRAFT_477298 [Arabidopsis lyrata subsp.
lyrata]
gi|297333702|gb|EFH64120.1| hypothetical protein ARALYDRAFT_477298 [Arabidopsis lyrata subsp.
lyrata]
Length = 576
Score = 671 bits (1732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/520 (66%), Positives = 410/520 (78%), Gaps = 28/520 (5%)
Query: 156 RDTGTSDRDKRASKEVSRHDPGHSKQHRPPVPPPGVKKVNGGSGRVETEEERRIRKKREY 215
R+ G +DRDK AS+ R P SK HR +P + ++ETEEERR+RKK+E
Sbjct: 81 RNQGPNDRDKGASRR-ERAKPDPSKHHRSHLP---------HTKKIETEEERRLRKKKEL 130
Query: 216 EKHRQEEKHRLQMKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTENRL 275
EK RQEEK R QMK S KSQM GH +++ PLL+ +R ENRL
Sbjct: 131 EKQRQEEKLRQQMKNSH-----KSQM----PKGHTE-------EKKPTPLLTTDRVENRL 174
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
KKPTTF+CKLKFRNELP+PSAQ KLM +K+DKD+FT+YT +SLEK +KP++ VEPDLGIP
Sbjct: 175 KKPTTFICKLKFRNELPDPSAQLKLMTIKRDKDQFTKYTITSLEKLWKPKIFVEPDLGIP 234
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
LDLLDLSVYNPP + PL PEDEELLRDD+ +TP+KKDGI+RKERPTDKGVSWLVKTQYI
Sbjct: 235 LDLLDLSVYNPPKFKAPLAPEDEELLRDDDAITPIKKDGIRRKERPTDKGVSWLVKTQYI 294
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNK 455
S ++ ESARQSLTEKQAKELREMKGG +IL NLN+RERQIK+IEASFEACK RP+HATNK
Sbjct: 295 SSINNESARQSLTEKQAKELREMKGGINILHNLNNRERQIKDIEASFEACKSRPVHATNK 354
Query: 456 NLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVAT 515
+LQPVE+LPLLP F+RYD+QFV A FD APTADSE + K+D S+RD HESRAI+KSYV
Sbjct: 355 SLQPVEVLPLLPYFDRYDEQFVVANFDSAPTADSEFFGKLDPSIRDEHESRAILKSYVVA 414
Query: 516 GSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFD 575
GSD+ANPEKFLAYMVPS++ELSKDM+DE+ED+S++WVREYHWDVRGDDA+D TYLVSFD
Sbjct: 415 GSDTANPEKFLAYMVPSLDELSKDMHDEDEDISYTWVREYHWDVRGDDANDIGTYLVSFD 474
Query: 576 DDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNS 635
D A Y+PLPTKLNLRKKR EGRS+DE+EHFP+PS + VRRR+ V+ IE K+ G YS+
Sbjct: 475 DGAASYLPLPTKLNLRKKRPREGRSSDEIEHFPVPSRVTVRRRSTVSVIEHKDSGVYSSR 534
Query: 636 KGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQ-SSGAEDD 674
G +SSSKM R++ +E L RS +QD Q S G EDD
Sbjct: 535 VG-ASSSKMRRLEDEEGLGRSWKHEPEQDANQYSDGNEDD 573
>gi|224054406|ref|XP_002298244.1| PAF1 complex component [Populus trichocarpa]
gi|222845502|gb|EEE83049.1| PAF1 complex component [Populus trichocarpa]
Length = 691
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/520 (64%), Positives = 403/520 (77%), Gaps = 47/520 (9%)
Query: 184 PPVPPPGVKKVNGGSGRVETEEERRIRKKREYEKHRQEEKHRLQMKESQNVVMQKSQMVA 243
P +PP G KK NG SGR ET++ERR+RKKRE++K RQEEKHR +KESQN + K+QM++
Sbjct: 189 PQLPPVG-KKANGHSGRAETDQERRLRKKREFDKQRQEEKHRQLLKESQNPALPKNQMMS 247
Query: 244 SGKGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKF---------------- 287
S KG HGS+ GSR+GDRRA PLL ERTENRLKKPTTFLCKLKF
Sbjct: 248 SQKG-HGSIAGSRLGDRRATPLLGAERTENRLKKPTTFLCKLKFSVMHVVISCACCMSKI 306
Query: 288 ------------RNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
RNELP+PSAQPKLM LK+DKDR+T+YT +SLEK YKPQL+VEPDLGIP
Sbjct: 307 FMYCMDVPVCRFRNELPDPSAQPKLMPLKRDKDRYTKYTITSLEKMYKPQLYVEPDLGIP 366
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
LDLLDLSVYNPPS+RPPL PEDEELLRDDE V PVK+DGI+RKERPTDKGVSWL
Sbjct: 367 LDLLDLSVYNPPSIRPPLAPEDEELLRDDETVAPVKRDGIRRKERPTDKGVSWL------ 420
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNK 455
SL + QAKELREMKGGR++L+NLN+RERQIKEI+ASFEA KL P+HATNK
Sbjct: 421 ----------SLNKNQAKELREMKGGRNLLDNLNNRERQIKEIQASFEANKLPPVHATNK 470
Query: 456 NLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVAT 515
NL P+E+LPLLPDF+RY+D+FV ATFDGAPT D+E Y+K D S R+A+ESRAIMK+ VAT
Sbjct: 471 NLHPIEVLPLLPDFDRYEDKFVTATFDGAPTVDAENYNKFDSSDREAYESRAIMKACVAT 530
Query: 516 GSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFD 575
GSD NPEKFLAYMVPS +E+SKDM+DE+ED+S+SW+R YHWD+RGDDA+DPTT+LVSFD
Sbjct: 531 GSDPTNPEKFLAYMVPSPDEMSKDMHDESEDISYSWIRGYHWDIRGDDANDPTTFLVSFD 590
Query: 576 DDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNS 635
+ EARY+PLPTK+NL K+RA EGRS DE+EHF +PS + VR+RA IE + GA S+S
Sbjct: 591 EAEARYLPLPTKINLTKRRAREGRSGDEIEHFSVPSRVTVRKRAIAATIEQRNLGAASSS 650
Query: 636 KGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAEDDM 675
+GN S GR + + L R ++ +D QSS AED+M
Sbjct: 651 RGNDSRMG-GRFEDDDGLGRLQRVAQDEDLEQSSEAEDEM 689
>gi|297743192|emb|CBI36059.3| unnamed protein product [Vitis vinifera]
Length = 420
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/421 (76%), Positives = 368/421 (87%), Gaps = 1/421 (0%)
Query: 257 MGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFS 316
MG+RR P LSG+R ENRL+KPTTFLCKLKFRNELP+P+AQPKLMALK DKDRFT+YT +
Sbjct: 1 MGERRTTPFLSGDRIENRLRKPTTFLCKLKFRNELPDPTAQPKLMALKTDKDRFTKYTIT 60
Query: 317 SLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIK 376
SLEK +KPQL VEPDLGIPLDLLDLSVYNPPSVR PLDPEDEELLRDDE VTPVKK+GIK
Sbjct: 61 SLEKMHKPQLFVEPDLGIPLDLLDLSVYNPPSVRRPLDPEDEELLRDDESVTPVKKEGIK 120
Query: 377 RKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIK 436
+KERPTDKGVSWLVKTQYISPLS ES +QSLTEKQAKELRE KGGR+ILEN N RER+I+
Sbjct: 121 KKERPTDKGVSWLVKTQYISPLSTESTKQSLTEKQAKELRETKGGRNILENFNSRERKIQ 180
Query: 437 EIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMD 496
IEA+F A K+ P+H+TNK+L+PVEILPLLPDF RYDD FV A+FD APTADSEIYSK+D
Sbjct: 181 NIEAAFAASKITPVHSTNKSLKPVEILPLLPDFARYDDSFVVASFDSAPTADSEIYSKLD 240
Query: 497 KSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYH 556
K+VRD+HES+AI+KSY+ATGSD + PEKFLAYM PS +ELSKD+YDENED S+SWVREYH
Sbjct: 241 KTVRDSHESQAILKSYMATGSDPSKPEKFLAYMAPSPDELSKDIYDENEDTSYSWVREYH 300
Query: 557 WDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVR 616
WDVRGDDADDPTTYLVSF+ +ARY+PLPTKL LRKKRA EGRS+DEVEHFP+PS + VR
Sbjct: 301 WDVRGDDADDPTTYLVSFNKTDARYLPLPTKLLLRKKRAKEGRSSDEVEHFPVPSKVTVR 360
Query: 617 RRANVTAIELKEQGAYSNSKGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAEDDMY 676
+R NV AIELK++ YS+SK SSSK G VD ++ L RS+ G + Q QSSGAED+M
Sbjct: 361 QRPNVAAIELKDEEVYSSSKRGVSSSKRG-VDMEDGLGRSYKGVQDQHMDQSSGAEDEMS 419
Query: 677 D 677
D
Sbjct: 420 D 420
>gi|218200498|gb|EEC82925.1| hypothetical protein OsI_27878 [Oryza sativa Indica Group]
Length = 633
Score = 553 bits (1426), Expect = e-155, Method: Compositional matrix adjust.
Identities = 296/501 (59%), Positives = 365/501 (72%), Gaps = 27/501 (5%)
Query: 187 PPPGVKKVNGGSGRVETEEERRIRKKREYEKHRQEEKHRLQM-KESQNVVMQKSQMVASG 245
PPP ++ R ETEEERR RKKREYEK R E++ QM ++SQ V+QK+Q V +
Sbjct: 142 PPPREQQSKSALPRAETEEERRARKKREYEKQRAEDRKNQQMMRQSQATVLQKTQQVRAA 201
Query: 246 K------------GGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPE 293
+ G ++ +R AAP + ER ENRLKKPTTFLCK KFRNELP+
Sbjct: 202 QQPQSRHHQQPSGGSRPAVTATRPA---AAP--NAERFENRLKKPTTFLCKHKFRNELPD 256
Query: 294 PSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPL 353
PS+Q K + L KDKDR+T+Y +SLEKNY P++ V DLGIPLDLLD+SVYN P V+PP+
Sbjct: 257 PSSQLKWLPLNKDKDRYTKYRITSLEKNYIPKMIVPEDLGIPLDLLDMSVYNTPPVQPPM 316
Query: 354 DPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAK 413
PEDEELLRDDEV+TPVKKDGI++KERPTDKG+SWLVKTQYISPLS ++A+ S+TEKQAK
Sbjct: 317 APEDEELLRDDEVLTPVKKDGIRKKERPTDKGMSWLVKTQYISPLSTDAAKMSITEKQAK 376
Query: 414 ELREMKGGR-SILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERY 472
E RE + GR + LEN+NDRE+QIK IE SF A K RP+H T + ++ +LPLLPDF+RY
Sbjct: 377 ERRESREGRNTFLENINDREKQIKAIEDSFRAAKSRPVHQTKRGMEAEWVLPLLPDFDRY 436
Query: 473 DDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPS 532
DDQFV FDG PTADSE Y+K+++S RD ESRA+MKS++ GSD A EKFLAYMVPS
Sbjct: 437 DDQFVMVNFDGDPTADSEQYNKLERSERDECESRAVMKSFLVNGSDPAKQEKFLAYMVPS 496
Query: 533 VNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRK 592
+ELSKD+ DE ED+ +SW+REYHW+VRGDD DDPTTYLV+FDDD A+Y+PLPTKL L+K
Sbjct: 497 PHELSKDLDDETEDIQYSWLREYHWEVRGDDKDDPTTYLVTFDDDGAKYLPLPTKLVLQK 556
Query: 593 KRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGA-YSNSKGNSSSSKMGRVDSQE 651
K+A EGRS DE+EHFP+PS I V R A+ +E E + + N K SS VD +
Sbjct: 557 KKAKEGRSGDEIEHFPVPSRITVSRTAHGGMMEHGESSSMHENLKRQRSS-----VD--D 609
Query: 652 DLERSHNGSRQQDPYQSSGAE 672
DL SR +D Q SG E
Sbjct: 610 DLYDHPKHSRVEDMDQYSGDE 630
>gi|357144952|ref|XP_003573471.1| PREDICTED: uncharacterized protein LOC100838919 [Brachypodium
distachyon]
Length = 614
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 279/479 (58%), Positives = 355/479 (74%), Gaps = 15/479 (3%)
Query: 200 RVETEEERRIRKKREYEKHRQEEKHRLQM-KESQNVVMQKSQMVASGKGG------HGSM 252
RVETEEERR RKKREYEK + EE+ + QM ++SQ ++QK+Q V + + H
Sbjct: 137 RVETEEERRARKKREYEKQKVEERKQQQMMRQSQASILQKTQQVRAAQQQQPQSRHHQPS 196
Query: 253 VGSRMGDRRAAPLLS--GERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRF 310
G+R + P + ER ENRLKKPTTFLCK KFRNELP+PSAQ K + L KDKDR+
Sbjct: 197 GGTRAATTTSRPASAPNTERFENRLKKPTTFLCKHKFRNELPDPSAQLKWLPLNKDKDRY 256
Query: 311 TRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPV 370
T+Y +SLEKNY P++ V DLGIPLDLLD+SVYNPP+V+P + PEDEELLRDDEV+TPV
Sbjct: 257 TKYRITSLEKNYMPKMIVPEDLGIPLDLLDMSVYNPPAVQPRMAPEDEELLRDDEVLTPV 316
Query: 371 KKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLND 430
K +GI++KERPTDKG+SWLVKTQYISPLS ++A+ S+TEKQAKE RE + GR +LENLND
Sbjct: 317 KPEGIRKKERPTDKGMSWLVKTQYISPLSTDAAKMSMTEKQAKERRESREGRDVLENLND 376
Query: 431 RERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSE 490
R+++IK I SF++ K RP+H T ++P +LPL+PDF+RY++ FV FDG PTADSE
Sbjct: 377 RQKRIKAIAESFKSAKSRPVHQTKPGMEPEFVLPLVPDFDRYNNPFVMVNFDGDPTADSE 436
Query: 491 IYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS 550
Y+K+++SVRD ESRA+MKS+ +GSD EKFLAYM P+ +EL KD+ DE EDV +S
Sbjct: 437 QYNKLERSVRDECESRALMKSFQVSGSDPTKQEKFLAYMAPAPHELVKDLDDEIEDVQYS 496
Query: 551 WVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIP 610
W+REYHW+VRGDD +DPTTYLV+FDDD A+Y+PLPTKL L+KK+A EGRS DE+EHFP+P
Sbjct: 497 WIREYHWEVRGDDKNDPTTYLVAFDDDGAKYLPLPTKLVLQKKKAKEGRSGDEIEHFPVP 556
Query: 611 SSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSS 669
S I V R + A+ E+G S+ GN K R +DL+ SR +D Q S
Sbjct: 557 SRITVDRTLHGDAM---ERGESSSMHGN---LKRQRSSLDDDLDEHPKHSRAEDMDQDS 609
>gi|326500378|dbj|BAK06278.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 617
Score = 523 bits (1348), Expect = e-146, Method: Compositional matrix adjust.
Identities = 272/452 (60%), Positives = 342/452 (75%), Gaps = 16/452 (3%)
Query: 200 RVETEEERRIRKKREYEKHRQEEKHRLQM-KESQNVVMQKSQMVASGKGGHGSMVGSRM- 257
RVETEEERR RKKREYEK + E++ + QM ++SQ ++QK+Q V + + SR
Sbjct: 135 RVETEEERRARKKREYEKQKVEDRKQQQMMRQSQATILQKTQQVRAAQQQQQPQPQSRHL 194
Query: 258 ---GDRRAAPLLS-------GERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDK 307
G R A +S ER ENRLKKPTTFLCK KFRNELP+PSAQ K + L KDK
Sbjct: 195 QPSGGTRVATTVSRPVSAPNTERFENRLKKPTTFLCKHKFRNELPDPSAQLKWLPLNKDK 254
Query: 308 DRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVV 367
DR+T+Y SSLEKNY P++ V DLGIPLDLLD++VYNPP+V+ PL PEDEELLRDDEV+
Sbjct: 255 DRYTKYRISSLEKNYLPKMIVPEDLGIPLDLLDMAVYNPPNVQLPLAPEDEELLRDDEVL 314
Query: 368 TPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGR-SILE 426
TPVK +GI++KERPTDKG+SWLVKTQYISPLS ++A+ +TEKQAKE RE + GR ++LE
Sbjct: 315 TPVKPEGIRKKERPTDKGMSWLVKTQYISPLSTDAAKMWITEKQAKERRESREGRDNVLE 374
Query: 427 NLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT 486
NLNDR+++IK I SF+A K RP+H T + ++P +LPL+PDF+ Y+D FV FDG PT
Sbjct: 375 NLNDRQKRIKAIAESFKAAKSRPVHQTKRGMEPEFVLPLVPDFDSYNDPFVMVNFDGDPT 434
Query: 487 ADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENED 546
ADSE Y+K+++SV D ES+A+MKS+ +GSD A EKFLAYM P+ +EL KD+ DENED
Sbjct: 435 ADSEQYNKLERSVHDECESQALMKSFQVSGSDPAKQEKFLAYMAPAPHELVKDLDDENED 494
Query: 547 VSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEH 606
+SW+REYHW+VRGDD DPTTYLVSFDDD+A+Y+PLPTKL L+KK+A EGRS DE+EH
Sbjct: 495 FQYSWIREYHWEVRGDDKQDPTTYLVSFDDDDAKYLPLPTKLVLQKKKAKEGRSGDEIEH 554
Query: 607 FPIPSSIAVRRRANVTAIELKEQGAYSNSKGN 638
FP+PS I V R A+ + E G S+ GN
Sbjct: 555 FPVPSRITVSRTAHGDEM---EHGESSSMPGN 583
>gi|115474865|ref|NP_001061029.1| Os08g0157100 [Oryza sativa Japonica Group]
gi|37805858|dbj|BAC99509.1| proline-rich protein-like [Oryza sativa Japonica Group]
gi|113622998|dbj|BAF22943.1| Os08g0157100 [Oryza sativa Japonica Group]
Length = 451
Score = 520 bits (1339), Expect = e-144, Method: Compositional matrix adjust.
Identities = 274/459 (59%), Positives = 339/459 (73%), Gaps = 26/459 (5%)
Query: 228 MKESQNVVMQKSQMVASGK------------GGHGSMVGSRMGDRRAAPLLSGERTENRL 275
M++SQ V+QK+Q V + + G ++ +R AAP + ER ENRL
Sbjct: 2 MRQSQATVLQKTQQVRAAQQPQSRHHQQPSGGSRPAVTATRPA---AAP--NAERFENRL 56
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
KKPTTFLCK KFRNELP+PS+Q K + L KDKDR+T+Y +SLEKNY P++ V DLGIP
Sbjct: 57 KKPTTFLCKHKFRNELPDPSSQLKWLPLNKDKDRYTKYRITSLEKNYIPKMIVPEDLGIP 116
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
LDLLD+SVYN P V+PP+ PEDEELLRDDEV+TPVKKDGI++KERPTDKG+SWLVKTQYI
Sbjct: 117 LDLLDMSVYNTPPVQPPMAPEDEELLRDDEVLTPVKKDGIRKKERPTDKGMSWLVKTQYI 176
Query: 396 SPLSMESARQSLTEKQAKELREMKGGR-SILENLNDRERQIKEIEASFEACKLRPIHATN 454
SPLS ++A+ S+TEKQAKE RE + GR + LEN+NDRE+QIK IE SF A K RP+H T
Sbjct: 177 SPLSTDAAKMSITEKQAKERRESREGRNTFLENINDREKQIKAIEDSFRAAKSRPVHQTK 236
Query: 455 KNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVA 514
+ ++ +LPLLPDF+RYDDQFV FDG PTADSE Y+K+++S RD ESRA+MKS++
Sbjct: 237 RGMEAEWVLPLLPDFDRYDDQFVMVNFDGDPTADSEQYNKLERSERDECESRAVMKSFLV 296
Query: 515 TGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSF 574
GSD A EKFLAYMVPS +ELSKD+ DE ED+ +SW+REYHW+VRGDD DDPTTYLV+F
Sbjct: 297 NGSDPAKQEKFLAYMVPSPHELSKDLDDETEDIQYSWLREYHWEVRGDDKDDPTTYLVTF 356
Query: 575 DDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGA-YS 633
DDD A+Y+PLPTKL L+KK+A EGRS DE+EHFP+PS I V R A+ +E E + +
Sbjct: 357 DDDGAKYLPLPTKLVLQKKKAKEGRSGDEIEHFPVPSRITVSRTAHGGMMEHGESSSMHE 416
Query: 634 NSKGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAE 672
N K SS VD +DL SR +D Q SG E
Sbjct: 417 NLKRQRSS-----VD--DDLYDHPKHSRVEDMDQYSGDE 448
>gi|242080621|ref|XP_002445079.1| hypothetical protein SORBIDRAFT_07g003820 [Sorghum bicolor]
gi|241941429|gb|EES14574.1| hypothetical protein SORBIDRAFT_07g003820 [Sorghum bicolor]
Length = 640
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 281/500 (56%), Positives = 360/500 (72%), Gaps = 36/500 (7%)
Query: 200 RVETEEERRIRKKREYEKHRQEEKHRLQM-KESQNVVMQK-------------------- 238
RVETEEERR RKKRE+EK R E++ + QM +++Q ++QK
Sbjct: 147 RVETEEERRARKKREFEKQRVEDRKQQQMMRQTQAAILQKTQQRAAQQQPQSRHHHHQPP 206
Query: 239 SQMVASGKGGHGSMVGSR---MGDR-RAAPLLSGERTENRLKKPTTFLCKLKFRNELPEP 294
S A G + GSR G R +AP + ER ENRLKKPTTFLCK KFRNELP+P
Sbjct: 207 SGSRAPATGSRAVVTGSRAVATGSRPASAP--NAERFENRLKKPTTFLCKHKFRNELPDP 264
Query: 295 SAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLD 354
SAQ K + L KDKDR+T+Y +SLEKNY P++ V DLGIPLDLLD+SVYNPP V+P +
Sbjct: 265 SAQLKWLPLNKDKDRYTKYRITSLEKNYIPKMIVPDDLGIPLDLLDMSVYNPPDVQPRMA 324
Query: 355 PEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKE 414
PEDEELLRDDEV+TP+K++GI+++ERPTD+GVSWLVKTQYISPLS ++A+ SLTEKQAKE
Sbjct: 325 PEDEELLRDDEVLTPIKQEGIRKRERPTDQGVSWLVKTQYISPLSTDAAKMSLTEKQAKE 384
Query: 415 LREMKGGR-SILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYD 473
RE + GR + L+NLNDRE+QIK IE SF+A K RP+H T + +Q ++PLLPDF+RY+
Sbjct: 385 RRESREGRNAFLDNLNDREKQIKAIEESFKAAKSRPVHQTKRGMQAEWVMPLLPDFDRYE 444
Query: 474 DQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSV 533
+ FV FDG PTADSE Y+K+++SVRD ESRA+MKS+ +GSD EKFLAYM P+
Sbjct: 445 EPFVMVNFDGDPTADSEQYNKLERSVRDECESRAVMKSFSVSGSDPTKQEKFLAYMAPAP 504
Query: 534 NELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDE-ARYVPLPTKLNLRK 592
+EL++D+ D+++D+ +SW+REYHWDVRGDD DDPTTYLV+FD +E A+Y+PLPTKL L+K
Sbjct: 505 HELTRDL-DDDDDIQYSWLREYHWDVRGDDKDDPTTYLVTFDKEEGAKYLPLPTKLVLQK 563
Query: 593 KRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDSQED 652
K+A EGRS DE+EHFP+PS I V + A+ +E E S G +SK R +D
Sbjct: 564 KKAKEGRSGDEIEHFPVPSRITVSKTAHGGTMERGE------SSGLHVNSKPRRSHVDDD 617
Query: 653 LERSHNGSRQQDPYQSSGAE 672
L+ SR +D Q SG E
Sbjct: 618 LDEHPKRSRVEDIDQYSGEE 637
>gi|413921265|gb|AFW61197.1| hypothetical protein ZEAMMB73_933462 [Zea mays]
Length = 453
Score = 503 bits (1296), Expect = e-140, Method: Compositional matrix adjust.
Identities = 251/421 (59%), Positives = 325/421 (77%), Gaps = 12/421 (2%)
Query: 228 MKESQNVVMQKSQMVASGKGGHGSMV-------GSRMG--DRRAAPLLSGERTENRLKKP 278
M+++Q ++QK+Q V + + S GSR G R A ++ ER ENRLKKP
Sbjct: 2 MRQTQATILQKTQQVRTAQQQPQSRHHHHHPPGGSRAGATGSRPATAVNAERFENRLKKP 61
Query: 279 TTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDL 338
TTFLCK KFRNELP+PSAQ K + L KDKDR+T+Y +SLEKNY P++ V D+GIPLDL
Sbjct: 62 TTFLCKHKFRNELPDPSAQLKWLPLNKDKDRYTKYRITSLEKNYIPKMIVPEDIGIPLDL 121
Query: 339 LDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPL 398
LD+SVYNPP V+PP+ PEDEELLRDD+V+TP+K++G +++ERPTDKGVSWLVKTQYISPL
Sbjct: 122 LDMSVYNPPDVQPPMAPEDEELLRDDQVLTPIKQEGTRKRERPTDKGVSWLVKTQYISPL 181
Query: 399 SMESARQSLTEKQAKELREMKGGR-SILENLNDRERQIKEIEASFEACKLRPIHATNKNL 457
S ++A+ SLTEKQAKE RE +GGR + L+NLNDRE+QIK IE SF A K RP+H T + +
Sbjct: 182 SADAAKTSLTEKQAKERRESRGGRNAFLDNLNDREKQIKAIEESFRAAKSRPVHQTKRGM 241
Query: 458 QPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGS 517
Q ++PLLPDF+RY++ FV FDG PTADSE Y+K+++ VRD ESRA+MKS+ +GS
Sbjct: 242 QAEWVMPLLPDFDRYEEPFVMVNFDGDPTADSEQYNKLERPVRDECESRAVMKSFSVSGS 301
Query: 518 DSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFD-D 576
D + EKFLAYM P+ +EL++D+ DEN+D+ +SW+REYHW+VRGDD DDPTTYLV+FD
Sbjct: 302 DPSKQEKFLAYMAPAPHELTRDLDDENDDIQYSWLREYHWEVRGDDKDDPTTYLVTFDKK 361
Query: 577 DEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKE-QGAYSNS 635
D A+Y+PLPTKL L+KK+A EGRS DE+EHFP+PS I V + A+ +E E G ++NS
Sbjct: 362 DGAKYLPLPTKLVLQKKKAKEGRSGDEIEHFPVPSRITVSKTAHGGTMERGESSGMHANS 421
Query: 636 K 636
K
Sbjct: 422 K 422
>gi|300681593|emb|CBI75547.1| Paf1 domain containing protein, expressed [Triticum aestivum]
Length = 453
Score = 499 bits (1286), Expect = e-138, Method: Compositional matrix adjust.
Identities = 263/459 (57%), Positives = 331/459 (72%), Gaps = 20/459 (4%)
Query: 228 MKESQNVVMQKSQMVASGKGGHGSMVGSRM----GDRRAAPLLS-------GERTENRLK 276
M++SQ ++QK+Q V + SR G R A S ER ENRLK
Sbjct: 1 MRQSQATILQKTQKVRAATQQQQPQQQSRHLQPSGGARPATTASRPAAAANAERFENRLK 60
Query: 277 KPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPL 336
KPTTFLCK KFRNELP+PSAQ K + L KDKDR+T+Y +SLEKNY P++ V DLGIPL
Sbjct: 61 KPTTFLCKHKFRNELPDPSAQLKWLPLNKDKDRYTKYRITSLEKNYMPKMIVPEDLGIPL 120
Query: 337 DLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYIS 396
DLLD+SVYNPPSV L PED+ELLR+DEV+TPVK +GI++KERPTDKG+SWLVKTQYIS
Sbjct: 121 DLLDMSVYNPPSVHRALAPEDQELLREDEVLTPVKPEGIRKKERPTDKGMSWLVKTQYIS 180
Query: 397 PLSMESARQSLTEKQAKELREMKGGR-SILENLNDRERQIKEIEASFEACKLRPIHATNK 455
PL+ ++A+ S+TEKQAKE RE + GR ++LENLNDR+++IK I SF+A K RP+H T +
Sbjct: 181 PLTTDAAKMSITEKQAKERRESREGRDNVLENLNDRQKRIKAIAESFKAAKSRPVHQTKR 240
Query: 456 NLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVAT 515
++ +LPL+PDF+RY+D FV FDG PTADSE Y+K+++ VRD ES+A+MKS+ +
Sbjct: 241 GMEAEFVLPLVPDFDRYNDPFVMVNFDGDPTADSEQYTKLERPVRDECESQALMKSFQVS 300
Query: 516 GSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFD 575
GSD A E+FLAYM P+ +EL KD+ DENED +SW+REYHW+VRGDD DPTTYLVSFD
Sbjct: 301 GSDPAKQERFLAYMAPAPHELVKDLDDENEDFQYSWIREYHWEVRGDDKQDPTTYLVSFD 360
Query: 576 DDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNS 635
DD+A+Y+PLPTKL L+KK+A EGRS DE+EHFP+PS I V R A+ + E G S+
Sbjct: 361 DDDAKYLPLPTKLVLQKKKAKEGRSGDEIEHFPVPSRITVSRTAHGDEM---EHGESSSM 417
Query: 636 KGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAEDD 674
GN K R +DLE SR +D Q S EDD
Sbjct: 418 HGN---LKRRRSSVDDDLEEHRRHSRVEDTEQYS--EDD 451
>gi|222639943|gb|EEE68075.1| hypothetical protein OsJ_26103 [Oryza sativa Japonica Group]
Length = 545
Score = 496 bits (1277), Expect = e-137, Method: Compositional matrix adjust.
Identities = 262/469 (55%), Positives = 325/469 (69%), Gaps = 65/469 (13%)
Query: 187 PPPGVKKVNGGSGRVETEEERRIRKKREYEKHRQEEKHRLQM-KESQNVVMQKSQMVASG 245
PPP ++ R ETEEERR RKKREYEK R E++ QM ++SQ V+QK+Q V +
Sbjct: 54 PPPREQQSKSALPRAETEEERRARKKREYEKQRAEDRKNQQMMRQSQATVLQKTQQVRAA 113
Query: 246 K------------GGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPE 293
+ G ++ +R AAP + ER ENRLKKPTTFLCK KFRNELP+
Sbjct: 114 QQPQSRHHQQPSGGSRPAVTATRPA---AAP--NAERFENRLKKPTTFLCKHKFRNELPD 168
Query: 294 PSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPL 353
PS+Q K + L KDKDR+T+Y +SLEKNY P++ V DLGIPLDLLD+SVYN P V+PP+
Sbjct: 169 PSSQLKWLPLNKDKDRYTKYRITSLEKNYIPKMIVPEDLGIPLDLLDMSVYNTPPVQPPM 228
Query: 354 DPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAK 413
PEDEELLRDDEV+TPVKKDGI++KERPTDKG+SWLVKTQYISPLS ++A+ S+TEKQAK
Sbjct: 229 APEDEELLRDDEVLTPVKKDGIRKKERPTDKGMSWLVKTQYISPLSTDAAKMSITEKQAK 288
Query: 414 ELREMKGGR-SILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERY 472
E RE + GR + LEN+NDRE+QIK IE SF A K RP+H T + ++ +LPLLPDF+RY
Sbjct: 289 ERRESREGRNTFLENINDREKQIKAIEDSFRAAKSRPVHQTKRGMEAEWVLPLLPDFDRY 348
Query: 473 DDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPS 532
DDQFV FDG PTADSE Y+K+++S RD ESRA+MKS++ GSD A EKFLAYMVPS
Sbjct: 349 DDQFVMVNFDGDPTADSEQYNKLERSERDECESRAVMKSFLVNGSDPAKQEKFLAYMVPS 408
Query: 533 VNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKL---- 588
+ELSKD+ DE ED+ +SW+REYHW+VRGDD DDPTTYLV+FDDD A+Y+PLPTKL
Sbjct: 409 PHELSKDLDDETEDIQYSWLREYHWEVRGDDKDDPTTYLVTFDDDGAKYLPLPTKLVLQK 468
Query: 589 ------------------------------------------NLRKKRA 595
NL+++R+
Sbjct: 469 KKAKEGRSGDEIEHFPVPSRITVSRTAHGGMMEHGESSSMHENLKRQRS 517
>gi|7715600|gb|AAF68118.1|AC010793_13 F20B17.16 [Arabidopsis thaliana]
Length = 593
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 271/588 (46%), Positives = 331/588 (56%), Gaps = 156/588 (26%)
Query: 156 RDTGTSDRDKRASKEVSRHD---PGHSKQHRPPVPPPGVKKVNGGSGRVETEEERRIRKK 212
R G +D +K ASK+V R + P SK H P S ++ETEEERR+RKK
Sbjct: 90 RHQGPNDHEKGASKQVGRRERAKPDPSKHHHRSHLP--------HSKKIETEEERRLRKK 141
Query: 213 REYEKHRQEEKHRLQMKESQNVVMQKSQMVASGKGGHGSMVGSRMGDRRAAPLLSGERTE 272
RE EK RQ+EKHR QMK S KSQM GH +++ PLL+ +R E
Sbjct: 142 RELEKQRQDEKHRQQMKNSH-----KSQM----PKGHTE-------EKKPTPLLTTDRVE 185
Query: 273 NRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKD------RFTRYTFSSLEKNYKPQL 326
NRLKKPTTF+CKLKFRNELP+PSAQ KLM +K+DKD RFT+YT +SLEK +KP++
Sbjct: 186 NRLKKPTTFICKLKFRNELPDPSAQLKLMTIKRDKDHYFDPTRFTKYTITSLEKLWKPKI 245
Query: 327 HVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGV 386
VEPDLGIPLDLLDLSVYNPP V+ PL PEDEELLRDD+ VTP+KKDGI+RKERPTDKG+
Sbjct: 246 FVEPDLGIPLDLLDLSVYNPPKVKAPLAPEDEELLRDDDAVTPIKKDGIRRKERPTDKGM 305
Query: 387 SWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACK 446
SWLVKTQYIS ++ ESARQSLTEKQAKELREMKGG +IL NLN+RERQIK+IEASFEACK
Sbjct: 306 SWLVKTQYISSINNESARQSLTEKQAKELREMKGGINILHNLNNRERQIKDIEASFEACK 365
Query: 447 LRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSK------------ 494
RP+HATNKNLQPVE+LPLLP F+RYD+QFV A FDGAP ADSE + K
Sbjct: 366 SRPVHATNKNLQPVEVLPLLPYFDRYDEQFVVANFDGAPIADSEFFGKLDPSIRDAHESR 425
Query: 495 ------------------------MDKSVRDAHESR-----------------------A 507
+D+ +D H+
Sbjct: 426 VSYELPISMNTANPEKFLAYMVPSLDELSKDIHDENEEISYTWVREYLWDVQPNANDPGT 485
Query: 508 IMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDP 567
+ S+ G+ S P +P L K E
Sbjct: 486 YLVSF-DNGTASYLP-------LPMRLNLRKKRAREGR---------------------- 515
Query: 568 TTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELK 627
DE + P+P+++ +R RR+ V+ IE K
Sbjct: 516 -------SSDEIEHFPVPSRVTVR-------------------------RRSTVSVIEHK 543
Query: 628 EQGAYSNSKGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQ-SSGAEDD 674
+ G YS+ G +SSSKM R++ + L RS +QD Q S G EDD
Sbjct: 544 DSGVYSSRVG-ASSSKMRRLEDEGGLGRSWKHEPEQDANQYSDGNEDD 590
>gi|357513505|ref|XP_003627041.1| RNA polymerase II-associated factor-like protein [Medicago
truncatula]
gi|355521063|gb|AET01517.1| RNA polymerase II-associated factor-like protein [Medicago
truncatula]
Length = 428
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 192/245 (78%), Positives = 216/245 (88%), Gaps = 13/245 (5%)
Query: 200 RVETEEERRIRKKREYEKHRQEEKHRLQ---MKESQNVVMQKSQMVASGKGG--HGSMVG 254
RVETEEE+R+RKK+EYEKHRQEEKHR Q +KESQN V+QK+QMV+SG G HGS+ G
Sbjct: 182 RVETEEEKRMRKKKEYEKHRQEEKHRHQQKQLKESQNSVLQKTQMVSSGGAGKVHGSIAG 241
Query: 255 SRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYT 314
SRMGD+RA PLL GER ENRLKKPTTFLCKL+FRNELP+P+AQPKLMA KKDKD++ +YT
Sbjct: 242 SRMGDKRATPLLGGERVENRLKKPTTFLCKLRFRNELPDPTAQPKLMAFKKDKDQYAKYT 301
Query: 315 FSSLEKNYKPQLHVEPDLGIPLDLLDLSV--------YNPPSVRPPLDPEDEELLRDDEV 366
+SLEK YKP+L VEPDLGIPLDLLDLSV Y+PPSVRPPL PEDE+LLRDDE
Sbjct: 302 ITSLEKMYKPKLFVEPDLGIPLDLLDLSVYKSDELFLYSPPSVRPPLAPEDEDLLRDDEA 361
Query: 367 VTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILE 426
VTP+KKDGIKRKERPTDKGV+WLVKTQYISPLSMESA+QSLTEKQAKELRE KGGRS+LE
Sbjct: 362 VTPLKKDGIKRKERPTDKGVAWLVKTQYISPLSMESAKQSLTEKQAKELREKKGGRSLLE 421
Query: 427 NLNDR 431
NLN+R
Sbjct: 422 NLNNR 426
>gi|168014146|ref|XP_001759615.1| Paf1 complex protein [Physcomitrella patens subsp. patens]
gi|162689154|gb|EDQ75527.1| Paf1 complex protein [Physcomitrella patens subsp. patens]
Length = 588
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 222/439 (50%), Positives = 289/439 (65%), Gaps = 47/439 (10%)
Query: 190 GVKKVNGGSGRVE-TEEERRIRKKREY-EKHRQEEKHRLQMKESQNVVMQKSQMVASGKG 247
G +V+ GR E T+EER RKKREY EK RQEEK + K Q + G G
Sbjct: 129 GTSQVDSVPGRREETKEERNERKKREYYEKKRQEEKRQ----------RGKGQPMRPGAG 178
Query: 248 GHG-SMVGSR--MGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALK 304
+ GSR D++ A + G++ ENRLK+P+TFLCK+KFRN+LP+ +AQPKLM +
Sbjct: 179 SSKPANNGSRDAYADKKVASSV-GDKIENRLKRPSTFLCKIKFRNDLPDSTAQPKLMHIN 237
Query: 305 KDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDD 364
DKDR+TRY F+SLEKN+K +LHVEPDLGIPLDLLD+S Y PLDPED LL DD
Sbjct: 238 TDKDRYTRYQFTSLEKNWKHKLHVEPDLGIPLDLLDISNYKKIDASVPLDPEDAALLLDD 297
Query: 365 EVVTPVKKD--GIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGR 422
EV VKKD GI++K+RPTDKGV+WLVKTQY+SP++++ A+Q++TEKQAK+LRE + GR
Sbjct: 298 EV--KVKKDSSGIRKKDRPTDKGVAWLVKTQYVSPINLDPAKQAMTEKQAKDLREQREGR 355
Query: 423 SI-LENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATF 481
I +E NDR++QI+ IE SF A K P H T L+PVEILPLLPDFER DQ+V F
Sbjct: 356 RIDVEAGNDRQQQIEAIEESFRAAKQLPKHLTKPELEPVEILPLLPDFERMGDQYVHMVF 415
Query: 482 DGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMY 541
D P D+ + +D+ R ESRAI+KS+ SD + PEKF+ YMVP +EL ++ Y
Sbjct: 416 DTDPLTDAPGLTDLDQDARLEIESRAIVKSFTIATSDESKPEKFIGYMVPKPDELDRNFY 475
Query: 542 DENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSN 601
D+ E++ ++WVREYHWD PL TKL L+KK+A +G+S
Sbjct: 476 DDEEEIQYTWVREYHWD------------------------PLGTKLVLQKKKAKDGKSK 511
Query: 602 DEVE--HFPIPSSIAVRRR 618
+ E FP+P+S+ V RR
Sbjct: 512 YDTEDRDFPVPASVTVIRR 530
>gi|297839961|ref|XP_002887862.1| hypothetical protein ARALYDRAFT_340237 [Arabidopsis lyrata subsp.
lyrata]
gi|297333703|gb|EFH64121.1| hypothetical protein ARALYDRAFT_340237 [Arabidopsis lyrata subsp.
lyrata]
Length = 330
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 171/236 (72%), Positives = 201/236 (85%), Gaps = 3/236 (1%)
Query: 239 SQMVASGK--GGHGS-MVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPS 295
S MV G+ H S M +++ PLL+ +R ENRLKKPTTF+CKLKFRNELP+PS
Sbjct: 45 STMVCIGEMNNSHKSQMPKGHTEEKKPTPLLTTDRVENRLKKPTTFICKLKFRNELPDPS 104
Query: 296 AQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDP 355
AQ KLM +K+DKD+FT+YT +SLEK +KP++ VEPDLGIPLDLLDLSVYNPP + PL P
Sbjct: 105 AQLKLMTIKRDKDQFTKYTITSLEKLWKPKIFVEPDLGIPLDLLDLSVYNPPKFKAPLAP 164
Query: 356 EDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKEL 415
EDEELLRDD+ +TP+KKDGI+RKERPTDKGVSWLVKTQYIS ++ ESARQSLTEKQAKEL
Sbjct: 165 EDEELLRDDDAITPIKKDGIRRKERPTDKGVSWLVKTQYISSINNESARQSLTEKQAKEL 224
Query: 416 REMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFER 471
REMKGG +IL NLN+RERQIK+IEASFEACK RP+HATNK+LQPVE+LPLLP F+R
Sbjct: 225 REMKGGINILHNLNNRERQIKDIEASFEACKSRPVHATNKSLQPVEVLPLLPYFDR 280
>gi|357513509|ref|XP_003627043.1| hypothetical protein MTR_8g014480 [Medicago truncatula]
gi|355521065|gb|AET01519.1| hypothetical protein MTR_8g014480 [Medicago truncatula]
Length = 215
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 142/204 (69%), Positives = 161/204 (78%), Gaps = 4/204 (1%)
Query: 472 YDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVP 531
YDDQFV A FD APT DSE+Y+K+DKSVRD ESRA+MKSYVAT SD ANPEKFLAYMVP
Sbjct: 14 YDDQFVIAAFDNAPTVDSEVYNKLDKSVRDISESRAVMKSYVATSSDPANPEKFLAYMVP 73
Query: 532 SVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLR 591
ELSKD+YDE+EDVS+SWVREYHWDVRGDDADDPTT+LVSFD+ EARY+PLPTKL LR
Sbjct: 74 QPGELSKDIYDEDEDVSYSWVREYHWDVRGDDADDPTTFLVSFDESEARYLPLPTKLVLR 133
Query: 592 KKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDSQE 651
KKRA EGRS DEVE FP PS + VRRR NV AIELK+ Y+ KGNSS + +
Sbjct: 134 KKRAKEGRSGDEVEQFPAPSRVTVRRRPNVAAIELKDSEVYTRLKGNSSKNLD----MDD 189
Query: 652 DLERSHNGSRQQDPYQSSGAEDDM 675
DL+ H + D +QSSGAED+M
Sbjct: 190 DLDDQHGDADHHDNFQSSGAEDEM 213
>gi|226503031|ref|NP_001145766.1| uncharacterized protein LOC100279273 [Zea mays]
gi|219884349|gb|ACL52549.1| unknown [Zea mays]
gi|413921266|gb|AFW61198.1| hypothetical protein ZEAMMB73_933462 [Zea mays]
Length = 202
Score = 232 bits (591), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 111/184 (60%), Positives = 145/184 (78%), Gaps = 2/184 (1%)
Query: 403 ARQSLTEKQAKELREMKGGR-SILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVE 461
A QSLTEKQAKE RE +GGR + L+NLNDRE+QIK IE SF A K RP+H T + +Q
Sbjct: 12 AFQSLTEKQAKERRESRGGRNAFLDNLNDREKQIKAIEESFRAAKSRPVHQTKRGMQAEW 71
Query: 462 ILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSAN 521
++PLLPDF+RY++ FV FDG PTADSE Y+K+++ VRD ESRA+MKS+ +GSD +
Sbjct: 72 VMPLLPDFDRYEEPFVMVNFDGDPTADSEQYNKLERPVRDECESRAVMKSFSVSGSDPSK 131
Query: 522 PEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFD-DDEAR 580
EKFLAYM P+ +EL++D+ DEN+D+ +SW+REYHW+VRGDD DDPTTYLV+FD D A+
Sbjct: 132 QEKFLAYMAPAPHELTRDLDDENDDIQYSWLREYHWEVRGDDKDDPTTYLVTFDKKDGAK 191
Query: 581 YVPL 584
Y+ +
Sbjct: 192 YLVI 195
>gi|388495832|gb|AFK35982.1| unknown [Lotus japonicus]
Length = 163
Score = 226 bits (576), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 112/157 (71%), Positives = 129/157 (82%), Gaps = 3/157 (1%)
Query: 509 MKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPT 568
MKSYVATGSD ANPEKFLAYM P+ ELSKD+YDENEDVS+SWVREYHWDVRGDDADDPT
Sbjct: 1 MKSYVATGSDPANPEKFLAYMAPTPGELSKDIYDENEDVSYSWVREYHWDVRGDDADDPT 60
Query: 569 TYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKE 628
T+LV+FD+ EARYVPLPTKL LRKKRA EGRS DEVE FPIPS + VRRR++V AIE K+
Sbjct: 61 TFLVAFDELEARYVPLPTKLVLRKKRAAEGRSGDEVEQFPIPSKVTVRRRSSVAAIERKD 120
Query: 629 QGAYSNSKGNSSSSKMGRVDSQED-LERSHNGSRQQD 664
G Y++SKGN SS+ G ++ +D LE H + QD
Sbjct: 121 SGVYTSSKGN--SSRRGDLEMDDDALEHQHRVAAHQD 155
>gi|226497630|ref|NP_001141723.1| uncharacterized protein LOC100273854 [Zea mays]
gi|194705700|gb|ACF86934.1| unknown [Zea mays]
gi|413917369|gb|AFW57301.1| hypothetical protein ZEAMMB73_235592 [Zea mays]
Length = 214
Score = 226 bits (575), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 116/217 (53%), Positives = 155/217 (71%), Gaps = 7/217 (3%)
Query: 457 LQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATG 516
+Q ++PLLPDF+RY++ FV FDG PTADSE Y+K+++ VRD ESRA+MKS+ +G
Sbjct: 1 MQAEWVMPLLPDFDRYEEPFVMVNFDGDPTADSEQYNKLERPVRDECESRAVMKSFSVSG 60
Query: 517 SDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDD 576
SD + EKFLAYM P+ +EL++D+ DEN+D+ +SW+REYHW+VRGDD DDPTTYLV+FD
Sbjct: 61 SDPSKQEKFLAYMAPAPHELARDLDDENDDIQYSWLREYHWEVRGDDKDDPTTYLVTFDK 120
Query: 577 DE-ARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGAYSNS 635
+E A+Y+PLPTKL L+KK+A EGRS DE+EHFP+PS I V + +V +E E S
Sbjct: 121 EEGAKYLPLPTKLVLQKKKAKEGRSGDEIEHFPVPSLITVNKTGHVGTMERGE------S 174
Query: 636 KGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAE 672
G +SK R +DL+ SR +D Q SG E
Sbjct: 175 SGMHVNSKPRRPHVDDDLDEHSKRSRVEDIDQYSGEE 211
>gi|384252480|gb|EIE25956.1| hypothetical protein COCSUDRAFT_64915 [Coccomyxa subellipsoidea
C-169]
Length = 385
Score = 185 bits (469), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 128/398 (32%), Positives = 203/398 (51%), Gaps = 44/398 (11%)
Query: 269 ERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHV 328
E+ ENRLK+ T F+ ++FR LPE PK++ D+ +++ ++LE+ + L +
Sbjct: 18 EKPENRLKRNTPFVSNVRFRTSLPEVPCDPKMLVAVLQPDKLSQFNITTLEQEMRQDLIL 77
Query: 329 EPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDG---IKRKERPTDKG 385
E +GIPL LLD Y P + PL PED L V+ P ++G KR+ER +G
Sbjct: 78 ESGVGIPLSLLDTQRYAVPDIAQPLAPEDAAL-----VMGPTTEEGPGATKRRERLRGEG 132
Query: 386 --VSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFE 443
+SWL++T YI+ ME RQ + +++ G + + DR+ QI IEA+FE
Sbjct: 133 TELSWLMRTTYIAN-DMEERRQQRSAEKS-------GVAATEDEAIDRDAQIAAIEATFE 184
Query: 444 ACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKM--DKSVRD 501
+ P H + +L PVEI+P+LPD +R + +V FDG PTAD + + + D R
Sbjct: 185 EARAPPTHIRDPSLTPVEIMPVLPDLDRLGNIYVRMAFDGEPTADHDRLASLPPDSVKRL 244
Query: 502 AHESRAIMKSYVATGSDSANPEKFLAYMVP-------SVNELSKDMYDENEDVSFSWVRE 554
A AIMKS+ ++F+AY++P + +E S D + +S+ W+RE
Sbjct: 245 AEH--AIMKSFTLNSDQHGKEQRFVAYLLPEEDPVQRNADEPSSSRDDAEDHISYEWIRE 302
Query: 555 YHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIA 614
Y++ V D +D +T+ FD + L TKL+L K+ G+ + F PS +
Sbjct: 303 YNYKVHND--EDHSTFWFYFDKGRVTFTDLDTKLSLFKR----GKQA-TTQAFQRPSGVT 355
Query: 615 VRRRANVTAIELKEQGAYSNSKGNSSSSKMGRVDSQED 652
++RR KE+ + + K+ D QED
Sbjct: 356 LKRR--------KERSSEEDETARQRMQKLTADDVQED 385
>gi|307104472|gb|EFN52725.1| hypothetical protein CHLNCDRAFT_138263 [Chlorella variabilis]
Length = 470
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 108/354 (30%), Positives = 169/354 (47%), Gaps = 26/354 (7%)
Query: 273 NRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDL 332
NRL++ T FLC ++F+N+LPE PKL+ D+D+ + + LE++ + L E DL
Sbjct: 28 NRLRRDTAFLCNIRFKNDLPEIPCDPKLLLPPYDRDQLAAFKLTELERDLRKDLMFEDDL 87
Query: 333 GIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKT 392
GIP+ ++ Y+ P V PPL P+D L+ DE P K + R +SWL++T
Sbjct: 88 GIPIHPWNIEQYSVPEVVPPLHPDDAALVESDEEAEPKKA-----RLRGDAAELSWLLRT 142
Query: 393 QYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHA 452
+YI+ + ++ + EN + RE +I++IEA F K H
Sbjct: 143 KYIA--AEAGLKRGTGAPAKAAAAAATAAAAGAENEDPREERIRQIEAQFVGAKAPLAHG 200
Query: 453 TNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSY 512
+ ++ P+E++P+ PD V ATFD P AD ++ +M R +KS+
Sbjct: 201 KDPSVVPLEVMPVFPDSLMEGRSCVLATFDNDPLADVDVVERMPAEQRARVPQAMQLKSF 260
Query: 513 V-ATGSDSANPEKFLAYMVP-SVNELSKDMYDENEDV-------SFSWVREYHWDVRGDD 563
ATG +F+A +VP + E D + + + WVREY R D+
Sbjct: 261 KPATGP------QFVALLVPKKLPEAGDDELSASGGIPGPQLAREYHWVREYQPHPRLDE 314
Query: 564 ADDPTTYLVSF-DDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVR 616
TTYL F D ++ L T+L LRK++ D+ E F P + VR
Sbjct: 315 K--STTYLFRFAGDGTVQFHDLNTRLELRKRKRQAAAGEDD-EGFMQPEMVVVR 365
>gi|358333056|dbj|GAA51648.1| RNA polymerase II-associated factor 1 homolog [Clonorchis sinensis]
Length = 850
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 123/390 (31%), Positives = 186/390 (47%), Gaps = 57/390 (14%)
Query: 238 KSQMVASGKG---GHGSMVGSRMGDRRAAPLLSGERTENRLK----------KPTTFLCK 284
K ++ SG G GH + + L RT+ +++ + + LC+
Sbjct: 282 KMRIEKSGIGVWLGHSCSLSTSQCSLTMTLLQKNGRTDEKVEDYPQDRAKELRVESLLCR 341
Query: 285 LKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVY 344
LK+RN LPE PK +A + RF +Y +SLE+NYK +L E D+G+ +DL+D V+
Sbjct: 342 LKYRNTLPELPFDPKFLAYPLEPSRFLQYVATSLERNYKHELLTETDVGVDVDLIDPDVF 401
Query: 345 NP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESA 403
SV+ L P+DE LL DD TP + RK R K VSWL +T+YIS
Sbjct: 402 KIDKSVK--LHPDDERLLEDD---TPAAINA--RKSRH-QKSVSWLRRTEYISTEMYNRW 453
Query: 404 RQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPI--HATNK 455
+S + E K G S+ +LN DRE QI IEA+F A K +PI H +
Sbjct: 454 NKS-------DKVESKLGYSVKRHLNEEIVYRDRESQIAAIEATFRAAK-KPITKHYSKP 505
Query: 456 NLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVAT 515
N+ VE+LP+LPDF + FD P ++ ++ + V ++A+++ V
Sbjct: 506 NVHAVEVLPVLPDFTLWRYPCAQVIFDDDPARKNKSNAEQKEEV-----NQAMIRGMVDE 560
Query: 516 GSDSANPEKFLAYMVP--SVNELSK-DM-----YDENEDVSFSWVREYHWDVRGDD-ADD 566
D F+AY +P S +L + D+ Y E + REY+W+V+ A+
Sbjct: 561 SGD-----HFVAYFLPTESTKQLRRLDIENHVPYTEGAAYEYELAREYNWNVKNKTMANY 615
Query: 567 PTTYLVSFDDDEARYVPLPTKLNLRKKRAI 596
Y F D Y L T++ L K+R +
Sbjct: 616 EENYFFVFRKDGVFYNELETRVRLSKRRKL 645
>gi|321456015|gb|EFX67133.1| hypothetical protein DAPPUDRAFT_302221 [Daphnia pulex]
Length = 507
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/329 (31%), Positives = 163/329 (49%), Gaps = 31/329 (9%)
Query: 279 TTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDL 338
T +C++K+ N LP+ PK + D +RF +Y +SLEK+YK +L E DLGI +DL
Sbjct: 29 TDLVCRVKYCNTLPDIPFDPKFITYPFDSNRFIQYNATSLEKSYKYELLAEYDLGITIDL 88
Query: 339 LDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPL 398
++ Y LDP DE LL +D P +D R++ K VSWL +T+YIS
Sbjct: 89 INPEAYAKEK-NAHLDPADERLLEEDTHHAP--QDA--RRKAQHAKNVSWLRRTEYISTE 143
Query: 399 SMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPI--HATNKN 456
+ S+ + +AK MK D E QIK IE +F K +PI H+T K
Sbjct: 144 ATRFQPMSIEKVEAKVGYSMKKALKEENAYMDHESQIKAIEKTFADAK-KPILEHSTKKG 202
Query: 457 LQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDK--SVRDAHESRAIMKSYVA 514
+ VE+LP++PDF+ + FD P + +D+ +++ S+A+++
Sbjct: 203 VTAVEVLPVIPDFKLWKYPCAQVIFDANP-------APVDRPSNIQVEEMSQAMIR---- 251
Query: 515 TGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDADD 566
G E+F+AY +P+ L+K D ++ V + R+YHW+V+ +
Sbjct: 252 -GVMDETGEQFVAYFLPTSETLTKRARDASQSVDYDDNDEYDYKMARQYHWNVKSKASKG 310
Query: 567 -PTTYLVSFDDDEARYVPLPTKLNLRKKR 594
Y + F DD Y L T++ L K+R
Sbjct: 311 YEENYFIVFRDDAVYYNELETRVRLTKRR 339
>gi|156389561|ref|XP_001635059.1| predicted protein [Nematostella vectensis]
gi|156222149|gb|EDO42996.1| predicted protein [Nematostella vectensis]
Length = 351
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 114/339 (33%), Positives = 177/339 (52%), Gaps = 48/339 (14%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
F+ ++K+ N LP+ K ++ + +RF +Y +SLEKNYK +L + DLG+ +DL+D
Sbjct: 7 FVTRIKYNNNLPDIPFDAKFISYPFEGNRFVQYKPTSLEKNYKHELLTDIDLGVNIDLID 66
Query: 341 LSVYNPPSVRP--PLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPL 398
+Y +V P LDP DE+LL +D VV P KK + R +K VSWL KT+YIS
Sbjct: 67 PQMY---AVDPNATLDPADEKLLEEDAVVMPDKK-----RTRHHNKNVSWLRKTEYIST- 117
Query: 399 SMESARQSLTEKQAKELREMKGGRSILENL------NDRERQIKEIEASFEACKLRPIHA 452
E R + + E+ E K G SI + DR+ Q+ IE +FEA K +PIH
Sbjct: 118 --EYNRA----QTSNEMAETKVGFSIKKKFKGTDLYKDRDSQLAAIENTFEAAK-KPIHH 170
Query: 453 --TNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRAIM 509
+ NL PVE+LP+ PDF+ + FD P D + +++D+ S+A++
Sbjct: 171 HPSKPNLTPVEVLPVFPDFQMWGYPCAHVVFDTDPAPRDRKGPAQVDEM------SQAMI 224
Query: 510 KSYVATGSDSANPEKFLAYMVP---SVNELSKDM-----YDENEDVSFSWVREYHWDVRG 561
+ V A+ ++F+AY +P ++N+ +D Y + E+ + REY+W+V+
Sbjct: 225 RGMV-----DASGDQFVAYFLPDKETLNKRKRDHEEMADYQDEEEYVYKMAREYNWNVKN 279
Query: 562 DDADD-PTTYLVSFDDDE-ARYVPLPTKLNLRKKRAIEG 598
Y F D E Y L T++ L K+RA G
Sbjct: 280 KSTKGYEENYFFVFRDGEGVFYNELVTRVRLSKRRARGG 318
>gi|391347358|ref|XP_003747931.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform 2
[Metaseiulus occidentalis]
Length = 525
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 105/338 (31%), Positives = 167/338 (49%), Gaps = 44/338 (13%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+K + LC++K+ N LP+ PK +A D RF Y +SLE NYK L E DLG+
Sbjct: 21 EKRSDLLCRVKYSNTLPDIPFDPKFLAFPFDNKRFVSYKSTSLETNYKHDLLTEHDLGVT 80
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL+D + Y PP+ PL +DE+LL ++E +TP ++ R + + W+ KT+YI
Sbjct: 81 IDLIDPNTYAPPAEDVPLRADDEKLL-EEEPITPAD----SKRSRLHNMVIPWVKKTEYI 135
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
+ E +R T E K G ++ + LN DR+ QI+ I +FE + +P
Sbjct: 136 ---ATEFSRYGQTGVNT----ETKVGYNVKKMLNERSLYMDRDSQIEAINKTFEEAQ-KP 187
Query: 450 I--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDA--HES 505
I H + N+ PVE+LP+ PDFE + F FD P + + ++VR S
Sbjct: 188 ITQHYSKPNVTPVEVLPVYPDFELWKYPFAQVLFDNEP-------APISQNVRAGILEMS 240
Query: 506 RAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSWVREYHW 557
+A+++ G + E+F+AY +P+ L K Y + D + REY+W
Sbjct: 241 QAMIR-----GVMDESGEQFVAYFMPTEETLRKRQDDAENKVEYQDEVDYEYKMTREYNW 295
Query: 558 DVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKR 594
+V+ + Y + D Y L T++ L K+R
Sbjct: 296 NVKNKTSSGYEENYFFAVRDGCIYYNELETRVRLSKRR 333
>gi|391347356|ref|XP_003747930.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform 1
[Metaseiulus occidentalis]
Length = 520
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 105/336 (31%), Positives = 161/336 (47%), Gaps = 45/336 (13%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+K + LC++K+ N LP+ PK +A D RF Y +SLE NYK L E DLG+
Sbjct: 21 EKRSDLLCRVKYSNTLPDIPFDPKFLAFPFDNKRFVSYKSTSLETNYKHDLLTEHDLGVT 80
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL+D + Y PP+ PL +DE+LL ++E +TP ++ R + + W+ KT+YI
Sbjct: 81 IDLIDPNTYAPPAEDVPLRADDEKLL-EEEPITPAD----SKRSRLHNMVIPWVKKTEYI 135
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
+ E +R T E K G ++ + LN DR+ QI+ I +FE + +P
Sbjct: 136 ---ATEFSRYGQTGVNT----ETKVGYNVKKMLNERSLYMDRDSQIEAINKTFEEAQ-KP 187
Query: 450 I--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRA 507
I H + N+ PVE+LP+ PDFE + F FD P S+ A E
Sbjct: 188 ITQHYSKPNVTPVEVLPVYPDFELWKYPFAQVLFDNEPAPISQ-----------AEE--- 233
Query: 508 IMKSYVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSWVREYHWDV 559
M + G + E+F+AY +P+ L K Y + D + REY+W+V
Sbjct: 234 -MSQAMIRGVMDESGEQFVAYFMPTEETLRKRQDDAENKVEYQDEVDYEYKMTREYNWNV 292
Query: 560 RGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKR 594
+ + Y + D Y L T++ L K+R
Sbjct: 293 KNKTSSGYEENYFFAVRDGCIYYNELETRVRLSKRR 328
>gi|145341024|ref|XP_001415616.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575839|gb|ABO93908.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 433
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 114/358 (31%), Positives = 179/358 (50%), Gaps = 22/358 (6%)
Query: 273 NRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDL 332
RLK+ T FLC +F+N+LP K++ + D+ T Y+ SL + + DL
Sbjct: 65 TRLKRETAFLCHAQFKNDLPAVPIDWKMLQTRVDRRALTEYSHLSLYDGLRKRGDFSEDL 124
Query: 333 GIPLDLLDLSVYNPPSVRPPLDPEDEELL-----RDDEVVTPVKKDGIKRKERPTDKGVS 387
GIPLD + Y P+ R +DPED EL R+ + + G + RP
Sbjct: 125 GIPLDPALMRAYRVPTRRVAMDPEDHELTMSSAEREGKRNGAIGTSG-RAVSRPDASDAL 183
Query: 388 WLVKTQYISPLSMESARQSLTEKQAKELR-EMKGGRSILE-NLNDRERQIKEIEASFEAC 445
WL+ TQYIS S++ AR L+EK+ K R E +GG + E L+ Q+++IEASF A
Sbjct: 184 WLMNTQYISAGSIK-ARTGLSEKEMKRRRLEAQGGAAEPEVELS----QVEQIEASFAAA 238
Query: 446 KLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTAD-SEIYSKMDKSVRDAHE 504
K P+H TNK + VE+LP+LPDFER +V FD D + + K +++ +A
Sbjct: 239 KRPPVHPTNKKAKVVEVLPVLPDFERIAMDYVRLNFDEKQETDVASLSGKSAETIENALN 298
Query: 505 SRAIMKSYVATGSDSANPEKFLAYMVPS--VNELSKDMYD-ENEDVSFSWVREYHWDVRG 561
++K + + ++ E+FL+ M+P + +D + E + VR+Y + +
Sbjct: 299 C-GVVKPF-SIVNERNQTERFLSLMLPQDPDAAMEEDFLALDGEPRQYDHVRDYVYKIHQ 356
Query: 562 DDAD-DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRR 618
DD + F D YV L TKL L KR+ + + + + PS + ++RR
Sbjct: 357 DDPNLGGGNMCFFFKKDRVTYVDLHTKLTL-SKRSKHSKGKEATDSWK-PSEVTLQRR 412
>gi|390369554|ref|XP_785518.3| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Strongylocentrotus purpuratus]
Length = 555
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 171/356 (48%), Gaps = 48/356 (13%)
Query: 265 LLSGERTENRLKKPTT-------FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSS 317
+ SGER + K ++ +C++K+ N LP+ PK + + +RF +Y +S
Sbjct: 5 IQSGERKDRERKSRSSTGPARSDIVCRVKYANSLPDIPFDPKFITYPFEANRFVQYNATS 64
Query: 318 LEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKR 377
LE+NYK +L E DLG+ +DL++ Y P LDP DE+LL ++E+ TP G +
Sbjct: 65 LERNYKHELLAEHDLGVTIDLINPDTYRIPEEHVELDPADEKLL-EEEIATP----GDSK 119
Query: 378 KERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSIL-----ENL-NDR 431
+ R K VSWL KT+YIS S + +K+ E K G + ENL DR
Sbjct: 120 RSRQHSKTVSWLRKTEYISSEYNRS-------QHSKDKIETKVGHHLKKQFTEENLYKDR 172
Query: 432 ERQIKEIEASFEACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADS 489
QI I+ +FE K +PI H + ++ PVEILP+ PDFE + FD P
Sbjct: 173 SNQISAIQKTFEDAK-KPIETHYSKPHVTPVEILPVFPDFETWIHPCAQVIFDSDPAL-- 229
Query: 490 EIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPS-------VNELSKDM-Y 541
DKS +H+ M + G + E+F+ Y +P+ + +D+ Y
Sbjct: 230 -----RDKS---SHQQLEEMSQAMIRGMVDESEEQFVGYFLPTEETCRKRKRDFEEDIDY 281
Query: 542 DENEDVSFSWVREYHWDVRGDDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRA 595
E E+ + REY+W+V+ + + + V Y L T++ L K+R
Sbjct: 282 VEGEEYEYKLTREYNWNVKNKASRGYEENYFFVFRKGKGVFYNELETRVRLSKRRV 337
>gi|380011783|ref|XP_003689974.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Apis
florea]
Length = 548
Score = 135 bits (340), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 113/348 (32%), Positives = 177/348 (50%), Gaps = 44/348 (12%)
Query: 270 RTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVE 329
RT +K + +C++K+ N LP+ PK + + RF +Y +SLE+NYK ++ E
Sbjct: 17 RTARPAEKRSELICRVKYCNTLPDIPFDPKFITYPFESTRFIQYNPTSLERNYKYEVLTE 76
Query: 330 PDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWL 389
DLG+ +DL++ Y V LDP DE+LL +D V+TP +D ++ R + VSWL
Sbjct: 77 HDLGVEIDLINKDTY-AGDVNSQLDPADEKLLEED-VLTP--QDS--KRSRHHARSVSWL 130
Query: 390 VKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE 443
+T+YI S Q+ + Q + E K G SI +NL DRE QIK IE +FE
Sbjct: 131 RRTEYI------STEQTRFQPQTVDKVEAKVGYSIKKNLKEETLYMDRESQIKAIEKTFE 184
Query: 444 ACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDG--APTADSEIYSKMDKSV 499
K +PI H + N+ PVEILP+ PDF+ + FD APT S + +++++
Sbjct: 185 DNK-KPIERHYSKPNVVPVEILPVFPDFKLWKYPCAQVIFDSDPAPTGRS-VPAQIEEM- 241
Query: 500 RDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSW 551
S+A+++ G + E+F+AY +P L K Y ++E+ +
Sbjct: 242 -----SQAMIR-----GVMDESGEQFVAYFLPLEETLEKRRRDFTAGIDYADDEEYEYKM 291
Query: 552 VREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
REY+W+V+ + Y + D Y L T++ L K+R G
Sbjct: 292 AREYNWNVKSKASKGYEENYFLVIRQDGVYYNELETRVRLSKRRQKVG 339
>gi|391347419|ref|XP_003747960.1| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Metaseiulus occidentalis]
Length = 397
Score = 135 bits (340), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 103/331 (31%), Positives = 157/331 (47%), Gaps = 45/331 (13%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
LC++K+ N LP PK +A D RF Y +SLE NYK L E DLG+ +DL+D
Sbjct: 26 LLCRVKYSNTLPAIPFDPKFLAFPFDNKRFLSYKSTSLETNYKHDLLTEHDLGVTIDLID 85
Query: 341 LSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
+ Y PP+ PL ++EELL ++E +TP + ++ R + + W+ KT+YI+
Sbjct: 86 PNTYAPPAEDVPLHADNEELL-EEEPITP----AVSKRSRLHNMVIPWVKKTEYIAT--- 137
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPI--HA 452
E +R T E K G ++ + LN DR+ QI+ I +FE + +PI H
Sbjct: 138 EFSRYGQTGVNT----ETKVGYNVKKMLNERSLYMDRDSQIEAINKTFEEAQ-KPITQHY 192
Query: 453 TNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSY 512
N+ PVE+LP+ PDFE + F F+ P S+ A E M
Sbjct: 193 AKPNVTPVEVLPVYPDFELWKYPFAQVLFENEPAPISQ-----------AEE----MSQA 237
Query: 513 VATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSWVREYHWDVRGDDA 564
+ G + E+F+AY +P+ L K Y + D + REY+W+V+ +
Sbjct: 238 MIRGVMDESGEQFVAYFMPTEETLRKRQNDAENKVEYQDEVDYEYEMTREYNWNVKNKTS 297
Query: 565 DD-PTTYLVSFDDDEARYVPLPTKLNLRKKR 594
Y + D Y L T++ L K+R
Sbjct: 298 SGYDENYFFAVRDGCIYYNELETRVRLSKRR 328
>gi|242010931|ref|XP_002426211.1| RNA polymerase-associated protein, putative [Pediculus humanus
corporis]
gi|212510262|gb|EEB13473.1| RNA polymerase-associated protein, putative [Pediculus humanus
corporis]
Length = 472
Score = 135 bits (340), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 106/336 (31%), Positives = 171/336 (50%), Gaps = 40/336 (11%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+K T +C++K+ N LP+ PK + D RF Y ++LE+NYK ++ E DLG+
Sbjct: 22 EKRTELVCRIKYCNTLPDIPFDPKFIQYPFDAARFISYKPTTLERNYKYEVLTEHDLGVT 81
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL++ YN + LDP DE+LL +D V+TP +D I+ + + VSWL +T+YI
Sbjct: 82 IDLINKDAYN-AEIGATLDPADEKLLEED-VLTP--QDSIRSRHHA--RSVSWLRRTEYI 135
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
S Q+ + Q + E K G SI +N DRE QI+ IE +FE L+P
Sbjct: 136 ------STEQTRFQPQTMDKVEAKVGYSIKKNFKEDPLIMDRESQIRAIEKTFEDS-LKP 188
Query: 450 I--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRA 507
I H + N+ P+EILP+ PDF+ + FD P + + +SV E
Sbjct: 189 IEKHYSKPNVYPIEILPVFPDFDGWKYPCAQVIFDSDP-------APVGRSVPAQLEE-- 239
Query: 508 IMKSYVATGSDSANPEKFLAYMVPSVNELSK---DM-----YDENEDVSFSWVREYHWDV 559
M + G + E+F+AY +P+ L+K D+ Y+++++ + REY+W+V
Sbjct: 240 -MSQAMIRGVMDESGEQFVAYFLPTEETLAKRKADIQNNIDYEDDQEYEYKMAREYNWNV 298
Query: 560 RGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKR 594
+ + Y + + Y L T++ L K+R
Sbjct: 299 KNKASKGYEENYFLVVREGGMYYNELETRVRLSKRR 334
>gi|328787903|ref|XP_624998.3| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform 1
[Apis mellifera]
Length = 549
Score = 135 bits (340), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 114/348 (32%), Positives = 178/348 (51%), Gaps = 44/348 (12%)
Query: 270 RTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVE 329
RT +K + +C++K+ N LP+ PK + + RF +Y +SLE+NYK ++ E
Sbjct: 17 RTARPAEKRSELICRVKYCNTLPDIPFDPKFITYPFESTRFIQYNPTSLERNYKYEVLTE 76
Query: 330 PDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWL 389
DLG+ +DL++ Y V LDP DE+LL +D V+TP +D ++ R + VSWL
Sbjct: 77 HDLGVEIDLINKDTY-AGDVNSQLDPADEKLLEED-VLTP--QDS--KRSRHHARSVSWL 130
Query: 390 VKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE 443
+T+YI S Q+ + Q + E K G SI +NL DRE QIK IE +FE
Sbjct: 131 RRTEYI------STEQTRFQPQTVDKVEAKVGYSIKKNLKEETLYMDRESQIKAIEKTFE 184
Query: 444 ACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDG--APTADSEIYSKMDKSV 499
K +PI H + N+ PVEILP+ PDF+ + FD APT S + +++++
Sbjct: 185 DNK-KPIERHYSKPNVVPVEILPVFPDFKLWKYPCAQVIFDSDPAPTGRS-VPAQIEEM- 241
Query: 500 RDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK---DM-----YDENEDVSFSW 551
S+A+++ G + E+F+AY +P L K D Y ++E+ +
Sbjct: 242 -----SQAMIR-----GVMDESGEQFVAYFLPLEETLEKRRRDFTAGIDYADDEEYEYKM 291
Query: 552 VREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
REY+W+V+ + Y + D Y L T++ L K+R G
Sbjct: 292 AREYNWNVKSKASKGYEENYFLVIRQDGVYYNELETRVRLSKRRQKVG 339
>gi|308799519|ref|XP_003074540.1| proline-rich protein-like (ISS) [Ostreococcus tauri]
gi|116000711|emb|CAL50391.1| proline-rich protein-like (ISS) [Ostreococcus tauri]
Length = 592
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 117/388 (30%), Positives = 186/388 (47%), Gaps = 26/388 (6%)
Query: 246 KGGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKK 305
K G+ G R R AP ++ ENRL++ T FLC ++F+N+LP K++A +
Sbjct: 27 KTKRGARSGERGASARDAPSVT---EENRLRRDTAFLCHVQFKNDLPPVPVDWKMLATRV 83
Query: 306 DKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELL---- 361
D+ Y+ SL + + DLGIP+D + Y P+ R LDPED ELL
Sbjct: 84 DRGALAEYSPLSLYDGMRRRGDFSEDLGIPIDPALMRAYRVPTERGVLDPEDHELLLNAA 143
Query: 362 -RDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKG 420
R+ + T G + RP WL+ TQYIS ++ AR L+E KE++ K
Sbjct: 144 ERETKRNTANGASG-RAASRPDASDALWLMNTQYISAGKIK-ARTGLSE---KEMKRRKL 198
Query: 421 GRSILENLNDRERQIKE-IEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAA 479
S E + + E +E ++A+FEA K P+H T +++ VEILP+ PDFER ++V
Sbjct: 199 AASGAEGMPEDELSFEEQVDATFEAAKKPPVHPTMNDMEVVEILPVFPDFERMALEYVRL 258
Query: 480 TFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKD 539
FD D + + ++ ++K + A +D E+FL+ M+P E +
Sbjct: 259 NFDENQAKDVPSLTGKSTEIVESALINGVVKPF-AIENDYGETERFLSLMLPQDPEAAAA 317
Query: 540 ---MYDENEDVSFSWVREYHWDV-----RGDDADDPTTYLVSFDDDEARYVPLPTKLNLR 591
+ + E ++ +R+Y R D A F D+ +V L TKL L
Sbjct: 318 PNFLEVDGEPRAYDHIRDYVXXXXXXXQRDDPAQGGGNVCFFFKKDKVTFVDLQTKLTL- 376
Query: 592 KKRAIEGRSNDEVEHFPIPSSIAVRRRA 619
KR+ + D ++ PS + ++RR+
Sbjct: 377 SKRSKHSKGKDALDW--KPSRVTLKRRS 402
>gi|383850476|ref|XP_003700821.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Megachile
rotundata]
Length = 554
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/338 (31%), Positives = 176/338 (52%), Gaps = 44/338 (13%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+K + +C++K+ N LP+ PK + + RF +Y +SLE+NYK ++ E DLG+
Sbjct: 23 EKRSELICRVKYCNTLPDIPFDPKFITYPFESTRFIQYNPTSLERNYKYEVLTEHDLGVE 82
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL++ Y V LDP DE+LL +D V+TP +D ++ R + VSWL +T+YI
Sbjct: 83 IDLINKDTY-AGDVNAQLDPADEKLLEED-VLTP--QDS--KRSRHHARSVSWLRRTEYI 136
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
S Q+ + Q + E K G SI +N DRE QIK IE +FE K +
Sbjct: 137 ------STEQTRFQPQTVDKVEAKVGYSIKKNFKEETLYMDRESQIKAIEKTFEDNK-KS 189
Query: 450 I--HATNKNLQPVEILPLLPDFERYDDQFVAATFDG--APTADSEIYSKMDKSVRDAHES 505
I H + N+ PVEILP+ PDF+ + FD APT S + +++++ S
Sbjct: 190 IERHYSKANVVPVEILPVFPDFKLWKYPCAQVIFDSDPAPTGRS-VPAQIEEM------S 242
Query: 506 RAIMKSYVATGSDSANPEKFLAYMVP---SVNELSKDM-----YDENEDVSFSWVREYHW 557
+A+++ G + E+F+AY +P ++++ +D Y ++E+ + REY+W
Sbjct: 243 QAMIR-----GVMDESGEQFVAYFLPLEETLDKRRRDFTAGIDYADDEEYEYKMAREYNW 297
Query: 558 DVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKR 594
+V+ + Y + D Y L T++ L K+R
Sbjct: 298 NVKSKASKGYEENYFLVIRQDGVYYNELETRVRLSKRR 335
>gi|307213815|gb|EFN89122.1| RNA polymerase II-associated factor 1-like protein [Harpegnathos
saltator]
Length = 533
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 117/372 (31%), Positives = 184/372 (49%), Gaps = 50/372 (13%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+K + +C++K+ N LP+ PK + + RF RY +SLE+NYK ++ E DLG+
Sbjct: 21 EKRSELICRVKYCNTLPDIPFDPKFITYPFEPSRFIRYNPTSLERNYKYEVLTEHDLGVE 80
Query: 336 LDLLDLSVY-NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQY 394
+DL++ Y P+ + LDP DE+LL +D V+TP +D ++ R + VSWL +T+Y
Sbjct: 81 IDLINKDTYAGDPNAQ--LDPADEKLLEED-VLTP--QDS--KRSRHHARSVSWLRRTEY 133
Query: 395 ISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLR 448
IS + QS+ + +A K G SI +N DRE QIK IE +FE K +
Sbjct: 134 ISTETTRFQPQSVDKVEA------KVGYSIKKNFKEETLYMDRESQIKAIEKTFEDNK-K 186
Query: 449 PI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESR 506
PI H + N+ PVEILP+ PDF+ + FD P +SV E
Sbjct: 187 PIERHYSKPNVIPVEILPVYPDFKLWKYPCAQVIFDSDPAPSG-------RSVPAQIEE- 238
Query: 507 AIMKSYVATGSDSANPEKFLAYMVP---SVNELSKDM-----YDENEDVSFSWVREYHWD 558
M + G + E+F+AY +P ++++ +D Y + E+ + REY+W+
Sbjct: 239 --MSQAMIRGVMDESGEQFVAYFLPLEETLDKRRRDFTAGIDYADEEEYEYKMAREYNWN 296
Query: 559 VRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRR 617
V+ + Y + D Y L T++ L K+R G+ P + + VR
Sbjct: 297 VKSKASKGYEENYFLVIRQDGVYYNELETRVRLSKRRQKAGQQ-------PNNTRLIVRH 349
Query: 618 RANVTAIELKEQ 629
R + AIE + Q
Sbjct: 350 RP-LNAIEFRMQ 360
>gi|226479828|emb|CAX73210.1| Putative DNA helicase INO80 [Schistosoma japonicum]
Length = 497
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 176/372 (47%), Gaps = 47/372 (12%)
Query: 277 KPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPL 336
K + LC+LK++N LPE PK + + RF +Y +SLE+NYK +L E D+G+ +
Sbjct: 25 KTESLLCRLKYQNNLPELPFDPKFLVYPLEPSRFLQYVATSLERNYKHELLTETDVGVEV 84
Query: 337 DLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYIS 396
DL+D V+ + L P+DE LL +DE T V RK R K VSWL KT+YIS
Sbjct: 85 DLIDPDVFRIDK-KATLHPDDERLL-EDEAPTFVN----ARKSRH-QKSVSWLRKTEYIS 137
Query: 397 PLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPI 450
+S E E + G ++ +LN DRE QI IE +F A + +PI
Sbjct: 138 TELYNRWNKS-------EKVESRLGYNVKRHLNEEIVYRDRESQINAIEETFNAAR-KPI 189
Query: 451 HATNK--NLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAI 508
H N+ +E+LP+LPDF + FD P+ ++ ++ + V ++A+
Sbjct: 190 HKHYSKPNVHALEVLPVLPDFTLWRYPCAQVIFDDDPSRKNKTTTEQKEEV-----NQAM 244
Query: 509 MKSYVATGSDSANPEKFLAYMVPSVNELSKDM----------YDENEDVSFSWVREYHWD 558
++ V D F+AY +P+ E +K + Y E + REY+W+
Sbjct: 245 IRGMVDESGDH-----FVAYFLPT--EQTKQLRRLDAENRLPYTEGAAYEYELTREYNWN 297
Query: 559 VRGDD-ADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRR 617
V+ A+ Y F D Y L T++ L K+R + +S V P + +
Sbjct: 298 VKNKTMANYEENYFFCFRKDGVYYNELETRVRLSKRRKL-NQSGTNVGVLQAPKTRLIVH 356
Query: 618 RANVTAIELKEQ 629
+ T ELK Q
Sbjct: 357 HRDFTDEELKAQ 368
>gi|221109423|ref|XP_002154497.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Hydra
magnipapillata]
Length = 427
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 113/331 (34%), Positives = 166/331 (50%), Gaps = 43/331 (12%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
F+C +K+ N LP+ K ++ + +RF Y +SLE+NYK +L EPDLG+ +DL++
Sbjct: 22 FVCNVKYTNVLPDIPFDGKFISYPFEANRFIEYKPTSLERNYKNELLTEPDLGVSIDLIN 81
Query: 341 LSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y+ + LDPEDE+LL +D + + KR ++ T K VSWL KT+YIS
Sbjct: 82 PETYD-IDMNVELDPEDEKLLEEDHIPQADQ----KRHQQHT-KTVSWLRKTEYIST--- 132
Query: 401 ESARQSLTEKQAKELREMKGGRSI-------LENLNDRERQIKEIEASFEACKLRPI--H 451
E R + + E+ E K G S ++ DRE QI+ IEASFEA K +PI H
Sbjct: 133 EYNRF----QTSSEMAETKVGYSSKKQSKLDIDLYKDRESQIEAIEASFEAAK-KPITCH 187
Query: 452 ATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKS 511
TN ++PVEILP+ PDF+ + FD PT K+V E M
Sbjct: 188 DTNARIKPVEILPVFPDFQFWHMPCAHVIFDTEPTPRG-------KTVPATVEQ---MSL 237
Query: 512 YVATGSDSANPEKFLAYMVPSVNELSKDM--YDEN------EDVSFSWVREYHWDVRGDD 563
+ G + + ++F+AY +PS L K D N E+ + REY+W+V+
Sbjct: 238 GMIRGMEDESGDQFVAYFLPSDLTLKKKKQELDNNEQPNPDEEYEYKLAREYNWNVKNKA 297
Query: 564 ADD-PTTYLVSFDDDE-ARYVPLPTKLNLRK 592
Y F ++E Y L T++ L K
Sbjct: 298 IKGFEENYFFVFRENEGVFYNELQTRVRLNK 328
>gi|332022364|gb|EGI62676.1| RNA polymerase II-associated factor 1-like protein [Acromyrmex
echinatior]
Length = 581
Score = 132 bits (331), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 112/344 (32%), Positives = 178/344 (51%), Gaps = 46/344 (13%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+K + +C++K+ N LP+ PK +A + RF +Y +SLE+NYK ++ E DLG+
Sbjct: 23 EKRSELICRVKYCNTLPDIPFDPKFIAYPFEPTRFIQYNPTSLERNYKYEVLTEHDLGVE 82
Query: 336 LDLLDLSVY-NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQY 394
+DL++ Y P+ + LDP DE+LL +D V+TP +D ++ R + VSWL +T+Y
Sbjct: 83 IDLINKDTYAGDPNAQ--LDPADEKLLEED-VLTP--QDS--KRSRHHARSVSWLRRTEY 135
Query: 395 ISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLR 448
I S ES R + Q + E K G SI +N DRE QIK IE +FE K +
Sbjct: 136 I---STESTR---FQPQTADKVEAKVGYSIKKNFKEETLYMDRESQIKAIEKTFEDNK-K 188
Query: 449 PI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDG--APTADSEIYSKMDKSVRDAHE 504
PI H + N+ P+EILP+ PDF+ + FD APT S + +++++
Sbjct: 189 PIERHYSKPNVVPIEILPVYPDFKLWKYPCAQVIFDSDPAPTGRS-VPAQIEEM------ 241
Query: 505 SRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSWVREYH 556
S+A+++ G + E+F+AY +P L K Y + E+ + REY+
Sbjct: 242 SQAMIR-----GVMDESGEQFVAYFLPLEETLEKRRRDFTAGIDYADEEEYEYKMAREYN 296
Query: 557 WDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
W+V+ + Y + D Y L T++ L K+R G+
Sbjct: 297 WNVKSKASKGYEENYFLVIRQDGVYYNELETRVRLSKRRQKVGQ 340
>gi|307199453|gb|EFN80066.1| RNA polymerase II-associated factor 1-like protein [Harpegnathos
saltator]
Length = 530
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 116/372 (31%), Positives = 184/372 (49%), Gaps = 50/372 (13%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+K + +C++K+ N LP+ PK + + RF RY +SLE+NYK ++ E DLG+
Sbjct: 23 EKRSELICRVKYCNTLPDIPFDPKFITYPFEPSRFIRYNPTSLERNYKYEVLTEHDLGVE 82
Query: 336 LDLLDLSVY-NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQY 394
+DL++ Y P+ + LDP DE+LL +D V+TP +D ++ R + VSWL +T+Y
Sbjct: 83 IDLINKDTYAGDPNAQ--LDPADEKLLEED-VLTP--QDS--KRSRHHARSVSWLRRTEY 135
Query: 395 ISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLR 448
IS + QS+ + +A K G SI +N DRE QIK IE +FE K +
Sbjct: 136 ISTETTRFQPQSVDKVEA------KVGYSIKKNFKEETLYMDRESQIKAIEKTFEDNK-K 188
Query: 449 PI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESR 506
PI H + N+ PVEILP+ PDF+ + FD P +SV E
Sbjct: 189 PIERHYSKPNVIPVEILPIYPDFKLWKYPCAQVIFDSDPAPSG-------RSVPAQIEE- 240
Query: 507 AIMKSYVATGSDSANPEKFLAYMVP---SVNELSKDM-----YDENEDVSFSWVREYHWD 558
M + G + E+F+AY +P ++++ +D Y + E+ + REY+W+
Sbjct: 241 --MSQAMIRGVMDESGEQFVAYFLPLEETLDKRRRDFTAGIDYADEEEYEYKMAREYNWN 298
Query: 559 VRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRR 617
V+ + Y + D Y L T++ L K+R G+ P + + VR
Sbjct: 299 VKSKASKGYEENYFLVMRQDGVYYNELETRVRLSKRRQKAGQQ-------PNNTRLIVRH 351
Query: 618 RANVTAIELKEQ 629
R + A+E + Q
Sbjct: 352 RP-LNALEFRMQ 362
>gi|307172366|gb|EFN63837.1| RNA polymerase II-associated factor 1-like protein [Camponotus
floridanus]
Length = 428
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/350 (32%), Positives = 180/350 (51%), Gaps = 44/350 (12%)
Query: 269 ERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHV 328
+RT +K + +C++K+ N LP+ PK + + RF +Y +SLE+NYK ++
Sbjct: 16 KRTTRPAEKRSELICRVKYCNTLPDIPFDPKFITYPFEPTRFIQYNPTSLERNYKYEVLT 75
Query: 329 EPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSW 388
E DLG+ +DL++ Y + LDP DE+LL +D V+TP +D ++ R K VSW
Sbjct: 76 EHDLGVEIDLINKDTY-AGDINAQLDPADEKLLEED-VLTP--QDS--KRSRHHAKSVSW 129
Query: 389 LVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASF 442
L +T+YIS ES R + Q + E K G SI +N DR+ QIK IE +F
Sbjct: 130 LRRTEYIST---ESTR---FQPQTADKVEAKVGYSIKKNFKEETLYMDRDSQIKAIEKTF 183
Query: 443 EACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDG--APTADSEIYSKMDKS 498
E K +PI H + N+ P+E+LP+ PDF+ + FD APT S + +++++
Sbjct: 184 EDNK-KPIERHYSKANVVPIEVLPVYPDFKLWKYPCAQVIFDSDPAPTGRS-VPAQIEEM 241
Query: 499 VRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK---DM-----YDENEDVSFS 550
S+A+++ G + E+F+AY +P L K D Y ++E+ +
Sbjct: 242 ------SQAMIR-----GVMDESGEQFVAYFLPLEETLEKRRRDFTAGIDYADDEEYEYK 290
Query: 551 WVREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
REY+W+V+ + Y + D Y L T++ L K+R G+
Sbjct: 291 MAREYNWNVKSKASKGYEENYFLVIRQDGVYYNELETRVRLSKRRQKVGQ 340
>gi|195454001|ref|XP_002074040.1| GK12821 [Drosophila willistoni]
gi|194170125|gb|EDW85026.1| GK12821 [Drosophila willistoni]
Length = 543
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/349 (29%), Positives = 170/349 (48%), Gaps = 38/349 (10%)
Query: 267 SGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQL 326
+ +R + + ++ + +C++K+ N LP+ K + D RF +Y +SLE+N+K +
Sbjct: 10 ADKRPQRQTERKSEIICRVKYGNSLPDIPFDLKFLQYPFDSHRFVQYNPTSLERNFKYDV 69
Query: 327 HVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGV 386
E DLG+ +DL++ +Y SV LDP DE+LL ++E +TP D + + R + V
Sbjct: 70 LSEHDLGVTVDLINRELYQADSV-SMLDPADEKLL-EEETLTPT--DSV--RSRQHSRTV 123
Query: 387 SWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEA 440
SWL K++YI S Q+ + Q E E K G ++ ++L DR+ QIK IE
Sbjct: 124 SWLRKSEYI------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDRDAQIKAIEK 177
Query: 441 SFEACKLRPI-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSV 499
+F K H + N+ PVE+LP+ PDF + FD P + M K+V
Sbjct: 178 TFVDTKTEITKHYSKPNVVPVEVLPIFPDFTNWKYPCAQVIFDSDP-------APMGKNV 230
Query: 500 RDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK--------DMYDENEDVSFSW 551
E M + G + E+F+AY +P+ L K ++Y ++E+ F
Sbjct: 231 VAQLEE---MSQAMIRGVMDESGEQFVAYFLPTEQTLEKRRTDLVAGELYKDDEEYEFKI 287
Query: 552 VREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
REY+W+V+ + Y D Y L T++ L K+R G+
Sbjct: 288 AREYNWNVKTKASKGYEENYFFVMRQDGIYYNELETRVRLNKRRVKIGQ 336
>gi|156550709|ref|XP_001605818.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Nasonia
vitripennis]
Length = 583
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 110/347 (31%), Positives = 171/347 (49%), Gaps = 45/347 (12%)
Query: 271 TENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEP 330
TE RL+ +CK+K+ N LP+ + PK ++ + RF +Y +SLE+NYK ++ E
Sbjct: 19 TEKRLE----LICKVKYCNTLPDIAFDPKFISYPFESTRFIQYNPTSLERNYKYEVLTEH 74
Query: 331 DLGIPLDLLDLSVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWL 389
DLG+P+DL++ Y P + +DP DE+LL +D V+T ++D + K K VSWL
Sbjct: 75 DLGVPIDLINRDTYACDPKIPYEMDPLDEKLLEED-VIT--QQDSKRSKHHA--KSVSWL 129
Query: 390 VKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE 443
+T+YIS Q+ ++K E K G +I ++ DR+ QI+ IE +FE
Sbjct: 130 RRTEYISTEQTRFQPQTTSDKV-----EAKVGHNIKKHFKEETLYMDRDSQIRAIEKTFE 184
Query: 444 ACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRD 501
K +PI H + N+ PVE+ P+ PDF+ + FD P + M SV
Sbjct: 185 DNK-KPIETHYSKPNVTPVEVFPIYPDFKIWKYPCAQVIFDSDP-------APMGLSVPA 236
Query: 502 AHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK---------DMYDENEDVSFSWV 552
E M + G + E+F+AY +P L K D DE+E +
Sbjct: 237 QIEE---MSQAMIRGVMDESGEQFVAYFLPLEETLEKRKRDFCSGIDYADEDE-YEYKMA 292
Query: 553 REYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
REY+W+V+ + Y + +D Y L T++ L K+R G
Sbjct: 293 REYNWNVKSKASKGYEENYFLVMREDGVYYNELETRVRLSKRRQKVG 339
>gi|350403064|ref|XP_003486690.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Bombus
impatiens]
Length = 562
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 174/348 (50%), Gaps = 44/348 (12%)
Query: 270 RTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVE 329
RT +K + +C++K+ N LP+ PK + RF +Y +SLE+NYK ++ E
Sbjct: 17 RTTRPAEKRSELICRVKYCNTLPDIPFDPKFITYPFASTRFIQYNPTSLERNYKYEVLTE 76
Query: 330 PDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWL 389
DLG+ +DL++ Y V LDP DE+LL +D V+TP +D ++ R + VSWL
Sbjct: 77 HDLGVEIDLINKDTY-AGDVNAQLDPADEKLLEED-VLTP--QDS--KRSRHHARSVSWL 130
Query: 390 VKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE 443
+T+YI S Q+ + Q + E K G SI +N DRE QIK IE +FE
Sbjct: 131 RRTEYI------STEQTRFQPQTVDKVEAKVGYSIKKNFKEETLYMDRESQIKAIEKTFE 184
Query: 444 ACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDG--APTADSEIYSKMDKSV 499
K + I H + N+ PVEILP+ PDF+ + FD APT S + +++++
Sbjct: 185 DNK-KSIERHYSKPNVVPVEILPVFPDFKLWKYPCAQVIFDSDPAPTGRS-VPAQIEEM- 241
Query: 500 RDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSW 551
S+A+++ G + E+F+AY +P L K Y ++++ +
Sbjct: 242 -----SQAMIR-----GVMDESGEQFVAYFLPLEETLEKRRRDFAAGIDYADDDEYEYKM 291
Query: 552 VREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
REY+W+V+ + Y + D Y L T++ L K+R G
Sbjct: 292 AREYNWNVKSKASKGYEENYFLVIRQDGVYYNELETRVRLSKRRQKVG 339
>gi|340728168|ref|XP_003402400.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Bombus
terrestris]
Length = 562
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 109/348 (31%), Positives = 174/348 (50%), Gaps = 44/348 (12%)
Query: 270 RTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVE 329
RT +K + +C++K+ N LP+ PK + RF +Y +SLE+NYK ++ E
Sbjct: 17 RTTRPAEKRSELICRVKYCNTLPDIPFDPKFITYPFASTRFIQYNPTSLERNYKYEVLTE 76
Query: 330 PDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWL 389
DLG+ +DL++ Y + LDP DE+LL +D V+TP +D ++ R + VSWL
Sbjct: 77 HDLGVEIDLINKDTY-AGDLNAQLDPADEKLLEED-VLTP--QDS--KRSRHHARSVSWL 130
Query: 390 VKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE 443
+T+YI S Q+ + Q + E K G SI +N DRE QIK IE +FE
Sbjct: 131 RRTEYI------STEQTRFQPQTVDKVEAKVGYSIKKNFKEETLYMDRESQIKAIEKTFE 184
Query: 444 ACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDG--APTADSEIYSKMDKSV 499
K + I H + N+ PVEILP+ PDF+ + FD APT S + +++++
Sbjct: 185 DNK-KSIERHYSKPNVVPVEILPVFPDFKLWKYPCAQVIFDSDPAPTGRS-VPAQIEEM- 241
Query: 500 RDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSW 551
S+A+++ G + E+F+AY +P L K Y ++++ +
Sbjct: 242 -----SQAMIR-----GVMDESGEQFVAYFLPLEETLEKRRRDFAAGIDYADDDEYEYKM 291
Query: 552 VREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
REY+W+V+ + Y + D Y L T++ L K+R G
Sbjct: 292 AREYNWNVKSKASKGYEENYFLVIRQDGVYYNELETRVRLSKRRQKVG 339
>gi|443724677|gb|ELU12581.1| hypothetical protein CAPTEDRAFT_117955 [Capitella teleta]
Length = 443
Score = 129 bits (325), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 186/368 (50%), Gaps = 46/368 (12%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+C++K+ N LP+ PK + + +RF +Y +SLE++YK +L E DLG+ +DL++
Sbjct: 7 LVCRVKYNNSLPDIPFDPKFLIYPFEANRFVQYNPTSLERSYKHELLTENDLGVQIDLIN 66
Query: 341 LSVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLS 399
Y P+ + LD DE LL +D + TP +D ++ R ++ VSWL KT+YIS
Sbjct: 67 PDAYKIDPNGKKYLDEADERLLEED-LSTP--QDS--KRSRHHNRNVSWLRKTEYIST-- 119
Query: 400 MESARQSLTEKQAKELREMKGGRSIL-----ENL-NDRERQIKEIEASFEACKLRPI--H 451
E R + Q + E K G SI E+L DR+ QIK IE +F+A K+ PI H
Sbjct: 120 -EYNRFT----QLADKAEAKIGFSIKKYFKEEDLYKDRDSQIKAIENTFDAVKI-PIEKH 173
Query: 452 ATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKS 511
+ N+ PV++LP+LPDF+ + FD P + +++ S+A+++
Sbjct: 174 YSKPNVYPVDVLPVLPDFDLWKHPCAQVIFDSDPARHT---GTAANPLQNEEMSQAMIRG 230
Query: 512 YVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSWVREYHWDVRGDD 563
V D +F+AY +P+ + K Y E ++ ++ REY+W+V+
Sbjct: 231 MVDESGD-----QFVAYFLPTEETIKKRKRDADNTLDYQEEDEYDYTLAREYNWNVKNKA 285
Query: 564 ADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVT 622
+ TY + +D Y L T++ L K+R G++ P S V + +
Sbjct: 286 SKGYEETYFFTVKEDGVFYNELETRVRLSKRRKA-GQTG------PSAKSRLVVKHRPLN 338
Query: 623 AIELKEQG 630
A+EL+ QG
Sbjct: 339 AVELQAQG 346
>gi|195497149|ref|XP_002095980.1| GE25435 [Drosophila yakuba]
gi|194182081|gb|EDW95692.1| GE25435 [Drosophila yakuba]
Length = 536
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 179/363 (49%), Gaps = 44/363 (12%)
Query: 270 RTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVE 329
RT+ + ++ + +C++K+ N LP+ K + D RF +Y +SLE+N+K + E
Sbjct: 17 RTQRQTERKSEIICRVKYGNNLPDIPFDLKFLQYPFDSHRFVQYNPTSLERNFKYDVLTE 76
Query: 330 PDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWL 389
DLG+ +DL++ +Y S+ LDP DE+LL ++E +TP D + + R + VSWL
Sbjct: 77 HDLGVTVDLINRELYQADSM-TMLDPADEKLL-EEETLTPT--DSV--RSRQHSRTVSWL 130
Query: 390 VKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE 443
K++YI S Q+ + Q E E K G ++ ++L DR+ QIK IE +F
Sbjct: 131 RKSEYI------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDRDAQIKAIEKTFS 184
Query: 444 ACKLRPI-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRD 501
K H + N+ PVE+LP+ PDF + FD P + +++++
Sbjct: 185 DTKSEITKHYSKPNVVPVEVLPIFPDFTNWKYPCAQVIFDSDPAPVGKNVPAQLEEM--- 241
Query: 502 AHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK--------DMYDENEDVSFSWVR 553
S+A+++ G + E+F+AY +P+ L K ++Y + E+ + R
Sbjct: 242 ---SQAMIR-----GVMDESGEQFVAYFLPTEQTLEKRRTDFIAGELYKDEEEYEYKIAR 293
Query: 554 EYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR----SNDEVEHFP 608
EY+W+V+ + Y D Y L T++ L K+R G+ + V+H P
Sbjct: 294 EYNWNVKTKASKGYEENYFFVMRQDGIYYNELETRVRLNKRRVKVGQQPNNTKLVVKHRP 353
Query: 609 IPS 611
+ S
Sbjct: 354 LDS 356
>gi|159477665|ref|XP_001696929.1| Paf1 complex component [Chlamydomonas reinhardtii]
gi|158274841|gb|EDP00621.1| Paf1 complex component [Chlamydomonas reinhardtii]
Length = 502
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 171/360 (47%), Gaps = 39/360 (10%)
Query: 271 TENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEP 330
ENRL++ T FL ++FRN+LPE + PK++ + + +R+ ++LE+ + L P
Sbjct: 96 AENRLRRDTPFLAHIRFRNDLPEIPSDPKMLVSQIQPEVLSRFGLTALERQARRDLLTPP 155
Query: 331 DLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDE---VV----TPVKKDG-----IKRK 378
+ I + LD+ Y P P+DP D LL+DD + P DG R
Sbjct: 156 N--ITISPLDVQRYQVPDQAVPMDPADAALLKDDARPLAIGGSPLPAAADGRHKSFAARS 213
Query: 379 ERPTDKGVSWLVKTQYISPL-SMESARQSLTEKQAKELREMK--------GGRSILENLN 429
+ VSWL++T YIS + +A+Q L EKQ + R + G LN
Sbjct: 214 KDVDVTKVSWLMRTTYISASDNRGAAKQGLPEKQVLQARAAERRLAAVAAGTDLDDRELN 273
Query: 430 DRERQIKEIEASFEACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATF---DGAPT 486
DRE Q++ IEASFEA + +P+H+ N L+PVE+LP+LPD + + + TF D
Sbjct: 274 DREAQVRAIEASFEAARAQPVHSRNPALKPVEVLPVLPDPTAWQHKLLLTTFHDSDPGEE 333
Query: 487 ADSEIYSKMDKSVRDAHESRAIMKSYVATG----SDSANPEKFLAYMVPSVNELSKDMYD 542
+ + + + A + ++ Y+ G A ++ L M L +
Sbjct: 334 LAAVVGQEAMAGLPSAKRPQLLVGHYLLKGFTHHISKAGQDRELKIMA-----LLPVLQP 388
Query: 543 ENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSND 602
E+ + + W ++Y+++++ D Y + D D AR+ + KL LR R E + D
Sbjct: 389 EDLEGDYQWTKDYNYELK----RDTVHYALRLDKDAARFYRMQGKLELRSWRDEEKLARD 444
>gi|156550159|ref|XP_001606181.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Nasonia
vitripennis]
Length = 572
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 186/380 (48%), Gaps = 57/380 (15%)
Query: 271 TENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEP 330
TE RL+ +CK+K+ N LP+ + PK ++ + RF +Y +SLE+N+K ++ E
Sbjct: 19 TEKRLE----LICKVKYCNTLPDIAFDPKFISYPFESTRFIQYNPTSLERNHKYEVLTEH 74
Query: 331 DLGIPLDLLDLSVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWL 389
DLG+P+DL++ Y P + +DP DE+LL +D V+T ++D + K K VSWL
Sbjct: 75 DLGVPIDLINRDTYACDPKIPYEMDPLDEKLLEED-VIT--QQDSKRSKHHA--KSVSWL 129
Query: 390 VKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE 443
+T+YIS Q+ ++K E K G +I ++ DR+ QI+ IE +FE
Sbjct: 130 RRTEYISTEQTRFQPQTTSDKV-----EAKVGHNIKKHFKEETLYMDRDSQIRAIEKTFE 184
Query: 444 ACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDG--APTADSEIYSKMDKSV 499
K +PI H + N+ PVE+ P+ PDF+ + FD AP S M +
Sbjct: 185 DNK-KPIERHYSKPNVTPVEVFPVYPDFKIWKYPCAQVIFDSDPAPMGLS-----MPAQI 238
Query: 500 RDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK---------DMYDENEDVSFS 550
+ S+A+++ G + E+F+AY +P L K D DE+E +
Sbjct: 239 EEM--SQAMIR-----GVMDESGEQFVAYFLPLEETLEKRKRDFCSGIDYADEDE-YEYK 290
Query: 551 WVREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPI 609
REY+W+V+ + Y + +D Y L T++ L K+R G P+
Sbjct: 291 MAREYNWNVKSKASKGYEENYFLVMREDGVYYNELETRVRLSKRRQKVGA--------PV 342
Query: 610 PSSIAVRRRANVTAIELKEQ 629
++ V R + A E K Q
Sbjct: 343 NNTRLVVRHRPLNANEFKMQ 362
>gi|330792762|ref|XP_003284456.1| hypothetical protein DICPUDRAFT_52960 [Dictyostelium purpureum]
gi|325085599|gb|EGC39003.1| hypothetical protein DICPUDRAFT_52960 [Dictyostelium purpureum]
Length = 454
Score = 125 bits (314), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 162/348 (46%), Gaps = 48/348 (13%)
Query: 277 KPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPL 336
KP+ F CKLKF+N LPE +PK + ++ D +RFT+Y +SLE+ YK QL EP LGIP+
Sbjct: 73 KPSEFQCKLKFQNSLPEIPFEPKFLKIQSDFNRFTQYKTTSLERQYKHQLLTEPQLGIPI 132
Query: 337 DLLDLSVYNPPS--VRPPLDPEDEELLRDDEVVTPVKKDGIKRKE--RPTDKGVSWLVKT 392
DL+D SVYN P +R P DE LL+ V +K K+K RP V WL +T
Sbjct: 133 DLIDPSVYNTPKTPIRVP--SRDEPLLKSLSVQDLEEKSAAKKKSEIRP---NVGWLRRT 187
Query: 393 QYISPLSMES------ARQSL--------TEKQAKELREMKGGRSILENLNDRERQIKEI 438
+Y+S + R SL T + ++E K S L +
Sbjct: 188 EYLSKADDNTFGRPPVKRASLPGDITNSPTNNKQLMIQEEKVSDSTL------------V 235
Query: 439 EASFEAC--KLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKM- 495
E +F+ C +H TN +L+P ILP+ PDFE + + F FD P D I + +
Sbjct: 236 ENTFDICASNYVFVHPTNPSLKPTSILPVFPDFELWPNSFTEVAFDTDP-LDHYIPNNIR 294
Query: 496 -DKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVRE 554
DK A I V G +KF+ ++ P++ E + + +E +D S +
Sbjct: 295 DDKVAEYASRHDEIRNKAVVKGI----TDKFVYFITPNIKENNSNNNNE-DDFSLDGEQY 349
Query: 555 YHWDVRGDDA---DDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
V D Y DD Y L ++NL+K ++ E +
Sbjct: 350 KMQKVLTSDIVPDQSQQNYFFMVKDDCVYYNQLKNRVNLKKVKSKEEK 397
>gi|198454011|ref|XP_001359430.2| GA15378 [Drosophila pseudoobscura pseudoobscura]
gi|198132611|gb|EAL28576.2| GA15378 [Drosophila pseudoobscura pseudoobscura]
Length = 559
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 99/350 (28%), Positives = 172/350 (49%), Gaps = 40/350 (11%)
Query: 267 SGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQL 326
+ +R + + ++ + +C++K+ N LP+ K + D RF +Y +SLE+N+K +
Sbjct: 14 ADKRPQRQTERKSEIICRVKYGNNLPDIPFDLKFLQYPFDSHRFVQYNPTSLERNFKYDV 73
Query: 327 HVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGV 386
E DLG+ +DL++ +Y S+ LDP DE+LL ++E + P D + + R + V
Sbjct: 74 LTEHDLGVTVDLINRELYQADSM-TLLDPADEKLL-EEETLAPT--DSV--RSRQHSRTV 127
Query: 387 SWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEA 440
SWL K++YI S Q+ + Q E E K G ++ ++L DR+ QIK IE
Sbjct: 128 SWLRKSEYI------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDRDAQIKAIEK 181
Query: 441 SFEACKLRPI-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKS 498
+F K H + N+ PVE+LP+ PDF + FD P + S++++
Sbjct: 182 TFLDTKTDITKHYSKPNVVPVEVLPVFPDFINWKYPCAQVIFDSDPAPVGKNVPSQLEEM 241
Query: 499 VRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK--------DMYDENEDVSFS 550
S+A+++ G + E+F+AY +P+ L K ++Y+E + +
Sbjct: 242 ------SQAMIR-----GVMDESGEQFVAYFLPTEQTLEKRRADFIAGELYNEEAEYEYK 290
Query: 551 WVREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
REY+W+V+ + Y D Y L T++ L K+R G+
Sbjct: 291 IAREYNWNVKTKASKGYEENYFFVMRQDGIYYNELETRVRLNKRRVKVGQ 340
>gi|195152704|ref|XP_002017276.1| GL21617 [Drosophila persimilis]
gi|194112333|gb|EDW34376.1| GL21617 [Drosophila persimilis]
Length = 559
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 99/350 (28%), Positives = 172/350 (49%), Gaps = 40/350 (11%)
Query: 267 SGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQL 326
+ +R + + ++ + +C++K+ N LP+ K + D RF +Y +SLE+N+K +
Sbjct: 14 ADKRPQRQTERKSEIICRVKYGNNLPDIPFDLKFLQYPFDSHRFVQYNPTSLERNFKYDV 73
Query: 327 HVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGV 386
E DLG+ +DL++ +Y S+ LDP DE+LL ++E + P D + + R + V
Sbjct: 74 LTEHDLGVTVDLINRELYQADSM-TLLDPADEKLL-EEETLAPT--DSV--RSRQHSRTV 127
Query: 387 SWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEA 440
SWL K++YI S Q+ + Q E E K G ++ ++L DR+ QIK IE
Sbjct: 128 SWLRKSEYI------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDRDAQIKAIEK 181
Query: 441 SFEACKLRPI-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKS 498
+F K H + N+ PVE+LP+ PDF + FD P + S++++
Sbjct: 182 TFLDTKTDITKHYSKPNVVPVEVLPVFPDFINWKYPCAQVIFDSDPAPVGKNVPSQLEEM 241
Query: 499 VRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK--------DMYDENEDVSFS 550
S+A+++ G + E+F+AY +P+ L K ++Y+E + +
Sbjct: 242 ------SQAMIR-----GVMDESGEQFVAYFLPTEQTLEKRRADFIAGELYNEEAEYEYK 290
Query: 551 WVREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
REY+W+V+ + Y D Y L T++ L K+R G+
Sbjct: 291 IAREYNWNVKTKASKGYEENYFFVMRQDGIYYNELETRVRLNKRRVKVGQ 340
>gi|412993998|emb|CCO14509.1| predicted protein [Bathycoccus prasinos]
Length = 389
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 172/353 (48%), Gaps = 33/353 (9%)
Query: 273 NRLKKPTTFLCKLKFRNELPEPSAQPKLMALKK----DKDRFTRYTFSSLEKNYKPQLHV 328
NRLK+ T FLC ++FRN LP P A + +K D D + + L + +
Sbjct: 10 NRLKRETAFLCPMQFRNNLPPPRALDWKLLRRKGSLVDDDMMAEHNVALLYDELRRSTRM 69
Query: 329 -EPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVS 387
DLGI LD + + + + P+D LL +D K G RK P
Sbjct: 70 FSEDLGINLDPIGSDAFRAARTKGEIHPDDLILLANDSAKDKETKSGTGRK--PDVNKAM 127
Query: 388 WLVKTQYISPLSMESARQSLTEKQAKELREMKGG-----------RSILENLNDRERQIK 436
WL+ T+YIS + S + ++EK L+ + R ++L++RE+QI
Sbjct: 128 WLMNTKYISEGGL-SLKTGISEKSNLILKRQRELREEEEKLDPSLRDYNKSLSEREKQIL 186
Query: 437 EIEASFEAC-KLRPIHAT---NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIY 492
I+ SFEA KL AT NK+L+P+ + + PDF+ + FV TFD PT D E
Sbjct: 187 AIKKSFEAAEKLTLETATHPRNKSLKPISVTSVFPDFKVWPQNFVRLTFDEDPTLDVEGV 246
Query: 493 SKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVP--SVNELSKDMYDENED---V 547
S +++++ +A++K + ++ P+KF+A ++P + N + + DENE+
Sbjct: 247 SDAAENLKEKAMQKALVKPMMVE-DEAGRPDKFIALLLPKDAANAENVKILDENENENGT 305
Query: 548 SFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKK-RAIEGR 599
+ WVREY + V+ +D + Y F D Y L TK+ +KK ++ +GR
Sbjct: 306 EYDWVREYKYAVKTEDINTVCFY---FGKDRVTYADLNTKITCQKKAKSTKGR 355
>gi|405965042|gb|EKC30470.1| RNA polymerase II-associated factor 1-like protein [Crassostrea
gigas]
Length = 479
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 171/357 (47%), Gaps = 53/357 (14%)
Query: 267 SGERTENRLKKP----TTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNY 322
SG R + K+P + +C++K+ N LP+ PK + + +RF Y +SLE++Y
Sbjct: 7 SGPRDDRDRKRPGERRSELVCRVKYNNTLPDIPFDPKFITYPFETNRFVEYKPTSLERSY 66
Query: 323 KPQLHVEPDLGIPLDLLDLSVYNPPSVRPP----LDPEDEELLRDDEVVTPVKKDGIKRK 378
K L E DLG+ +DL+ NP + R LDPEDE LL +D + TP ++
Sbjct: 67 KYDLLTEHDLGVTIDLI-----NPDTYRIEPGAYLDPEDERLLEED-LNTPTD----SKR 116
Query: 379 ERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENL------NDRE 432
R + VSWL KT+YIS + T+K E K G I + + DR
Sbjct: 117 SRQHNTTVSWLRKTEYIS-----TEYNRFTQKSDNS--ERKVGFKIKQIMKDEDIYKDRA 169
Query: 433 RQIKEIEASFEACKLRPI-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEI 491
QI+ I+ +FE K + H + + PVE+LP+ PDF+ + F FD P
Sbjct: 170 SQIQAIDKTFEQAKEKITKHYSKPGVTPVEVLPVFPDFDLWKHPFAQVIFDSDPAPKGRP 229
Query: 492 Y-SKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV--- 547
++M++ S+A+++ G+ N E+F+AY +P+ L+K D E V
Sbjct: 230 QPAQMEEM------SQAMIR-----GAVDENNEQFVAYFLPTEETLTKRKRDSEEGVDYN 278
Query: 548 -----SFSWVREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
++ REY+W+V+ + TY F +D Y L T++ L K+R G
Sbjct: 279 AEDEYDYTLAREYNWNVKNKLSRGYEETYFFVFREDGMFYNELETRVRLSKRRKTGG 335
>gi|195396015|ref|XP_002056628.1| GJ11046 [Drosophila virilis]
gi|194143337|gb|EDW59740.1| GJ11046 [Drosophila virilis]
Length = 567
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 95/341 (27%), Positives = 171/341 (50%), Gaps = 40/341 (11%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
++ + +C++K+ N LP+ K + D +RF +Y +SLE+N+K + E DLG+
Sbjct: 23 ERKSEIICRVKYGNNLPDIPFDLKFLQYPFDGNRFVQYNPTSLERNFKYDVLTEHDLGVT 82
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL++ +Y + LDP DE+LL ++E +TP D + + R + VSWL K++YI
Sbjct: 83 VDLINRELYQADPM-SQLDPADEKLL-EEETLTPT--DSV--RSRQHSRTVSWLRKSEYI 136
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
S Q+ + Q E E K G ++ ++L DR+ QIK IE +F K
Sbjct: 137 ------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDRDAQIKAIEKTFSDTKNDI 190
Query: 450 I-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRA 507
H + N+ PVE+LP+ PDF + FD P + +++++ S+A
Sbjct: 191 TKHYSKPNVVPVEVLPIFPDFTNWKYPCAQVIFDSDPAPQGKNVPAQLEEM------SQA 244
Query: 508 IMKSYVATGSDSANPEKFLAYMVPS--------VNELSKDMYDENEDVSFSWVREYHWDV 559
+++ G + E+F+AY +P+ ++ ++ ++Y ++E+ + REY+W+V
Sbjct: 245 MIR-----GVMDESGEQFVAYFLPTEPTLEKRRIDFVAGELYKDDEEYEYKIAREYNWNV 299
Query: 560 RGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
+ + Y D Y L T++ L K+R G+
Sbjct: 300 KTKASKGYEENYFFVMRQDGIYYNELETRVRLNKRRVKVGQ 340
>gi|194898602|ref|XP_001978858.1| GG12571 [Drosophila erecta]
gi|190650561|gb|EDV47816.1| GG12571 [Drosophila erecta]
Length = 538
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 175/357 (49%), Gaps = 44/357 (12%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
++ + +C++K+ N LP+ K + D RF +Y +SLE+N+K + E DLG+
Sbjct: 23 ERKSEIICRVKYGNNLPDIPFDLKFLQYPFDSHRFVQYNPTSLERNFKYDVLTEHDLGVT 82
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL++ +Y S+ LDP DE+LL ++E +TP D + + R + VSWL K++YI
Sbjct: 83 VDLINRELYQADSM-TLLDPADEKLL-EEETLTPT--DSV--RSRQHSRTVSWLRKSEYI 136
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
S Q+ + Q E E K G ++ ++L DR+ QIK IE +F K
Sbjct: 137 ------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDRDAQIKAIEKTFSDTKSEI 190
Query: 450 I-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRA 507
H + N+ PVE+LP+ PDF + FD P + +++++ S+A
Sbjct: 191 TKHYSKPNVVPVEVLPIFPDFTNWKYPCAQVIFDSDPAPVGKNVPAQLEEM------SQA 244
Query: 508 IMKSYVATGSDSANPEKFLAYMVPSVNELSK--------DMYDENEDVSFSWVREYHWDV 559
+++ G + E+F+AY +P+ L K ++Y + E+ + REY+W+V
Sbjct: 245 LIR-----GVMDESGEQFVAYFLPTEQTLEKRRTDFIAGELYKDEEEYEYKIAREYNWNV 299
Query: 560 RGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR----SNDEVEHFPIPS 611
+ + Y D Y L T++ L K+R G+ + V+H P+ S
Sbjct: 300 KTKASKGYEENYFFVMRQDGIYYNELETRVRLNKRRVKVGQQPNNTKLVVKHRPLDS 356
>gi|440794036|gb|ELR15207.1| hypothetical protein ACA1_218610 [Acanthamoeba castellanii str.
Neff]
Length = 356
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 159/340 (46%), Gaps = 54/340 (15%)
Query: 285 LKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVY 344
+KF N LP PKL+ D RF RY +SLE +++ LH EPDLGIP+DL+D S Y
Sbjct: 1 MKFSNTLPPIPFDPKLLTNPFDSMRFVRYRTTSLETSHQHTLHAEPDLGIPIDLIDPSTY 60
Query: 345 --NPPSVRPPLDPEDEELLRDDEVVTP-VKKDGIKRKERPTDKGVSWLVK-TQYISPLSM 400
P + P PED L++ TP VKK +E+P+ VSWL + T+YIS
Sbjct: 61 KATPGAALP---PEDAVLIQVGAAETPEVKKKKANVREKPS-TSVSWLRRHTEYISDYEQ 116
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPIHATN 454
+ + S + E + G + L+ RE + I+ FEA K P+H T+
Sbjct: 117 STKKVSRADSI-----EARVGHATLKRAEKDKRKRTREEVVDAIDQGFEAAKEIPVHPTD 171
Query: 455 KNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVA 514
+L+P+EILP+ PD + + + + TFD P R + +A++K +
Sbjct: 172 PSLKPLEILPVFPDHDLWANSYTLVTFDADPG-------------RPRAQKQALLKGFST 218
Query: 515 TGSDSANPEK--FLAYMVPSVN------ELSKDMYDENED-----------VSFSWVREY 555
K F+AY+VP + + K + DE+++ + WVREY
Sbjct: 219 EAEAEVGLAKQSFVAYLVPKNDGNDDEGKGKKKVGDEDDEGAAVEEVDGKTAEYEWVREY 278
Query: 556 HWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRA 595
+ D TY +DD+ RY + K+ L K ++
Sbjct: 279 SF---RKDQQHSATYFFVWDDESVRYNEIQAKIVLDKIKS 315
>gi|195111787|ref|XP_002000458.1| GI10242 [Drosophila mojavensis]
gi|193917052|gb|EDW15919.1| GI10242 [Drosophila mojavensis]
Length = 575
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 162/340 (47%), Gaps = 38/340 (11%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
++ + +C++K+ N LP+ K + D +RF +Y +SLE+N+K + E DLG+
Sbjct: 23 ERKSEIICRVKYGNNLPDIPFDLKFLQYPFDSNRFVQYNPTSLERNFKYDVLTEHDLGVT 82
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL++ +Y + LDP DE+LL ++E +TP D + + R + VSWL K++YI
Sbjct: 83 VDLINRELYQADPM-SQLDPADEKLL-EEETLTPT--DSV--RSRQHSRTVSWLRKSEYI 136
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
S Q+ + Q E E K G ++ ++L DR+ QIK IE +F K
Sbjct: 137 ------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDRDAQIKAIEKTFSDTKNEI 190
Query: 450 I-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAI 508
H + N+ PVE+LP+ PDF + FD P K+V E
Sbjct: 191 TKHYSKPNVVPVEVLPIFPDFINWKYPCAQVIFDSDPAPQG-------KNVPAQLEE--- 240
Query: 509 MKSYVATGSDSANPEKFLAYMVPSVNELSK--------DMYDENEDVSFSWVREYHWDVR 560
M + G + E+F+AY +P+ L K +Y ++E+ + REY+W+V+
Sbjct: 241 MSQAMIRGVMDESGEQFVAYFLPTETTLEKRRADFVAGQLYKDDEEYEYKIAREYNWNVK 300
Query: 561 GDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
+ Y D Y L T++ L K+R G+
Sbjct: 301 TKASKGYEENYFFVMRQDGIYYNELETRVRLNKRRVKVGQ 340
>gi|363746155|ref|XP_003643545.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Gallus
gallus]
Length = 617
Score = 123 bits (308), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 184/386 (47%), Gaps = 51/386 (13%)
Query: 233 NVVMQKSQMVASGKGGHGS--MVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNE 290
++ + + Q++ G+ GS +G R RR + L + + +C++K+ N
Sbjct: 86 DLCLDEGQVLGVGEALTGSPLALGGRWHQRRPG-------SHRTLPERSGVVCRVKYCNS 138
Query: 291 LPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNP-PSV 349
LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++ Y PSV
Sbjct: 139 LPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINPDTYRIDPSV 198
Query: 350 RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTE 409
LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS E R ++
Sbjct: 199 L--LDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST---EFNRYGVSN 248
Query: 410 KQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHATNKNLQPVEI 462
++ E+K G S+ + DR+ QI IE +FE A K H + + P+E+
Sbjct: 249 EKP----EVKIGVSVKQQFTEEEIYKDRDSQIAAIEKTFEDAQKAITQHYSKPRVTPLEV 304
Query: 463 LPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANP 522
+P+ PDF+ + + FD P + D S A E +M + G
Sbjct: 305 MPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAMIRGMMDEEG 354
Query: 523 EKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD--DPTTYLV 572
+F+AY +P L K D+ E++ ++ REY+W+V+ + + + +
Sbjct: 355 NQFVAYFLPVEETLRKRKRDQEEEMDYAPEDVYDYKIAREYNWNVKNKASKGYEENYFFI 414
Query: 573 SFDDDEARYVPLPTKLNLRKKRAIEG 598
+ D Y L T++ L K+RA G
Sbjct: 415 FREGDGVYYNELETRVRLSKRRARAG 440
>gi|195054373|ref|XP_001994099.1| GH22996 [Drosophila grimshawi]
gi|193895969|gb|EDV94835.1| GH22996 [Drosophila grimshawi]
Length = 566
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 100/340 (29%), Positives = 164/340 (48%), Gaps = 38/340 (11%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
++ + +C++K+ N LP+ K + D +RF +Y +SLE+N+K + E DLG+
Sbjct: 23 ERKSEIICRVKYGNNLPDIPFDLKFLQYPFDSNRFVQYNPTSLERNFKYDVLTEHDLGVT 82
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL++ +Y S+ LDP DE+LL ++E +TP D + + R + VSWL K++YI
Sbjct: 83 VDLINRELYQADSM-SQLDPADEKLL-EEETMTPT--DSV--RSRQHSRTVSWLRKSEYI 136
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
S Q+ + Q E E K G ++ ++L DR+ QIK IE +F K
Sbjct: 137 ------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDRDAQIKAIEKTFIDTKSDI 190
Query: 450 I-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAI 508
H + N+ PVE+LP+ PDF + FD P K+V E
Sbjct: 191 TKHYSKPNVVPVEVLPIFPDFTNWKYPCAQVIFDSDPAPQG-------KNVPAQLEE--- 240
Query: 509 MKSYVATGSDSANPEKFLAYMVPSVNELSK--------DMYDENEDVSFSWVREYHWDVR 560
M + G + E+F+AY +P+ L K ++Y ++E+ + REY+W+V+
Sbjct: 241 MSQAMIRGVMDESGEQFVAYFLPTEPTLEKRRNDFVAGELYKDDEEYEYKIAREYNWNVK 300
Query: 561 GDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
+ Y D Y L T++ L K+R G+
Sbjct: 301 TKASKGYEENYFFVMRPDGIYYNELETRVRLNKRRVKLGQ 340
>gi|452820231|gb|EME27276.1| hypothetical protein Gasu_51340 [Galdieria sulphuraria]
Length = 406
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 166/359 (46%), Gaps = 47/359 (13%)
Query: 247 GGHGSMVGSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKD 306
G + +R+ RR A + E K P+ FLCK ++ N LPEP PK+ L+ +
Sbjct: 22 GTTKAHHSTRVSSRRTATSGTTE------KPPSVFLCKPRYMNNLPEPPFLPKMYNLEVE 75
Query: 307 KDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEV 366
++ +Y SSLE +YKP+L P LGI +D + PL E+EE+L E
Sbjct: 76 TSQYAKYRISSLESSYKPKLETGPALGIWVDWVTPEEEQQLGEGEPLTAEEEEILAQVE- 134
Query: 367 VTPVKKDGIKRKERPTDKGV-SWLVKTQYISPLSMESARQSLTEK-QAKELREMKGGRSI 424
+E V SW+ +T Y S A +++++ + +K RS+
Sbjct: 135 ---------NSQESKRKFSVPSWMRRTGY-DEFSELRANKNISQYPNSDGSSSLKQQRSL 184
Query: 425 LENLNDRERQIKEIEASFEACKLRPIH--ATNKNLQPVEILPLLPDFERYDDQFVAATFD 482
E + D FEA K P+H ++L+PV++LP+ PDFE + D+FV FD
Sbjct: 185 SETIQD----------EFEAAKQVPVHPDPRKRHLKPVQVLPVFPDFENWPDKFVVLQFD 234
Query: 483 GAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNEL------ 536
P+ + E + + V + +++ + + + E+FL+Y +P+ N +
Sbjct: 235 SNPSEEIESHQDTQRIVEE------FIENALTVAFTNQSGERFLSYYIPTENTIENRKSQ 288
Query: 537 --SKDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKK 593
+ M N + +REY + R ++ + +L DD RY P+ +KL L ++
Sbjct: 289 HWEETMKKNNTIEDYELLREYTFVTRPEEGNRSFVFLQ--DDQVVRYFPVTSKLFLYRR 345
>gi|427789477|gb|JAA60190.1| Putative paf1 [Rhipicephalus pulchellus]
Length = 469
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 108/368 (29%), Positives = 178/368 (48%), Gaps = 54/368 (14%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+C++K+ N LP+ PK ++ + +RF Y +SLE+NYK L E DLG+ +DL+D
Sbjct: 26 LVCRVKYCNTLPDIPFDPKFISYPFEPNRFVSYKATSLERNYKHDLLTEHDLGVTIDLID 85
Query: 341 LSVY--NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPL 398
Y +P +V L P+DE+LL +D +TP +D ++ R + V WL KT+YI+
Sbjct: 86 PKTYEIDPNAV---LHPDDEKLLEED-TLTP--QDS--KRSRHHNLVVPWLKKTEYIAT- 136
Query: 399 SMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPI-- 450
E +R T E K G ++ + DRE QI I +FE + +PI
Sbjct: 137 --EFSRYGQTGVNT----ETKVGYNVKKLFKEEDLYMDRESQINAINKTFEEAQ-KPIEA 189
Query: 451 HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMK 510
H + N++PVE+LPL PD + + F FD P +++ S+A+++
Sbjct: 190 HYSKPNVKPVEVLPLFPDSDLWKYPFAQVMFDSDPAPITQL----------EEMSQAMIR 239
Query: 511 SYVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSWVREYHWDVRGD 562
G + E+F+AY +P+ + + K Y ++++ + REY+W+V+
Sbjct: 240 -----GVMDESGEQFVAYFLPTEDTIKKRKRDAEEGMDYMDDDEYEYRMAREYNWNVKNK 294
Query: 563 DADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG----RSNDEVEHFPIPSSIAVRR 617
+ Y F DD Y L T++ L K+R G S V H P+ +
Sbjct: 295 ASKGYEENYFFVFRDDGVYYNELETRVRLTKRRLKPGVQPNNSKLVVRHRPLNEMEHKTQ 354
Query: 618 RANVTAIE 625
+A +T +E
Sbjct: 355 QARLTQLE 362
>gi|213512601|ref|NP_001133500.1| RNA polymerase II-associated factor 1 homolog [Salmo salar]
gi|209154248|gb|ACI33356.1| RNA polymerase II-associated factor 1 homolog [Salmo salar]
Length = 532
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/336 (29%), Positives = 165/336 (49%), Gaps = 44/336 (13%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D+ RF +Y +SLEK +K +L EPDLG+ +DL++
Sbjct: 31 VCRVKYCNSLPDIPFDPKFITYPFDQHRFVQYKATSLEKQHKHELLTEPDLGVTIDLINP 90
Query: 342 SVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y PS+ LDP DE+LL +D + P KR ++ K V W+ KT+YIS
Sbjct: 91 DTYRVDPSI--LLDPADEKLLEED-ITAPSSS---KRSQQHA-KVVPWMRKTEYIST--- 140
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPI--HA 452
E R ++ ++ E+K G S+ + DR+ QI IE +FE + +PI H
Sbjct: 141 EFNRYGVSNEKV----EVKIGVSVKQQFTEEEIYKDRDSQIAAIEKTFEDAQ-KPIAQHY 195
Query: 453 TNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSY 512
+ + PVE++P+ PDF+ + + FD P + D S A + +M
Sbjct: 196 SKPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDMS---APQGVEMMSQA 245
Query: 513 VATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV--------SFSWVREYHWDVRGDDA 564
+ G +F+AY +P+ + K D +E++ + REY+W+V+ +
Sbjct: 246 MIRGMMDEEGNQFVAYFLPNEETIRKRKRDVDEELDYMPDDLYDYKIAREYNWNVKNKAS 305
Query: 565 D--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + D D Y L T++ L K+RA G
Sbjct: 306 KGYEENYFFIFRDGDGVYYNELETRVRLSKRRAKAG 341
>gi|223648724|gb|ACN11120.1| RNA polymerase II-associated factor 1 homolog [Salmo salar]
Length = 536
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/336 (29%), Positives = 165/336 (49%), Gaps = 44/336 (13%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D+ RF +Y +SLEK +K +L EPDLG+ +DL++
Sbjct: 31 VCRVKYCNSLPDIPFDPKFITYPFDQHRFVQYKATSLEKQHKHELLTEPDLGVTIDLINP 90
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y PS+ LDP DE+LL +D + P KR ++ K V W+ KT+YIS
Sbjct: 91 DTYRVDPSI--LLDPADEKLLEED-ITAPSSS---KRSQQHA-KVVPWMRKTEYIST--- 140
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPI--HA 452
E R ++ ++ E+K G S+ + DR+ QI IE +FE + +PI H
Sbjct: 141 EFNRYGVSNEKV----EVKIGVSVKQQFTEEEIYKDRDSQIAAIEKTFEDAQ-KPIAQHY 195
Query: 453 TNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSY 512
+ + PVE++P+ PDF+ + + FD P + D S A + +M
Sbjct: 196 SKPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDMS---APQGVEMMSQA 245
Query: 513 VATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV--------SFSWVREYHWDVRGDDA 564
+ G +F+AY +P+ + K D +E++ + REY+W+V+ +
Sbjct: 246 MIRGMMDEEGNQFVAYFLPNEETIRKRKRDVDEELDYMPDDLYDYKIAREYNWNVKNKAS 305
Query: 565 D--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + D D Y L T++ L K+RA G
Sbjct: 306 KGYEENYFFIFRDGDGVYYNELETRVRLSKRRAKAG 341
>gi|432889892|ref|XP_004075383.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Oryzias
latipes]
Length = 534
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/336 (29%), Positives = 158/336 (47%), Gaps = 42/336 (12%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+C++K+ N LP+ PK + D+ RF +Y +SLEK +K +L EPDLG+ +DL++
Sbjct: 30 IVCRVKYCNTLPDIPFDPKFITYPFDQHRFVQYKATSLEKQHKHELLTEPDLGVTIDLIN 89
Query: 341 LSVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLS 399
Y P++ LDP DE+LL +D ++ ++ + K V W+ KT+YIS
Sbjct: 90 PDTYRIDPTI--LLDPADEKLLEED-----IQAPSSSKRSQQHAKVVPWMRKTEYIST-- 140
Query: 400 MESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHA 452
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H
Sbjct: 141 -EFNRYGVSNEKV----EVKIGVSVKQQFTEEEIYKDRDSQISAIEKTFEDAQKSIAQHY 195
Query: 453 TNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSY 512
+ + PVEILP+ PDF+ + + FD P A +I A +M
Sbjct: 196 SKPRVTPVEILPVFPDFKMWINPCAQVIFDSDP-APKDI---------SAPAGVEMMSQA 245
Query: 513 VATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV--------SFSWVREYHWDVRGDDA 564
+ G +F+AY +P L K D E V + REY+W+V+ +
Sbjct: 246 MIRGMMDEEGNQFVAYFLPHEETLRKRKRDSEEGVEYMADDVYDYKIAREYNWNVKNKAS 305
Query: 565 D--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + D D Y L T++ L K+RA G
Sbjct: 306 KGYEENYFFIFRDGDGVYYNELETRVRLSKRRAKAG 341
>gi|397482135|ref|XP_003812288.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Pan
paniscus]
Length = 533
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 163/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YI S
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYI---ST 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ E E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGIS----NEKPEVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|390478984|ref|XP_002762162.2| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Callithrix jacchus]
Length = 525
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|395501648|ref|XP_003755203.1| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Sarcophilus harrisii]
Length = 529
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 105/378 (27%), Positives = 178/378 (47%), Gaps = 55/378 (14%)
Query: 275 LKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGI 334
+++ + +C++K+ N LP+ PK + D+ RF +Y +SLEK ++ L EPDLG+
Sbjct: 23 VQERSGVVCRVKYCNSLPDIPFDPKFITYPFDQSRFVQYKATSLEKQHRHDLLTEPDLGV 82
Query: 335 PLDLLDLSVYNPPSVR----PPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLV 390
+DL+ NP + R PLDP DE+LL ++E+ P R+ + K V W+
Sbjct: 83 TIDLI-----NPDTYRVEAGAPLDPADEKLL-EEEIQAPTS----SRRSQQHAKVVPWMR 132
Query: 391 KTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE- 443
KT+YIS E R ++ ++ E+K G S+ + DR+ QI IE +FE
Sbjct: 133 KTEYIST---EFNRYGVSNEKP----EVKIGVSVKQQFTEEEIYKDRDSQISAIEKTFED 185
Query: 444 ACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAH 503
A K H + + PVE++P+ PDF+ + + FD P + D S A
Sbjct: 186 AQKSISQHYSKPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGSAAL 238
Query: 504 ESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREY 555
E +M + G +F+AY +P + K D+ E + ++ REY
Sbjct: 239 E---MMSQAMIRGMMDEEGNQFVAYFLPVEETMRKRKRDQEEKMDYAPDDIYDYKIAREY 295
Query: 556 HWDVRGDDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSI 613
+W+V+ + + + + + D Y L T++ L K+RA G + +++
Sbjct: 296 NWNVKNKASKGYEENYFFIFREGDGVYYNELETRVRLSKRRAKAGVQSG-------TNAL 348
Query: 614 AVRRRANVTAIELKEQGA 631
V + N+ EL+ Q A
Sbjct: 349 LVVKHRNMNEKELEAQEA 366
>gi|197099574|ref|NP_001125634.1| RNA polymerase II-associated factor 1 homolog [Pongo abelii]
gi|75041948|sp|Q5RAX0.1|PAF1_PONAB RecName: Full=RNA polymerase II-associated factor 1 homolog
gi|55728701|emb|CAH91090.1| hypothetical protein [Pongo abelii]
Length = 533
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 163/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ E E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGIS----NEKPEVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|301784041|ref|XP_002927430.1| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Ailuropoda melanoleuca]
Length = 528
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|402905498|ref|XP_003915556.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Papio
anubis]
Length = 530
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|42476169|ref|NP_061961.2| RNA polymerase II-associated factor 1 homolog isoform 1 [Homo
sapiens]
gi|182670295|sp|Q8N7H5.2|PAF1_HUMAN RecName: Full=RNA polymerase II-associated factor 1 homolog;
Short=hPAF1; AltName: Full=Pancreatic differentiation
protein 2
gi|12054502|emb|CAC20564.1| PD2 protein [Homo sapiens]
gi|12652555|gb|AAH00017.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae)
[Homo sapiens]
gi|15426567|gb|AAH13402.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae)
[Homo sapiens]
gi|119577284|gb|EAW56880.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae),
isoform CRA_a [Homo sapiens]
gi|119577286|gb|EAW56882.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae),
isoform CRA_a [Homo sapiens]
gi|123981872|gb|ABM82765.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae)
[synthetic construct]
gi|123996701|gb|ABM85952.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae)
[synthetic construct]
Length = 531
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 163/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ E E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGIS----NEKPEVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|343960825|dbj|BAK62002.1| hypothetical protein [Pan troglodytes]
gi|410208516|gb|JAA01477.1| Paf1, RNA polymerase II associated factor, homolog [Pan
troglodytes]
gi|410248688|gb|JAA12311.1| Paf1, RNA polymerase II associated factor, homolog [Pan
troglodytes]
gi|410292268|gb|JAA24734.1| Paf1, RNA polymerase II associated factor, homolog [Pan
troglodytes]
gi|410342695|gb|JAA40294.1| Paf1, RNA polymerase II associated factor, homolog [Pan
troglodytes]
Length = 533
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 163/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ E E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGIS----NEKPEVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|403305255|ref|XP_003943183.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Saimiri
boliviensis boliviensis]
Length = 529
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|355703537|gb|EHH30028.1| hypothetical protein EGK_10597 [Macaca mulatta]
gi|355755821|gb|EHH59568.1| hypothetical protein EGM_09708 [Macaca fascicularis]
gi|380787111|gb|AFE65431.1| RNA polymerase II-associated factor 1 homolog [Macaca mulatta]
gi|383411397|gb|AFH28912.1| RNA polymerase II-associated factor 1 homolog [Macaca mulatta]
gi|384940428|gb|AFI33819.1| RNA polymerase II-associated factor 1 homolog [Macaca mulatta]
Length = 531
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|332242502|ref|XP_003270424.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Nomascus
leucogenys]
Length = 535
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|426388666|ref|XP_004060754.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform 1
[Gorilla gorilla gorilla]
Length = 533
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|73947660|ref|XP_533675.2| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform 1
[Canis lupus familiaris]
gi|410983062|ref|XP_003997863.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Felis
catus]
Length = 528
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|348563022|ref|XP_003467307.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Cavia
porcellus]
Length = 535
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|291389992|ref|XP_002711498.1| PREDICTED: Paf1, RNA polymerase II associated factor, homolog
[Oryctolagus cuniculus]
Length = 527
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|335302328|ref|XP_003359435.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Sus
scrofa]
Length = 532
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|31980912|ref|NP_062331.2| RNA polymerase II-associated factor 1 homolog [Mus musculus]
gi|81901032|sp|Q8K2T8.1|PAF1_MOUSE RecName: Full=RNA polymerase II-associated factor 1 homolog
gi|20987627|gb|AAH29843.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae)
[Mus musculus]
gi|53733817|gb|AAH83337.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae)
[Mus musculus]
gi|74138249|dbj|BAE28608.1| unnamed protein product [Mus musculus]
gi|148692192|gb|EDL24139.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae),
isoform CRA_b [Mus musculus]
Length = 535
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|149722114|ref|XP_001498010.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Equus
caballus]
Length = 524
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|426242871|ref|XP_004015294.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Ovis
aries]
Length = 532
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|67846072|ref|NP_001020069.1| RNA polymerase II-associated factor 1 homolog [Rattus norvegicus]
gi|81908653|sp|Q4V886.1|PAF1_RAT RecName: Full=RNA polymerase II-associated factor 1 homolog
gi|66911741|gb|AAH97494.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae)
[Rattus norvegicus]
gi|149056464|gb|EDM07895.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae)
[Rattus norvegicus]
Length = 535
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|395859720|ref|XP_003802180.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform 1
[Otolemur garnettii]
Length = 533
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|114052767|ref|NP_001039758.1| RNA polymerase II-associated factor 1 homolog [Bos taurus]
gi|122136137|sp|Q2KJ14.1|PAF1_BOVIN RecName: Full=RNA polymerase II-associated factor 1 homolog
gi|86827732|gb|AAI05571.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae)
[Bos taurus]
gi|440910331|gb|ELR60139.1| RNA polymerase II-associated factor 1-like protein [Bos grunniens
mutus]
Length = 532
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|281344083|gb|EFB19667.1| hypothetical protein PANDA_017209 [Ailuropoda melanoleuca]
Length = 513
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 15 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 74
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 75 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 124
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 125 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 180
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 181 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 230
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 231 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 290
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 291 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 325
>gi|343958600|dbj|BAK63155.1| hypothetical protein [Pan troglodytes]
Length = 523
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 163/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 20 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 79
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 80 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 129
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ E E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 130 EFNRYGIS----NEKPEVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 185
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 186 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 235
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 236 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 295
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 296 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 330
>gi|21355457|ref|NP_649493.1| antimeros, isoform A [Drosophila melanogaster]
gi|442617435|ref|NP_001262263.1| antimeros, isoform B [Drosophila melanogaster]
gi|7296819|gb|AAF52095.1| antimeros, isoform A [Drosophila melanogaster]
gi|17944277|gb|AAL48032.1| LD37523p [Drosophila melanogaster]
gi|220946232|gb|ACL85659.1| atms-PA [synthetic construct]
gi|220955924|gb|ACL90505.1| atms-PA [synthetic construct]
gi|440217065|gb|AGB95646.1| antimeros, isoform B [Drosophila melanogaster]
Length = 538
Score = 118 bits (296), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 174/357 (48%), Gaps = 44/357 (12%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
++ + +C++K+ N LP+ K + D RF +Y +SLE+N+K + E DLG+
Sbjct: 23 ERKSEIICRVKYGNNLPDIPFDLKFLQYPFDSHRFVQYNPTSLERNFKYDVLTEHDLGVT 82
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL++ +Y S+ LDP DE+LL ++E +TP D + + R + VSWL K++YI
Sbjct: 83 VDLINRELYQADSM-TLLDPADEKLL-EEETLTPT--DSV--RSRQHSRTVSWLRKSEYI 136
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
S Q+ + Q E E K G ++ ++L DRE QIK IE +F K
Sbjct: 137 ------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDREAQIKAIEKTFSDTKSEI 190
Query: 450 I-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRA 507
H + N+ PVE+LP+ PDF + FD P A + +++++ S+A
Sbjct: 191 TKHYSKPNVVPVEVLPIFPDFTNWKFPCAQVIFDSDPAPAGKNVPAQLEEM------SQA 244
Query: 508 IMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYD--------ENEDVSFSWVREYHWDV 559
+++ G + E+F+AY +P+ L K D E E+ + REY+W+V
Sbjct: 245 MIR-----GVMDESGEQFVAYFLPTEQTLEKRRTDFINGELYKEEEEYEYKIAREYNWNV 299
Query: 560 RGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR----SNDEVEHFPIPS 611
+ + Y D Y L T++ L K+R G+ + V+H P+ S
Sbjct: 300 KTKASKGYEENYFFVMRQDGIYYNELETRVRLNKRRVKVGQQPNNTKLVVKHRPLDS 356
>gi|351706483|gb|EHB09402.1| RNA polymerase II-associated factor 1-like protein [Heterocephalus
glaber]
Length = 535
Score = 118 bits (296), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|417402499|gb|JAA48096.1| Putative rna polymerase ii regulator [Desmodus rotundus]
Length = 539
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|357627369|gb|EHJ77088.1| hypothetical protein KGM_06557 [Danaus plexippus]
Length = 587
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 166/346 (47%), Gaps = 37/346 (10%)
Query: 259 DRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSL 318
+R P +GER + + ++K+ N LP+ K + RF +Y +SL
Sbjct: 11 EREKRPQRTGERR-------SELVTRVKYCNTLPDIPFDLKFLTYPFSSTRFIQYNPTSL 63
Query: 319 EKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRK 378
EKNY+ ++ E DLG+ +DL++ +Y LDP DE+LL DD V+TP +D ++
Sbjct: 64 EKNYRYEVLTEHDLGVHIDLINRDIYQGDG-NAQLDPADEKLLEDD-VLTP--QDS--KR 117
Query: 379 ERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEI 438
R K VSWL +++YIS QS+ + +AK +K S DR+ QIK I
Sbjct: 118 SRHHAKSVSWLRRSEYISTEQTRFQPQSMEKVEAKVGYNVKKIFSEETLYMDRDSQIKAI 177
Query: 439 EASFEACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKM 495
E +FE K + I H + + PVEI+P+ PDFE + FD P AD I ++
Sbjct: 178 EKTFEDNK-KTIEKHYSKPGVTPVEIMPVFPDFEMWKYPCAQVIFDSDPAPADKNIAGQI 236
Query: 496 DKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV-------- 547
+ S+A+++ G + E+F+AY +P+ + + K D E +
Sbjct: 237 EAM------SQAMIR-----GVMDESGEQFVAYCLPTEDTIQKRRRDITEGIPYMDGDTY 285
Query: 548 SFSWVREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRK 592
+ REY+W+V+ + Y + + Y L T++ L K
Sbjct: 286 EYKMAREYNWNVKSKASKGYEENYFLVVRNHCIYYNELETRVRLSK 331
>gi|346470299|gb|AEO34994.1| hypothetical protein [Amblyomma maculatum]
Length = 475
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/358 (30%), Positives = 171/358 (47%), Gaps = 59/358 (16%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+C++K+ N LP+ PK ++ + +RF Y +SLE+NYK L E DLG+ +DL+D
Sbjct: 26 LVCRVKYCNTLPDIPFDPKFISYPFEPNRFVSYKATSLERNYKHDLLTEHDLGVTIDLID 85
Query: 341 LSVY--NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPL 398
Y +P +V L P+DE+LL +D +TP +D ++ R + V WL KT+YI+
Sbjct: 86 PKTYEIDPNAV---LHPDDEKLLEEDS-LTP--QDS--KRSRHHNLVVPWLKKTEYIAT- 136
Query: 399 SMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPI-- 450
E +R T E K G ++ + DRE QI I +FE + +PI
Sbjct: 137 --EFSRYGQTGVNT----ETKVGYNVKKLFKEEDLYMDRESQINAINKTFEEAQ-KPIEA 189
Query: 451 HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMK 510
H + N++ VE+LPL PD E + F FD P +++ S+A+++
Sbjct: 190 HYSKPNVKAVEVLPLFPDSELWKYPFAQVMFDSDPAPITQL----------EEMSQAMIR 239
Query: 511 SYVATGSDSANPEKFLAYMVPSVNELSK---------DMYDENEDVSFSWVREYHWDVRG 561
G + E+F+AY +P+ + + K D DE+E + REY+W+V+
Sbjct: 240 -----GVMDESGEQFVAYFLPTEDTIKKRKRDAEEGVDYMDEDE-YEYRMAREYNWNVKN 293
Query: 562 DDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRR 618
+ Y F +D Y L T++ L K+R G P S + VR R
Sbjct: 294 KASKGYEENYFFVFREDGVYYNELETRVRLTKRRLKPGVQ-------PNNSKLVVRHR 344
>gi|126272991|ref|XP_001372133.1| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Monodelphis domestica]
Length = 527
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 106/375 (28%), Positives = 181/375 (48%), Gaps = 49/375 (13%)
Query: 275 LKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGI 334
+++ + +C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+
Sbjct: 23 VQERSGVVCRVKYCNTLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGV 82
Query: 335 PLDLLDLSVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQ 393
+DL++ Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+
Sbjct: 83 TIDLINPDTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTE 135
Query: 394 YISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACK 446
YIS E R ++ ++ E+K G S+ + DR+ QI IE +FE A K
Sbjct: 136 YIST---EFNRYGVSNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQK 188
Query: 447 LRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESR 506
H + + PVE++P+ PDF+ + + FD P + D S A E
Sbjct: 189 SISQHYSKPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGSAALE-- 239
Query: 507 AIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWD 558
+M + G +F+AY +P + K D+ E++ ++ REY+W+
Sbjct: 240 -MMSQAMIRGMMDEEGNQFVAYFLPVEETMRKRKRDQEEEMDYTPDDIYDYKIAREYNWN 298
Query: 559 VRGDDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVR 616
V+ + + + + + D Y L T++ L K+RA G + +++ V
Sbjct: 299 VKNKASKGYEENYFFIFREGDGVYYNELETRVRLSKRRAKAGAQSG-------TNALLVV 351
Query: 617 RRANVTAIELKEQGA 631
+ N+ EL+ Q A
Sbjct: 352 KHRNMNEKELEAQEA 366
>gi|431920159|gb|ELK18198.1| RNA polymerase II-associated factor 1 like protein [Pteropus
alecto]
Length = 537
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|354483435|ref|XP_003503898.1| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Cricetulus griseus]
Length = 524
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 21 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 80
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 81 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 130
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 131 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 186
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 187 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 236
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 237 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 296
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 297 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 331
>gi|432090694|gb|ELK24034.1| RNA polymerase II-associated factor 1 like protein [Myotis davidii]
Length = 522
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 163/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 20 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 79
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 80 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 129
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ E E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 130 EFNRYGIS----NEKPEVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 185
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 186 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 235
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 236 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 295
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 296 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 330
>gi|444732083|gb|ELW72402.1| RNA polymerase II-associated factor 1 like protein [Tupaia
chinensis]
Length = 523
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 20 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 79
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 80 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 129
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 130 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 185
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 186 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 235
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 236 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 295
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 296 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 330
>gi|195343495|ref|XP_002038333.1| GM10775 [Drosophila sechellia]
gi|195568273|ref|XP_002102142.1| GD19748 [Drosophila simulans]
gi|194133354|gb|EDW54870.1| GM10775 [Drosophila sechellia]
gi|194198069|gb|EDX11645.1| GD19748 [Drosophila simulans]
Length = 538
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 174/357 (48%), Gaps = 44/357 (12%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
++ + +C++K+ N LP+ K + D RF +Y +SLE+N+K + E DLG+
Sbjct: 23 ERKSEIICRVKYGNNLPDIPFDLKFLQYPFDSHRFVQYNPTSLERNFKYDVLTEHDLGVT 82
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL++ +Y S+ LDP DE+LL ++E +TP D + + R + VSWL K++YI
Sbjct: 83 VDLINRELYQADSM-TLLDPADEKLL-EEETLTPT--DSV--RSRQHSRTVSWLRKSEYI 136
Query: 396 SPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRP 449
S Q+ + Q E E K G ++ ++L DRE QIK IE +F K
Sbjct: 137 ------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDREAQIKAIEKTFSDTKSEI 190
Query: 450 I-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRA 507
H + N+ PVE+LP+ PDF + FD P A + +++++ S+A
Sbjct: 191 TKHYSKPNVVPVEVLPIFPDFTNWKFPCAQVIFDSDPAPAGKNVPAQLEEM------SQA 244
Query: 508 IMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYD--------ENEDVSFSWVREYHWDV 559
+++ G + E+F+AY +P+ L K D E E+ + REY+W+V
Sbjct: 245 MIR-----GVMDESGEQFVAYFLPTEQTLEKRRTDFINGELYKEEEEYEYKIAREYNWNV 299
Query: 560 RGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR----SNDEVEHFPIPS 611
+ + Y D Y L T++ L K+R G+ + V+H P+ S
Sbjct: 300 KTKASKGYEENYFFVMRQDGIYYNELETRVRLNKRRVKVGQQPNNTKLVVKHRPLDS 356
>gi|348505228|ref|XP_003440163.1| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Oreochromis niloticus]
Length = 535
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 160/335 (47%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D+ RF +Y +SLEK +K +L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNTLPDIPFDPKFITYPFDQHRFVQYKATSLEKQHKHELLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P++ LDP DE+LL +D ++ ++ + K V W+ KT+YIS
Sbjct: 90 DTYRIDPNI--LLDPADEKLLEED-----IQAPSSSKRSQQHAKVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGVSNEKV----EVKIGVSVKQQFTEEEIYKDRDSQISAIEKTFEDAQKSITQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE+LP+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVLPVFPDFKMWINPCAQVIFDSDP-------APKDMSGPAAVE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV--------SFSWVREYHWDVRGDDAD 565
G +F+AY +P+ L K D E + + REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPNEETLRKRKRDCEEGIDYMPDDLYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + D D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFRDGDGVYYNELETRVRLSKRRAKAG 340
>gi|344236939|gb|EGV93042.1| RNA polymerase II-associated factor 1-like [Cricetulus griseus]
Length = 537
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 34 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 93
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 94 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 143
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 144 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 199
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 200 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 249
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 250 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 309
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 310 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 344
>gi|410910168|ref|XP_003968562.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Takifugu
rubripes]
Length = 529
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 159/335 (47%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D+ RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQHRFVQYKATSLEKQHKHDLLSEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL +D ++ ++ + K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLLEED-----IQAPSSSKRSQQHAKVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGVSNEKV----EVKIGVSVKQQFTEEEIYKDRDSQISAIEKTFEDAQKSITQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE+LP+ PDF+ + + FD P A +I A +M +
Sbjct: 196 KPRVTPVEVLPVFPDFKMWINPCAQVIFDSDP-APKDI---------SAPAGVEMMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV--------SFSWVREYHWDVRGDDAD 565
G +F+AY +P+ + L K D E + + REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPNEDTLRKRKRDFEEGMDYMPEDLYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + D D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFRDGDGVYYNELETRVRLSKRRAKAG 340
>gi|126329111|ref|XP_001363052.1| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Monodelphis domestica]
Length = 551
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGSAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P + K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETMRKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|157117813|ref|XP_001653048.1| hypothetical protein AaeL_AAEL001339 [Aedes aegypti]
gi|108883312|gb|EAT47537.1| AAEL001339-PA [Aedes aegypti]
Length = 549
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/353 (29%), Positives = 174/353 (49%), Gaps = 44/353 (12%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+ ++K+ N LP+ K + + DRF +Y +SLE+NY+ ++ E DLG+ +DL++
Sbjct: 28 LISRVKYCNTLPDIPFDLKFITYPFENDRFIQYKPTSLERNYRYEVLTEHDLGVTIDLIN 87
Query: 341 LSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
+Y + LDP DE+LL +D + TP +D ++ K VSWL K++YI
Sbjct: 88 RDLYQIDHL-AQLDPADEKLLEED-IHTP--QDSLRSSRHA--KSVSWLRKSEYI----- 136
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASF-EACKLRPIHAT 453
S Q+ + Q E E K G ++ ++L DRE QIK IE +F + K H +
Sbjct: 137 -STEQTRFQPQTMEKVEAKVGFNVKQSLREETLYMDREAQIKAIEKTFDDNTKEITAHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRAIMKSY 512
+ PVEILP+ PDF + FD P A + ++M++ S+A+++
Sbjct: 196 KPGVTPVEILPVFPDFANWKYPCAQVIFDFDPAPAGKNVPAQMEEM------SQAMIR-- 247
Query: 513 VATGSDSANPEKFLAYMVPSVNELSK---DMYDEN-----EDVSFSWVREYHWDVRGDDA 564
G + E+F+AY +PS L K D+ +E E+ + REY+W+V+ +
Sbjct: 248 ---GVMDESGEQFVAYFLPSEETLEKRRRDLVNETLYEDEEEYEYKMAREYNWNVKSKAS 304
Query: 565 DD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR----SNDEVEHFPIPSS 612
Y + D Y L T++ L K+R G+ + V+H P+ +S
Sbjct: 305 KGYEENYFLVLRQDGMYYNELETRVRLSKRRQKVGQQPNNTKLVVKHRPLNAS 357
>gi|12857167|dbj|BAB30913.1| unnamed protein product [Mus musculus]
Length = 377
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|145587688|ref|NP_001019624.1| RNA polymerase II-associated factor 1 homolog [Danio rerio]
gi|82277925|sp|Q4U0S5.1|PAF1_DANRE RecName: Full=RNA polymerase II-associated factor 1 homolog;
AltName: Full=PD2-like protein
gi|66277459|gb|AAY44602.1| PD2-like protein [Danio rerio]
gi|141795323|gb|AAI39614.1| Paf1l protein [Danio rerio]
Length = 503
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/339 (29%), Positives = 162/339 (47%), Gaps = 50/339 (14%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D+ RF +Y +SLEK +K +L EPDLG+ +DL+
Sbjct: 30 VCRVKYGNSLPDIPFDPKFITYPFDQHRFVQYKATSLEKQHKHELLTEPDLGVTIDLI-- 87
Query: 342 SVYNPPSVRPP----LDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISP 397
NP + R LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 88 ---NPDTYRIDPNILLDPADEKLL-EEEIQAP---SSSKRSQQHA-KVVPWMRKTEYIST 139
Query: 398 LSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPI 450
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K
Sbjct: 140 ---EFNRYGVSNEKV----EVKIGVSVKQQFTEEEIYKDRDSQIAAIEKTFEDAQKSISQ 192
Query: 451 HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRAIM 509
H + + PVE+LP+ PDF+ + + FD P D + +D +M
Sbjct: 193 HYSKPRVTPVEVLPVFPDFKMWINPCAQVIFDSDPAPKDVSAPAGVD-----------MM 241
Query: 510 KSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV--------SFSWVREYHWDVRG 561
+ G +F+AY +P+ + + K D E++ + REY+W+V+
Sbjct: 242 SQAMIRGMMDEEGNQFVAYFLPNEDTMRKRKRDVEEELDYMPEEVYEYKIAREYNWNVKN 301
Query: 562 DDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D D Y L T++ L K+RA G
Sbjct: 302 KASKGYEENYFFIFRDADGVYYNELETRVRLSKRRAKVG 340
>gi|74147922|dbj|BAE22315.1| unnamed protein product [Mus musculus]
Length = 534
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K++A G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRQAKAG 340
>gi|90084485|dbj|BAE91084.1| unnamed protein product [Macaca fascicularis]
Length = 531
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 163/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y + LEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATFLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|7670494|dbj|BAA95098.1| unnamed protein product [Mus musculus]
Length = 535
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/337 (29%), Positives = 164/337 (48%), Gaps = 46/337 (13%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAI--MKS 511
+ PVE++P+ PDF+ + + FD D + +D + A+ M
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDS------------DLAPKDTSGAAALEMMSQ 243
Query: 512 YVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDD 563
+ G +F+AY +P L K D+ E++ ++ REY+W+V+
Sbjct: 244 AMIRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKA 303
Query: 564 AD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + + D Y L T++ L K+RA G
Sbjct: 304 SKGYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|296477828|tpg|DAA19943.1| TPA: RNA polymerase II-associated factor 1 homolog [Bos taurus]
Length = 386
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|47227757|emb|CAG08920.1| unnamed protein product [Tetraodon nigroviridis]
Length = 370
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 158/335 (47%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D+ RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 31 VCRVKYCNSLPDIPFDPKFITYPFDQHRFVQYKATSLEKQHKHDLLSEPDLGVTIDLINP 90
Query: 342 SVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y PSV LDP DE+LL +D ++ ++ + K V W+ KT+YIS
Sbjct: 91 DTYRIDPSVL--LDPADEKLLEED-----IQAPSSSKRSQQHAKVVPWMRKTEYIST--- 140
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 141 EFNRYGVSNEKV----EVKIGVSVKQQFTEEEIYKDRDSQISAIEKTFEDAQKSITQHYS 196
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE+LP+ PDF+ + + FD P A +I +M +
Sbjct: 197 KPRVTPVEVLPVFPDFKMWINPCAQVIFDSDP-APKDI---------SGPAGVEMMSQAM 246
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV--------SFSWVREYHWDVRGDDAD 565
G +F+AY +P+ + L K D E V + REY+W+V+ +
Sbjct: 247 IRGMMDEEGNQFVAYFLPNEDTLRKRKRDFEEGVDYMPEDLYDYKIAREYNWNVKNKASK 306
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + D D Y L T++ L K+RA G
Sbjct: 307 GYEENYFFIFRDGDGVYYNELETRVRLSKRRAKAG 341
>gi|241825632|ref|XP_002416619.1| conserved hypothetical protein [Ixodes scapularis]
gi|215511083|gb|EEC20536.1| conserved hypothetical protein [Ixodes scapularis]
Length = 431
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 98/331 (29%), Positives = 155/331 (46%), Gaps = 39/331 (11%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+C++K+ N LP+ PK +A + +RF Y +SLE+NYK L E DLG+ +DL+D
Sbjct: 4 LVCRVKYCNTLPDIPFDPKFIAYPFEPNRFVSYKATSLERNYKHDLLTEHDLGVTIDLID 63
Query: 341 LSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y + L P+DE+LL +D TP +D ++ R + V WL KT+YI+
Sbjct: 64 PKTYEIDA-NAVLHPDDEKLLEED-AHTP--QDS--KRSRHHNLVVPWLKKTEYIAT-EF 116
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFE------AC--KLRPIHA 452
QS + K +K + DRE QI I +FE +C + H
Sbjct: 117 NRYGQSGVNTETKVGYNVKKLFKEEDLYMDRESQISAINKTFEEAQKQASCFHSFQTCHY 176
Query: 453 TNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSY 512
+ N++PVE+LPL PD E + F FD P +++ S+A+++
Sbjct: 177 SKPNVRPVEVLPLFPDSELWKYPFAQVMFDSDPAPITQL----------EEMSQAMIR-- 224
Query: 513 VATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDA 564
G + E+F+AY +P+ + + K D E V + REY+W+V+ +
Sbjct: 225 ---GVMDESGEQFVAYFLPTEDTIRKRKRDAEEKVEYMDEDEYEYRMAREYNWNVKNKAS 281
Query: 565 DD-PTTYLVSFDDDEARYVPLPTKLNLRKKR 594
Y F +D Y L T++ L K+R
Sbjct: 282 KGYEENYFFVFREDGVFYNELETRVRLSKRR 312
>gi|66825187|ref|XP_645948.1| RNA polymerase II-associated factor 1 [Dictyostelium discoideum
AX4]
gi|60474119|gb|EAL72056.1| RNA polymerase II-associated factor 1 [Dictyostelium discoideum
AX4]
Length = 499
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 165/347 (47%), Gaps = 47/347 (13%)
Query: 277 KPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPL 336
K + F CKLKF+N LPE +PK + L D RFT+Y +SLE+ YK L EP LGIP+
Sbjct: 108 KISEFQCKLKFQNSLPEIPFEPKFLKLSSDFQRFTQYKTTSLERQYKHPLLTEPQLGIPI 167
Query: 337 DLLDLSVYNPPSVRPPLDPEDEELLR--DDEVVTPVKKDGIKRKERPTDKGVSWLVKTQY 394
DL+D SVYN P + P DE LL+ + + I +K+ V WL +T+Y
Sbjct: 168 DLIDPSVYNTPKSPIQVPPGDEPLLKPLSQQDLEEKNSSAIFKKKSEIRPNVGWLRRTEY 227
Query: 395 ISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKE--------IEASFEACK 446
+S S E+ T++ + + S N+++ + + +E +F+ C
Sbjct: 228 LSK-SDENTFGRPTKRISTSGELLTSSSSGSNTPNNKQLSLAQEQLSDSVLVENTFDICA 286
Query: 447 --LRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAP----------TADSEIYSK 494
+ +H TN +L+PV +L + PDF+ + + F TFD P + + Y+
Sbjct: 287 SDYQFVHPTNPSLKPVSVLQVFPDFDLWANSFTEVTFDSDPLDHFLPKDLRSDKVQEYAS 346
Query: 495 MDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVRE 554
VR+ +AI+K +KF+ ++ P + K+ YD N + F + +
Sbjct: 347 RHNEVRN----KAIVKGIT---------DKFVYFITPDL----KENYDVNNNNDFE-IDD 388
Query: 555 YHWDVR----GDDADDPTT--YLVSFDDDEARYVPLPTKLNLRKKRA 595
+ ++ D D T+ Y +D Y PL ++NL+K ++
Sbjct: 389 SQYKMQKVLTSDIITDATSQNYFFLVKNDAVYYNPLKNRVNLKKIKS 435
>gi|344298406|ref|XP_003420884.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Loxodonta
africana]
Length = 499
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|378744165|ref|NP_001243755.1| RNA polymerase II-associated factor 1 homolog isoform 2 [Homo
sapiens]
gi|119577287|gb|EAW56883.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae),
isoform CRA_c [Homo sapiens]
Length = 485
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 163/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 20 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 79
Query: 342 SVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 80 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 129
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 130 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 185
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P D S A E +M +
Sbjct: 186 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDPAPK-------DTSGAAALE---MMSQAM 235
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 236 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 295
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 296 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 330
>gi|355709097|gb|AES03478.1| Paf1, RNA polymerase II associated factor,-like protein [Mustela
putorius furo]
Length = 337
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 100/332 (30%), Positives = 163/332 (49%), Gaps = 42/332 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNVL--LDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRA 595
+ + + + D Y L T++ L K+RA
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRA 337
>gi|395859722|ref|XP_003802181.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform 2
[Otolemur garnettii]
Length = 484
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 162/338 (47%), Gaps = 48/338 (14%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL+
Sbjct: 20 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLI-- 77
Query: 342 SVYNPPSVR----PPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISP 397
NP + R LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 78 ---NPDTYRIDPNVLLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST 129
Query: 398 LSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPI 450
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K
Sbjct: 130 ---EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQ 182
Query: 451 HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMK 510
H + + PVE++P+ PDF+ + + FD P D S A E +M
Sbjct: 183 HYSKPRVTPVEVMPVFPDFKMWINPCAQVIFDSDPAPK-------DTSGAAALE---MMS 232
Query: 511 SYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGD 562
+ G +F+AY +P L K D+ E++ ++ REY+W+V+
Sbjct: 233 QAMIRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNK 292
Query: 563 DAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + + D Y L T++ L K+RA G
Sbjct: 293 ASKGYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 330
>gi|256079279|ref|XP_002575916.1| procollagen-lysine2-oxoglutarate 5-dioxygenase [Schistosoma
mansoni]
gi|360044866|emb|CCD82414.1| putative procollagen-lysine,2-oxoglutarate 5-dioxygenase
[Schistosoma mansoni]
Length = 921
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 165/372 (44%), Gaps = 75/372 (20%)
Query: 277 KPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPL 336
K + LC+LK++N LPE PK + + RF +Y +SLE+NYK +L E D+G+ +
Sbjct: 25 KTESLLCRLKYQNNLPELPFDPKFLVYPLEPSRFLQYVATSLERNYKHELLTETDVGVEV 84
Query: 337 DLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYIS 396
DL+D V+ R L P+DE LL D E PT
Sbjct: 85 DLIDPDVFRIDK-RATLHPDDERLLED---------------EAPT-------------- 114
Query: 397 PLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPI 450
+AR+S R K G ++ +LN DRE QI IE +F+A + +PI
Sbjct: 115 ---FVNARKS---------RHQKLGYNVKRHLNEEIVYRDRESQINAIEETFKAAE-KPI 161
Query: 451 HATNK--NLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAI 508
H N+ +E+LP+LPDF + FD P+ ++ ++ + V ++A+
Sbjct: 162 HKHYSKPNVHALEVLPVLPDFTLWRYPCAQVIFDDDPSRKNKTTTEQKEEV-----NQAM 216
Query: 509 MKSYVATGSDSANPEKFLAYMVPSVNELSKDM----------YDENEDVSFSWVREYHWD 558
++ V D F+AY +P+ E +K + Y E+ + REY+W+
Sbjct: 217 IRGMVDESGD-----HFVAYFLPT--EQTKQLRRLDAENQTPYTEDAAYEYELTREYNWN 269
Query: 559 VRGDD-ADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRR 617
V+ A+ Y F D Y L T++ L K+R + +S V P + +
Sbjct: 270 VKNKTMANYEENYFFCFRKDGVYYNELETRVRLSKRRKL-NQSGTNVGALQAPKTRLIVH 328
Query: 618 RANVTAIELKEQ 629
+ T ELK Q
Sbjct: 329 HRDFTDEELKAQ 340
>gi|393905193|gb|EFO23108.2| Paf1 family protein [Loa loa]
Length = 569
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 158/363 (43%), Gaps = 57/363 (15%)
Query: 279 TTFLCKLKFRNELPEPSAQPKLMAL-KKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLD 337
T FLC++K+ N LP+ K +A RFT Y SSLEKN+K +L EPD G+ +D
Sbjct: 31 TDFLCRVKYSNALPDIPFDTKFLACPFVSLSRFTDYKSSSLEKNFKFELLCEPDCGVNID 90
Query: 338 LLDLSVY----NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQ 393
L++ Y + P P+D E L +DE P R+ K V W+ KT+
Sbjct: 91 LINPETYYVDPDNPKKHHPIDLE----LLEDEQANPQN----LRRSLQHSKMVPWMRKTE 142
Query: 394 YISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASF-EACK 446
YIS E R + + E +E + G S + DR QI I +F +A K
Sbjct: 143 YISS---EFTRFGV----SAERQETRIGYSTKKKFQTDVLYRDRTSQIAAINKTFDDASK 195
Query: 447 LRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESR 506
H T K + VE LPLLPDF+ + F FDG P + D +++
Sbjct: 196 TVLQHPTKKGVTAVEELPLLPDFDNWMHPFALVVFDGDPIPQN-----------DKVDAQ 244
Query: 507 AIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWD 558
+M + G E+F+ Y +PS L K + D + + F VR+Y+W
Sbjct: 245 TLMPQALIRGMMDEEGEQFVTYFLPSKETLDKRLKDAEDGLEFDSDYVYEYQSVRDYNWL 304
Query: 559 VRGDDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVR 616
VR + +L + D Y L TK++L +++ R + + VR
Sbjct: 305 VRNKSTKGYEQDNFLFTIRDGAVYYNELETKVSLNRRKTKRIRQ---------ETKLMVR 355
Query: 617 RRA 619
RA
Sbjct: 356 NRA 358
>gi|21758435|dbj|BAC05305.1| unnamed protein product [Homo sapiens]
Length = 485
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 163/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLE+ +K L EPDLG+ +DL++
Sbjct: 20 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLERQHKHDLLTEPDLGVTIDLINP 79
Query: 342 SVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 80 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 129
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 130 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 185
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P D S A E +M +
Sbjct: 186 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDPAPK-------DTSGAAALE---MMSQAM 235
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 236 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 295
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 296 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 330
>gi|340373169|ref|XP_003385114.1| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Amphimedon queenslandica]
Length = 420
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/332 (29%), Positives = 163/332 (49%), Gaps = 46/332 (13%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
LC ++F+N LP+ +PK + D RF +Y +SLE+N++ + E DLG+ +DL+
Sbjct: 29 LLCSVQFKNVLPDVPFEPKFITYPFDSLRFIQYNPTSLERNHRHETLTEVDLGVSVDLI- 87
Query: 341 LSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
L+P D+++L ++EV + ++ R + VSWL +T+YIS
Sbjct: 88 -----LHETHTSLEPADDKILLEEEVTAQPE----SKRARQHARNVSWLRRTEYISS--- 135
Query: 401 ESARQSLTEKQAKELREMKGGRSI---LENLN---DRERQIKEIEASFEACKLRPI--HA 452
E R + A E K G ++ L+ LN DR+ QI IE++F A + +PI H
Sbjct: 136 ELTRSHTVGETA----ETKVGYNVKKKLQGLNLYKDRDSQISAIESTFTAAQ-KPILKHY 190
Query: 453 TNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSY 512
+ ++ E+LPL PDF+ + F FD P +Y K K R E M
Sbjct: 191 SKPGVEAREVLPLFPDFKLWHLPFAQVIFDTDPA----VYGK--KEDRQMQE----MSQA 240
Query: 513 VATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSF--------SWVREYHWDVRGDDA 564
+ G N E+F+AY +PS + ++K D+ + +SF S R+Y+W+V+ +
Sbjct: 241 MIRGMMDENNEQFVAYFLPSPDTITKREEDDRDGISFRPDEEYEYSLSRDYNWNVKNKAS 300
Query: 565 D--DPTTYLVSFDDDEARYVPLPTKLNLRKKR 594
+ + + V + Y + T++ L K+R
Sbjct: 301 KGYEESYFFVVRPGEGVYYNEIETRVRLNKRR 332
>gi|7023594|dbj|BAA92020.1| unnamed protein product [Homo sapiens]
Length = 531
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 162/335 (48%), Gaps = 42/335 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++R +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRIVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ +K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKPG----VKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 340
>gi|55726317|emb|CAH89930.1| hypothetical protein [Pongo abelii]
Length = 534
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 101/336 (30%), Positives = 164/336 (48%), Gaps = 43/336 (12%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGDDAD 565
G +F+AY +P L K D+ E++ ++ REY+W+V+ +
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNKASK 305
Query: 566 --DPTTYLVSFDDDEARYVPLPT-KLNLRKKRAIEG 598
+ + + + D Y L T ++ L K+RA G
Sbjct: 306 GYEENYFFIFREGDGVYYNELETRRVRLSKRRAKAG 341
>gi|426388668|ref|XP_004060755.1| PREDICTED: RNA polymerase II-associated factor 1 homolog isoform 2
[Gorilla gorilla gorilla]
Length = 487
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 162/338 (47%), Gaps = 48/338 (14%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL+
Sbjct: 20 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLI-- 77
Query: 342 SVYNPPSVR----PPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISP 397
NP + R LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 78 ---NPDTYRIDPNVLLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST 129
Query: 398 LSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPI 450
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K
Sbjct: 130 ---EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQ 182
Query: 451 HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMK 510
H + + PVE++P+ PDF+ + + FD P D S A E +M
Sbjct: 183 HYSKPRVTPVEVMPVFPDFKMWINPCAQVIFDSDPAPK-------DTSGAAALE---MMS 232
Query: 511 SYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRGD 562
+ G +F+AY +P L K D+ E++ ++ REY+W+V+
Sbjct: 233 QAMIRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKNK 292
Query: 563 DAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + + D Y L T++ L K+RA G
Sbjct: 293 ASKGYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 330
>gi|312076665|ref|XP_003140963.1| Paf1 family protein [Loa loa]
Length = 385
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 159/365 (43%), Gaps = 54/365 (14%)
Query: 279 TTFLCKLKFRNELPEPSAQPKLMAL-KKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLD 337
T FLC++K+ N LP+ K +A RFT Y SSLEKN+K +L EPD G+ +D
Sbjct: 31 TDFLCRVKYSNALPDIPFDTKFLACPFVSLSRFTDYKSSSLEKNFKFELLCEPDCGVNID 90
Query: 338 LLDLSVY----NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQ 393
L++ Y + P P+D E L +DE P R+ K V W+ KT+
Sbjct: 91 LINPETYYVDPDNPKKHHPIDLE----LLEDEQANPQN----LRRSLQHSKMVPWMRKTE 142
Query: 394 YISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASF-EACK 446
YIS E R + + E +E + G S + DR QI I +F +A K
Sbjct: 143 YISS---EFTRFGV----SAERQETRIGYSTKKKFQTDVLYRDRTSQIAAINKTFDDASK 195
Query: 447 LRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESR 506
H T K + VE LPLLPDF+ + F FDG P + D +++
Sbjct: 196 TVLQHPTKKGVTAVEELPLLPDFDNWMHPFALVVFDGDPIPQN-----------DKVDAQ 244
Query: 507 AIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWD 558
+M + G E+F+ Y +PS L K + D + + F VR+Y+W
Sbjct: 245 TLMPQALIRGMMDEEGEQFVTYFLPSKETLDKRLKDAEDGLEFDSDYVYEYQSVRDYNWL 304
Query: 559 VRGDDAD--DPTTYLVSFDDDEARYVPLPTK--LNLRKKRAIEGRSNDEVEHFPIPSSIA 614
VR + +L + D Y L TK LN RK + I RS + +
Sbjct: 305 VRNKSTKGYEQDNFLFTIRDGAVYYNELETKVSLNRRKTKRIRVRS----RFMQQETKLM 360
Query: 615 VRRRA 619
VR RA
Sbjct: 361 VRNRA 365
>gi|270001899|gb|EEZ98346.1| hypothetical protein TcasGA2_TC000801 [Tribolium castaneum]
Length = 465
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/321 (29%), Positives = 154/321 (47%), Gaps = 46/321 (14%)
Query: 299 KLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPP--LDPE 356
K ++ D RF ++ +SLE+NY+ ++ E DLG+ +DL++ +Y +V P LDP
Sbjct: 17 KFISYPFDSSRFIQFNPTSLERNYRYEVLTEHDLGVAIDLINKDIY---AVEPGAMLDPA 73
Query: 357 DEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELR 416
DE+LL +D ++TP ++ R + VSWL +T+YI S Q+ + Q+ +
Sbjct: 74 DEKLLEED-ILTPQD----SKRSRHHARSVSWLRRTEYI------STEQTRFQPQSMDKV 122
Query: 417 EMKGGRSILENLN------DRERQIKEIEASFEACKLRPI--HATNKNLQPVEILPLLPD 468
E K G SI +NL DR+ QIK IE +F PI H + N+ VE+LP+ PD
Sbjct: 123 EAKVGYSIKKNLKNETLYMDRDSQIKAIEKTFSDTT-NPIEKHYSKPNVTAVEVLPVFPD 181
Query: 469 FERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA 527
F + FD P ++ +++++ M + G E+F+A
Sbjct: 182 FNLWKYPCAQVIFDSDPAPVGKQVPAQIEE-----------MSQAMIRGVMDERGEQFVA 230
Query: 528 YMVPSVNELSKDM--------YDENEDVSFSWVREYHWDVRGDDADD-PTTYLVSFDDDE 578
Y +P+ L K Y ++E+ + REY+W+V+ + Y D
Sbjct: 231 YFLPTEETLVKRREDFANEIPYQDDEEYEYKMAREYNWNVKSKASKGYEENYFFVIRQDG 290
Query: 579 ARYVPLPTKLNLRKKRAIEGR 599
A Y L T++ L K+R G+
Sbjct: 291 AYYNELETRVRLSKRRQKVGQ 311
>gi|45361543|ref|NP_989348.1| RNA polymerase II-associated factor 1 homolog [Xenopus (Silurana)
tropicalis]
gi|82186238|sp|Q6P2Y1.1|PAF1_XENTR RecName: Full=RNA polymerase II-associated factor 1 homolog
gi|39850160|gb|AAH64253.1| Paf1, RNA polymerase II associated factor, homolog [Xenopus
(Silurana) tropicalis]
Length = 520
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/366 (28%), Positives = 168/366 (45%), Gaps = 60/366 (16%)
Query: 254 GSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRY 313
G R R P SG +C++K+ N LP+ PK + D++RF +Y
Sbjct: 14 GHRSSSHRTVPERSG------------VVCRVKYCNTLPDIPFDPKFITYPFDQNRFVQY 61
Query: 314 TFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYN-PPSVRPPLDPEDEELLRDDEVVTPVKK 372
+SLEK +K L EPDLG+ +DL++ Y P+V LD DE+LL ++E+ P
Sbjct: 62 KATSLEKQHKHDLLTEPDLGVTIDLINPDTYRIDPNV--TLDIADEKLL-EEEIQAPSSS 118
Query: 373 DGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN--- 429
KR ++ K V W+ KT+YIS E R ++ ++ E+K G S+ +
Sbjct: 119 ---KRSQQHA-KVVPWMRKTEYIST---EFNRYGVSNEKP----EVKIGVSVKQQFTEED 167
Query: 430 ---DRERQIKEIEASFEACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGA 484
DR+ QI IE +FE + +PI H + + PVE++P+ PDF+ + + FD
Sbjct: 168 IYKDRDSQISAIEKTFEDAQ-KPISQHYSKPRVTPVEVMPVFPDFKMWINPCAQVIFDSD 226
Query: 485 PTADSEIYSKMDKSVRDAHESRAI--MKSYVATGSDSANPEKFLAYMVPSVNELSKDMYD 542
P +DA S A+ M + G +F+AY +P + K D
Sbjct: 227 PAP------------KDASGSAALDMMSQAMIRGMMDEEGNQFVAYFLPGEETMRKRKRD 274
Query: 543 ENEDV--------SFSWVREYHWDVRGDDAD--DPTTYLVSFDDDEARYVPLPTKLNLRK 592
+ E + + REY+W+V+ + + + + + D Y L T++ L K
Sbjct: 275 QEEGLDYMPEDIYDYKIAREYNWNVKNKASKGYEENYFFIFREGDGVYYNELETRVRLSK 334
Query: 593 KRAIEG 598
+R G
Sbjct: 335 RRVKAG 340
>gi|390342619|ref|XP_003725696.1| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Strongylocentrotus purpuratus]
Length = 515
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 152/349 (43%), Gaps = 75/349 (21%)
Query: 265 LLSGERTENRLKKPTT-------FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSS 317
+ SGER + K +T +C++K+ N LP+ PK + + +RF +Y +S
Sbjct: 5 IQSGERKDRERKSRSTTGPARSDIVCRVKYANSLPDIPFDPKFITYPFEANRFVQYNATS 64
Query: 318 LEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKR 377
LE+NYK +L E DLG+ +DL++ Y P LDP DE+LL ++E+ TP G +
Sbjct: 65 LERNYKHELLAEHDLGVTIDLINPDTYRIPEEHVELDPADEKLL-EEEIATP----GDSK 119
Query: 378 KERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKE 437
+ R K VSWL KT+YIS N + +
Sbjct: 120 RSRQHSKTVSWLRKTEYISS-----------------------------EYNRSQHSKDK 150
Query: 438 IEASFEACKLRPIHATNK-NLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMD 496
IE TNK ++ PVEILP+ PDFE + FD P D
Sbjct: 151 IE-------------TNKPHVTPVEILPVFPDFETWIHPCAQVIFDSDPAL-------RD 190
Query: 497 KSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPS-------VNELSKDM-YDENEDVS 548
KS +H+ M + G + E+F+ Y +P+ + +D+ Y E E+
Sbjct: 191 KS---SHQQLEEMSQAMIRGMVDESEEQFVGYFLPTEETCRKRKRDFEEDIDYVEGEEYE 247
Query: 549 FSWVREYHWDVRGDDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRA 595
+ REY+W+V+ + + + V Y L T++ L K+R
Sbjct: 248 YKLTREYNWNVKNKASRGYEENYFFVFRKGKGVFYNELETRVRLSKRRV 296
>gi|194746540|ref|XP_001955738.1| GF18911 [Drosophila ananassae]
gi|190628775|gb|EDV44299.1| GF18911 [Drosophila ananassae]
Length = 548
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 167/349 (47%), Gaps = 38/349 (10%)
Query: 267 SGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQL 326
+ +R + + ++ + +C++K+ N LP+ K + D RF +Y +SLE+N+K +
Sbjct: 14 ADKRPQRQTERKSEIICRVKYGNNLPDIPFDLKFLQYPFDSHRFVQYNPTSLERNFKYDV 73
Query: 327 HVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGV 386
E DLG+ +DL++ +Y S+ LDP DE+LL ++E +TP D + + R + V
Sbjct: 74 LTEHDLGVTVDLINRELYQADSM-TLLDPADEKLL-EEETLTPT--DSV--RSRQHSRTV 127
Query: 387 SWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEA 440
SWL K++YI S Q+ + Q E E K G ++ ++L DR+ QIK IE
Sbjct: 128 SWLRKSEYI------STEQTRFQPQNLENIEAKVGYNVKKSLREETLYLDRDAQIKAIEK 181
Query: 441 SFEACKLRPI-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSV 499
+F K H + N+ PVE+LP+ PDF + FD P + + K+V
Sbjct: 182 TFSDTKSEITKHYSKPNVVPVEVLPIFPDFINWKYPCAQVIFDSDP-------APVGKNV 234
Query: 500 RDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYD--------ENEDVSFSW 551
E M + G + E+F+AY +P+ L K D E E+ +
Sbjct: 235 PAQLEE---MSQAMIRGVMDESGEQFVAYFLPTEQTLEKRRADFVAGELYKEEEEYEYKI 291
Query: 552 VREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
REY+W+V+ + Y D Y L T++ L K+R G+
Sbjct: 292 AREYNWNVKTKASKGYEENYFFVMRPDGIYYNELETRVRLNKRRVKVGQ 340
>gi|428183424|gb|EKX52282.1| hypothetical protein GUITHDRAFT_102185 [Guillardia theta CCMP2712]
Length = 498
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 90/257 (35%), Positives = 132/257 (51%), Gaps = 17/257 (6%)
Query: 280 TFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLL 339
TF+ K+KFRN+LPE +PK +A + D + T Y+F+SLEKN+K L EP+LGI +DL+
Sbjct: 46 TFISKMKFRNDLPELPFEPKFLASQMDVKKLTSYSFTSLEKNFKYSLLTEPNLGIHIDLI 105
Query: 340 DLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLS 399
+ Y PPS PL PED+ LL ++V T I KERP V WL KT Y
Sbjct: 106 EPETYEPPSNGAPLLPEDKYLLEGEKVETNRSNLRINIKERP---NVPWLHKTAYY---- 158
Query: 400 MESARQSLTEKQAKELREMKGGRSILENLNDRE-RQIKE-IEASFEAC-KLRPI-HATNK 455
+ L + Q +++ I E++ + +Q+ E IE SFE KL + H TN
Sbjct: 159 ---GNEDLYDYQQGHKPQVQLPVPIQEDVQELTPKQLAEKIEKSFELSRKLEDLKHPTNP 215
Query: 456 NLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVAT 515
+L ++ +LPD + + + + FD PT D + + R S A + +
Sbjct: 216 DLVATKVWEILPDTKCWPNDYTELVFDQDPTLDDPKAANTSMADRATLASTAFIN--IGR 273
Query: 516 GSDSANPEKFLAYMVPS 532
AN F AYM+P+
Sbjct: 274 SMQHANDRTF-AYMLPA 289
>gi|297801888|ref|XP_002868828.1| hypothetical protein ARALYDRAFT_916596 [Arabidopsis lyrata subsp.
lyrata]
gi|297314664|gb|EFH45087.1| hypothetical protein ARALYDRAFT_916596 [Arabidopsis lyrata subsp.
lyrata]
Length = 340
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 52/69 (75%), Positives = 63/69 (91%)
Query: 403 ARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPIHATNKNLQPVEI 462
+ +SLTEKQAKELREMKGG +IL+NLN+RERQI +IEASFEACK +PIH+TNKN+QPVE+
Sbjct: 232 SMKSLTEKQAKELREMKGGINILQNLNNRERQIMDIEASFEACKSQPIHSTNKNVQPVEV 291
Query: 463 LPLLPDFER 471
LPLL F+R
Sbjct: 292 LPLLAYFDR 300
>gi|170054859|ref|XP_001863321.1| antimeros [Culex quinquefasciatus]
gi|167875008|gb|EDS38391.1| antimeros [Culex quinquefasciatus]
Length = 548
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 171/362 (47%), Gaps = 46/362 (12%)
Query: 254 GSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRY 313
GS D+RA +R+E + ++K+ N LP+ K + + DRF +Y
Sbjct: 8 GSNAADKRAVARPQEKRSE--------LITRVKYCNTLPDIPFDLKFITYPFENDRFIQY 59
Query: 314 TFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKD 373
+SLE+NY+ ++ E DLG+ +DL++ +Y + LDP DE+LL +D + TP +D
Sbjct: 60 KPTSLERNYRYEVLTEHDLGVTIDLINRDLYQIDHL-AVLDPADEKLLEED-IHTP--QD 115
Query: 374 GIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN---- 429
++ K VSWL K++YI S Q+ + Q E E K G ++ ++L
Sbjct: 116 SMRSSRHA--KSVSWLRKSEYI------STEQTRFQPQTMEKVEAKVGFNVKQSLREETL 167
Query: 430 --DRERQIKEIEASFEACKLR-PIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT 486
DRE Q+K I+ +FE K H + + PVE+LP+ PDF + FD P
Sbjct: 168 YMDREAQVKAIDKTFEDNKKEITAHYSKPGVVPVEVLPVFPDFVNWKYPCAQVIFDFDP- 226
Query: 487 ADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK---DMYDE 543
+ + K+V E M + G + E+F+AY +P+ L K D+ +E
Sbjct: 227 ------APIGKNVPAQIEE---MSQAMIRGVMDESGEQFVAYFLPTEETLEKRRRDLVNE 277
Query: 544 N-----EDVSFSWVREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIE 597
E+ + REY+W+V+ + Y + D Y L T++ L K+R
Sbjct: 278 TLYEDEEEYEYKMAREYNWNVKSKASKGYEENYYLVLRQDGMYYNELETRVRLSKRRQKV 337
Query: 598 GR 599
G+
Sbjct: 338 GQ 339
>gi|147902583|ref|NP_001086458.1| RNA polymerase II-associated factor 1 homolog [Xenopus laevis]
gi|171769536|sp|A2BD83.1|PAF1_XENLA RecName: Full=RNA polymerase II-associated factor 1 homolog
gi|122936408|gb|AAI30058.1| Paf1 protein [Xenopus laevis]
Length = 524
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 95/336 (28%), Positives = 161/336 (47%), Gaps = 44/336 (13%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNTLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LD DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--TLDFADEKLL-EEEIQAPSSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASF-EACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +F +A K H +
Sbjct: 140 EFNRYGVSNEKP----EVKIGVSVKQQFTEEDIYKDRDSQISAIEKTFDDAQKDISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRAIMKSY 512
+ PVE++P+ PDF+ + + FD P D+ + +D +M
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDPAPKDASGTAALD-----------MMSQA 244
Query: 513 VATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV--------SFSWVREYHWDVRGDDA 564
+ G +F+AY +P + + K D+ E + + REY+W+V+ +
Sbjct: 245 MIRGMMDEEGNQFVAYFLPGEDTMRKRKRDQEEGLDYMPEDIYDYKIAREYNWNVKNKAS 304
Query: 565 D--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
+ + + + D Y L T++ L K+R G
Sbjct: 305 KGYEENYFFIFREGDGVYYNELETRVRLSKRRVKAG 340
>gi|324512878|gb|ADY45317.1| RNA polymerase II-associated factor 1 [Ascaris suum]
Length = 470
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 151/340 (44%), Gaps = 44/340 (12%)
Query: 281 FLCKLKFRNELPEPSAQPKLMAL-KKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLL 339
FLC++K+ N LP+ + K +A RF Y +SLEKN+K +L E DLG+ +DL+
Sbjct: 37 FLCRVKYSNTLPDIPFEAKFLACPFTSLSRFIDYKQTSLEKNFKFELLAESDLGVKIDLI 96
Query: 340 DLSVY--NPPS-VRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYIS 396
+ Y +P + + L+ D ELL +DE P R+ K V W+ KT+YIS
Sbjct: 97 NPETYFVDPDAPKKQQLNAIDAELL-EDEQANPQN----SRRSLQHSKMVPWMRKTEYIS 151
Query: 397 PLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASF-EACKLRP 449
E R + + E +E K G S + DR QI I +F EA K
Sbjct: 152 S---EFTRFGV----SNERQETKIGYSTKKKFQNEVLYRDRASQIAAINKTFDEAAKPVT 204
Query: 450 IHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIM 509
H K + VE L LLPDFE + F FD P SE +M
Sbjct: 205 RHPMKKGVTAVEELYLLPDFENWKHPFALVVFDSDPVPQSE-----------KANPGTLM 253
Query: 510 KSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENE------DVSFSW--VREYHWDVRG 561
+ G E+F+ Y +P+ L K M D D ++ + VR+Y+W VR
Sbjct: 254 SQALIRGMMDEEGEQFVTYFLPTKETLDKRMTDTENGKQFDPDYTYEYYSVRDYNWFVRN 313
Query: 562 DDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGR 599
+ +L SF D Y L T+++L +++A R
Sbjct: 314 KSTKGYEQDNFLFSFRDSGVYYNELETRVSLTRRKAKRSR 353
>gi|347965688|ref|XP_321844.5| AGAP001302-PA [Anopheles gambiae str. PEST]
gi|333470395|gb|EAA01198.5| AGAP001302-PA [Anopheles gambiae str. PEST]
Length = 565
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 106/356 (29%), Positives = 176/356 (49%), Gaps = 50/356 (14%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+ ++K+ N LP+ K + + DRF +Y +SLE+NY+ ++ E DLG+ +DL++
Sbjct: 26 LISRVKYCNTLPDIPFDLKFITYPFENDRFIQYNPTSLERNYRYEVLTEHDLGVTIDLIN 85
Query: 341 LSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYIS---- 396
+Y LDP DE+LL +D + TP +D ++ K VSWL K++YIS
Sbjct: 86 RDLYQIDH-SAQLDPADEKLLEED-IHTP--QDSMRSSRHA--KSVSWLRKSEYISTEQT 139
Query: 397 ---PLSMESARQSLTEKQAKELREMKGGRSILENL-NDRERQIKEIEASFEACKLRPI-- 450
P +ME + K LRE E L DRE QIK IE +FE +PI
Sbjct: 140 RFNPQTMEKVEAKVGFNVKKSLRE--------ETLYMDREAQIKAIEKTFED-NTKPITT 190
Query: 451 HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRAIM 509
H + + PVEI+P+ PDF + FD P + + +++++ S+A++
Sbjct: 191 HYSKPGVTPVEIMPVFPDFANWKYPCAQVIFDSDPAPSGKNVPAQIEEM------SQAMI 244
Query: 510 KSYVATGSDSANPEKFLAYMVPSVNELSK---DMYDEN-----EDVSFSWVREYHWDVRG 561
+ G + E+F+AY +P+ + L K D+ +E E+ + REY+W+V+
Sbjct: 245 R-----GVMDESGEQFVAYFLPTDDTLEKRRRDLVNETLYEDEEEYEYKMAREYNWNVKS 299
Query: 562 DDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRA--IEGRSNDE--VEHFPIPSS 612
+ Y + D Y L T++ L K+R + +SN + V+H P+ +S
Sbjct: 300 KASKGYEENYYLVLRPDGIYYNELETRVRLSKRRQKNAQQQSNTKLVVKHRPLNAS 355
>gi|417402890|gb|JAA48275.1| Putative rna polymerase ii regulator [Desmodus rotundus]
Length = 572
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 166/365 (45%), Gaps = 69/365 (18%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISP--- 397
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNV--LLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYISTEFN 142
Query: 398 ---------------------------LSMESARQSLTEKQAKELREMKGGRSILENLN- 429
+S E R ++ ++ E+K G S+ +
Sbjct: 143 RYGISNEKPEVXXXXXXVVPWMRKTEYISTEFNRYGISNEKP----EVKIGVSVKQQFTE 198
Query: 430 -----DRERQIKEIEASFE-ACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDG 483
DR+ QI IE +FE A K H + + PVE++P+ PDF+ + + FD
Sbjct: 199 EEIYKDRDSQITAIEKTFEDAQKSISQHYSKPRVTPVEVMPVFPDFKMWINPCAQVIFDS 258
Query: 484 APTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDE 543
P + D S A E +M + G +F+AY +P L K D+
Sbjct: 259 DP-------APKDTSGAAALE---MMSQAMIRGMMDEEGNQFVAYFLPVEETLKKRKRDQ 308
Query: 544 NEDVSFS--------WVREYHWDVRGDDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKK 593
E++ ++ REY+W+V+ + + + + + D Y L T++ L K+
Sbjct: 309 EEEMDYAPDDVYDYKIAREYNWNVKNKASKGYEENYFFIFREGDGVYYNELETRVRLSKR 368
Query: 594 RAIEG 598
RA G
Sbjct: 369 RAKAG 373
>gi|312374080|gb|EFR21724.1| hypothetical protein AND_16488 [Anopheles darlingi]
Length = 788
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 101/334 (30%), Positives = 164/334 (49%), Gaps = 46/334 (13%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+ ++K+ N LP+ K + + DRF +Y +SLE+NY+ ++ E DLG+ +DL++
Sbjct: 45 LISRVKYCNTLPDIPFDLKFITYPFENDRFIQYNPTSLERNYRYEVLTEHDLGVTIDLIN 104
Query: 341 LSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYIS---- 396
+Y LDP DE+LL +D + TP +D ++ K VSWL K++YIS
Sbjct: 105 RDLYQIDHT-AQLDPADEKLLEED-IHTP--QDSMRSSRHA--KSVSWLRKSEYISTEQT 158
Query: 397 ---PLSMESARQSLTEKQAKELREMKGGRSILENL-NDRERQIKEIEASF-EACKLRPIH 451
P +ME + K LRE E L DRE QIK IE +F + K H
Sbjct: 159 RFNPQTMEKVEAKVGFNVKKSLRE--------ETLYMDREAQIKAIEKTFDDNTKTITTH 210
Query: 452 ATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPT-ADSEIYSKMDKSVRDAHESRAIMK 510
+ + PVE+LPL PDF + FD P + + +++++ S+A+++
Sbjct: 211 YSKPGVTPVEVLPLFPDFVNWKYPCAQVIFDSDPAPSGKNVPAQIEEM------SQAMIR 264
Query: 511 SYVATGSDSANPEKFLAYMVPSVNELSK---DMYDEN-----EDVSFSWVREYHWDVRGD 562
G + E+F+AY +P+ + L K D+ +E E+ + REY+W+V+
Sbjct: 265 -----GVMDESGEQFVAYFLPTEDTLEKRRRDLVNETLYEDEEEYEYKMAREYNWNVKSK 319
Query: 563 DAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKR 594
+ + YLV D Y L T++ L K+R
Sbjct: 320 ASKGYEENYYLV-LRQDGIYYNELETRVRLSKRR 352
>gi|109124676|ref|XP_001086329.1| PREDICTED: RNA polymerase II-associated factor 1 homolog [Macaca
mulatta]
Length = 333
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 147/296 (49%), Gaps = 40/296 (13%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNVL--LDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRG 561
G +F+AY +P L K D+ E++ ++ REY+W+V+
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKN 301
>gi|51258264|gb|AAH79993.1| Paf1 protein [Xenopus laevis]
Length = 407
Score = 108 bits (270), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 101/364 (27%), Positives = 167/364 (45%), Gaps = 56/364 (15%)
Query: 254 GSRMGDRRAAPLLSGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRY 313
G R R P SG +C++K+ N LP+ PK + D++RF +Y
Sbjct: 14 GHRSSSHRTVPERSG------------VVCRVKYCNTLPDIPFDPKFITYPFDQNRFVQY 61
Query: 314 TFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYN-PPSVRPPLDPEDEELLRDDEVVTPVKK 372
+SLEK +K L EPDLG+ +DL++ Y P+V LD DE+LL ++E+ P
Sbjct: 62 KATSLEKQHKHDLLTEPDLGVTIDLINPDTYRIDPNV--TLDFADEKLL-EEEIQAPSSS 118
Query: 373 DGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN--- 429
KR ++ K V W+ KT+YIS E R ++ ++ E+K G S+ +
Sbjct: 119 ---KRSQQHA-KVVPWMRKTEYIST---EFNRYGVSNEKP----EVKIGVSVKQQFTEED 167
Query: 430 ---DRERQIKEIEASF-EACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAP 485
DR+ QI IE +F +A K H + + PVE++P+ PDF+ + + FD P
Sbjct: 168 IYKDRDSQISAIEKTFDDAQKDISQHYSKPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP 227
Query: 486 T-ADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDEN 544
D+ + +D +M + G +F+AY +P + + K D+
Sbjct: 228 APKDASGTAALD-----------MMSQAMIRGMMDEEGNQFVAYFLPGEDTMRKRKRDQE 276
Query: 545 EDV--------SFSWVREYHWDVRGDDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKR 594
E + + REY+W+V+ + + + + + D Y L T++ L K+R
Sbjct: 277 EGLDYMPEDIYDYKIAREYNWNVKNKASKGYEENYFFIFREGDGVYYNELETRVRLSKRR 336
Query: 595 AIEG 598
G
Sbjct: 337 VKAG 340
>gi|417399206|gb|JAA46631.1| Putative rna polymerase ii regulator [Desmodus rotundus]
Length = 338
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 147/296 (49%), Gaps = 40/296 (13%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 90 DTYRIDPNVL--LDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST--- 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYV 513
+ PVE++P+ PDF+ + + FD P + D S A E +M +
Sbjct: 196 KPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGAAALE---MMSQAM 245
Query: 514 ATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRG 561
G +F+AY +P L K D+ E++ ++ REY+W+V+
Sbjct: 246 IRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVKN 301
>gi|291236098|ref|XP_002737978.1| PREDICTED: antimeros-like [Saccoglossus kowalevskii]
Length = 533
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 161/362 (44%), Gaps = 68/362 (18%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+C++K+ N LP+ PK + + +RF +Y +SLE++YK +L E DLG+ +DL++
Sbjct: 27 LVCRVKYCNHLPDIPFDPKFITYPFEPNRFVQYNATSLERSYKHELLTEHDLGVTIDLIN 86
Query: 341 LSVYN-PPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLS 399
Y+ PSV +DP DE LL ++E+ TP G ++ R K VSWL KT+YIS
Sbjct: 87 PDTYSQDPSV--IIDPADERLL-EEEISTP----GDSKRSRQHAKPVSWLRKTEYIST-- 137
Query: 400 MESARQSLTEKQAKELREMKGGRSILENL------NDRERQIKEIEASFEACKLRPI--H 451
E R + + +A E K G ++ + DRE QI I+ +F+ K+ PI H
Sbjct: 138 -EYNRFTHSSDKA----ETKVGANLRKQFAEDYLYKDRESQISAIDKTFDTAKV-PITEH 191
Query: 452 ATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKM---------------- 495
+ N+ V++LP+ PDF V+ TF D I +
Sbjct: 192 YSKPNVTAVDVLPVFPDFN------VSFTFIDRMFTDKRICHAVGDVSHPKSVYYLPVAM 245
Query: 496 -----------DKSVRDAHESRAI--MKSYVATGSDSANPEKFLAYMVPSVNELSKDMYD 542
D SV + + M + G + E+F+ Y +P+ K D
Sbjct: 246 DHPCAQVIFDSDPSVSGKQMALQMEEMSQAMIRGMVDESDEQFVGYFLPNEETRQKRKRD 305
Query: 543 ENEDVSFS--------WVREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKK 593
E + ++ REY+W+V+ Y DD Y L T++ L K+
Sbjct: 306 VEEGIDYTPEEEYEYKMAREYNWNVKNKATKGYEENYFFVVRDDGVYYNELETRVRLSKR 365
Query: 594 RA 595
R
Sbjct: 366 RV 367
>gi|339239561|ref|XP_003381335.1| 1-pyrroline-5-carboxylate dehydrogenase [Trichinella spiralis]
gi|316975641|gb|EFV59049.1| 1-pyrroline-5-carboxylate dehydrogenase [Trichinella spiralis]
Length = 1198
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 163/352 (46%), Gaps = 45/352 (12%)
Query: 287 FRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNP 346
+ N LP+ + K + DR +Y + +E+ Y +L E D+GI +DL+ + Y P
Sbjct: 38 YSNTLPDVPFEAKFLPHPFPADRHVKYETTMMERAYTGELVTEDDVGIFIDLVLIDKYEP 97
Query: 347 -PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISP----LSME 401
P+ L P DE L D++ + K K R K V W+ +T+YIS ++
Sbjct: 98 NPNEPVQLHPTDELLCSDEDTLLSTMK-----KSRHGTKAVPWMRRTEYISTDYSRFGVQ 152
Query: 402 SARQ--SLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPI--HATNKNL 457
+ RQ L K L+E R DRE Q+ I +FE K +P+ H + +++
Sbjct: 153 TERQETKLGYHIQKVLKEASLYR-------DRESQLTAINKTFEDAK-KPVREHFSKRDV 204
Query: 458 QPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGS 517
VE LPLLPDF+ + F FD P + + +++++ RA+++ G
Sbjct: 205 YAVEELPLLPDFDAWKYPFAQVIFDVDPAPRDK--TSLEETLATDQICRAMIR-----GM 257
Query: 518 DSANPEKFLAYMVPSVNELSKDMYDENEDVS---------FSWVREYHWDVRGD-DADDP 567
E+F+AY VP++ + K E +D+S F +REY+W V+
Sbjct: 258 MDEKGEQFVAYFVPTLETIRKLKSVEGKDISEMDLSTPFDFKLLREYNWSVKNKASVGYE 317
Query: 568 TTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSS-IAVRRR 618
+ SF D A Y L T++ L ++R +G H +P+S + VR R
Sbjct: 318 ENHFFSFRDGRAYYNELETRVRLNRRRVKDGHV-----HSAVPNSRLIVRYR 364
>gi|328722735|ref|XP_001944291.2| PREDICTED: RNA polymerase II-associated factor 1 homolog
[Acyrthosiphon pisum]
Length = 568
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 164/371 (44%), Gaps = 72/371 (19%)
Query: 275 LKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGI 334
++K + +CK+K+ N LP+ K ++ D R+ +Y +SLE++YK ++ + DLG+
Sbjct: 19 VEKRSDLVCKVKYCNMLPDIPFDLKFLSYPFDATRYIQYNPTSLERSYKFEILADHDLGL 78
Query: 335 PLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQY 394
+DL++ Y +D DE+LL +D V+ P +D ++ R + VSWL +T+Y
Sbjct: 79 NIDLVNKDKY-TIDYNLQMDNADEKLLEED-VLAP--QDS--KRSRHHARSVSWLRRTEY 132
Query: 395 ISPLSMESARQSLTEK-----QAKELREMKGGRSILENLN------DRERQIKEIEASFE 443
IS TEK Q E E K G SI +N DR+ Q+K IE +FE
Sbjct: 133 IS-----------TEKTRFQPQTIEKVEAKVGFSIQKNFKEETLYMDRDSQMKAIEKTFE 181
Query: 444 ACKLRPI--HATNKNLQPVEILPLLPDFERYDDQFVAATFDG--------APTADSEIYS 493
K +P+ H + N+ VE L + PDF+ + FD P E+
Sbjct: 182 DNK-KPVEKHYSKPNVHAVETLNVFPDFKNWRFPCAQVIFDSDPAPMGRPVPAQIEEMSQ 240
Query: 494 KMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENE 545
M + V D + E+F+AY +PS + K YD++
Sbjct: 241 AMIRGVMD------------------ESGEQFVAYFLPSEETIVKRREDAVEARPYDDDY 282
Query: 546 DVSFSWVREYHWDVRGDDADD-PTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDE- 603
+ + REY+W+V+ + Y D Y L T++ L K+R G +
Sbjct: 283 EYEYKMAREYNWNVKSKSSKGYEENYFFIVQSDGVYYNELETRVRLSKRRQKIGTQSSST 342
Query: 604 -----VEHFPI 609
V H PI
Sbjct: 343 NTRLIVRHVPI 353
>gi|298711177|emb|CBJ32401.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 509
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 89/345 (25%), Positives = 161/345 (46%), Gaps = 32/345 (9%)
Query: 274 RLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLG 333
+ +K T FLC +KFRN LP+P P + + D + +Y ++LE +YK +LH E DLG
Sbjct: 17 KWQKQTEFLCTIKFRNSLPDPPLGPHFLDVPLDLQSYVKYKPTTLESDYKWKLHYERDLG 76
Query: 334 IPLDLLDLSVYNPPSVRPPLDPEDEELL--RDDEVVTPVKKDGIKRKER--PTDKGVSWL 389
+P+D++D Y P PL PEDE LL R+ + VKK+ + R D V+WL
Sbjct: 77 VPIDVIDPRNYLLPKKAVPLPPEDEALLNWREGDGPGSVKKEDLSTSARRKTVDTSVTWL 136
Query: 390 VKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRP 449
KT Y++ + Q +E+Q + +E++ + I ++R++ E+ A +P
Sbjct: 137 KKTVYLTNDPFDPVHQFKSEQQTQADKEVELEKEIARG-KKQDRKMLIEESFLHANSNKP 195
Query: 450 I---HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESR 506
+ A+ ++L ++P++PD + + + FD P AD + + + A E+
Sbjct: 196 LVHQDASKRHLTAEWVMPVMPDVSLWPNTYTTVMFDKDPAADETVKGSEARKRKRAGEA- 254
Query: 507 AIMKSYVATGSDSANPEKFL--AYMVPSVNELSKD--------------MYDENEDVSFS 550
+ V + N + + +Y++P K M E+ V +
Sbjct: 255 --LVCNVKRTTFEGNSHEVITGSYLMPKGKATKKGEGEEEEEEEEEEEKMDVEDGSVGYD 312
Query: 551 WVREY-----HWDVRGDDADDPTTYLVSFDDDEARYVPLPTKLNL 590
+V++Y H+ D + ++++ D EA + L K+ L
Sbjct: 313 FVKDYKFNIVHYAQTQYDGNPHLLFVINKDTQEASFCKLSAKVEL 357
>gi|303274140|ref|XP_003056393.1| PAF1 complex protein [Micromonas pusilla CCMP1545]
gi|226462477|gb|EEH59769.1| PAF1 complex protein [Micromonas pusilla CCMP1545]
Length = 365
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 173/377 (45%), Gaps = 43/377 (11%)
Query: 273 NRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDL 332
N+LK+ T FLC +FRN+LP + K + + + Y SL + + + DL
Sbjct: 7 NKLKRETAFLCPAEFRNDLPLVPSDWKFLHRPNLCESISDYHHVSLIGEARDAV-LPADL 65
Query: 333 GIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKT 392
G+ +D L S Y RPPL+ D +LL +++ V + R +RP WL+ T
Sbjct: 66 GLTVDPLLSSSYREKQFRPPLEHVDNKLLEEEKDREMVADNHKIRNKRPDMSKALWLMNT 125
Query: 393 QYIS--PLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLRPI 450
QYIS PL R + K++R+ + + +QIK I+ SF A K R
Sbjct: 126 QYISSMPLPEHLGRSEKDWAKHKQIRDQDAAPN-----DSHIQQIKSIDRSFTAVKHRLS 180
Query: 451 HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSE-----IYSKMDKSVRDAHES 505
+++++ P+ LP+LPDF FV +D P+ D+ ++M ++ +
Sbjct: 181 KDSSQDVFPIMELPVLPDFAHSSSSFVHFIYDEDPSDDARPENGSCLTRMHTALEN---- 236
Query: 506 RAIMKSY-VATGSDSANPEKFLAYMVPSVN---------ELSKDMYDENEDVSFSWVREY 555
+ +K Y V EK +A M+PS++ L K Y+E ++W+REY
Sbjct: 237 -SFVKPYSVNCLRGDTGAEKLVALMLPSLSNHDTGARKVNLDKLNYEE-----YNWIREY 290
Query: 556 HWDVRGDDADDPTTYLVSFDDDE-ARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIA 614
+ + + A+ + FD + RY+ L ++ L KR+ + + +F PS I
Sbjct: 291 QYRLHRERANSHKSMCFFFDKEHNTRYMELSARI-LLSKRSKHSKGTRDGANFR-PSKIT 348
Query: 615 VRRRANVTAIELKEQGA 631
+++ L+ QGA
Sbjct: 349 IKK-------PLRNQGA 358
>gi|170572245|ref|XP_001892038.1| Paf1 family protein [Brugia malayi]
gi|158603082|gb|EDP39151.1| Paf1 family protein [Brugia malayi]
Length = 353
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/336 (28%), Positives = 147/336 (43%), Gaps = 40/336 (11%)
Query: 281 FLCKLKFRNELPEPSAQPKLMAL-KKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLL 339
FLC++K+ N LP+ K +A RFT Y SSLEKN+K +L EPD G+ +DL+
Sbjct: 33 FLCRVKYSNALPDIPFDTKFLACPFVSLSRFTDYKSSSLEKNFKFELLCEPDCGVNIDLI 92
Query: 340 DLSVY----NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+ Y + P P+D E L +DE P R+ K V W+ KT+YI
Sbjct: 93 NPETYYVDPDSPKKHHPIDLE----LLEDEQANPQN----LRRSLQHSKMVPWMRKTEYI 144
Query: 396 SP----LSMESARQ-SLTEKQAKELREMKGGRSILENLNDRERQIKEIEASF-EACKLRP 449
S + + RQ + + L + + R QI I +F +A K
Sbjct: 145 SSEFTRFGVSAERQETRVSRFLNWLLYKRRNSKRMYYTGIRASQIAAINKTFDDASKSVL 204
Query: 450 IHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIM 509
H T K + VE L LLPDF+ + F FDG P + D +++ +M
Sbjct: 205 NHPTKKGVTAVEELSLLPDFDNWMHPFALVVFDGDPIPQN-----------DKVDAQTLM 253
Query: 510 KSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVRG 561
+ G E+F+ Y +PS L K + D +E + F VR+Y+W VR
Sbjct: 254 PQALIRGMMDEEGEQFVTYFLPSRETLDKRLKDADEGLEFDSDYVYEYQSVRDYNWLVRN 313
Query: 562 DDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRA 595
+ +L + D Y L TK++L +++
Sbjct: 314 KSTKGYEQDNFLFTIRDGAVYYNELETKVSLTRRKT 349
>gi|281204907|gb|EFA79101.1| RNA polymerase II-associated factor 1 [Polysphondylium pallidum
PN500]
Length = 469
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 107/208 (51%), Gaps = 6/208 (2%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+ C+L+ N+L + PK + + D ++F Y +SLEKNYK L EP+LGIP++L+D
Sbjct: 98 YACQLRLSNKLADTPFDPKFLVIPSDFNKFVHYKTTSLEKNYKYPLLTEPNLGIPIELID 157
Query: 341 LSVYNPPSVRPPLDPEDEELLRD-DEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLS 399
VYN P + + ED+ LLR DE +++ I R VSWL KT+Y+S
Sbjct: 158 PEVYNIPKTKVAVPEEDQPLLRSLDEQDAELRRAPINRGSALLRPAVSWLRKTEYLSSHD 217
Query: 400 MESARQSLTEKQAKELREMKGGRS-ILENLNDRERQIKEIEASFEAC-KLRPIHATNKNL 457
+ R K+A E +S + I +E +F+A +H TN +L
Sbjct: 218 NQMGRPV---KRASESNAPTSTQSPYQQQQQQDAADIVLVENTFDAINNTNFVHPTNPSL 274
Query: 458 QPVEILPLLPDFERYDDQFVAATFDGAP 485
+PV ILP+ PDF+ + + + FD P
Sbjct: 275 KPVSILPVFPDFDLWANDYTETVFDADP 302
>gi|255070103|ref|XP_002507133.1| PAF1 complex protein [Micromonas sp. RCC299]
gi|226522408|gb|ACO68391.1| PAF1 complex protein [Micromonas sp. RCC299]
Length = 368
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 92/352 (26%), Positives = 154/352 (43%), Gaps = 14/352 (3%)
Query: 272 ENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPD 331
EN+ K+ T FLC F N LP + KLM + +L + + D
Sbjct: 8 ENKWKRETAFLCTADFNNNLPPNPIRWKLMDFPTHRTCLALPGAEALHDELSQRNTLSAD 67
Query: 332 LGIPLDLLDLSVYNPPSVRPPLDPEDEELLRD--DEVVTPVKKD-GIKRKERPTDKGVSW 388
LG +D + L S R L +D LL + + + D + RP W
Sbjct: 68 LGESVDPVLLKKLQVCSGRASLLSQDATLLNEYTNAHRAAARNDFNVSSARRPDLSKALW 127
Query: 389 LVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACKLR 448
L+ TQYIS +++ + L + + + N QI+ I SF+A +
Sbjct: 128 LMNTQYISSMTLP---EHLGRSEKDWAKRKHSNVLDITRRNSHASQIEAIHESFKAAQNV 184
Query: 449 PIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAI 508
P H N ++ P E++ +LPD +R ++ TFD +P +D E + ++
Sbjct: 185 PAHEKNADIHPAEVIEVLPDIDRQHGSYIHFTFDESPASDIEGKDILRNDCPLGQVENSL 244
Query: 509 MKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDADDPT 568
+K Y G+ S++P+KF+A+M PS+ + E + W EY + + D P
Sbjct: 245 VKPYSIDGA-SSSPDKFIAFMTPSLPIVYFPKIQEGSH-KYEWRNEYQYRMANDYGTKPD 302
Query: 569 TYLVSFD--DDEARYVPLPTKLNL-RKKRAIEGRSNDEVEHFPIPSSIAVRR 617
T + FD RYV L +++ L R+ + +G++ ++ PSSI V+R
Sbjct: 303 TVCLLFDPGMSCMRYVKLNSRMILSRRSKHAKGKAARRMQR---PSSITVKR 351
>gi|402590790|gb|EJW84720.1| hypothetical protein WUBG_04371, partial [Wuchereria bancrofti]
Length = 291
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 90/279 (32%), Positives = 126/279 (45%), Gaps = 34/279 (12%)
Query: 281 FLCKLKFRNELPEPSAQPKLMAL-KKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLL 339
FLC++K+ N LP+ K +A RFT Y SSLEKN+K +L EPD G+ +DL+
Sbjct: 33 FLCRVKYSNALPDIPFDTKFLACPFVSLSRFTDYKSSSLEKNFKFELLCEPDCGVNIDLI 92
Query: 340 DLSVY--NP--PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+ Y NP P P+D E L +DE P R+ K V W+ KT+YI
Sbjct: 93 NPETYYVNPDSPKKHHPIDLE----LLEDEQANPQN----LRRSLQHSKMVPWMRKTEYI 144
Query: 396 SP----LSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASF-EACKLRPI 450
S + S RQ E + + K +L DR QI I +F +A K
Sbjct: 145 SSEFTRFGVSSERQ---ETRIGYCTKKKFQTDVL--YRDRASQIAAINKTFDDASKTVLH 199
Query: 451 HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMK 510
H T K + VE L LLPDF+ + F FDG P + D +++ +M
Sbjct: 200 HPTKKGVTAVEELSLLPDFDNWMHPFALVVFDGDPIPQN-----------DKVDAQTLMP 248
Query: 511 SYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSF 549
+ G E+F+ Y +PS L K + D E + F
Sbjct: 249 QALIRGMMDEEGEQFVTYFLPSRETLDKRLKDAEEGLEF 287
>gi|427796871|gb|JAA63887.1| Putative rna polymerase ii-associated factor 1, partial
[Rhipicephalus pulchellus]
Length = 619
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 114/408 (27%), Positives = 175/408 (42%), Gaps = 90/408 (22%)
Query: 291 LPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVY--NPPS 348
LP+ PK ++ + +RF Y +SLE+NYK L E DLG+ +DL+D Y +P +
Sbjct: 122 LPDIPFDPKFISYPFEPNRFVSYKATSLERNYKHDLLTEHDLGVTIDLIDPKTYEIDPNA 181
Query: 349 VRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLT 408
V L P+DE+LL +D +TP +D ++ R + V WL KT+YI+ E +R T
Sbjct: 182 V---LHPDDEKLLEED-TLTP--QDS--KRSRHHNLVVPWLKKTEYIAT---EFSRYGQT 230
Query: 409 EKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPI--HATNKNLQPV 460
E K G ++ + DRE QI I +FE + +PI H + N++PV
Sbjct: 231 GVNT----ETKVGYNVKKLFKEEDLYMDRESQINAINKTFEEAQ-KPIEAHYSKPNVKPV 285
Query: 461 EILPLLPDFERYDDQFVAATFDGAPT--------ADSEIYSKMDKS-------------- 498
E+LPL PD + + F FD P + + I MD+S
Sbjct: 286 EVLPLFPDSDLWKYPFAQVMFDSDPAPITQLEEMSQAMIRGVMDESGEQFVAYFLPTEDT 345
Query: 499 ----VRDAHESRAI-----------------------MKSYVATGSDSANPEKFLAYMVP 531
RDA E M + G + E+F+AY +P
Sbjct: 346 IKKRKRDAEEGMDYMDDDEYEYRMAXXDPAPITQLEEMSQAMIRGVMDESGEQFVAYFLP 405
Query: 532 SVNELSK---------DMYDENEDVSFSWVREYHWDVRGDDADD-PTTYLVSFDDDEARY 581
+ + + K D D++E + REY+W+V+ + Y F DD Y
Sbjct: 406 TEDTIKKRKRDAEEGMDYMDDDE-YEYRMAREYNWNVKNKASKGYEENYFFVFRDDGVYY 464
Query: 582 VPLPTKLNLRKKRAIEG----RSNDEVEHFPIPSSIAVRRRANVTAIE 625
L T++ L K+R G S V H P+ ++A +T +E
Sbjct: 465 NELETRVRLTKRRLKPGVQPNNSKLVVRHRPLNEMEHKTQQARLTQLE 512
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 55/92 (59%), Gaps = 5/92 (5%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
+C++K+ N LP+ PK ++ + +RF Y +SLE+NYK L E DLG+ +DL+D
Sbjct: 34 LVCRVKYCNTLPDIPFDPKFISYPFEPNRFVSYKATSLERNYKHDLLTEHDLGVTIDLID 93
Query: 341 LSVY--NPPSVRPPLDPEDEELLRDDEVVTPV 370
Y +P +V L P+DE+LL +D + +
Sbjct: 94 PKTYEIDPNAV---LHPDDEKLLEEDTLTXTL 122
>gi|326918213|ref|XP_003205385.1| PREDICTED: RNA polymerase II-associated factor 1 homolog, partial
[Meleagris gallopavo]
Length = 385
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/310 (29%), Positives = 146/310 (47%), Gaps = 48/310 (15%)
Query: 310 FTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDLSVYNPPSVRPP----LDPEDEELLRDDE 365
F +Y +SLEK +K L EPDLG+ +DL+ NP + R LDP DE+LL ++E
Sbjct: 1 FVQYKATSLEKQHKHDLLTEPDLGVTIDLI-----NPDTYRIDPGVLLDPADEKLL-EEE 54
Query: 366 VVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSIL 425
+ P KR ++ K V W+ KT+YIS E R ++ ++ E+K G S+
Sbjct: 55 IQAPTSS---KRSQQHA-KVVPWMRKTEYIST---EFNRYGVSNEKP----EVKIGVSVK 103
Query: 426 ENLN------DRERQIKEIEASFE-ACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVA 478
+ DR+ QI IE +FE A K H + + P+E++P+ PDF+ + +
Sbjct: 104 QQFTEEEIYKDRDSQIAAIEKTFEDAQKAITQHYSKPRVTPIEVMPVFPDFKMWINPCAQ 163
Query: 479 ATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSK 538
FD P + D S A E +M + G +F+AY +P L K
Sbjct: 164 VIFDSDP-------APKDTSGAAALE---MMSQAMIRGMMDEEGNQFVAYFLPVEETLRK 213
Query: 539 DMYDENEDVSFS--------WVREYHWDVRGDDAD--DPTTYLVSFDDDEARYVPLPTKL 588
D+ E++ ++ REY+W+V+ + + + + + D Y L T++
Sbjct: 214 RKRDQEEEMDYAPEDVYDYKIAREYNWNVKNKASKGYEENYFFIFREGDGVYYNELETRV 273
Query: 589 NLRKKRAIEG 598
L K+RA G
Sbjct: 274 RLSKRRARAG 283
>gi|320169024|gb|EFW45923.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 500
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 163/342 (47%), Gaps = 44/342 (12%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
F +K++N LP+ PKL+ + D R +Y +SLE+ +K + E +L IP+DL+D
Sbjct: 114 FNWPIKYKNTLPDVPFDPKLLYYQLDPMRSIQYATTSLERQFKFDILTEANLSIPIDLID 173
Query: 341 LSVYNPPSVRPP-LDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYIS--- 396
Y P + L D+ LL D+ + T + K+K R + V+WL TQYIS
Sbjct: 174 PDRYKIPFGQTAQLSAVDQTLLSDEPLSTVMDP---KKKARLEAENVAWLRATQYISAEQ 230
Query: 397 --PLSMESARQSLTEKQAKELREMKGGRSILENL-NDRERQIKEIEASFEACKLRPI--H 451
PL +++L + ++++ I + L DR QI+ I+ +F+ K RPI H
Sbjct: 231 DKPL---EPKENLERRIGVLVQDV-----ITQGLRQDRASQIRAIQETFDHAK-RPIITH 281
Query: 452 ATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKS 511
+ NL + + P+ P + + +Q +FD AP A+ I SK++ S A
Sbjct: 282 PNHPNLTAISVQPIFP-YPHHLEQLAQISFDSAP-ANKPIGSKLEDSDFVA--------- 330
Query: 512 YVATGSDSANPEKFLAYMVPSVNELSKDMY------DENEDVSFSWVREYHWDV-RGDDA 564
++ G A + Y +P+ E+ + D+ + F+ VREY + V +G
Sbjct: 331 -MSAGMLKATRPGVMTYFLPASTEVVRKRRRLESGEDQAATLEFASVREYTFQVPKGTTT 389
Query: 565 DDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEH 606
+D ++ + Y P+ + N+ ++RA + E+EH
Sbjct: 390 EDNFVFMEM--GGQVLYAPIGRRWNMTRRRARSAKH--ELEH 427
>gi|350585207|ref|XP_003355968.2| PREDICTED: RNA polymerase II-associated factor 1 homolog [Sus
scrofa]
Length = 257
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 108/197 (54%), Gaps = 22/197 (11%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL++
Sbjct: 30 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLINP 89
Query: 342 SVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y P+V LDP DE+LL ++E+ P ++ + K V W+ KT+YI S
Sbjct: 90 DTYRIDPNVL--LDPADEKLL-EEEIQAPTS----SKRSQQHAKVVPWMRKTEYI---ST 139
Query: 401 ESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFE-ACKLRPIHAT 453
E R ++ ++ E+K G S+ + DR+ QI IE +FE A K H +
Sbjct: 140 EFNRYGISNEKP----EVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQKSISQHYS 195
Query: 454 NKNLQPVEILPLLPDFE 470
+ PVE++P+ PDF+
Sbjct: 196 KPRVTPVEVMPVFPDFK 212
>gi|196016881|ref|XP_002118290.1| hypothetical protein TRIADDRAFT_33941 [Trichoplax adhaerens]
gi|190579121|gb|EDV19224.1| hypothetical protein TRIADDRAFT_33941 [Trichoplax adhaerens]
Length = 347
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 144/330 (43%), Gaps = 40/330 (12%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLD 340
LC++K+RN LP+ PK + + +RFT Y +SLE+ +K +L + D G+ L+D
Sbjct: 4 LLCRIKYRNTLPDIPFDPKFLRYPFENNRFTDYNTTSLEREHKHELLTDTDAGVKNGLID 63
Query: 341 LSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSM 400
Y +D + E D ++ + RK + VSW+ + +Y LS
Sbjct: 64 PDAY-------IIDEDAELDDADKALLGIEGSASMDRKGTRHSRTVSWMRRNEY---LSN 113
Query: 401 ESARQSLTEKQAKELREMKGGRSI------LENLNDRERQIKEIEASFEACKLRPI--HA 452
E R + Q + E K G S+ LE D+E QIK IE +FEA K +PI H
Sbjct: 114 EGVRMA----QNPDKPETKVGYSVRKYAKNLEKYKDKESQIKAIEKTFEATK-KPIIDHP 168
Query: 453 TNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKSY 512
T + PL PDFE + TFD P SK+ + + M +
Sbjct: 169 TKPGVTAKRSYPLFPDFEMWKYSLALVTFDSDP-------SKLTDGSKLTEQQ---MSNS 218
Query: 513 VATGSDSANPEKFLAYMVPSVNELSKDMYDENEDV-------SFSWVREYHWDVRGDDAD 565
V G +F+ Y P + K EN+D ++ REY+W V+ D+
Sbjct: 219 VIKGMQVDTGNQFVGYYTPIATDAKKRKVSENQDSETSRNCRDYNLAREYNWVVKSRDSG 278
Query: 566 DPTTYLVSFDDDEARYVPLPTKLNLRKKRA 595
+ ++ F+ + L ++ L K+RA
Sbjct: 279 RDVSNVIMFEYSFCHFNKLFERVKLSKQRA 308
>gi|198415605|ref|XP_002124310.1| PREDICTED: similar to RNA polymerase II-associated factor 1 homolog
(hPAF1) (Pancreatic differentiation protein 2) [Ciona
intestinalis]
Length = 328
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 149/332 (44%), Gaps = 32/332 (9%)
Query: 270 RTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVE 329
RT +++ + + ++K+ N LP+ PK + D +RF Y +SLEK +K +LH +
Sbjct: 15 RTRGSIERGSGLVTRVKYCNTLPDIPFDPKFIRYPFDANRFVDYKPTSLEKQHKTELHTD 74
Query: 330 PDLGIPLDLLDLSVY--NPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVS 387
DLG+ +DL++ Y NP + + P DE LL DD P +D + K+ + VS
Sbjct: 75 HDLGVVIDLINPDTYKVNPNTW---IHPADEALLEDD---LPAPQDTKRSKQHSLN--VS 126
Query: 388 WLVKTQYIS-PLSMESARQSLTEKQAKELREMKGGRSILENL-NDRERQIKEIEASFEAC 445
WL +++YIS + + + E + + ++L DR+ QI I+ +FE
Sbjct: 127 WLRRSEYISQEFNRYGVKSTNVETRYLSYNPVPKKYFKQDDLYKDRDGQIAAIQKTFEDS 186
Query: 446 KLRPI-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHE 504
K H + N+ PV++LP+ PD + + FD P+ + A E
Sbjct: 187 KKEITKHYSKPNVFPVDVLPIYPDMKMWQHPSAQVIFDTDPSVQG----------KPAAE 236
Query: 505 SRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYH 556
M + G ++F+AY +P+ + K D E V F REY+
Sbjct: 237 QLEEMSQAMIRGMVDEEGDQFVAYFLPTPSTRQKRKRDVEEGVDFEPEETYDYRLAREYN 296
Query: 557 WDVRGDDADD-PTTYLVSFDDDEARYVPLPTK 587
W+V+ + Y F ++ Y L T+
Sbjct: 297 WNVKNKASKGYEENYYFIFREEGVYYNELETR 328
>gi|297838423|ref|XP_002887093.1| hypothetical protein ARALYDRAFT_894419 [Arabidopsis lyrata subsp.
lyrata]
gi|297332934|gb|EFH63352.1| hypothetical protein ARALYDRAFT_894419 [Arabidopsis lyrata subsp.
lyrata]
Length = 201
Score = 92.0 bits (227), Expect = 1e-15, Method: Composition-based stats.
Identities = 51/94 (54%), Positives = 61/94 (64%), Gaps = 27/94 (28%)
Query: 405 QSLTEKQAKELREMKGGRSILENLNDR---------------------------ERQIKE 437
+SLTEKQ KELREMKGG +IL+NLN+R ERQI +
Sbjct: 58 KSLTEKQGKELREMKGGINILDNLNNRCLVSQICVFLNKVKKEGLMVYFYLVSRERQIMD 117
Query: 438 IEASFEACKLRPIHATNKNLQPVEILPLLPDFER 471
IEASFEACK +PIH+TNKN+QPVE+LPLL F+R
Sbjct: 118 IEASFEACKSQPIHSTNKNVQPVEVLPLLAYFDR 151
>gi|328769977|gb|EGF80020.1| hypothetical protein BATDEDRAFT_25627 [Batrachochytrium
dendrobatidis JAM81]
Length = 404
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 121/405 (29%), Positives = 172/405 (42%), Gaps = 79/405 (19%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPD--LGIPLDL 338
F KL +RN LPE PKL+ KDR +YT++SL + P L D G+PL+L
Sbjct: 11 FAFKLSYRNTLPEIPFDPKLLEHPFPKDRHYKYTYNSLYAS-TPHLVYAADDENGLPLNL 69
Query: 339 LDLSVYNPPSVRPP-----------LDPEDEELLRDDEVVTPVKK-----DGIKRKERPT 382
L Y + R P LDPED ELL V PV + + RP
Sbjct: 70 L-AHGYLENAFRNPHAALLQPNPDALDPEDLELL-----VPPVDAKLASDNAEATRVRPV 123
Query: 383 DKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDR--ERQIKEIEA 440
V WL +T+Y++ R++ T + K +K + L +L DR E Q+ IE
Sbjct: 124 ---VPWLRRTEYVAVEGKTYGRRADTGVETKMGISIKKDKH-LSSLMDRSEEAQMLAIEN 179
Query: 441 SFEACKLRPI----HATNKNLQPVEILPLLPDFERYDDQFVAATFD----------GAPT 486
+FE + H TN +L+ VEI P+ PDFE + + + TFD GA
Sbjct: 180 TFETAAAATLSNLKHPTNPDLKLVEIFPIYPDFEFWPNLYQLVTFDDDPISRAPGGGATD 239
Query: 487 ADSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEK----FLAYMVPSVNELS----- 537
AD Y D RAI+K ANPE +A+ P+ S
Sbjct: 240 ADGTEYE------HDLKLDRAIIKPI-------ANPEDDTDFTMAFYTPTDASASAVREH 286
Query: 538 -----------KDMYDENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDEARYVPLPT 586
D+ DE+ED S + +D ++ P + ++ A Y LP
Sbjct: 287 RERQKHLMAQGHDIEDEDEDQSHEYKFRRDFDESVSKSNYPFCIELRKEEGGAFYSMLPK 346
Query: 587 KLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRANVTAIELKEQGA 631
++ LRKKRA+ R D + P+ I V R +TA E+ E+
Sbjct: 347 RMTLRKKRALNKRERDYDVEYEKPTKILVSHR-ELTAEEISERNT 390
>gi|268557112|ref|XP_002636545.1| Hypothetical protein CBG23232 [Caenorhabditis briggsae]
Length = 453
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 137/294 (46%), Gaps = 33/294 (11%)
Query: 277 KPTTFLCKLKFRNELPEPSAQPKLMAL-KKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+ F+ K +F N +P+ K M RF Y S+++++ K + + D+G+
Sbjct: 39 RKVDFMLKPRFTNNVPDVPFDAKFMPCPFVPLSRFVEYKPSTVDRDCKHAIICDDDMGLS 98
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL+DL Y+ +D +D LL D+ K IKR + + K V W+ KT+YI
Sbjct: 99 VDLIDLRKYDEDPADYAMDEKDNILLEDE----SASKMSIKRSAQHS-KLVPWMRKTEYI 153
Query: 396 SPLSMESARQSLT-EKQAKEL-REMKGGRSILENLNDRERQIKEIEASFEACKLRPI--H 451
S E R +T ++Q +L +K + + + D++ QI I +FE + +P+ H
Sbjct: 154 ST---EFNRFGVTADRQETKLGYNLKKNQQVEDMYRDKQSQIDAINKTFEDVR-KPVTAH 209
Query: 452 ATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKS 511
T KN++PVE + PDFE + F FDG T +E E R +S
Sbjct: 210 HTKKNMKPVEECYIFPDFEHWKYLFTHVQFDG-DTITTEF---------GEDEKRQAQES 259
Query: 512 YVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSWVREYHW 557
V G + + +KF A VP++ L+ M +D ++ F REY W
Sbjct: 260 SVIKGMEYED-KKFAALFVPTIEGLTHMMEDLEMDRPFDPDQKYEFLLSREYDW 312
>gi|341892745|gb|EGT48680.1| hypothetical protein CAEBREN_08454 [Caenorhabditis brenneri]
Length = 424
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/413 (24%), Positives = 183/413 (44%), Gaps = 59/413 (14%)
Query: 277 KPTTFLCKLKFRNELPEPSAQPKLMA-----LKKDKDRFTRYTFSSLEKNYKPQLHVEPD 331
+ F+ K +F N +P+ K M L RF Y S+++++K + + D
Sbjct: 15 RKVDFMLKPRFTNNVPDVPFDAKFMPCPFVPLS----RFVEYKQCSIDRDFKHAVICDDD 70
Query: 332 LGIPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVK 391
+G+ +DL+DL Y+ +D +D+ LL D+ K +KR + + K V W+ K
Sbjct: 71 MGLNVDLIDLQKYDEDKNPIEMDEKDQILLEDESAT----KMSMKRSAQHS-KLVPWMRK 125
Query: 392 TQYISPLSMESARQSLT-EKQAKEL-REMKGGRSILENLNDRERQIKEIEASFEACKLRP 449
T+YIS E R +T ++Q +L +K + + + D++ QI I +FE + +P
Sbjct: 126 TEYIST---EFNRFGITADRQETKLGYNLKKNQQVEDMYRDKQSQIDAINKTFEDVR-KP 181
Query: 450 I--HATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRA 507
+ H T K ++PVE + PDF+ + F FDG T S+I + K A ES +
Sbjct: 182 VKEHHTKKGMKPVEECFIFPDFKHWQYLFTHVQFDG-DTITSDIPEEEKKQ---AQES-S 236
Query: 508 IMKSYVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSWVREYHWDV 559
I+K+ + + ++F A VP++ L+ M YD + F REY W +
Sbjct: 237 IIKAM------NVDDKQFAAVFVPTIECLTNYMDDLELERPYDPDRKYEFLLSREYDWKM 290
Query: 560 RGDDADDPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIAVRRRA 619
D ++ + D +Y + + + + ++R + ++ + +
Sbjct: 291 EQVPPRDRDAFVFYYRDGIFQYDEIDSNVKMTRRRKM---------------YMSRKSKM 335
Query: 620 NVTAIELKEQGAYSNSKGNSSSSKMGRVDSQEDLERSHNGSRQQDPYQSSGAE 672
VT E E+ K + + + QE LER +D SSGA+
Sbjct: 336 MVTYREFNEEEKAEMDKRTAELYEQPKTRKQEMLERVQEEEENRD---SSGAQ 385
>gi|325181082|emb|CCA15494.1| RNA polymerase IIassociated factor 1 putative [Albugo laibachii
Nc14]
Length = 525
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 112/220 (50%), Gaps = 19/220 (8%)
Query: 275 LKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGI 334
L K + FL L+FRN+LP+ K + + DR ++ ++LE++Y ++H EP+LG+
Sbjct: 115 LGKQSEFLANLEFRNKLPDIPFDTKFLQYPHESDRLIKWKPNTLEQDYVFEIHEEPNLGL 174
Query: 335 PLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQY 394
+DL+D + Y P+V PL+ DE +L ++ + K K RP VSWL +T+Y
Sbjct: 175 SIDLIDPAKYEVPTVTEPLEAGDEAVL----MMKETQSKDTKSKARPV---VSWLRRTEY 227
Query: 395 ISPLSMESARQSLTEKQAK-ELREMKGGRSILENLNDRERQI---KEIEASFEACK--LR 448
+ E + +E + + ELRE EN E Q+ + E SF+
Sbjct: 228 MGNDLYEPVHKFKSEVEFQSELREG------TENALAEEVQVTLKQRAEDSFKDVNDPEV 281
Query: 449 PIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTAD 488
+H KNL+P +I + PD Y +++ ++D P+ D
Sbjct: 282 LVHPFKKNLKPAKIWDVYPDQSLYPNKYAHLSYDVLPSMD 321
>gi|3287674|gb|AAC25503.1| F23149_1 [Homo sapiens]
gi|119577285|gb|EAW56881.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae),
isoform CRA_b [Homo sapiens]
Length = 510
Score = 82.8 bits (203), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 132/297 (44%), Gaps = 67/297 (22%)
Query: 282 LCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIPLDLLDL 341
+C++K+ N LP+ PK + D++RF +Y +SLEK +K L EPDLG+ +DL+
Sbjct: 20 VCRVKYCNSLPDIPFDPKFITYPFDQNRFVQYKATSLEKQHKHDLLTEPDLGVTIDLI-- 77
Query: 342 SVYNPPSVRPP----LDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISP 397
NP + R LDP DE+LL ++E+ P KR ++ K V W+ KT+YIS
Sbjct: 78 ---NPDTYRIDPNVLLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVPWMRKTEYIST 129
Query: 398 LSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPIH 451
E R ++ E E+K G S+ + DR+ QI IE +FE +
Sbjct: 130 ---EFNRYGIS----NEKPEVKIGVSVKQQFTEEEIYKDRDSQITAIEKTFEDAQ----- 177
Query: 452 ATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKS 511
+ + I P FD P + D S A E +M
Sbjct: 178 ------KSMWINPC-----------AQVIFDSDP-------APKDTSGAAALE---MMSQ 210
Query: 512 YVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WVREYHWDVR 560
+ G +F+AY +P L K D+ E++ ++ REY+W+V+
Sbjct: 211 AMIRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIAREYNWNVK 267
>gi|260822425|ref|XP_002606602.1| hypothetical protein BRAFLDRAFT_209622 [Branchiostoma floridae]
gi|229291946|gb|EEN62612.1| hypothetical protein BRAFLDRAFT_209622 [Branchiostoma floridae]
Length = 352
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 124/266 (46%), Gaps = 43/266 (16%)
Query: 353 LDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYISPLSMESARQSLTEKQA 412
LDP DE+LL +D + TP +D ++ R K VSWL +T+YIS E R + + ++A
Sbjct: 79 LDPLDEKLLEED-MSTP--QDS--KRSRHHAKNVSWLRRTEYIST---EYNRFTPSSEKA 130
Query: 413 KELREMKGGRSILENLN------DRERQIKEIEASFEACKLRPIHATNKN--LQPVEILP 464
E K G SI + DRE QI IE +FE K H +K +QP+E+L
Sbjct: 131 ----ETKIGHSIKKQFKEEDIYKDRESQIAAIEKTFEDVKTPITHHYSKGDRVQPLEVLE 186
Query: 465 LLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHE--SRAIMKSYVATGSDSANP 522
PDFE + FD P KSV A E S+A+++ V D
Sbjct: 187 FFPDFENWAFPCAQVIFDTDPAPRG-------KSVPAAVEEMSQAMIRGMVDEEGD---- 235
Query: 523 EKFLAYMVPSVNELSKDMYDENEDV--------SFSWVREYHWDVRGDDADD-PTTYLVS 573
+F+ Y +P ++K +E +V + REY+W+V+ + Y V
Sbjct: 236 -QFVGYFLPVEETVTKRKREEESEVDYMPEEEYDYMMAREYNWNVKNKSSRGYEENYFVV 294
Query: 574 FDDDEARYVPLPTKLNLRKKRAIEGR 599
F DD Y L T++ L K+RA G+
Sbjct: 295 FKDDGVYYNELETRVRLSKRRAKGGQ 320
>gi|308479030|ref|XP_003101725.1| hypothetical protein CRE_11174 [Caenorhabditis remanei]
gi|308262936|gb|EFP06889.1| hypothetical protein CRE_11174 [Caenorhabditis remanei]
Length = 426
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 130/281 (46%), Gaps = 29/281 (10%)
Query: 277 KPTTFLCKLKFRNELPEPSAQPKLMAL-KKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+ F+ K +F N +P+ K M RF Y S ++++ K + + D+G+
Sbjct: 15 RKVDFMLKPRFTNNVPDVPFDAKFMPCPFVPLSRFVEYKQSGIDRDCKHAVICDDDMGLN 74
Query: 336 LDLLDLSVYNPPSV--RPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQ 393
+DL+DL Y+ V L+ +D+ LL D+ K +KR + + K V W+ KT+
Sbjct: 75 VDLIDLRKYDEDVVGEVEELNEKDQYLLEDENT----SKMSLKRSAQHS-KLVPWMRKTE 129
Query: 394 YISPLSMESARQSLT-EKQAKEL-REMKGGRSILENLNDRERQIKEIEASFEACKLRPI- 450
YIS E R +T ++Q +L +K + + + D++ QI I +F+ + +PI
Sbjct: 130 YIST---EFNRFGVTADRQETKLGYNLKKNQQVEDMYRDKQSQIDAINKTFDDVR-KPIL 185
Query: 451 -HATNKNLQPVEILPLLPDFERYDDQFVAATFDG-APTADSEIYSKMDKSVRDAHESRAI 508
H T K ++PVE + PDFE + F FDG T D K R A ES I
Sbjct: 186 EHHTKKGVKPVEEAYIFPDFEHWKHLFAYVQFDGDTVTTDLAGAEK-----RQAQESSVI 240
Query: 509 MKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSF 549
+ +KF A VP++ L+ M D D SF
Sbjct: 241 -------KAMEFEDQKFAALFVPTIECLTHMMEDLEMDRSF 274
>gi|17559000|ref|NP_505925.1| Protein C55A6.9 [Caenorhabditis elegans]
gi|74961204|sp|P90783.2|PAF1_CAEEL RecName: Full=RNA polymerase II-associated factor 1 homolog
gi|3875280|emb|CAB02869.1| Protein C55A6.9 [Caenorhabditis elegans]
Length = 425
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 135/294 (45%), Gaps = 32/294 (10%)
Query: 277 KPTTFLCKLKFRNELPEPSAQPKLMAL-KKDKDRFTRYTFSSLEKNYKPQLHVEPDLGIP 335
+ F+ K +F N +P+ K M RF + +++ ++YK + + D+G+
Sbjct: 15 RKVDFMLKPRFTNTVPDVPFDAKFMTCPFVPLGRFVEFQPAAIYRDYKHAVICDDDMGLN 74
Query: 336 LDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQYI 395
+DL+DL Y+ + +D +D LL DD + K + + K V W+ KT+YI
Sbjct: 75 VDLIDLKKYDEDPIETEIDEKDNILLEDDGAAKLIAK-----RSQQHSKLVPWMRKTEYI 129
Query: 396 SPLSMESARQSLT-EKQAKEL-REMKGGRSILENLNDRERQIKEIEASFEACKLRPI--H 451
S E R +T ++Q +L +K + + + D++ QI I +FE + +P+ H
Sbjct: 130 ST---EFNRFGVTADRQETKLGYNLKKNQQVEDMYRDKQSQIDAINKTFEDVR-KPVKEH 185
Query: 452 ATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHESRAIMKS 511
+ K ++ VE + PDF+ + F FDG T +E + ++ + A ES I
Sbjct: 186 YSKKGVKAVEESFVFPDFDHWKHLFAHVQFDG-DTITTEFEEEDER--QQARESSVI--- 239
Query: 512 YVATGSDSANPEKFLAYMVPSVNELSKDM--------YDENEDVSFSWVREYHW 557
+ +KF A VP++ L+ M +DE+ F REY +
Sbjct: 240 ----KAMEFEDQKFAAVFVPTIGCLTSFMDDLELERPFDEDMKYEFLLSREYTF 289
>gi|301117418|ref|XP_002906437.1| RNA polymerase II-associated factor 1 [Phytophthora infestans
T30-4]
gi|262107786|gb|EEY65838.1| RNA polymerase II-associated factor 1 [Phytophthora infestans
T30-4]
Length = 509
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/331 (27%), Positives = 153/331 (46%), Gaps = 36/331 (10%)
Query: 275 LKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQLHVEPDLGI 334
L K + FL L+FRN LP+ K + + +R +Y ++LE +Y ++H EP+LG+
Sbjct: 90 LGKQSEFLATLEFRNTLPDIPFDTKFVKYPHEPERLIKYKPNTLEMDYTYEIHEEPNLGL 149
Query: 335 PLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQY 394
+DL+D + Y P PL+ DE+ L E K + K RPT VSWL +T+Y
Sbjct: 150 TIDLIDPTKYEAPLEPEPLEVGDEQTLMMKEDHAGNKG---RSKARPT---VSWLRRTEY 203
Query: 395 ISPLSMESARQSLTEKQAKE-LREMKGGRSILE-----NLNDRERQIKEIEASFEAC--- 445
+ +S + +E + + LRE G + L L DR EASF
Sbjct: 204 MGNDLYDSVHKFKSEAEIQSALRE--GTENALAEVVSVTLEDR------AEASFHDINDS 255
Query: 446 KLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHES 505
KL +H NK + +++ + PD +++ ++D P+ D + SK S E
Sbjct: 256 KL-LVHPHNKRKKIIKVWDVFPDQILSANKYAILSYDMLPSED--VKSKGITS----REE 308
Query: 506 RAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDAD 565
RA++ V + N + ++P+ S + + N+ FS++REY V D+
Sbjct: 309 RALLCG-VNKKIQAGNELIQGSILLPNA---STEEEESNDREKFSYLREYLMSVDSLDSR 364
Query: 566 DPTTYLVSFD--DDEARYVPLPTKLNLRKKR 594
+ + D ++ Y + ++ LRK +
Sbjct: 365 ESQHIVFMLDPESNQFTYSDVLNRIQLRKTK 395
>gi|348688394|gb|EGZ28208.1| hypothetical protein PHYSODRAFT_551925 [Phytophthora sojae]
Length = 511
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 155/332 (46%), Gaps = 37/332 (11%)
Query: 275 LKKPTTFLCKLKFRNELPEPSAQPKLMALKKD-KDRFTRYTFSSLEKNYKPQLHVEPDLG 333
L K + FL L+FRN LP+ K + + + R +Y ++LE +Y ++H EP+LG
Sbjct: 90 LGKQSEFLANLEFRNTLPDIPFDTKFVKYPHEPERRLIKYKPNTLEMDYTYEIHEEPNLG 149
Query: 334 IPLDLLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSWLVKTQ 393
+ +DL+D + Y P+ PL+ DE++L E K + K RPT VSWL +T+
Sbjct: 150 LTIDLIDPAKYEAPAEPEPLEVGDEQVLMMKEDHANTKG---RSKVRPT---VSWLRRTE 203
Query: 394 YISPLSMESARQSLTEKQAKE-LREMKGGRSILE-----NLNDRERQIKEIEASFEAC-- 445
Y+ +S + +E + + LRE G + L L DR EASF
Sbjct: 204 YMGNDLYDSVHKFKSEAEIQSALRE--GTENALAEVVSVTLEDR------AEASFRDIND 255
Query: 446 -KLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVRDAHE 504
KL +H NK + ++ + PD +++ ++D P+ D + +K + + E
Sbjct: 256 PKL-LVHPHNKKKKIAKVWDVFPDQHLSANKYAILSYDILPSEDVK-----NKGITN-RE 308
Query: 505 SRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWDVRGDDA 564
RA++ V + N + ++P+ S + + N+ FS++REY V D+
Sbjct: 309 DRALLCG-VNKKIQAGNEHIQGSILLPNA---STEDEESNDREKFSYLREYLMSVESLDS 364
Query: 565 DDPTTYLVSFD--DDEARYVPLPTKLNLRKKR 594
+ + D ++ Y + ++ LRK +
Sbjct: 365 RESQHVVFMLDPESNQFTYSDVLNRIQLRKTK 396
>gi|148692191|gb|EDL24138.1| Paf1, RNA polymerase II associated factor, homolog (S. cerevisiae),
isoform CRA_a [Mus musculus]
Length = 470
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 130/288 (45%), Gaps = 49/288 (17%)
Query: 330 PDLGIPLDLLDLSVYNPPSVRPP--LDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVS 387
P LGIP + PP+ P LDP DE+LL ++E+ P KR ++ K V
Sbjct: 18 PGLGIP--------HGPPAPYPLVLLDPADEKLL-EEEIQAPTSS---KRSQQHA-KVVP 64
Query: 388 WLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLN------DRERQIKEIEAS 441
W+ KT+YIS E R ++ E E+K G S+ + DR+ QI IE +
Sbjct: 65 WMRKTEYIST---EFNRYGIS----NEKPEVKIGVSVKQQFTEEEIYKDRDSQITAIEKT 117
Query: 442 FE-ACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYSKMDKSVR 500
FE A K H + + PVE++P+ PDF+ + + FD P + D S
Sbjct: 118 FEDAQKSISQHYSKPRVTPVEVMPVFPDFKMWINPCAQVIFDSDP-------APKDTSGA 170
Query: 501 DAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFS--------WV 552
A E +M + G +F+AY +P L K D+ E++ ++
Sbjct: 171 AALE---MMSQAMIRGMMDEEGNQFVAYFLPVEETLKKRKRDQEEEMDYAPDDVYDYKIA 227
Query: 553 REYHWDVRGDDAD--DPTTYLVSFDDDEARYVPLPTKLNLRKKRAIEG 598
REY+W+V+ + + + + + D Y L T++ L K+RA G
Sbjct: 228 REYNWNVKNKASKGYEENYFFIFREGDGVYYNELETRVRLSKRRAKAG 275
>gi|290975262|ref|XP_002670362.1| predicted protein [Naegleria gruberi]
gi|284083920|gb|EFC37618.1| predicted protein [Naegleria gruberi]
Length = 620
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 66/220 (30%), Positives = 106/220 (48%), Gaps = 31/220 (14%)
Query: 279 TTFLCKLKFRNELPEPSAQPKLMALKKD-KDRFTRYTFSSLEKNYKPQLHVEPDLGIPLD 337
++FLC +KF+ LPE P L+ KD + F Y +SLEKN+K +L E +LGI ++
Sbjct: 142 SSFLCNMKFQTGLPEILLDPVLLEYPKDFTENFIGYKTTSLEKNHKFELLTENNLGINMN 201
Query: 338 LLDLSVYNPPSVRPPLDPEDEELLRDDEVVTPVK--KDGIKRK------ERPTDKGVSWL 389
L++ VY LDPEDE LL+ PVK K I R+ RP SWL
Sbjct: 202 LINPFVYYKDE-SAKLDPEDELLLK------PVKSAKKEIARELVENLSSRP-----SWL 249
Query: 390 VKTQYISPLSMESAR--------QSLTEKQAKELREMKGGRSILENLNDRERQIKEIEAS 441
Y+ ++ + A Q + E+ ++++ R + E R + K+++A
Sbjct: 250 KAPTYVDMINYKKAVKGYGEDLIQDVEEEHVQKIKPKTVPRKVEE--KSRFEKAKQLDAK 307
Query: 442 FEACKLRPIHATNKNLQPVEILPLLPDFERYDDQFVAATF 481
F++ L + ++ + LLP+F +F+AA F
Sbjct: 308 FQSDTLTRTLPNGQKVKAIACYSLLPNFATMGTKFMAANF 347
>gi|400602635|gb|EJP70237.1| chromatin remodeling complex subunit [Beauveria bassiana ARSEF
2860]
Length = 487
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 117/249 (46%), Gaps = 47/249 (18%)
Query: 267 SGERTENRLKKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYT---FSSLEKNYK 323
SGER ++ F+ ++++ N LP P PKL+ + +YT F+S +
Sbjct: 8 SGERVIHQ-----DFIARIRYSNALPPPPNPPKLLDIPNTGLSSGQYTTPGFASRLAREQ 62
Query: 324 PQLHVEPD--LGIPLDLL--------DLSVYNPPSVRPPLDPEDEELLRDDEVVTPVKKD 373
P L++E D LG+PLDL+ D S P+ PP+ P D+ LLR P+
Sbjct: 63 P-LNIEADAELGMPLDLVGMPGVFDGDESSLQAPAQAPPVHPHDKPLLR------PIAAL 115
Query: 374 GIKRKERPTDKGVSWLVKTQYISPLSME-----SARQSLTEKQAKELREMKGGRSILENL 428
G K + + VS+L +T+YIS L+ + S R L + + +R+ + E +
Sbjct: 116 G---KPKVAEANVSFLRRTEYISSLASKRFEGNSPRALLMKAKRPVVRQAEAAADAPETI 172
Query: 429 NDRERQIKEIEASFEACKL------RPIHATNKNLQPVEILPLLPDFERYDDQ--FVAAT 480
++IE SF+ + R H T K+L+ VE+ PL+PD + + D +V
Sbjct: 173 K------RKIEHSFDVAEQDLKDPKRARHPTKKHLKAVEVTPLIPDLDAFPDSGAYVTIK 226
Query: 481 FDGAPTADS 489
F P A S
Sbjct: 227 FLTNPMAGS 235
>gi|299745335|ref|XP_001831645.2| hypothetical protein CC1G_05716 [Coprinopsis cinerea okayama7#130]
gi|298406540|gb|EAU90178.2| hypothetical protein CC1G_05716 [Coprinopsis cinerea okayama7#130]
Length = 424
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 157/374 (41%), Gaps = 90/374 (24%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQ-LHVEPDLGIPLDL- 338
L ++++ N +P P PKL+ + + R+ R F + + +P + V+ + G+PLDL
Sbjct: 11 LLVRVRYTNPIPPPPCPPKLLDIPTNPMRYARPEFLNAVASEQPLPMIVDAECGMPLDLS 70
Query: 339 ---------LDLSVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTD----- 383
D S NP P+ P LDP+D +L D P G+ P
Sbjct: 71 QFESLWDEGADDSSLNPDPNNLPRLDPKDAFMLGD-----PSSSTGMYPTSGPIGPTPIA 125
Query: 384 ---KGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEA 440
V WL KT+Y LS ES ++ T + E K + + +++ R Q+ IEA
Sbjct: 126 PMPASVPWLRKTEY---LSRESGQRGTTTQ------EPKVATTTVVDVS-RNAQLATIEA 175
Query: 441 SFEACK-------LRPIHATNKNLQPVEILPLLPDFERYDDQFVAATFDGAP------TA 487
SF+AC LR H N NL VE P+LPD + + +Q+ F P
Sbjct: 176 SFKACNDDFDLENLR--HPNNPNLTAVESYPILPDADIWANQYDLFRFSERPGERPADVP 233
Query: 488 DSEIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLA-----YMVPSVNELSKDMYD 542
D + + + ++ H+S + Y+ DSA K L Y VP +
Sbjct: 234 DERLDCAILRPMKTEHDS--FLAYYLTKDDDSALRLKELRASLNPYEVP----------E 281
Query: 543 ENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDE---------------------ARY 581
E E+ F +VR+Y + + + P +L++ +DE A Y
Sbjct: 282 EQEETIFQFVRDY--ETVKVEQEVPNEFLLTIQEDEGIRSMADVVNGTDQRPRKEKGAYY 339
Query: 582 VPLPTKLNLRKKRA 595
+ K+ L+KKRA
Sbjct: 340 KNIERKMLLKKKRA 353
>gi|50553684|ref|XP_504253.1| YALI0E22022p [Yarrowia lipolytica]
gi|49650122|emb|CAG79848.1| YALI0E22022p [Yarrowia lipolytica CLIB122]
Length = 386
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 171/371 (46%), Gaps = 52/371 (14%)
Query: 281 FLCKLKFRNELPEPSAQPKLMALKKDKDRFTRY---TFSSLEKNYKPQLHVEPDLGIPLD 337
+L +++++NELP P PK++ + ++ T S +++ PQ++V+ DLG+PLD
Sbjct: 7 YLVRVRYQNELPPPELPPKMLNIPVPPEKLTSLGINILSDMQRRESPQINVDNDLGMPLD 66
Query: 338 LLDL---------SVYNPPSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTDKGVSW 388
L ++ S P P LDP D LL++ + K +P GV++
Sbjct: 67 LTEIRGVFEQDDESGLLPLENLPELDPRDLVLLKEPQ--------STGNKSQP---GVAF 115
Query: 389 LVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKEIEASFEACK-- 446
L +T+YIS S S + L A + R +++ E+ D E Q++ +EA+FEA
Sbjct: 116 LRRTEYIS--SEVSGGRKLETLSASQRR--LAQKTLEESFQDPESQLRAVEATFEAANTD 171
Query: 447 LRPI-HATNKNLQPVEILPLLPDFERYDDQFVAATFDGA-----PTADSEIYSKMDKSVR 500
++ + H +L VE P+LP+ + +D F+ G+ P D KM S+
Sbjct: 172 IKTLKHPKKPHLTAVESFPILPNSQIFDLTFLTMKMVGSAVGVPPLPDGSPDPKMSVSLF 231
Query: 501 D--AHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYDENEDVSFSWVREYHWD 558
E+ M ++ +S + ++ ++ + + D+ F R+Y D
Sbjct: 232 RPMTLETDEWMSMFLTKDEESKDLKR-------QLDSTDEHVADDKHVYRFEHKRDYDMD 284
Query: 559 VRGDDADDPTTYLVSFDDDE----ARYVPLPTKLNLRKKRAIEGRSNDEVEHFPIPSSIA 614
++ +A ++ D D+ A YVP+ K NL+++R ++ + EH +
Sbjct: 285 LQM-NASQFEEIVIDVDLDKPGSVAHYVPVQGKTNLKRRRVLKSLRSQIKEHNIAAIDLT 343
Query: 615 VRRRANVTAIE 625
+R ++TA E
Sbjct: 344 LR---DITAEE 351
>gi|393222197|gb|EJD07681.1| hypothetical protein FOMMEDRAFT_24985 [Fomitiporia mediterranea
MF3/22]
Length = 419
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 159/365 (43%), Gaps = 77/365 (21%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTFSSLEKNYKPQ-LHVEPDLGI 334
K L K++F+N LP P PKL+ + + R+ F++ + P + V+ DLG+
Sbjct: 5 KSKLDLLFKVRFQNPLPPPPFPPKLLNIPTNPTRYAGPDFTASLASETPLPMVVDADLGM 64
Query: 335 PLDL----------LDLSVYNP-PSVRPPLDPEDEELLRDDEVVTPVKKDGIKRKERPTD 383
PLDL D S NP P + PP+DP+D +L D P P+
Sbjct: 65 PLDLSYYECLWGDGQDDSQINPDPEMLPPVDPKDAFMLGD--TSAPTSNGVFTGNGTPSA 122
Query: 384 KGVSWLVKTQYIS------PLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKE 437
VSWL KT+YIS P + R++ + A+ + +++ RE Q+++
Sbjct: 123 PQVSWLRKTEYISSTVNRTPTVIRDTRRTAAAEDAR-----------IVDIS-REAQLRD 170
Query: 438 IEASFEAC----KLRPIHATNK-NLQPVEILPLLPDFERYDDQFVAATFD---GAPTADS 489
IE SF + L + NK + VE +LPD + + + + F G AD
Sbjct: 171 IENSFRSLGDNFDLSQLKHPNKPGVTAVESYEILPDTDIWANAYDLFRFSERPGDRAAD- 229
Query: 490 EIYSKMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNELSKDMYD-----EN 544
V D + I++ A G + FLAY +P +E++++ + EN
Sbjct: 230 ---------VPDPRINCGILRPMEADG------DHFLAYFLPKEDEMAENFVERRRNGEN 274
Query: 545 -EDVSFSWVREYHWDVRGDDADDPTTYLVSFDDDE-------------ARYVPLPTKLNL 590
E+ FS++R+Y + + D +L+ DD + A Y + K+ L
Sbjct: 275 VEETIFSFIRDY--ETVKIEQDVQNEFLLVLDDGDIKSEGDSGRREKGAYYKNIERKILL 332
Query: 591 RKKRA 595
+K+RA
Sbjct: 333 KKRRA 337
>gi|403417479|emb|CCM04179.1| predicted protein [Fibroporia radiculosa]
Length = 433
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 137/334 (41%), Gaps = 58/334 (17%)
Query: 276 KKPTTFLCKLKFRNELPEPSAQPKLMALKKDKDRFTRYTF-SSLEKNYKPQLHVEPDLGI 334
K L ++++ N LP P PKL+ D R+ R F + + + V+ DLG+
Sbjct: 5 KSKLDLLIRVRYSNPLPAPPCPPKLLDTPTDPMRYARPEFLDDIAADTPLPMIVDGDLGM 64
Query: 335 PLDLL----------DLSVYNPPSVRPP-LDPEDEELLRDDEVVTPVKKDGIKR------ 377
PLDL D S NP PP LDP+DE LL D TP +G
Sbjct: 65 PLDLSRWECLWGENGDDSELNPDPDNPPVLDPKDELLLADPSTSTPFSSNGFSTSGLTPS 124
Query: 378 KERPTDKGVSWLVKTQYISPLSMESARQSLTEKQAKELREMKGGRSILENLNDRERQIKE 437
+ V WL KT+Y LS E ++ ++AK M I R QI++
Sbjct: 125 SSQTMMPNVPWLRKTEY---LSREGVTRASLSQEAK--HNMDAAIDI-----SRSAQIRD 174
Query: 438 IEASF---EACKLRPIHATNK-NLQPVEILPLLPDFERYDDQFVAATFDGAPTADSEIYS 493
IEASF E L+ + NK N+ VE + PD E + + + F P
Sbjct: 175 IEASFAVTENFDLKTLRHPNKPNVTAVESYEIFPDAEIWANAYDLFRFSERP-------G 227
Query: 494 KMDKSVRDAHESRAIMKSYVATGSDSANPEKFLAYMVPSVNEL-----------SKDMYD 542
+ V D AI++ + G + FLAY + +EL S D +
Sbjct: 228 ERPPDVEDPRLDCAILRPMESDG------DHFLAYYLTKEDELAVEFKKMRLARSPDATE 281
Query: 543 ENEDVSFSWVREYHWDVRGDDADDPTTYLVSFDD 576
E E F +VR+Y + + + P +L+ DD
Sbjct: 282 EEEPTPFHFVRDY--ETVKVEQEVPNEFLLVIDD 313
>gi|361129281|gb|EHL01193.1| putative protein transport protein SEC31 [Glarea lozoyensis 74030]
Length = 1267
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 58/150 (38%), Gaps = 16/150 (10%)
Query: 11 PQSSFPPPPPPNQNPSQPPP---PPQQQQQQQRPNPYSQNWGGYSNTGGGAQQHYHQPYS 67
P SF PP P NP P PQ QQ Q PN Y GGY G Q Y P
Sbjct: 833 PSKSFVPPTPAASNPYAPAGNAYAPQGYQQPQAPNAY----GGY-----GQPQAYGAPSP 883
Query: 68 YAQPPPPPPPESSYPPPPPPPPPPPPTQQQTQPSMYYSSNQYNQNSMYPPMQPPLP-PPP 126
Y P PP S + PP PPP + Q+N M ++P
Sbjct: 884 YNAPANGAPP-SQFATGPPRNTPPPTGTGAPYSARNKDMAQWNDTPMV--VKPATARRGT 940
Query: 127 PSSPPPSSSIPPPPPPGSPPPPPPKDVEGR 156
PS+ P +S +P P PPPP GR
Sbjct: 941 PSAAPITSPLPNPQSASMTPPPPQAGFGGR 970
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.309 0.130 0.388
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,849,179,993
Number of Sequences: 23463169
Number of extensions: 790285389
Number of successful extensions: 19224625
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 41542
Number of HSP's successfully gapped in prelim test: 94768
Number of HSP's that attempted gapping in prelim test: 9575152
Number of HSP's gapped (non-prelim): 2970494
length of query: 677
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 527
effective length of database: 8,839,720,017
effective search space: 4658532448959
effective search space used: 4658532448959
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 81 (35.8 bits)