BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004094
(774 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6QNU4|DDB1_SOLLC DNA damage-binding protein 1 OS=Solanum lycopersicum GN=DDB1 PE=1
SV=1
Length = 1090
Score = 1431 bits (3705), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 703/767 (91%), Positives = 740/767 (96%), Gaps = 2/767 (0%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KLNLQPD KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR
Sbjct: 324 QLVKLNLQPDTKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 383
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGINEQASVELQGIKGMWSLRS+TDDP+DTFLVVSFISETR+LAMNLEDELEETEIEG
Sbjct: 384 NGIGINEQASVELQGIKGMWSLRSATDDPYDTFLVVSFISETRVLAMNLEDELEETEIEG 443
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F SQ QTLFCHDA+YNQLVQVTS SVRLVSSTSR+L+NEW +P GYSVNVATANA+QVLL
Sbjct: 444 FNSQVQTLFCHDAVYNQLVQVTSNSVRLVSSTSRDLKNEWFAPVGYSVNVATANATQVLL 503
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
ATGGGHLVYLEIGDG+L EVK+A+L+Y+ISCLDINPIGENP+YS IAAVGMWTDISVRI+
Sbjct: 504 ATGGGHLVYLEIGDGVLNEVKYAKLDYDISCLDINPIGENPNYSNIAAVGMWTDISVRIY 563
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
SLPDLNLITKE LGGEIIPRSVL+C+FEGISYLLCALGDGHLLNF+L+M TGELTDRKKV
Sbjct: 564 SLPDLNLITKEQLGGEIIPRSVLMCSFEGISYLLCALGDGHLLNFVLSMSTGELTDRKKV 623
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
SLGTQPITLRTFSSK+TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFN AAFPD
Sbjct: 624 SLGTQPITLRTFSSKDTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNVAAFPD 683
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESE 428
SLAIAKEGELTIGTID+IQKLHIRSIPLGEH RRI HQEQ+RTFA+CS+K QS A++ E
Sbjct: 684 SLAIAKEGELTIGTIDEIQKLHIRSIPLGEHARRISHQEQTRTFALCSVKYTQSNADDPE 743
Query: 429 MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR 488
MHFVRLLDDQTFEFISTYPLD FEYGCSILSCSFSDDSNVYYC+GTAYV+PEENEPTKGR
Sbjct: 744 MHFVRLLDDQTFEFISTYPLDQFEYGCSILSCSFSDDSNVYYCIGTAYVMPEENEPTKGR 803
Query: 489 ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD-GTRELQSE 547
ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW R+D G+RELQ+E
Sbjct: 804 ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTE 863
Query: 548 CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 607
CGHHGHILALYVQTRGDFIVVGDLMKSISLLI+KHEEGAIEERARDYNANWMSAVEILDD
Sbjct: 864 CGHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDD 923
Query: 608 DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 667
DIYLGAENNFNLFTVRKNSEGATDEER RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG
Sbjct: 924 DIYLGAENNFNLFTVRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 983
Query: 668 QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 727
QIPTVIFGTVNGVIGVIASLPH+QYLFLEKLQTNLRKVIKGVGGL+HEQWRSF NEKKTV
Sbjct: 984 QIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQWRSFYNEKKTV 1043
Query: 728 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 774
DAKNFLDGDLIESFLDLSR RM+EISK M+V VEEL KRVEELTRLH
Sbjct: 1044 DAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEELTRLH 1090
>sp|Q6E7D1|DDB1_SOLCE DNA damage-binding protein 1 OS=Solanum cheesmanii GN=DDB1 PE=3 SV=1
Length = 1095
Score = 1431 bits (3704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 703/767 (91%), Positives = 740/767 (96%), Gaps = 2/767 (0%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KLNLQPD KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR
Sbjct: 329 QLVKLNLQPDTKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 388
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGINEQASVELQGIKGMWSLRS+TDDP+DTFLVVSFISETR+LAMNLEDELEETEIEG
Sbjct: 389 NGIGINEQASVELQGIKGMWSLRSATDDPYDTFLVVSFISETRVLAMNLEDELEETEIEG 448
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F SQ QTLFCHDA+YNQLVQVTS SVRLVSSTSR+L+NEW +P GYSVNVATANA+QVLL
Sbjct: 449 FNSQVQTLFCHDAVYNQLVQVTSNSVRLVSSTSRDLKNEWFAPVGYSVNVATANATQVLL 508
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
ATGGGHLVYLEIGDG+L EVK+A+L+Y+ISCLDINPIGENP+YS IAAVGMWTDISVRI+
Sbjct: 509 ATGGGHLVYLEIGDGVLNEVKYAKLDYDISCLDINPIGENPNYSNIAAVGMWTDISVRIY 568
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
SLPDLNLITKE LGGEIIPRSVL+C+FEGISYLLCALGDGHLLNF+L+M TGELTDRKKV
Sbjct: 569 SLPDLNLITKEQLGGEIIPRSVLMCSFEGISYLLCALGDGHLLNFVLSMSTGELTDRKKV 628
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
SLGTQPITLRTFSSK+TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFN AAFPD
Sbjct: 629 SLGTQPITLRTFSSKDTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNVAAFPD 688
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESE 428
SLAIAKEGELTIGTID+IQKLHIRSIPLGEH RRI HQEQ+RTFA+CS+K QS A++ E
Sbjct: 689 SLAIAKEGELTIGTIDEIQKLHIRSIPLGEHARRISHQEQTRTFALCSVKYTQSNADDPE 748
Query: 429 MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR 488
MHFVRLLDDQTFEFISTYPLD FEYGCSILSCSFSDDSNVYYC+GTAYV+PEENEPTKGR
Sbjct: 749 MHFVRLLDDQTFEFISTYPLDQFEYGCSILSCSFSDDSNVYYCIGTAYVMPEENEPTKGR 808
Query: 489 ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD-GTRELQSE 547
ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW R+D G+RELQ+E
Sbjct: 809 ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTE 868
Query: 548 CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 607
CGHHGHILALYVQTRGDFIVVGDLMKSISLLI+KHEEGAIEERARDYNANWMSAVEILDD
Sbjct: 869 CGHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDD 928
Query: 608 DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 667
DIYLGAENNFNLFTVRKNSEGATDEER RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG
Sbjct: 929 DIYLGAENNFNLFTVRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 988
Query: 668 QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 727
QIPTVIFGTVNGVIGVIASLPH+QYLFLEKLQTNLRKVIKGVGGL+HEQWRSF NEKKTV
Sbjct: 989 QIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQWRSFYNEKKTV 1048
Query: 728 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 774
DAKNFLDGDLIESFLDLSR RM+EISK M+V VEEL KRVEELTRLH
Sbjct: 1049 DAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEELTRLH 1095
>sp|Q9M0V3|DDB1A_ARATH DNA damage-binding protein 1a OS=Arabidopsis thaliana GN=DDB1A PE=1
SV=1
Length = 1088
Score = 1429 bits (3700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 692/765 (90%), Positives = 737/765 (96%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KLNL PDAKGSYVEVLERY+NLGPIVDFCVVDLERQGQGQVVTCSGA+KDGSLR+VR
Sbjct: 324 QLVKLNLHPDAKGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSGAFKDGSLRVVR 383
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGINEQASVELQGIKGMWSL+SS D+ FDTFLVVSFISETRILAMNLEDELEETEIEG
Sbjct: 384 NGIGINEQASVELQGIKGMWSLKSSIDEAFDTFLVVSFISETRILAMNLEDELEETEIEG 443
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F SQ QTLFCHDA+YNQLVQVTS SVRLVSST+RELR+EW +P G++VNVATANASQVLL
Sbjct: 444 FLSQVQTLFCHDAVYNQLVQVTSNSVRLVSSTTRELRDEWHAPAGFTVNVATANASQVLL 503
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
ATGGGHLVYLEIGDG LTEV+HA LEYE+SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF
Sbjct: 504 ATGGGHLVYLEIGDGKLTEVQHALLEYEVSCLDINPIGDNPNYSQLAAVGMWTDISVRIF 563
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
SLP+L LITKE LGGEIIPRSVLLCAFEGISYLLCALGDGHLLNF ++ TG+L DRKKV
Sbjct: 564 SLPELTLITKEQLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFQMDTTTGQLKDRKKV 623
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
SLGTQPITLRTFSSK+ THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD
Sbjct: 624 SLGTQPITLRTFSSKSATHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 683
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM 429
SLAIA+EGELTIGTIDDIQKLHIR+IPLGEH RRICHQEQ+RTF ICSL NQS +EESEM
Sbjct: 684 SLAIAREGELTIGTIDDIQKLHIRTIPLGEHARRICHQEQTRTFGICSLGNQSNSEESEM 743
Query: 430 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 489
HFVRLLDDQTFEF+STYPLD+FEYGCSILSCSF++D NVYYCVGTAYVLPEENEPTKGRI
Sbjct: 744 HFVRLLDDQTFEFMSTYPLDSFEYGCSILSCSFTEDKNVYYCVGTAYVLPEENEPTKGRI 803
Query: 490 LVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 549
LVFIVEDG+LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG
Sbjct: 804 LVFIVEDGRLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 863
Query: 550 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 609
HHGHILALYVQTRGDFIVVGDLMKSISLL+YKHEEGAIEERARDYNANWMSAVEILDDDI
Sbjct: 864 HHGHILALYVQTRGDFIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDI 923
Query: 610 YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 669
YLGAENNFNL TV+KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS++GQI
Sbjct: 924 YLGAENNFNLLTVKKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQI 983
Query: 670 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 729
PTVIFGTVNGVIGVIASLP EQY FLEKLQ++LRKVIKGVGGL+HEQWRSFNNEK+T +A
Sbjct: 984 PTVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTAEA 1043
Query: 730 KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 774
+NFLDGDLIESFLDLSR +M++ISK+MNV VEELCKRVEELTRLH
Sbjct: 1044 RNFLDGDLIESFLDLSRNKMEDISKSMNVQVEELCKRVEELTRLH 1088
>sp|O49552|DDB1B_ARATH DNA damage-binding protein 1b OS=Arabidopsis thaliana GN=DDB1B PE=2
SV=2
Length = 1088
Score = 1383 bits (3580), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 670/765 (87%), Positives = 723/765 (94%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QLIKLNLQPDAKGSYVE+LE+YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR
Sbjct: 324 QLIKLNLQPDAKGSYVEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 383
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGINEQASVELQGIKGMWSL+SS D+ FDTFLVVSFISETRILAMN+EDELEETEIEG
Sbjct: 384 NGIGINEQASVELQGIKGMWSLKSSIDEAFDTFLVVSFISETRILAMNIEDELEETEIEG 443
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F S+ QTLFCHDA+YNQLVQVTS SVRLVSST+RELRN+W +P G+SVNVATANASQVLL
Sbjct: 444 FLSEVQTLFCHDAVYNQLVQVTSNSVRLVSSTTRELRNKWDAPAGFSVNVATANASQVLL 503
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
ATGGGHLVYLEIGDG LTEVKH LEYE+SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF
Sbjct: 504 ATGGGHLVYLEIGDGTLTEVKHVLLEYEVSCLDINPIGDNPNYSQLAAVGMWTDISVRIF 563
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
LPDL LITKE LGGEIIPRSVLLCAFEGISYLLCALGDGHLLNF L+ G+L DRKKV
Sbjct: 564 VLPDLTLITKEELGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFQLDTSCGKLRDRKKV 623
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
SLGT+PITLRTFSSK+ THVFAASDRP VIYS+NKKLLYSNVNLKEVSHMCPFNSAAFPD
Sbjct: 624 SLGTRPITLRTFSSKSATHVFAASDRPAVIYSNNKKLLYSNVNLKEVSHMCPFNSAAFPD 683
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM 429
SLAIA+EGELTIGTIDDIQKLHIR+IP+GEH RRICHQEQ+RTFAI L+N+ AEESE
Sbjct: 684 SLAIAREGELTIGTIDDIQKLHIRTIPIGEHARRICHQEQTRTFAISCLRNEPSAEESES 743
Query: 430 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 489
HFVRLLD Q+FEF+S+YPLD FE GCSILSCSF+DD NVYYCVGTAYVLPEENEPTKGRI
Sbjct: 744 HFVRLLDAQSFEFLSSYPLDAFECGCSILSCSFTDDKNVYYCVGTAYVLPEENEPTKGRI 803
Query: 490 LVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 549
LVFIVE+G+LQLI EKETKGAVYSLNAFNGKLLA+INQKIQLYKWMLRDDGTRELQSECG
Sbjct: 804 LVFIVEEGRLQLITEKETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECG 863
Query: 550 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 609
HHGHILALYVQTRGDFI VGDLMKSISLLIYKHEEGAIEERARDYNANWM+AVEIL+DDI
Sbjct: 864 HHGHILALYVQTRGDFIAVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDI 923
Query: 610 YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 669
YLG +N FN+FTV+KN+EGATDEER R+EVVGEYH+GEFVNRFRHGSLVM+LPDSD+GQI
Sbjct: 924 YLGTDNCFNIFTVKKNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVMKLPDSDIGQI 983
Query: 670 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 729
PTVIFGTV+G+IGVIASLP EQY FLEKLQT+LRKVIKGVGGL+HEQWRSFNNEK+T +A
Sbjct: 984 PTVIFGTVSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTAEA 1043
Query: 730 KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 774
K +LDGDLIESFLDLSR +M+EISK M+V VEELCKRVEELTRLH
Sbjct: 1044 KGYLDGDLIESFLDLSRGKMEEISKGMDVQVEELCKRVEELTRLH 1088
>sp|A1A4K3|DDB1_BOVIN DNA damage-binding protein 1 OS=Bos taurus GN=DDB1 PE=2 SV=1
Length = 1140
Score = 851 bits (2198), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/815 (52%), Positives = 559/815 (68%), Gaps = 56/815 (6%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KLN+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+R
Sbjct: 332 QLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIR 391
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGI+E AS++L GIKG+W LRS + D LV+SF+ +TR+L +N E E+EETE+ G
Sbjct: 392 NGIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMG 450
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F QT FC + + QL+Q+TS SVRLVS + L +EWK P G +++VA+ N+SQV++
Sbjct: 451 FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVV 510
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
A G L YL+I L ++ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI
Sbjct: 511 AVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIA 569
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F LN++TG L+DRKKV
Sbjct: 570 KLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKV 629
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PD
Sbjct: 630 TLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPD 689
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------- 419
SLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 690 SLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGT 749
Query: 420 -----------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDT 450
S EE E+H + ++D TFE + +
Sbjct: 750 TALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQ 809
Query: 451 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 510
EY S++SC D N Y+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGA
Sbjct: 810 NEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGA 869
Query: 511 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 570
VYS+ FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGD
Sbjct: 870 VYSMVEFNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGD 925
Query: 571 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 630
LM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S T
Sbjct: 926 LMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATT 985
Query: 631 DEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPH 689
DEER L+ VG +HLGEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL
Sbjct: 986 DEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSE 1045
Query: 690 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 749
Y L +Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M
Sbjct: 1046 SWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKM 1105
Query: 750 DEISKTMN----------VSVEELCKRVEELTRLH 774
E+ + + ++L K VEELTR+H
Sbjct: 1106 QEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 1140
>sp|Q3U1J4|DDB1_MOUSE DNA damage-binding protein 1 OS=Mus musculus GN=Ddb1 PE=1 SV=2
Length = 1140
Score = 851 bits (2198), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/815 (52%), Positives = 558/815 (68%), Gaps = 56/815 (6%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KLN+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+R
Sbjct: 332 QLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIR 391
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGI+E AS++L GIKG+W LRS D LV+SF+ +TR+L +N E E+EETE+ G
Sbjct: 392 NGIGIHEHASIDLPGIKGLWPLRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMG 450
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F QT FC + + QL+Q+TS SVRLVS + L +EWK P G +++VA+ N+SQV++
Sbjct: 451 FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVV 510
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
A G L YL+I L ++ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI
Sbjct: 511 AVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARIL 569
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F LN++TG L+DRKKV
Sbjct: 570 KLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKV 629
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PD
Sbjct: 630 TLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPD 689
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------- 419
SLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 690 SLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGT 749
Query: 420 -----------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDT 450
S EE E+H + ++D TFE + +
Sbjct: 750 TALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQ 809
Query: 451 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 510
EY S++SC D N Y+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGA
Sbjct: 810 NEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGA 869
Query: 511 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 570
VYS+ FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGD
Sbjct: 870 VYSMVEFNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGD 925
Query: 571 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 630
LM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S T
Sbjct: 926 LMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATT 985
Query: 631 DEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPH 689
DEER L+ VG +HLGEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL
Sbjct: 986 DEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSE 1045
Query: 690 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 749
Y L +Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M
Sbjct: 1046 SWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKM 1105
Query: 750 DEISKTMN----------VSVEELCKRVEELTRLH 774
E+ + + ++L K VEELTR+H
Sbjct: 1106 QEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 1140
>sp|P33194|DDB1_CHLAE DNA damage-binding protein 1 OS=Chlorocebus aethiops GN=DDB1 PE=1
SV=1
Length = 1140
Score = 850 bits (2196), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/815 (52%), Positives = 558/815 (68%), Gaps = 56/815 (6%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KLN+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+R
Sbjct: 332 QLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIR 391
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGI+E AS++L GIKG+W LRS + D LV+SF+ +TR+L +N E E+EETE+ G
Sbjct: 392 NGIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMG 450
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F QT FC + + QL+Q+TS SVRLVS + L +EWK P +++VA+ N+SQV++
Sbjct: 451 FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVV 510
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
A G L YL+I L ++ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI
Sbjct: 511 AVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARIL 569
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F LN++TG L+DRKKV
Sbjct: 570 KLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKV 629
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PD
Sbjct: 630 TLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPD 689
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------- 419
SLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 690 SLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGT 749
Query: 420 -----------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDT 450
S EE E+H + ++D TFE + +
Sbjct: 750 TALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQ 809
Query: 451 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 510
EY S++SC D N Y+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGA
Sbjct: 810 NEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGA 869
Query: 511 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 570
VYS+ FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGD
Sbjct: 870 VYSMVEFNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGD 925
Query: 571 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 630
LM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S T
Sbjct: 926 LMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATT 985
Query: 631 DEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPH 689
DEER L+ VG +HLGEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL
Sbjct: 986 DEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSE 1045
Query: 690 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 749
Y L +Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M
Sbjct: 1046 SWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKM 1105
Query: 750 DEISKTMN----------VSVEELCKRVEELTRLH 774
E+ + + ++L K VEELTR+H
Sbjct: 1106 QEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 1140
>sp|Q16531|DDB1_HUMAN DNA damage-binding protein 1 OS=Homo sapiens GN=DDB1 PE=1 SV=1
Length = 1140
Score = 850 bits (2195), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/815 (52%), Positives = 558/815 (68%), Gaps = 56/815 (6%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KLN+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+R
Sbjct: 332 QLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIR 391
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGI+E AS++L GIKG+W LRS + D LV+SF+ +TR+L +N E E+EETE+ G
Sbjct: 392 NGIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMG 450
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F QT FC + + QL+Q+TS SVRLVS + L +EWK P +++VA+ N+SQV++
Sbjct: 451 FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVV 510
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
A G L YL+I L ++ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI
Sbjct: 511 AVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARIL 569
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F LN++TG L+DRKKV
Sbjct: 570 KLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKV 629
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PD
Sbjct: 630 TLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPD 689
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------- 419
SLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 690 SLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGT 749
Query: 420 -----------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDT 450
S EE E+H + ++D TFE + +
Sbjct: 750 TALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQ 809
Query: 451 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 510
EY S++SC D N Y+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGA
Sbjct: 810 NEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGA 869
Query: 511 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 570
VYS+ FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGD
Sbjct: 870 VYSMVEFNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGD 925
Query: 571 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 630
LM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S T
Sbjct: 926 LMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATT 985
Query: 631 DEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPH 689
DEER L+ VG +HLGEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL
Sbjct: 986 DEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSE 1045
Query: 690 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 749
Y L +Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M
Sbjct: 1046 SWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKM 1105
Query: 750 DEISKTMN----------VSVEELCKRVEELTRLH 774
E+ + + ++L K VEELTR+H
Sbjct: 1106 QEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 1140
>sp|Q805F9|DDB1_CHICK DNA damage-binding protein 1 OS=Gallus gallus GN=DDB1 PE=2 SV=1
Length = 1140
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/815 (52%), Positives = 558/815 (68%), Gaps = 56/815 (6%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KLN+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+R
Sbjct: 332 QLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIR 391
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGI+E AS++L GIKG+W LRS + D LV+SF+ +TR+L +N E E+EETE+ G
Sbjct: 392 NGIGIHEHASIDLPGIKGLWPLRSDSHREMDNMLVLSFVGQTRVLMLNGE-EVEETELTG 450
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F QT FC + + QL+Q+TS SVRLVS + L +EWK P G +++VA+ N++QV++
Sbjct: 451 FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVV 510
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
A G L YLEI L ++ ++E+E++CLDI P+G+ S + A+G+WTDIS RI
Sbjct: 511 AVGRA-LYYLEIRPQELRQINCTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARIL 569
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F L+++TG L+DRKKV
Sbjct: 570 KLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKV 629
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PD
Sbjct: 630 TLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPD 689
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------- 419
SLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 690 SLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGT 749
Query: 420 -----------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDT 450
S EE E+H + ++D TFE + +
Sbjct: 750 TALRPSASTQALSSSVSTSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQ 809
Query: 451 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 510
EY S++SC D N Y+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGA
Sbjct: 810 NEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGA 869
Query: 511 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 570
VYS+ FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGD
Sbjct: 870 VYSMVEFNGKLLASINSTVRLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGD 925
Query: 571 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 630
LM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S T
Sbjct: 926 LMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATT 985
Query: 631 DEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPH 689
DEER L+ VG HLGEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL
Sbjct: 986 DEERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSE 1045
Query: 690 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 749
Y L +Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M
Sbjct: 1046 SWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKM 1105
Query: 750 DEISKTMNV----------SVEELCKRVEELTRLH 774
E+ + + +V++L K VEELTR+H
Sbjct: 1106 QEVVANLQIDDGSGMKREATVDDLIKIVEELTRIH 1140
>sp|Q5R649|DDB1_PONAB DNA damage-binding protein 1 OS=Pongo abelii GN=DDB1 PE=2 SV=1
Length = 1140
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/815 (52%), Positives = 557/815 (68%), Gaps = 56/815 (6%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KLN+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+R
Sbjct: 332 QLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIR 391
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGI+E AS++L GIKG+W LRS + D LV+SF+ +TR+L +N E E+EETE+ G
Sbjct: 392 NGIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMG 450
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F QT FC + + QL+Q+TS SVRLVS + L +EWK P +++VA+ N+SQV++
Sbjct: 451 FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVV 510
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
A G L YL+I L ++ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI
Sbjct: 511 AVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARIL 569
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F LN++TG L+DRKKV
Sbjct: 570 KLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKV 629
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PD
Sbjct: 630 TLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPD 689
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------- 419
SLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 690 SLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGT 749
Query: 420 -----------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDT 450
S EE E+H + ++D TFE + +
Sbjct: 750 TALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQ 809
Query: 451 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 510
EY S++SC D N Y+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGA
Sbjct: 810 NEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGA 869
Query: 511 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 570
VY + FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGD
Sbjct: 870 VYPMVEFNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGD 925
Query: 571 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 630
LM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S T
Sbjct: 926 LMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATT 985
Query: 631 DEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPH 689
DEER L+ VG +HLGEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL
Sbjct: 986 DEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSE 1045
Query: 690 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 749
Y L +Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M
Sbjct: 1046 SWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKM 1105
Query: 750 DEISKTMN----------VSVEELCKRVEELTRLH 774
E+ + + ++L K VEELTR+H
Sbjct: 1106 QEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 1140
>sp|Q6P6Z0|DDB1_XENLA DNA damage-binding protein 1 OS=Xenopus laevis GN=ddb1 PE=2 SV=1
Length = 1140
Score = 847 bits (2189), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/815 (53%), Positives = 559/815 (68%), Gaps = 56/815 (6%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL+KL + + +GSYV V+E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+R
Sbjct: 332 QLVKLTTESNEQGSYVVVMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIR 391
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGI+E AS++L GIKG+W LR + D D LV+SF+ +TR+L + E E+EET++ G
Sbjct: 392 NGIGIHEHASIDLPGIKGLWPLRVAADRDTDDTLVLSFVGQTRVLTLTGE-EVEETDLAG 450
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F QT FC + + QL+Q+TS SVRLVS + L +EWK P G V+V + N+ QVLL
Sbjct: 451 FVDDQQTFFCGNVAHQQLIQITSASVRLVSQNPQNLVSEWKEPQGRKVSVCSCNSRQVLL 510
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
A G L YLEI G L + ++E+E++CLD+ P+G N + S + A+G+WTDIS RI
Sbjct: 511 AVGR-VLYYLEIHPGELRQTSCTEMEHEVACLDVTPLGGNDTLSSLCAIGLWTDISARIL 569
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
SLP L+ KE LGGEIIPRS+L+ +FE YLLCALGDG L F LN TG L+DRKKV
Sbjct: 570 SLPGFQLLHKEMLGGEIIPRSILMTSFESSHYLLCALGDGALFYFSLNTDTGLLSDRKKV 629
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PD
Sbjct: 630 TLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPD 689
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-------- 421
SLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S + +
Sbjct: 690 SLALANNSTLTIGTIDEIQKLHIRTVPLFESPRKICYQEVSQCFGVLSSRIEVQDASGGS 749
Query: 422 ----------------SCA---------------EESEMHFVRLLDDQTFEFISTYPLDT 450
SC+ EE E+H + ++D TFE + T+
Sbjct: 750 SPLRPSASTQALSSSVSCSKLFSGSTSPHETSFGEEVEVHNLLIIDQHTFEVLHTHQFLQ 809
Query: 451 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 510
EY S++SC D Y+ VGTA V P+E EP +GRI+VF DGKLQ +AEKE KGA
Sbjct: 810 NEYTLSLVSCKLGKDPTTYFVVGTAMVYPDEAEPKQGRIVVFQYNDGKLQTVAEKEVKGA 869
Query: 511 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 570
VYS+ FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGD
Sbjct: 870 VYSMVEFNGKLLASINSTVRLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGD 925
Query: 571 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 630
LM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S T
Sbjct: 926 LMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATT 985
Query: 631 DEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPH 689
DEER L+ VG +HLGEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL
Sbjct: 986 DEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSPPTQGSVLFGTVNGMIGLVTSLSE 1045
Query: 690 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 749
Y L +Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M
Sbjct: 1046 SWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKM 1105
Query: 750 DEISKTMNV----------SVEELCKRVEELTRLH 774
E+ + + +V++L K VEELTR+H
Sbjct: 1106 QEVIANLQIDDGSGMKRETTVDDLIKVVEELTRIH 1140
>sp|Q9ESW0|DDB1_RAT DNA damage-binding protein 1 OS=Rattus norvegicus GN=Ddb1 PE=2 SV=1
Length = 1140
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/815 (51%), Positives = 555/815 (68%), Gaps = 56/815 (6%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
Q +KLN+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+R
Sbjct: 332 QPVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIR 391
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGI+E AS++L GIKG+W LRS + D LV+SF+ +TR+L +N E E+EETE+ G
Sbjct: 392 NGIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMG 450
Query: 130 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 189
F QT FC + + QL+Q+TS SVRLVS + L +EWK P +++VA+ N+SQV++
Sbjct: 451 FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPRAKNISVASCNSSQVVV 510
Query: 190 ATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 249
A G L YL+I L ++ H ++E+E++CLD+ P+G++ S + A+G+WTDIS RI
Sbjct: 511 AVGRA-LYYLQIHPQELRQISHTEMEHEVACLDVTPLGDSNGLSPLCAIGLWTDISARIL 569
Query: 250 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKV 309
LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F LN++TG L+DRKKV
Sbjct: 570 KLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKV 629
Query: 310 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 369
+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PD
Sbjct: 630 TLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPD 689
Query: 370 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------- 419
SLA+A LTIGT+++IQKLHIR++P+ E PR+IC+QE S+ F + S +
Sbjct: 690 SLALANTSTLTIGTMNEIQKLHIRTVPIYESPRKICYQEVSQCFGVLSTRIEVQDTSGGT 749
Query: 420 -----------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDT 450
S EE E+H + ++D TFE + +
Sbjct: 750 TALRPSASTQALSSSVSSSKLFSSSAAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQ 809
Query: 451 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 510
EY S++SC D N Y+ VGTA V PEE EP +GRI+VF GKLQ +AEKE KGA
Sbjct: 810 NEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSGGKLQTVAEKEVKGA 869
Query: 511 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 570
VYS+ FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGD
Sbjct: 870 VYSMVEFNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGD 925
Query: 571 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 630
LM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S T
Sbjct: 926 LMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATT 985
Query: 631 DEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPH 689
DEER L+ VG +HLGEFVN F HGSLVM+ L ++ +V+ GTVNG+IG++ SL
Sbjct: 986 DEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLLGTVNGMIGLVTSLSE 1045
Query: 690 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 749
Y L +Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M
Sbjct: 1046 SWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKM 1105
Query: 750 DEISKTMN----------VSVEELCKRVEELTRLH 774
E+ + + ++L K VEELTR+H
Sbjct: 1106 QEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 1140
>sp|Q9XYZ5|DDB1_DROME DNA damage-binding protein 1 OS=Drosophila melanogaster GN=pic PE=1
SV=1
Length = 1140
Score = 733 bits (1893), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/821 (47%), Positives = 529/821 (64%), Gaps = 67/821 (8%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QL++LN + GSYV +E + NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R
Sbjct: 331 QLVRLNSEA-IDGSYVVPVENFTNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIR 389
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIE 128
GIGI E A ++L GIKGMWSL+ D+ P++ LV++F+ TRIL ++ E E+EETEI
Sbjct: 390 IGIGIQEHACIDLPGIKGMWSLKVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIP 448
Query: 129 GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 188
GF S QT C + Y+QL+QVTS SVRLVSS ++ L EW+ ++ V + N +Q+L
Sbjct: 449 GFASDLQTFLCSNVDYDQLIQVTSDSVRLVSSATKALVAEWRPTGDRTIGVVSCNTTQIL 508
Query: 189 LATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRI 248
+A+ + Y+ I DG L E L YE++CLDI P+ E S + AVG+WTDIS I
Sbjct: 509 VASAC-DIFYIVIEDGSLREQSRRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVI 567
Query: 249 FSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKK 308
SLPDL I E L GEIIPRS+L+ FEGI YLLCALGDG + F+++ TG+LTD+KK
Sbjct: 568 LSLPDLETIYTEKLSGEIIPRSILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKK 627
Query: 309 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 368
V+LGTQP TLRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+P
Sbjct: 628 VTLGTQPTTLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYP 687
Query: 369 DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK--------- 419
DSLA+A + + +GTID+IQKLHIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 688 DSLALANKNAVILGTIDEIQKLHIRTVPLGEGPRRIAYQESSQTFAVSTLRIDVHGRGGA 747
Query: 420 --------------------------------NQSCAEESEMHFVRLLDDQTFEFISTYP 447
N +E ++H + ++D TFE + +
Sbjct: 748 KPLRNSASTQAQNITCSSNFLPKPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQ 807
Query: 448 LDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKET 507
E S++S DD N YY V T+ V+PEE EP GRI++F + KL +AE +
Sbjct: 808 FVAPETISSLMSAKLGDDPNTYYVVATSLVIPEEPEPKVGRIIIFHYHENKLTQVAETKV 867
Query: 508 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 567
G Y+L FNGK+LA I ++LY+W +EL+ EC I AL+++ +GDFI+
Sbjct: 868 DGTCYALVEFNGKVLAGIGSFVRLYEWT----NEKELRMECNIQNMIAALFLKAKGDFIL 923
Query: 568 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 627
VGDLM+SI+LL +K EG E ARD WM AVEILDDD +LG+E N NLF +K+S
Sbjct: 924 VGDLMRSITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSA 983
Query: 628 GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIG 682
TDEER L + +HLG+ VN FRHGSLVM+ +VG+ T V++GT NG IG
Sbjct: 984 ATTDEERQLLPELARFHLGDTVNVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIG 1039
Query: 683 VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 742
++ +P + Y FL L+ L+K+IK VG + H +R+F K ++ F+DGDLIESFL
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINSKVEPSEGFIDGDLIESFL 1099
Query: 743 DLSRTRMDEISKTMNVS---------VEELCKRVEELTRLH 774
DLSR +M + + + ++ VE++ K VE+LTR+H
Sbjct: 1100 DLSRDKMRDAVQGLELTLNGERKSADVEDVIKIVEDLTRMH 1140
>sp|B0M0P5|DDB1_DICDI DNA damage-binding protein 1 OS=Dictyostelium discoideum GN=repE PE=1
SV=1
Length = 1181
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/827 (43%), Positives = 527/827 (63%), Gaps = 74/827 (8%)
Query: 10 QLIKLNLQPD-AKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIV 68
QLI+LN + D SYV LE + N+GP+VDFCVVD E+QGQ Q+VTCSG Y+DGSLRI+
Sbjct: 362 QLIRLNTEKDQTTDSYVTYLEAFTNIGPVVDFCVVDAEKQGQAQIVTCSGTYRDGSLRII 421
Query: 69 RNGIGINEQASVELQGIKGMWSL---------------------RSSTDDPFDTFLVVSF 107
RNGIGI EQAS+EL+GIKG++ + + D D +L+ SF
Sbjct: 422 RNGIGIAEQASIELEGIKGIFPINNNNNNNNNNNNNNNNNNNNNSNGITDSKDRYLITSF 481
Query: 108 ISETRILAMNLEDELEETEIEGFCSQTQTLFCH--DAIYNQLVQVTSGSVRLVSSTSREL 165
I T++L+ +E+EETE EG S TL+C D + N L+Q+T+ S+ L+ S + +
Sbjct: 482 IECTKVLSFQ-GEEIEETEFEGLESNCSTLYCGTIDKL-NLLIQITNVSINLIDSNTFKR 539
Query: 166 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGILTEVKHAQLEYEISCLDI 223
++W P +N+ + N Q++L+ L+Y +I + + VK +L +EISC+DI
Sbjct: 540 VSQWNVEPSRRINLVSTNQDQIVLSIDKS-LLYFQINSSNKSIQLVKEIELPHEISCIDI 598
Query: 224 NPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYL 282
+P + SQ+ +VG+W DI++RIF LP L I KE LGGEI+PRS+L+ +F+ I Y+
Sbjct: 599 SPFDSFMDTKSQLVSVGLWNDITLRIFKLPTLEEIWKEPLGGEILPRSILMISFDSIDYI 658
Query: 283 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS 342
C+LGDGHL F + + +L D++K++LGTQPI L+ F KNT ++FA SDRPTVIYS
Sbjct: 659 FCSLGDGHLFKFQFDFSSFKLFDKRKLTLGTQPIILKKFKLKNTINIFAISDRPTVIYSH 718
Query: 343 NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEH-P 401
NKKL YS VNLK+V+++ FNS FP+S+AIA LTIGTID+IQKLHI++IPL E
Sbjct: 719 NKKLFYSVVNLKDVTNVTSFNSDGFPNSMAIATTNSLTIGTIDEIQKLHIKTIPLNEEMG 778
Query: 402 RRICHQEQSRTFAICSLKNQS---------CAEESEMHFVRLLDDQTFEFISTYPLDTFE 452
RRI H E +A+ ++KN C E+ E+ ++R+ +DQTFE IS+Y LD +E
Sbjct: 779 RRIVHLEDHSCYAVITVKNNEGLLGGAQDLCEEDEEVSYIRIYNDQTFELISSYKLDPYE 838
Query: 453 YGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTK--GRILVFIV--------------- 494
G SI C F+ DD N Y VGT+ N P K GR+L+F +
Sbjct: 839 MGWSITPCKFAGDDVNTYLAVGTSI-----NTPIKSSGRVLLFSLSSSSSSNDKDSLDNN 893
Query: 495 --------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM-LRDDGTRELQ 545
+GKL L+ E + + +VY L +FNG+L+AA+++++ ++ ++ + +
Sbjct: 894 NNNNNNSGANGKLTLLEEIKFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVIS 953
Query: 546 SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 605
SE H GH + L + +RG FI+VGD+MKS+SLL+ + +G++E+ AR+ W+ +V ++
Sbjct: 954 SESVHKGHTMILKLASRGHFILVGDMMKSMSLLV-EQSDGSLEQIARNPQPIWIRSVAMI 1012
Query: 606 DDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 665
+DD ++GAE + N V+KN++ + ER L+ VG YH+GE +N RHGSLV RLPDSD
Sbjct: 1013 NDDYFIGAEASNNFIVVKKNNDSTNELERELLDSVGHYHIGESINSMRHGSLV-RLPDSD 1071
Query: 666 VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 725
IPT+++ +VNG IGV+AS+ E ++F KLQ L +V++GVGG +HE WR+F+N+
Sbjct: 1072 QPIIPTILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSNDHH 1131
Query: 726 TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 772
T+D+KNF+DGDLIE+FLDL + + ++ ++ +R+E L +
Sbjct: 1132 TIDSKNFIDGDLIETFLDLKYESQLKAVADLGITPDDAFRRIESLMQ 1178
>sp|Q21554|DDB1_CAEEL DNA damage-binding protein 1 OS=Caenorhabditis elegans GN=ddb-1 PE=1
SV=2
Length = 1134
Score = 473 bits (1217), Expect = e-132, Method: Compositional matrix adjust.
Identities = 282/818 (34%), Positives = 451/818 (55%), Gaps = 65/818 (7%)
Query: 10 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR 69
QLI+L +P+ GSY +LE Y N+GPI D +V E GQ Q+VTC+GA KDGSLR++R
Sbjct: 329 QLIRLMTEPNG-GSYSVILETYSNIGPIRDMVMV--ESDGQPQLVTCTGADKDGSLRVIR 385
Query: 70 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 129
NGIGI+E ASV+L G+ G++ +R D D +++VS ET +L + E ELE+ ++
Sbjct: 386 NGIGIDELASVDLAGVVGIFPIR--LDSNADNYVIVSLSDETHVLQITGE-ELEDVKLLE 442
Query: 130 FCSQTQTLFCHDAIYNQ----LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS 185
+ T+F ++Q T +RL+SS+ L W+ G ++ + NA+
Sbjct: 443 INTDLPTIFASTLFGPNDSGIILQATEKQIRLMSSSG--LSKFWEPTNGEIISKVSVNAA 500
Query: 186 QVLLATGGGHLVYL------EIGDGILTEVKHAQLEYEISCLDINPIGENPS-YSQIAAV 238
+ VYL E+G + + E EI+CLD++ G++P+ + +
Sbjct: 501 NGQIVLAARDTVYLLTCIVDEMGALDIQLTAEKKFENEIACLDLSNEGDDPNNKATFLVL 560
Query: 239 GMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM 298
W+ ++ + LPDL + L +IIPRS++ E + YLL A GDG L+ ++ ++
Sbjct: 561 AFWSTFAMEVIQLPDLITVCHTDLPTKIIPRSIIATCIEEVHYLLVAFGDGALVYYVFDI 620
Query: 299 KTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSH 358
KTG + KK ++GT+P +L +KN H+F SDRP +I+S++KKL++SNVN+K V
Sbjct: 621 KTGTHGEPKKSNVGTRPPSLHRVRNKNRQHLFVCSDRPVIIFSASKKLVFSNVNVKLVDT 680
Query: 359 MCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSL 418
+C +S+A+ D L I+ + GT+DDIQK+H+RSIP+GE RI +Q+ + T+ +CS
Sbjct: 681 VCSLSSSAYRDCLVISDGNSMVFGTVDDIQKIHVRSIPMGESVLRIAYQKSTSTYGVCSN 740
Query: 419 KNQSCAEE---SEMHFVR--------------------------LLDDQTFEFISTYPLD 449
+ +S AE S+ V +LD TF+ + ++
Sbjct: 741 RTESKAERVFASKNALVTSQSRPKVASTRADMDESPPNTTSSFMVLDQNTFQVLHSHEFG 800
Query: 450 TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKE 506
+E S +S F++DS+ YY VGT + P+E E GRI+VF V+D KL+ + E
Sbjct: 801 PWETALSCISGQFTNDSSTYYVVGTGLIYPDETETKIGRIVVFEVDDVERSKLRRVHELV 860
Query: 507 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 566
+G+ ++ NGKL+AAIN I+L++W +EL+ EC H++AL ++ + +
Sbjct: 861 VRGSPLAIRILNGKLVAAINSSIRLFEWTT----DKELRLECSSFNHVIALDLKVMNEEV 916
Query: 567 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KN 625
V D+M+S+SLL Y+ EG EE A+D+N+ WM E + + LG E + NLFTV
Sbjct: 917 AVADVMRSVSLLSYRMLEGNFEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDK 976
Query: 626 SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 685
+ TD+ R LE G ++LGE +LV++ DS + ++FGT G IG+I
Sbjct: 977 TRPITDDGRYVLEPTGYWYLGELPKVMTRSTLVIQPEDSIIQYSQPIMFGTNQGTIGMIV 1036
Query: 686 SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 745
+ + FL ++ + +K + H +R+F +K+ F+DGDL+ES LD+
Sbjct: 1037 QIDDKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAEPPSGFVDGDLVESILDMD 1096
Query: 746 RT-RMDEISKTMNVSVE--------ELCKRVEELTRLH 774
R+ MD +SK + + E+ K +E+L R+H
Sbjct: 1097 RSVAMDILSKVSDKGWDPSLPRDPVEILKVIEDLARMH 1134
>sp|O13807|DDB1_SCHPO DNA damage-binding protein 1 OS=Schizosaccharomyces pombe (strain 972
/ ATCC 24843) GN=ddb1 PE=1 SV=1
Length = 1072
Score = 287 bits (734), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 215/772 (27%), Positives = 399/772 (51%), Gaps = 70/772 (9%)
Query: 25 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 84
+E+L+ +VN+ PI DF + D Q ++TCSGAYKDG+LRI+RN I I A +E++G
Sbjct: 348 LEILQNFVNIAPISDFIIDD--DQTGSSIITCSGAYKDGTLRIIRNSINIENVALIEMEG 405
Query: 85 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA-- 142
IK +S+ + +D ++ +S I ETR + ++ EG S L C ++
Sbjct: 406 IKDFFSVSFRAN--YDNYIFLSLICETRAIIVSP---------EGVFSANHDLSCEESTI 454
Query: 143 ----IY--NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 196
IY +Q++Q+T+ +RL ++L + W SP S+ ++ A V +A GG +
Sbjct: 455 FVSTIYGNSQILQITTKEIRLFD--GKKLHS-WISP--MSITCGSSFADNVCVAVAGGLI 509
Query: 197 VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT-DISVRIFSLPDLN 255
++ E GI TEV Q + E+S L EN Y VG+W+ DI + + ++
Sbjct: 510 LFFE---GI-TEVGRYQCDTEVSSLCFTE--ENVVY-----VGLWSADIIMLTYCQDGIS 558
Query: 256 LITKEHLGGEIIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTDR--KKVSL 311
L L IPRS++ G L + +G++L F N + G++ + ++ L
Sbjct: 559 LTHSLKLTD--IPRSIVYSQKYGDDGGTLYVSTNNGYVLMF--NFQNGQVIEHSLRRNQL 614
Query: 312 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 371
G PI L+ F SK +FA ++P ++Y + KL+ + ++ E+ ++ + + + ++
Sbjct: 615 GVAPIILKHFDSKEKNAIFALGEKPQLMYYESDKLVITPLSCTEMLNISSYVNPSLGVNM 674
Query: 372 AIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA--EESEM 429
+++ + +I+ L+++++ + PRRIC F +C +S E+ +
Sbjct: 675 LYCTNSYISLAKMSEIRSLNVQTVSVKGFPRRICSNSLFY-FVLCMQLEESIGTQEQRLL 733
Query: 430 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 489
F+R+ + T I+ + + +E SI+ +DD V VGT + P+++ P GR+
Sbjct: 734 SFLRVYEKNTLSEIAHHKFNEYEMVESII--LMNDDKRV--VVGTGFNFPDQDAPDSGRL 789
Query: 490 LVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 548
+VF + D +++ AE + +G+V +L + ++A IN + ++++ + GT +++
Sbjct: 790 MVFEMTSDNNIEMQAEHKVQGSVNTLVLYKHLIVAGINASVCIFEY---EHGTMHVRNSI 846
Query: 549 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 608
+ + + V D I+ DLMKSI++L + ++ + E ARDY+ W ++VEIL +
Sbjct: 847 RTPTYTIDISVNQ--DEIIAADLMKSITVLQFIDDQ--LIEVARDYHPLWATSVEILSER 902
Query: 609 IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 668
Y E + N + +++ +R +L +++LGE +N+ RH + + P
Sbjct: 903 KYFVTEADGNAVILLRDNVSPQLSDRKKLRWYKKFYLGELINKTRHCTFIE--PQDKSLV 960
Query: 669 IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 728
P ++ TV+G + ++ L +LQ N+RKVI GGL+H++W+ + E +T
Sbjct: 961 TPQLLCATVDGSLMIVGDAGMSNTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET-S 1019
Query: 729 AKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 774
+ +DG LIES L L ++EI +++SV++L +E L +LH
Sbjct: 1020 PSDLIDGSLIESILGLREPILNEIVNGGHEGTKLDISVQDLKSIIENLEKLH 1071
>sp|Q54SA7|SF3B3_DICDI Probable splicing factor 3B subunit 3 OS=Dictyostelium discoideum
GN=sf3b3 PE=3 SV=1
Length = 1256
Score = 206 bits (524), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 206/857 (24%), Positives = 365/857 (42%), Gaps = 141/857 (16%)
Query: 33 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 91
+L PI+DF V+DL R+ Q+ + G + SL+++R+G+ + + L G+ G+W++
Sbjct: 416 SLSPIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTANLPGVPSGIWTV 475
Query: 92 RSSTD----DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 147
ST D D ++VVSF+ T +L++ D ++E G T TL + +
Sbjct: 476 PKSTSPNAIDQTDKYIVVSFVGTTSVLSVG--DTIQENHESGILETTTTLLVKSMGDDAI 533
Query: 148 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI-L 206
+QV R + S R NEW++P ++ A+AN SQ+ +A GG ++Y E+ L
Sbjct: 534 IQVFPTGFRHIKSDLR--INEWRAPGRKTIVRASANQSQLAIALSGGEIIYFELDQASNL 591
Query: 207 TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL--GG 264
E+ L +I+C++I+PI + + ++ AV W +R+ SL N + + +
Sbjct: 592 IEIIKKDLRRDIACIEISPIPKGRNMARFIAVSDWEG-PIRVLSLDRDNCLGQVSMLDTD 650
Query: 265 EIIPRSVLLCAFE----GIS-------------------------------YLLCALGDG 289
++ S+ + + GI +L L +G
Sbjct: 651 KVYIESLSIIEMQLNEMGIETKKSQSQTGQTTTTTTSTSSASSSVTSGGSLFLFVGLKNG 710
Query: 290 HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIY--------- 340
+ L+ TGEL+D + LG +P+ L + + + A S R + Y
Sbjct: 711 VVKRATLDSVTGELSDIRTRLLGRKPVKLFKVKVRGSNAMLALSSRVWLNYINQGKLDIV 770
Query: 341 -------------------------SSNKKLLYSNVNLKEVSHMCPFNSAAFPD------ 369
S NK +++S L ++ + A P
Sbjct: 771 PLSIEPLENASNLSSEQSAESIVATSENKIIIFSIDKLGDLFNQETIKLNATPKRFIIHP 830
Query: 370 --SLAIAKEGELTIGTID-DIQKLHIRSIPLGEHPRRICHQEQSRTF----------AIC 416
S I E E T + DI K++ +S L ++ QE
Sbjct: 831 QTSYIIILETETNYNTDNIDIDKINEQSEKLLLEKQKELQQEMDIDDDDQNNNNEIEPFK 890
Query: 417 SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG--T 474
L + +++++D T E + + L+ E G S+ +CSF + ++ VG T
Sbjct: 891 KLFKPKAGKGKWKSYIKIMDPITHESLESLMLEDGEAGFSVCTCSFGESGEIFLVVGCVT 950
Query: 475 AYVL-PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 533
VL P+ ++ + FI KL+L+ + E + VY++ F GKL+ + + I++Y
Sbjct: 951 DMVLNPKSHKSAHLNLYRFIDGGKKLELLYKTEVEEPVYAMAQFQGKLVCGVGKSIRIY- 1009
Query: 534 WMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 592
D G ++L +C + + + GD +VVGD+ +SI + YK E + A
Sbjct: 1010 ----DMGKKKLLRKCETKNLPNTIVNIHSLGDRLVVGDIQESIHFIKYKRSENMLYVFAD 1065
Query: 593 DYNANWMSAVEILDDDIYLGAENNFNLFTVR------------------KNSEGATDEER 634
D WM++ +LD D GA+ N+F +R K G +
Sbjct: 1066 DLAPRWMTSSVMLDYDTVAGADKFGNIFVLRLPLLISDEVEEDPTGTKLKFESGTLNGAP 1125
Query: 635 GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYL 693
+L+ + + +G+ V SLV VG +++ T++G IG +I E
Sbjct: 1126 HKLDHIANFFVGDTVTTLNKTSLV-------VGGPEVILYTTISGAIGALIPFTSREDVD 1178
Query: 694 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 753
F L+ N+R + G +H +RS+ KN +DGDL E F L+ + IS
Sbjct: 1179 FFSTLEMNMRSDCLPLCGRDHLAYRSY-----YFPVKNIIDGDLCEQFSTLNYQKQLSIS 1233
Query: 754 KTMNVSVEELCKRVEEL 770
+ ++ S E+ K++EE+
Sbjct: 1234 EELSRSPSEVIKKLEEI 1250
>sp|Q15393|SF3B3_HUMAN Splicing factor 3B subunit 3 OS=Homo sapiens GN=SF3B3 PE=1 SV=4
Length = 1217
Score = 200 bits (509), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 213/866 (24%), Positives = 378/866 (43%), Gaps = 136/866 (15%)
Query: 3 TFYVLPKQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFC-VVDLERQGQGQVVTCSGAYK 61
TF+ P+ L L L ++ +L PI+ FC + DL + Q+ G
Sbjct: 384 TFFFQPRPLKNLVL-----------VDELDSLSPIL-FCQIADLANEDTPQLYVACGRGP 431
Query: 62 DGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 120
SLR++R+G+ ++E A EL G +W++R +D FD +++VSF++ T +L++ +
Sbjct: 432 RSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG--E 489
Query: 121 ELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 180
+EE GF T TL C + LVQV +R + + R NEWK+P ++
Sbjct: 490 TVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVKC 547
Query: 181 TANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 238
N QV++A GG LVY E+ G L E + ++ ++ C+ + + S+ AV
Sbjct: 548 AVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAV 607
Query: 239 GMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GISY 281
G+ D +VRI SL + + + L + +P + LC E G Y
Sbjct: 608 GL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLY 664
Query: 282 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 341
L L +G LL +L+ TG+L+D + LG++P+ L + V A S R + YS
Sbjct: 665 LNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYS 724
Query: 342 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEH 400
+ + ++ + + F S P+ + L I ++ + + + + PL
Sbjct: 725 YQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYT 784
Query: 401 PRR-ICHQEQSR-----------TFAICSLKNQSCAEE-------------SEM------ 429
PR+ + H E + T A + + Q AEE +EM
Sbjct: 785 PRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLN 844
Query: 430 -------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 470
+R+++ + L+ E S+ C FS+ +Y
Sbjct: 845 ENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWY 904
Query: 471 C-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAIN 526
VG A L G + +V +G KL+ + + + ++ F G++L +
Sbjct: 905 VLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVG 964
Query: 527 QKIQLYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHE 583
+ +++Y D G ++L +C + HI A Y +QT G ++V D+ +S + YK
Sbjct: 965 KLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRN 1017
Query: 584 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE--------- 632
E + A D W++ +LD D GA+ N+ VR N+ DE
Sbjct: 1018 ENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALW 1077
Query: 633 ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 685
+RG L EV+ YH+GE V + +L+ G ++++ T++G IG++
Sbjct: 1078 DRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILV 1130
Query: 686 SL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 744
HE + F + ++ +LR + G +H +RS+ KN +DGDL E F +
Sbjct: 1131 PFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSM 1185
Query: 745 SRTRMDEISKTMNVSVEELCKRVEEL 770
+ +S+ ++ + E+ K++E++
Sbjct: 1186 EPNKQKNVSEELDRTPPEVSKKLEDI 1211
>sp|A0JN52|SF3B3_BOVIN Splicing factor 3B subunit 3 OS=Bos taurus GN=SF3B3 PE=2 SV=1
Length = 1217
Score = 200 bits (509), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 213/866 (24%), Positives = 378/866 (43%), Gaps = 136/866 (15%)
Query: 3 TFYVLPKQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFC-VVDLERQGQGQVVTCSGAYK 61
TF+ P+ L L L ++ +L PI+ FC + DL + Q+ G
Sbjct: 384 TFFFQPRPLKNLVL-----------VDELDSLSPIL-FCQIADLANEDTPQLYVACGRGP 431
Query: 62 DGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 120
SLR++R+G+ ++E A EL G +W++R +D FD +++VSF++ T +L++ +
Sbjct: 432 RSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG--E 489
Query: 121 ELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 180
+EE GF T TL C + LVQV +R + + R NEWK+P ++
Sbjct: 490 TVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVKC 547
Query: 181 TANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 238
N QV++A GG LVY E+ G L E + ++ ++ C+ + + S+ AV
Sbjct: 548 AVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAV 607
Query: 239 GMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GISY 281
G+ D +VRI SL + + + L + +P + LC E G Y
Sbjct: 608 GL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLY 664
Query: 282 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 341
L L +G LL +L+ TG+L+D + LG++P+ L + V A S R + YS
Sbjct: 665 LNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYS 724
Query: 342 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEH 400
+ + ++ + + F S P+ + L I ++ + + + + PL
Sbjct: 725 YQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYT 784
Query: 401 PRR-ICHQEQSR-----------TFAICSLKNQSCAEE-------------SEM------ 429
PR+ + H E + T A + + Q AEE +EM
Sbjct: 785 PRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLN 844
Query: 430 -------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 470
+R+++ + L+ E S+ C FS+ +Y
Sbjct: 845 ENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWY 904
Query: 471 C-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAIN 526
VG A L G + +V +G KL+ + + + ++ F G++L +
Sbjct: 905 VLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVG 964
Query: 527 QKIQLYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHE 583
+ +++Y D G ++L +C + HI A Y +QT G ++V D+ +S + YK
Sbjct: 965 KLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRN 1017
Query: 584 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE--------- 632
E + A D W++ +LD D GA+ N+ VR N+ DE
Sbjct: 1018 ENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALW 1077
Query: 633 ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 685
+RG L EV+ YH+GE V + +L+ G ++++ T++G IG++
Sbjct: 1078 DRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILV 1130
Query: 686 SL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 744
HE + F + ++ +LR + G +H +RS+ KN +DGDL E F +
Sbjct: 1131 PFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSM 1185
Query: 745 SRTRMDEISKTMNVSVEELCKRVEEL 770
+ +S+ ++ + E+ K++E++
Sbjct: 1186 EPNKQKNVSEELDRTPPEVSKKLEDI 1211
>sp|Q921M3|SF3B3_MOUSE Splicing factor 3B subunit 3 OS=Mus musculus GN=Sf3b3 PE=2 SV=1
Length = 1217
Score = 200 bits (509), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 213/866 (24%), Positives = 378/866 (43%), Gaps = 136/866 (15%)
Query: 3 TFYVLPKQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFC-VVDLERQGQGQVVTCSGAYK 61
TF+ P+ L L L ++ +L PI+ FC + DL + Q+ G
Sbjct: 384 TFFFQPRPLKNLVL-----------VDELDSLSPIL-FCQIADLANEDTPQLYVACGRGP 431
Query: 62 DGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 120
SLR++R+G+ ++E A EL G +W++R +D FD +++VSF++ T +L++ +
Sbjct: 432 RSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG--E 489
Query: 121 ELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 180
+EE GF T TL C + LVQV +R + + R NEWK+P ++
Sbjct: 490 TVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVKC 547
Query: 181 TANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 238
N QV++A GG LVY E+ G L E + ++ ++ C+ + + S+ AV
Sbjct: 548 AVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAV 607
Query: 239 GMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GISY 281
G+ D +VRI SL + + + L + +P + LC E G Y
Sbjct: 608 GL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLY 664
Query: 282 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 341
L L +G LL +L+ TG+L+D + LG++P+ L + V A S R + YS
Sbjct: 665 LNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYS 724
Query: 342 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEH 400
+ + ++ + + F S P+ + L I ++ + + + + PL
Sbjct: 725 YQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYT 784
Query: 401 PRR-ICHQEQSR-----------TFAICSLKNQSCAEE-------------SEM------ 429
PR+ + H E + T A + + Q AEE +EM
Sbjct: 785 PRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLN 844
Query: 430 -------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 470
+R+++ + L+ E S+ C FS+ +Y
Sbjct: 845 ENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWY 904
Query: 471 C-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAIN 526
VG A L G + +V +G KL+ + + + ++ F G++L +
Sbjct: 905 VLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVG 964
Query: 527 QKIQLYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHE 583
+ +++Y D G ++L +C + HI A Y +QT G ++V D+ +S + YK
Sbjct: 965 KLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRN 1017
Query: 584 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE--------- 632
E + A D W++ +LD D GA+ N+ VR N+ DE
Sbjct: 1018 ENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALW 1077
Query: 633 ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 685
+RG L EV+ YH+GE V + +L+ G ++++ T++G IG++
Sbjct: 1078 DRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILV 1130
Query: 686 SL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 744
HE + F + ++ +LR + G +H +RS+ KN +DGDL E F +
Sbjct: 1131 PFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSM 1185
Query: 745 SRTRMDEISKTMNVSVEELCKRVEEL 770
+ +S+ ++ + E+ K++E++
Sbjct: 1186 EPNKQKNVSEELDRTPPEVSKKLEDI 1211
>sp|Q5RBI5|SF3B3_PONAB Splicing factor 3B subunit 3 OS=Pongo abelii GN=SF3B3 PE=2 SV=1
Length = 1217
Score = 200 bits (509), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 213/866 (24%), Positives = 378/866 (43%), Gaps = 136/866 (15%)
Query: 3 TFYVLPKQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFC-VVDLERQGQGQVVTCSGAYK 61
TF+ P+ L L L ++ +L PI+ FC + DL + Q+ G
Sbjct: 384 TFFFQPRPLKNLVL-----------VDELDSLSPIL-FCQIADLANEDTPQLYVACGRGP 431
Query: 62 DGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 120
SLR++R+G+ ++E A EL G +W++R +D FD +++VSF++ T +L++ +
Sbjct: 432 RSSLRVLRHGLEVSETAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG--E 489
Query: 121 ELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 180
+EE GF T TL C + LVQV +R + + R NEWK+P ++
Sbjct: 490 TVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVKC 547
Query: 181 TANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 238
N QV++A GG LVY E+ G L E + ++ ++ C+ + + S+ AV
Sbjct: 548 AVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAV 607
Query: 239 GMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GISY 281
G+ D +VRI SL + + + L + +P + LC E G Y
Sbjct: 608 GL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLY 664
Query: 282 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 341
L L +G LL +L+ TG+L+D + LG++P+ L + V A S R + YS
Sbjct: 665 LNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYS 724
Query: 342 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEH 400
+ + ++ + + F S P+ + L I ++ + + + + PL
Sbjct: 725 YQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYT 784
Query: 401 PRR-ICHQEQSR-----------TFAICSLKNQSCAEE-------------SEM------ 429
PR+ + H E + T A + + Q AEE +EM
Sbjct: 785 PRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLN 844
Query: 430 -------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 470
+R+++ + L+ E S+ C FS+ +Y
Sbjct: 845 ENLPESIFGAPKAGSGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWY 904
Query: 471 C-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAIN 526
VG A L G + +V +G KL+ + + + ++ F G++L +
Sbjct: 905 VLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVG 964
Query: 527 QKIQLYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHE 583
+ +++Y D G ++L +C + HI A Y +QT G ++V D+ +S + YK
Sbjct: 965 KLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRN 1017
Query: 584 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE--------- 632
E + A D W++ +LD D GA+ N+ VR N+ DE
Sbjct: 1018 ENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALR 1077
Query: 633 ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 685
+RG L EV+ YH+GE V + +L+ G ++++ T++G IG++
Sbjct: 1078 DRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILV 1130
Query: 686 SL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 744
HE + F + ++ +LR + G +H +RS+ KN +DGDL E F +
Sbjct: 1131 PFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSM 1185
Query: 745 SRTRMDEISKTMNVSVEELCKRVEEL 770
+ +S+ ++ + E+ K++E++
Sbjct: 1186 EPNKQKNVSEELDRTPPEVTKKLEDI 1211
>sp|Q1LVE8|SF3B3_DANRE Splicing factor 3B subunit 3 OS=Danio rerio GN=sf3b3 PE=2 SV=1
Length = 1217
Score = 199 bits (507), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 203/863 (23%), Positives = 373/863 (43%), Gaps = 130/863 (15%)
Query: 3 TFYVLPKQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD 62
TF+ P+ L L L ++ +L PI+ + DL + Q+ G
Sbjct: 384 TFFFQPRPLKNLVL-----------VDEQESLSPIMSCQIADLANEDTPQLYVACGRGPR 432
Query: 63 GSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDE 121
+LR++R+G+ ++E A EL G +W++R +D FD +++VSF++ T +L++ +
Sbjct: 433 STLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHVEDEFDAYIIVSFVNATLVLSIG--ET 490
Query: 122 LEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVAT 181
+EE GF T TL C + LVQV +R + + R NEWK+P ++
Sbjct: 491 VEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIIRCA 548
Query: 182 ANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVG 239
N QV++A GG LVY E+ G L E + ++ ++ C+ + + S+ AVG
Sbjct: 549 VNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAVG 608
Query: 240 MWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCAFEGIS--------------YL 282
+ D +VRI SL L ++ + L + P S+ + G+ YL
Sbjct: 609 L-VDNTVRIISLDPSDCLQPLSMQALPAQ--PESLCIVEMGGVEKQDELGEKGTIGFLYL 665
Query: 283 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS 342
L +G LL +L+ TG+L+D + LG++P+ L + V A S R + YS
Sbjct: 666 NIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSY 725
Query: 343 NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHP 401
+ + ++ + + + F S P+ + L I ++ + + + + PL P
Sbjct: 726 QSRFHLTPLSYETLEYASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTP 785
Query: 402 RR-ICHQE-----------QSRTFAICSLKNQSCAEE-------------SEM------- 429
R+ + H E + T A + + Q AEE +EM
Sbjct: 786 RKFVIHPETNNLILIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLNE 845
Query: 430 ------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 471
VRL++ + L+ E S+ C F + + +Y
Sbjct: 846 NLPEAIFGAPKAGSGQWASLVRLINPIQGNTLDLVQLEQNEAAFSVAICRFLNGGDDWYV 905
Query: 472 -VGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 527
VG A ++ G I + + G KL+ + + + ++ F G++L + +
Sbjct: 906 LVGVARDMILNPRSVGGGYIYTYRIVGGGDKLEFLHKTPVEDVPLAIAPFQGRVLVGVGK 965
Query: 528 KIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 586
+++Y D G ++L +C H L + T G ++V D+ +S+ + Y+ E
Sbjct: 966 LLRIY-----DLGKKKLLRKCENKHVPNLVTGIHTIGQRVIVSDVQESLFWVRYRRNENQ 1020
Query: 587 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE---------ERG 635
+ A D W++ +LD D A+ N+ VR N+ DE +RG
Sbjct: 1021 LIIFADDTYPRWITTACLLDYDTMASADKFGNICVVRLPPNTSDDVDEDPTGNKALWDRG 1080
Query: 636 RL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL- 687
L E++ YH+GE V + +L+ G ++++ T++G IG++
Sbjct: 1081 LLNGASQKAEIIINYHIGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFT 1133
Query: 688 PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRT 747
HE + F + L+ ++R + G +H +RS+ KN +DGDL E F +
Sbjct: 1134 SHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMDPH 1188
Query: 748 RMDEISKTMNVSVEELCKRVEEL 770
+ +S+ ++ + E+ K++E++
Sbjct: 1189 KQKSVSEELDRTPPEVSKKLEDI 1211
>sp|Q4WLI5|RSE1_ASPFU Pre-mRNA-splicing factor rse1 OS=Neosartorya fumigata (strain ATCC
MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=rse1 PE=3
SV=1
Length = 1225
Score = 195 bits (495), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 203/815 (24%), Positives = 354/815 (43%), Gaps = 97/815 (11%)
Query: 25 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 84
+ ++E +L P++D +V+L Q+ T SG+ S R +++G+ ++E EL
Sbjct: 411 LNLVETLNSLNPLIDSKIVNLNEDDAPQIYTVSGSGARSSFRTLKHGLEVSEIVESELPS 470
Query: 85 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 143
+ +W+ + + D FD ++++SF + T +L++ + +EE GF S TL
Sbjct: 471 VPSAVWTTKLTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLG 528
Query: 144 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-G 202
+ L+QV +R + + R NEW +P S+ A N QV +A G +VY E+
Sbjct: 529 EDSLIQVHPRGIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDA 586
Query: 203 DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 260
DG L E + Q+ ++CL + + E S AVG D +VRI SL PD L K
Sbjct: 587 DGTLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKS 645
Query: 261 HLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 314
P ++ + + S YL L G L +L+ TGEL+D + LG +
Sbjct: 646 VQALTSAPSALNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAK 705
Query: 315 PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 372
P+ L S K T V A S RP + YS K + + ++ + F+S + +
Sbjct: 706 PVKLFRVSVKGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMV 765
Query: 373 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 431
+ L I +I+ + + SIPL PRR+ + F + N + +
Sbjct: 766 GIQAQNLRIFSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDNNVLSPATRARL 825
Query: 432 VR----------LLDDQTFEF----------------------ISTYPLDTFEYGCSILS 459
+ +L + F + IST L+ E S+ +
Sbjct: 826 IEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKAVISTIELEENEAAVSMAA 885
Query: 460 CSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSL 514
FS D + VGTA + N P+ + I EDGK L+ I + + + +L
Sbjct: 886 VPFSSQDDETFLVVGTAKDM-IVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLAL 944
Query: 515 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDL 571
F G+LLA I +++Y D G ++L +C +++ + QT+G IVV D+
Sbjct: 945 LGFQGRLLAGIGSTLRIY-----DLGMKQLLRKC--QAQVVSKTIVGLQTQGSRIVVSDV 997
Query: 572 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEG 628
+S++ ++YK+++ + D + W ++ ++D + G + NL+ VR K SE
Sbjct: 998 RESVTYVVYKYQDNILIPFVDDSVSRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKASEE 1057
Query: 629 ATDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 673
A ++ ERG RL+++ + + LV G ++
Sbjct: 1058 ADEDGSGAHLIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKTQLV-------AGGRDILV 1110
Query: 674 FGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 732
+ G IG++ + E F + L+ L + G +H +RS+ K V
Sbjct: 1111 WTGFQGTIGMLVPFVSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSYYAPVKGV----- 1165
Query: 733 LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 767
+DGDL E + L I+ ++ SV E+ +++
Sbjct: 1166 IDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKI 1200
>sp|Q5B1X8|RSE1_EMENI Pre-mRNA-splicing factor rse1 OS=Emericella nidulans (strain FGSC A4
/ ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=rse1 PE=3
SV=2
Length = 1209
Score = 194 bits (492), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 208/815 (25%), Positives = 353/815 (43%), Gaps = 91/815 (11%)
Query: 25 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 84
+ ++E +L P+VD VV++ Q+ T SG + R +++G+ ++E EL
Sbjct: 411 LNLVEAINSLNPLVDSKVVNISEDDAPQIFTVSGTGARSTFRTLKHGLEVSEIVESELPS 470
Query: 85 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 143
+ +W+ + + D FD ++V+SF + T +L++ + +EE GF S TL
Sbjct: 471 VPSAVWTTKLTRADEFDAYIVLSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528
Query: 144 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-G 202
+ L+Q+ +R + + R NEW +P S+ A N QV +A G +VY E+
Sbjct: 529 EDSLIQIHPRGIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFELDA 586
Query: 203 DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 260
DG L E + Q+ ++CL + + E S AVG D +VRI SL PD L K
Sbjct: 587 DGSLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDTTLENKS 645
Query: 261 HLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 314
P ++ + A S YL L G L L+ TGEL+D + LG++
Sbjct: 646 VQALTAAPSALNIIAMADSSSGGTTLYLHIGLHSGVYLRTALDEVTGELSDTRTRFLGSK 705
Query: 315 PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 372
+ L S T V A S RP + YS K + + ++ + F+S + +
Sbjct: 706 AVKLFQVSVTGQTAVLALSSRPWLGYSDTQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMV 765
Query: 373 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 431
+ L I +I+ + + +SIPL PR + F + N + +
Sbjct: 766 GIQGQNLRIFSIEKLDNNMLQQSIPLAYTPRHFIKHPEEPLFYVIEADNNVLSPATR--- 822
Query: 432 VRLLDDQTFEFIST---------YP--------------------------LDTFEYGCS 456
RLL+D T YP L+ E S
Sbjct: 823 ARLLEDSKARGGDTTVLPPEDFGYPRGTGHWASCIQIIDPLDAKAVVGAVELEENEAAVS 882
Query: 457 ILSCSF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAV 511
I + F S D + VGTA + N P+ G I ++ EDGK L+ I + + +
Sbjct: 883 IAAVPFTSQDDETFLVVGTAKDM-TVNPPSSAGGYIHIYRFQEDGKELEFIHKTKVEEPP 941
Query: 512 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 570
+L F G+LLA + +++Y D G ++L +C A+ +QT+G IVV D
Sbjct: 942 LALLGFQGRLLAGVGSVLRIY-----DLGMKQLLRKCQAAVAPKAIVGLQTQGSRIVVSD 996
Query: 571 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSE 627
+ +S++ ++YK+++ + D A W +A ++D + G + NL+ VR K SE
Sbjct: 997 VRESVTYVVYKYQDNVLIPFVDDSIARWTTAATMVDYETTAGGDKFGNLWLVRCPKKASE 1056
Query: 628 GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV-----------GQIPTVIFGT 676
A +E G + +L NR L++ + D+ G +++
Sbjct: 1057 EADEEGSGAHLIHDRGYLQGTPNRLE---LMIHVFTQDIPTSLHKTQLVAGGRDILVWTG 1113
Query: 677 VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 735
G IG++ + E F + L+ L + G +H +RS+ K V +DG
Sbjct: 1114 FQGTIGILVPFVSREDVDFFQSLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDG 1168
Query: 736 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 770
DL E + LS I+ ++ SV E+ +++ ++
Sbjct: 1169 DLCEQYFLLSNDTKMMIAAELDRSVREIERKISDM 1203
>sp|Q52E49|RSE1_MAGO7 Pre-mRNA-splicing factor RSE1 OS=Magnaporthe oryzae (strain 70-15 /
ATCC MYA-4617 / FGSC 8958) GN=RSE1 PE=3 SV=2
Length = 1216
Score = 187 bits (475), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 211/829 (25%), Positives = 357/829 (43%), Gaps = 84/829 (10%)
Query: 8 PKQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 67
P + + +P + VE ++ ++ P++D V +L + Q+ T SG + R+
Sbjct: 400 PYEPVYFYPRPTENLALVESID---SMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRM 456
Query: 68 VRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 126
+++G+ +NE + +L G +W+ + DD +D ++V+SF + T +L++ + +EE
Sbjct: 457 LKHGLEVNEIVASQLPGTPSAVWTTKLRRDDEYDAYIVLSFTNGTLVLSIG--ETVEEVS 514
Query: 127 IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ 186
GF S TL + LVQV +R + + + NEW SP S+ A N Q
Sbjct: 515 DTGFLSSVPTLAVQQLGDDGLVQVHPKGIRHIRNG---VVNEWSSPQHRSIVAAATNERQ 571
Query: 187 VLLATGGGHLVYLEIG-DGILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDI 244
V +A G +VY E+ DG L E + + ++ L + + E S AVG D
Sbjct: 572 VAVALSSGEIVYFEMDTDGSLAEYDEKKEMFGTVTSLSLGEVPEGRLRSSYLAVGC-DDC 630
Query: 245 SVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLN 297
+VRI SL P+ L +K P ++ + + E S YL L G L +L+
Sbjct: 631 TVRILSLDPESTLESKSVQALTAAPSALSIMSMEDSSSGGTTLYLHIGLNSGVYLRTVLD 690
Query: 298 MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKE 355
TGELTD ++ LG + + L S + T V A S R + +S K + +N +E
Sbjct: 691 EVTGELTDTRQKFLGPKAVRLFQVSVQKRTCVLALSSRSWLGFSDPVTKGFTMTPLNYEE 750
Query: 356 VSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFA 414
+ F S + + L I I+ + I +SIPL PR++ R F
Sbjct: 751 LEWGWNFVSEQCEEGMVGVNGQFLRIFAIEKLGDNVIQKSIPLTYTPRKLAKHPTQRIFY 810
Query: 415 ICSLKNQSCAEESEMHFV----------RLLDDQTFEF---------------------- 442
N + A E + R+L F +
Sbjct: 811 TIEADNNTLAPELREQLMAAPTAVNGDARVLPPDEFGYPRGNGRWASCISVVDPLGDGEE 870
Query: 443 -----ISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IV 494
+ LD E S+ SF S D + VGT ++ T+G I V+
Sbjct: 871 LEPGVVQRIDLDNNEAALSMAVVSFASQDGESFLVVGTGKDMVVNPRRFTEGYIHVYRFS 930
Query: 495 EDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 553
EDG+ L+ I + + + +L F G+L+A I + +++Y LR R+ Q+E
Sbjct: 931 EDGRELEFIHKTKVEEPPTALLPFQGRLVAGIGRMLRIYDLGLR-QLLRKAQAEVAPQ-- 987
Query: 554 ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 613
L + + T+G I+VGD+ + + YK E + A D A W + ++D D GA
Sbjct: 988 -LIVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAGA 1046
Query: 614 ENNFNLFTVR--KNSEGATDEERGRLEVV-GEYHLGEFVNRFRHGSLV--MRLPDS---- 664
+ NL+ +R + + +DE + +V +L NR + V +P S
Sbjct: 1047 DKFGNLWILRCPEKASQESDEPGSEVHLVHSRDYLHGTSNRLALMAHVYTQDIPTSICKT 1106
Query: 665 --DVGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 721
VG +++G G IGV I + E F + L+ +LR + G +H +R
Sbjct: 1107 NLVVGGQEVLLWGGFQGTIGVLIPFVSREDADFFQSLEQHLRSEDPPLAGRDHLMYRGC- 1165
Query: 722 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 770
V K +DGDL E + L + I+ ++ SV E+ +++ ++
Sbjct: 1166 ----YVPVKGVIDGDLCERYTMLPNDKKQMIAGELDRSVREIERKISDI 1210
>sp|Q9UTT2|RSE1_SCHPO Pre-mRNA-splicing factor prp12 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=prp12 PE=1 SV=1
Length = 1206
Score = 184 bits (468), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 195/811 (24%), Positives = 348/811 (42%), Gaps = 93/811 (11%)
Query: 25 VEVLERYVNLGPIVDFCVVDLERQGQG-QVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 83
+ ++E +L + D ++ G+ Q+ T G + SLR +R G+ E + EL
Sbjct: 419 LSLVEEIPSLYSLTDTLLMKAPSSGEANQLYTVCGRGSNSSLRQLRRGLETTEIVASELP 478
Query: 84 GIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 142
G +W+L+ + D +D+++++SF + T +L++ + +EE GF S TL
Sbjct: 479 GAPIAIWTLKLNQTDVYDSYIILSFTNGTLVLSIG--ETVEEISDSGFLSSVSTLNARQM 536
Query: 143 IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG 202
+ LVQ+ +R + + + +EWK P V + N Q+++A G LVY E+
Sbjct: 537 GRDSLVQIHPKGIRYIRANKQT--SEWKLPQDVYVVQSAINDMQIVVALSNGELVYFEMS 594
Query: 203 D----GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI 257
D G L E + + L ++ L + P+ E S + D +VR+ SL DL
Sbjct: 595 DDVEGGQLNEYQERKTLTANVTSLALGPVQEGSRRSNFMCLAC-DDATVRVLSL-DL-YT 651
Query: 258 TKEHLGGE----------IIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTD 305
T E+L + IIP +V G+S YL L +G L ++++ +G+L D
Sbjct: 652 TLENLSVQALSSPANSLCIIPMNV-----NGVSTLYLHIGLMNGVYLRTVIDVTSGQLLD 706
Query: 306 RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 365
+ LG + + + + KN V A S R + YS + L S + + H F S
Sbjct: 707 TRTRFLGPRAVKIYPITMKNQNTVLAVSSRTFLAYSYQQNLQLSPIAYSAIDHASSFASE 766
Query: 366 AFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQ---------EQSRTF-- 413
P+ + ++ L I T+D +Q L PL PR+I + R F
Sbjct: 767 QCPEGIVAIQKNTLKIFTVDSLQDDLKSDIYPLICTPRKIVKHPNFPVLYILQSERNFDS 826
Query: 414 ------------AICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCS 461
+ K +S + F+ + D + + I PL E S+ +
Sbjct: 827 FKYAQENGDVGSSYTKEKQNEHTSKSWVSFISVFDMISKKIIHESPLGDNEAAFSMTAAF 886
Query: 462 FSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLIAEKETKGAVYSLNAFN 518
F + + G+A + E R+ F E KL+LI+ E G +L F
Sbjct: 887 FKNRDEFFLVAGSATNMDLECRTCSHGNFRVYRFHDEGKKLELISHTEIDGIPMALTPFQ 946
Query: 519 GKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 574
G++LA + + +++Y K MLR EL + + ++ + IVV D S
Sbjct: 947 GRMLAGVGRFLRIYDLGNKKMLRKG---ELSAV-----PLFITHITVQASRIVVADSQYS 998
Query: 575 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE 632
+ ++YK E+ + A D W + ++D D G + N++ +R ++ DE
Sbjct: 999 VRFVVYKPEDNHLLTFADDTIHRWTTTNVLVDYDTLAGGDKFGNIWLLRCPEHVSKLADE 1058
Query: 633 ERGRLEVVGEYHLGEFVNRFRHG-SLVMRLPDSDV-----------GQIPTVIFGTVNGV 680
E +++ H F+N H L+ +D+ G +++ + G
Sbjct: 1059 ENSESKLI---HEKPFLNSTPHKLDLMAHFFTNDIPTSLQKVQLVEGAREVLLWTGLLGT 1115
Query: 681 IGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 739
+GV + E F ++L+ LRK + G +H +RS+ K V +DGDL E
Sbjct: 1116 VGVFTPFINQEDVRFFQQLEFLLRKECPPLAGRDHLAYRSYYAPVKCV-----IDGDLCE 1170
Query: 740 SFLDLSRTRMDEISKTMNVSVEELCKRVEEL 770
+ L + I+ ++ ++ E+ K++E+
Sbjct: 1171 MYYSLPHPVQEMIANELDRTIAEVSKKIEDF 1201
>sp|Q7RYR4|RSE1_NEUCR Pre-mRNA-splicing factor rse-1 OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=rse-1 PE=3 SV=2
Length = 1209
Score = 180 bits (456), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 202/808 (25%), Positives = 357/808 (44%), Gaps = 84/808 (10%)
Query: 27 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 86
++E ++ P VD V +L + Q+ + G + R++++G+ ++E + EL G
Sbjct: 416 LVESIDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTP 475
Query: 87 -GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 145
+W+ + + D +D ++V+SF + T +L++ + +EE GF + TL +
Sbjct: 476 SAVWTTKLTKYDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTTAPTLAVQQMGED 533
Query: 146 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDG 204
L+QV +R + NEW +P S+ ATAN +QV++A G +VY E+ DG
Sbjct: 534 GLIQVHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDG 590
Query: 205 ILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 262
L E + ++ ++ L + + E S AVG D +VRI SL PD L K
Sbjct: 591 SLAEYDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQ 649
Query: 263 GGEIIPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 317
P ++ + + E YL L G L +L+ TGELTD ++ LG +P
Sbjct: 650 ALTAAPSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTR 709
Query: 318 LRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 375
L S ++ V A S RP + Y+ K + + ++ E+ + F+S + +
Sbjct: 710 LFQVSVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIH 769
Query: 376 EGELTIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEE 426
L I +I+ + I +SIPL P+ + H EQ + I S N E+
Sbjct: 770 ANYLRIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQ 829
Query: 427 SEMHFVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF- 462
+L + F + IS P LD E S F
Sbjct: 830 QSNGDATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFA 889
Query: 463 SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNG 519
S + + VGT ++ + + T+G I V+ EDG+ L+ I + + +L F G
Sbjct: 890 SQEGESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQG 949
Query: 520 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 579
+LLA + + +++Y L+ R+ Q++ L + +Q++G+ I+VGDL + I+ ++
Sbjct: 950 RLLAGVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVV 1005
Query: 580 YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEV 639
YK E + A D W + ++D + G + N++ VR + + + E
Sbjct: 1006 YKAEGNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EP 1061
Query: 640 VGEYHLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV 683
E HL N + HG+ LP S VG +++ + G +GV
Sbjct: 1062 GSEIHLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGV 1120
Query: 684 -IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 742
I + E F + L+ ++R + G +H +R + K V +DGDL E F
Sbjct: 1121 FIPFVSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFS 1175
Query: 743 DLSRTRMDEISKTMNVSVEELCKRVEEL 770
L + I+ ++ SV E+ +++ ++
Sbjct: 1176 LLPNDKKQMIAGELDRSVREIERKISDI 1203
>sp|P0CR22|RSE1_CRYNJ Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
neoformans serotype D (strain JEC21 / ATCC MYA-565)
GN=RSE1 PE=3 SV=1
Length = 1217
Score = 171 bits (434), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 206/816 (25%), Positives = 353/816 (43%), Gaps = 102/816 (12%)
Query: 33 NLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMW 89
+L PI D VV+L Q+ G + R +++G+ + E S L G+ +W
Sbjct: 420 SLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPGVPTNVW 479
Query: 90 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 149
+L+ + DD +D+++V+SF + T +L++ + +EE GF S TL L+Q
Sbjct: 480 TLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGNAGLLQ 537
Query: 150 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 208
V +R + + R +EW +PPG ++ AT N QV++A LVY E+ +G L+E
Sbjct: 538 VHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDPEGSLSE 595
Query: 209 VKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 266
+ + L +C+ I + E + AVG + +V I SL PD L T
Sbjct: 596 YQEKKALPGNATCVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLSLQALTA 654
Query: 267 IPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 318
P S+ L S +L L +G LL +++ G L+D + LG +P L
Sbjct: 655 PPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLGAKPPKL 714
Query: 319 RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 378
+ + V A S R ++Y+ L + + + ++A PD L
Sbjct: 715 VRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLIGISGNT 774
Query: 379 LTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQSCAEES 427
L I I + +KL S L PR+ I H S RT++ +++ +ES
Sbjct: 775 LRIFNIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERIVKQKES 834
Query: 428 E-----------------------MHF---VRLLDDQTFEFISTYPLDTFEYGCSILSCS 461
E H+ VR+LD E I T LD E SI
Sbjct: 835 EGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAFSIAIAY 894
Query: 462 FS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAF 517
F + VGT + + K G + V+ I E G+ L+ + + +T L F
Sbjct: 895 FERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIPLCLAGF 954
Query: 518 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSIS 576
G LLA I + ++LY+ G + L +C ++G A+ + +G I+VGD+ +S
Sbjct: 955 QGFLLAGIGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTF 1009
Query: 577 LLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---SEG 628
+Y+ +I R A D W++ V +D + + N+F R + SE
Sbjct: 1010 YCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLDPSISEK 1065
Query: 629 ATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV-------IFG 675
D+ G + ++G H E + + GS+V + + +IP V ++
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGRDVLVYT 1120
Query: 676 TVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 734
T++G +G + + + F+ L+ ++R + G +H +R + V K +D
Sbjct: 1121 TISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVPIKGVVD 1175
Query: 735 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 770
GDL ESF L + I+ ++ SV ++ K++E++
Sbjct: 1176 GDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQM 1211
>sp|P0CR23|RSE1_CRYNB Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
neoformans serotype D (strain B-3501A) GN=RSE1 PE=3 SV=1
Length = 1217
Score = 171 bits (434), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 206/816 (25%), Positives = 353/816 (43%), Gaps = 102/816 (12%)
Query: 33 NLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMW 89
+L PI D VV+L Q+ G + R +++G+ + E S L G+ +W
Sbjct: 420 SLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPGVPTNVW 479
Query: 90 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 149
+L+ + DD +D+++V+SF + T +L++ + +EE GF S TL L+Q
Sbjct: 480 TLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGNAGLLQ 537
Query: 150 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 208
V +R + + R +EW +PPG ++ AT N QV++A LVY E+ +G L+E
Sbjct: 538 VHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDPEGSLSE 595
Query: 209 VKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 266
+ + L +C+ I + E + AVG + +V I SL PD L T
Sbjct: 596 YQEKKALPGNATCVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLSLQALTA 654
Query: 267 IPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 318
P S+ L S +L L +G LL +++ G L+D + LG +P L
Sbjct: 655 PPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLGAKPPKL 714
Query: 319 RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 378
+ + V A S R ++Y+ L + + + ++A PD L
Sbjct: 715 VRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLIGISGNT 774
Query: 379 LTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQSCAEES 427
L I I + +KL S L PR+ I H S RT++ +++ +ES
Sbjct: 775 LRIFNIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERIVKQKES 834
Query: 428 E-----------------------MHF---VRLLDDQTFEFISTYPLDTFEYGCSILSCS 461
E H+ VR+LD E I T LD E SI
Sbjct: 835 EGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAFSIAIAY 894
Query: 462 FS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAF 517
F + VGT + + K G + V+ I E G+ L+ + + +T L F
Sbjct: 895 FERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIPLCLAGF 954
Query: 518 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSIS 576
G LLA I + ++LY+ G + L +C ++G A+ + +G I+VGD+ +S
Sbjct: 955 QGFLLAGIGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTF 1009
Query: 577 LLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---SEG 628
+Y+ +I R A D W++ V +D + + N+F R + SE
Sbjct: 1010 YCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLDPSISEK 1065
Query: 629 ATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV-------IFG 675
D+ G + ++G H E + + GS+V + + +IP V ++
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGRDVLVYT 1120
Query: 676 TVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 734
T++G +G + + + F+ L+ ++R + G +H +R + V K +D
Sbjct: 1121 TISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVPIKGVVD 1175
Query: 735 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 770
GDL ESF L + I+ ++ SV ++ K++E++
Sbjct: 1176 GDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQM 1211
>sp|Q4PGM6|RSE1_USTMA Pre-mRNA-splicing factor RSE1 OS=Ustilago maydis (strain 521 / FGSC
9021) GN=RSE1 PE=3 SV=1
Length = 1221
Score = 155 bits (393), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 193/811 (23%), Positives = 354/811 (43%), Gaps = 89/811 (10%)
Query: 33 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 91
+L PI+D ++ Q+ G S +++R+G+ + E S +L G+ +W+
Sbjct: 420 SLDPILDAKPLNPLAADSPQIFAACGRGARSSFKMLRHGLEVQEAVSSDLPGVPSAVWTT 479
Query: 92 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 151
+ + D +D+++++SF++ T +L++ + +EE GF + + TL + L+QV
Sbjct: 480 KITQQDEYDSYIILSFVNGTLVLSIG--ETIEEVSDSGFLTSSSTLAVQQLGQDALLQVH 537
Query: 152 SGSVRLVSSTSRELRNEWKSP--PG--YSVNVAT-ANASQVLLATGGGHLVYLEIG-DGI 205
+R V +++ NEW +P P + VAT N QV++A LVY E+ DG
Sbjct: 538 PHGIRHVL-VDKQI-NEWATPSLPNGRQTTIVATCTNERQVVVALSSNELVYFELDMDGQ 595
Query: 206 LTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 264
L E + + + + + + E + AVG D +VRI SL + + +
Sbjct: 596 LNEYQERKAMGAGVLTMSMPDCPEGRQRTPYLAVGC-DDSTVRIISLEPNSTLASISIQA 654
Query: 265 EIIPRSVLLCAFE----------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 314
P S + C E +++ L +G LL +L+ TG+LTD + LG++
Sbjct: 655 LTAPASSI-CMAEMLDATIDRNHATTFVNIGLQNGVLLRTILDAVTGQLTDTRTRFLGSK 713
Query: 315 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 374
+ L V A S R + Y+ +L + + + H F++ P+ L
Sbjct: 714 AVRLIRTKVHGQAAVMALSTRTWLSYTYQDRLQFVPLIFDVLDHAWSFSAELCPEGLIGI 773
Query: 375 KEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCA-------- 424
L I TI + KL S+ L PR+I H + F + ++++ +
Sbjct: 774 VGSTLRIFTIPSLASKLKQDSVALSYTPRKIANHPNEQGLFYVVEAEHRTLSPGAQRRRT 833
Query: 425 ----EESEMHFVRLLDDQTFEFISTYP------------------------LDTFEYGCS 456
+E + H +LD EF + +D E S
Sbjct: 834 EMLGKELKPHQRGVLDLNPAEFGAIRAEAGNWASCIRAVDGVQAQTTHRLEMDDNEAAFS 893
Query: 457 ILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVY 512
I F S + V VG+A V+ K + + ++++G+ L+L+ + E
Sbjct: 894 IAVVPFASAEKEVMLVVGSAVDVVLSPRSCKKAYLTTYRLLDNGRELELLHKTEVDDIPL 953
Query: 513 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDL 571
L AF G+LLA I + +++Y D G ++L +C + A+ + +G IVVGD+
Sbjct: 954 VLRAFQGRLLAGIGKALRIY-----DLGKKKLLRKCENRSFPTAVVSLDAQGSRIVVGDM 1008
Query: 572 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGA 629
+SI YK E + A D +++ +LD D A+ N++ +R N+ +
Sbjct: 1009 QESIIFASYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNIYVLRLDGNTSRS 1068
Query: 630 TDEERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGV 680
DE+ + +V G H V F G ++ L + + G +++ ++G
Sbjct: 1069 VDEDPTGMTIVHEKPVLMGAAHKASLVAHFFVGDIITSLHRTAMVAGGREVLLYTGLSGS 1128
Query: 681 IGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 739
IG + + E L L+++LR+ + G +H +RS K+V +DGDL E
Sbjct: 1129 IGALVPFVSKEDVDTLSTLESHLRQENNSIVGRDHLAYRSSYAPVKSV-----IDGDLCE 1183
Query: 740 SFLDLSRTRMDEISKTMNVSVEELCKRVEEL 770
+F LS + + I+ ++ E+ K++ +L
Sbjct: 1184 TFGLLSPAKQNAIAGELDRKPGEINKKLAQL 1214
>sp|Q10426|RIK1_SCHPO Chromatin modification-related protein rik1 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=rik1 PE=1 SV=2
Length = 1040
Score = 110 bits (276), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 171/749 (22%), Positives = 314/749 (41%), Gaps = 84/749 (11%)
Query: 33 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 92
NLGPI D V L+ + + C+G ++ SL ++ + ++ ++ GI L
Sbjct: 341 NLGPIHDLLV--LKNDIEKSFLVCAGTPRNASLIYFQHALKLDILGQTKISGILRAMVLP 398
Query: 93 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 152
S + L + F SET +A N++++ + E++ S + + VQVTS
Sbjct: 399 SYPEHK----LFLGFPSET--VAFNIKEDFQ-LELDPSLSTKERTIALSGTNGEFVQVTS 451
Query: 153 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 212
+ + S R + N A ++ G ++ + TEV
Sbjct: 452 TFLCIYDSAKRSRLVYIEK----ITNAACYQEYSAIVINGTALAIFKKD-----TEVARK 502
Query: 213 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-KEHLGGEIIPRSV 271
E EISCLD + + QI VG W+ V I + D + I+ +PR++
Sbjct: 503 VFESEISCLDFS------AQFQIG-VGFWSK-QVMILTFSDNSSISCAFQTNVPSLPRNI 554
Query: 272 LLCAFEGI----SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 327
+L EG+ + LL + G G +++L ++ K GT P++ R F+ T
Sbjct: 555 IL---EGVGVDRNLLLVSSGSGEFKSYVLFKNNLVFSETKH--FGTTPVSFRRFTMNIGT 609
Query: 328 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 387
++ +D P ++Y N L Y +++ + +C F + D L G L ++ +
Sbjct: 610 YIICNNDCPHMVYGFNGALCYMPLSMPQSYDVCQFRDNSGKDFLISVSLGGLKFLQLNPL 669
Query: 388 QKLHIRSIPLGEHP-RRICHQEQ--SRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIS 444
+L R + L P + I Q + RT +S E + V DD +F S
Sbjct: 670 PELTPRKVLLEHVPLQAIIFQNKLLLRTLENRYEDYESYKENYHLELVDSYDDNSFRVFS 729
Query: 445 TYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV--FIVEDGKLQLI 502
+ E I S VGT+ + ++ P GR+++ F E L+++
Sbjct: 730 FTENERCEKVLKINESSL--------LVGTSIIEQDKLVPVNGRLILLEFEKELQSLKVV 781
Query: 503 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 562
+ AV L +N + + A Q++ + K + + S +L L V+
Sbjct: 782 SSMVLSAAVIDLGVYNDRYIVAFGQQVAIVKLT---EERLMIDSRISLGSIVLQLIVE-- 836
Query: 563 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 622
G+ I + D + +++ + ++ + R + N + A + + +Y+ A N+ L +
Sbjct: 837 GNEIAIADSIGRFTIMYFDGQKFIVVARYL-FGENIVKAA-LYEGTVYIIATNSGLLKLL 894
Query: 623 RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGV 680
R N + +R E V YHL + V++F++ P ++ P ++F T G
Sbjct: 895 RYNKDAKNFNDRFICESV--YHLHDKVSKFQN------FPITNTNSFLEPKMLFATEIGA 946
Query: 681 IGVIASLPHEQYLFLEKLQTNLRKV-IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 739
IG I SL ++ L LE+L +RK+ + +++E E + F+DGDL+
Sbjct: 947 IGSIVSLKDKE-LELEELTRKIRKLKFSYLSSMDYESI-----EADLISPVPFIDGDLV- 999
Query: 740 SFLDLSRTRMDEISKTMNVSVEELCKRVE 768
+D+ R E+ + LC+ VE
Sbjct: 1000 --IDVKRWASSELFR--------LCRSVE 1018
>sp|Q9EPU4|CPSF1_MOUSE Cleavage and polyadenylation specificity factor subunit 1 OS=Mus
musculus GN=Cpsf1 PE=1 SV=1
Length = 1441
Score = 100 bits (250), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 356 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 410
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 970 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029
Query: 411 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 445
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089
Query: 446 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 495
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148
Query: 496 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 549
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203
Query: 550 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 609
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263
Query: 610 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 666
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317
Query: 667 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 715
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1318 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377
Query: 716 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 759
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1423
Score = 41.2 bits (95), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 73/172 (42%), Gaps = 36/172 (20%)
Query: 26 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 76
EV + +N+GP + V + L + Q ++V CSG K+G+L +++ I
Sbjct: 463 EVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 522
Query: 77 QASVELQGIKGMWSL------------------------RSSTDDPFDTFLVVSFISETR 112
+ EL G MW++ ++ D FL++S T
Sbjct: 523 VTTFELPGCYDMWTVIAPVRKEEEETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTM 582
Query: 113 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 163
IL E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 583 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 632
>sp|Q10570|CPSF1_HUMAN Cleavage and polyadenylation specificity factor subunit 1 OS=Homo
sapiens GN=CPSF1 PE=1 SV=2
Length = 1443
Score = 100 bits (250), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 117/474 (24%), Positives = 201/474 (42%), Gaps = 62/474 (13%)
Query: 356 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 410
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 972 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1031
Query: 411 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 445
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 1032 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1091
Query: 446 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 495
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1092 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1150
Query: 496 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 549
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1151 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1205
Query: 550 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 609
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1206 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1265
Query: 610 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 662
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1266 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1325
Query: 663 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 721
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1326 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1385
Query: 722 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 773
+++T+ +N LDG+L+ +L LS E++K + + + + + E R+
Sbjct: 1386 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRV 1439
Score = 40.8 bits (94), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 74/173 (42%), Gaps = 37/173 (21%)
Query: 26 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 76
EV + +N+GP + V + L + Q ++V CSG K+G+L +++ I
Sbjct: 464 EVCDSILNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQV 523
Query: 77 QASVELQGIKGMWSL-----RSSTDDP--------------------FDTFLVVSFISET 111
+ EL G MW++ + D+P FL++S T
Sbjct: 524 VTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDST 583
Query: 112 RILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 163
IL E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 584 MILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 634
>sp|Q10569|CPSF1_BOVIN Cleavage and polyadenylation specificity factor subunit 1 OS=Bos
taurus GN=CPSF1 PE=1 SV=1
Length = 1444
Score = 98.6 bits (244), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 356 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 410
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 973 IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032
Query: 411 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 445
+ +A+ + + C + E ++L+ ++E I +
Sbjct: 1033 KVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVSWEAIPNAR 1092
Query: 446 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 495
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1093 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1151
Query: 496 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 549
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1206
Query: 550 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 609
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1207 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1266
Query: 610 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 666
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1267 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1320
Query: 667 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 715
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1321 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1380
Query: 716 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 759
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1381 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1426
Score = 38.5 bits (88), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 40/172 (23%), Positives = 72/172 (41%), Gaps = 36/172 (20%)
Query: 26 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 76
EV + +N+GP + + + L + Q ++V CSG K+G+L +++ I
Sbjct: 466 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 525
Query: 77 QASVELQGIKGMWSL------------------------RSSTDDPFDTFLVVSFISETR 112
+ EL G MW++ + D FL++S T
Sbjct: 526 VTTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTM 585
Query: 113 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 163
IL E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 586 ILQTG--QEIMELDASGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 635
>sp|Q5A7S5|RSE1_CANAL Pre-mRNA-splicing factor RSE1 OS=Candida albicans (strain SC5314 /
ATCC MYA-2876) GN=RSE1 PE=3 SV=1
Length = 1219
Score = 95.5 bits (236), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 161/755 (21%), Positives = 313/755 (41%), Gaps = 98/755 (12%)
Query: 88 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 147
+++ + S + D +LV+S ++ L +++ + +E+ E F T+ +
Sbjct: 485 IFTTKLSLESANDEYLVISSSLSSKTLVLSIGEVVEDVEDSEFVLDQPTIAVQQVGIASV 544
Query: 148 VQVTSGSVRLVSSTSRELRN-EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGIL 206
VQ+ S ++ V + + + +W P G ++ AT N QVL+A +VY EI D
Sbjct: 545 VQIYSNGIKHVRTVNGNKKTTDWFPPAGITITHATTNNQQVLIALSNLSVVYFEI-DATD 603
Query: 207 TEVKHAQLEYEI-SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL--- 262
++ Q EI + + I EN S A+ +D ++++ SL + N + + L
Sbjct: 604 DQLIEYQDRLEIATTITAMAIQENISEKSPFAIIGCSDETIQVVSLQEHNCLEIKSLQAL 663
Query: 263 -GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 321
+ + E +++ + +G ++ G L++ + +G++P++L
Sbjct: 664 SANSSSLKMLKSSGKE--THVHIGMENGVYARIKIDTINGNLSNSRVKYIGSKPVSLSVI 721
Query: 322 SSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP-DSLAIAKEGEL 379
N + A S P + Y + + ++++ F S + + K+ L
Sbjct: 722 KFSNEIEGILAISSAPWISYLYRDSFKITPLLEIDITNGSSFISEDIGGEGIVGIKDNNL 781
Query: 380 TIGTI-------DDIQKLHIRSIPLGEHPRRICHQEQSRTFAI-----------CSLKNQ 421
I ++ D Q L I + L PR++ +R F C++
Sbjct: 782 IIFSVGKEDSVFDPSQDLTIATTKLRYTPRKMI-TNGNRLFISESEYNVQGPFKCNINGD 840
Query: 422 SCAEESEMHF---------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS 466
E ++ ++++D ++ + I + LD E S+ + SF+ S
Sbjct: 841 VKENVDEDYYEAFGYEWKQNSWASCIQVVDSKSNQVIQSLQLDGNESIVSMSAVSFNKTS 900
Query: 467 N-----VYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 518
+ VG +LP N K + F + LQL+ + E L F
Sbjct: 901 TPSVPASHLVVGVCTNQTILP--NSYDKSYLYTFKIGKKHLQLVHKTELDHIPQVLENFQ 958
Query: 519 GKLLAAINQKIQLYKWMLRDDGTRELQSEC----GHHGHILALYVQTRGDFIVVGDLMKS 574
KLL A I+LY D G ++L + +I + QT + I++ D KS
Sbjct: 959 DKLLVASGNHIRLY-----DIGQKQLLKKSTTIIDFSTNINKIIPQT--NRIIICDSHKS 1011
Query: 575 ISLLIYKHEEGAIE--ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--------- 623
S++ K +E + A D ++++ LD D +G + N+F R
Sbjct: 1012 -SIVFAKFDESQNQFVPFADDVMKRQITSIMNLDIDTLIGGDKFGNIFVTRIDEDISKQA 1070
Query: 624 -------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 676
K +G + +L+ + E+H+G+ + F G L ++ +VI+
Sbjct: 1071 DDDWTILKTQDGILNSCPYKLQNLIEFHIGDIITSFNLGCL-------NLAGTESVIYTG 1123
Query: 677 VNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 735
+ G IG++ L + + L LQ +++ + G +H + RS+ N KN +DG
Sbjct: 1124 LQGTIGLLIPLVSKSEVELLFNLQLYMQQSQNNLVGKDHLKLRSYYNP-----IKNVIDG 1178
Query: 736 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 770
DL+E FL+ + EIS+ +N SV ++ K++ +L
Sbjct: 1179 DLLERFLEFDISLKIEISRKLNKSVNDIEKKLIDL 1213
>sp|Q7XWP1|CPSF1_ORYSJ Probable cleavage and polyadenylation specificity factor subunit 1
OS=Oryza sativa subsp. japonica GN=Os04g0252200 PE=3 SV=2
Length = 1441
Score = 89.7 bits (221), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 87/353 (24%), Positives = 159/353 (45%), Gaps = 33/353 (9%)
Query: 440 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 493
+E ST P+ FE ++ L + + ++ +GTAYVL E+ +GR+L+F
Sbjct: 1095 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVL-GEDVAARGRVLLFSFT 1153
Query: 494 VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 550
+ L+ E KE+KGAV ++ + G LL A KI L KW EL + +
Sbjct: 1154 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GAELTAVAFY 1208
Query: 551 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 610
+ + + +F++ GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1209 DAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGST 1268
Query: 611 LG-----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDS 664
L ++ N +F + + +G +L E+H+G + +F + LP
Sbjct: 1269 LSLVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQ 1322
Query: 665 DVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 720
+ T ++FG ++G IG IA + + L+ LQ L + V GLN +R F
Sbjct: 1323 GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQF 1382
Query: 721 --NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 771
N + N +D +L+ S+ LS ++++ + + ++ +++
Sbjct: 1383 HSNGKGHRPGPDNIIDFELLCSYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 1435
Score = 48.5 bits (114), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 69/141 (48%), Gaps = 21/141 (14%)
Query: 15 NLQPDAKGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGS 64
+L+ K SY+ V + +N+GP+ DF + +Q ++V CSG K+GS
Sbjct: 499 SLESAQKISYI-VRDALINVGPLKDFSYGLRANADPNAMGNAKQSNYELVCCSGHGKNGS 557
Query: 65 LRIVRNGIGINEQASVELQGIKGMWSL-------RSSTDDPFDTFLVVSFISETRILAMN 117
L +++ I + VEL +G+W++ + + D+ + +L++S E R + +
Sbjct: 558 LSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRGQMAEDNEYHAYLIISL--ENRTMVLE 615
Query: 118 LEDELEE-TEIEGFCSQTQTL 137
D+L E TE + Q T+
Sbjct: 616 TGDDLGEVTETVDYFVQASTI 636
>sp|Q9V726|CPSF1_DROME Cleavage and polyadenylation specificity factor subunit 1
OS=Drosophila melanogaster GN=Cpsf160 PE=1 SV=1
Length = 1455
Score = 87.8 bits (216), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 108/436 (24%), Positives = 198/436 (45%), Gaps = 64/436 (14%)
Query: 392 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 433
+R +PL PR++ + ++R + + + +++ +EES F+
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085
Query: 434 -----LLDDQTFEFISTYPLDTFE-----YGCSILSCSFSDDSN---VYYCVGTAYVLPE 480
L+ +T+E + + TFE I+ S+ + Y C+GT +
Sbjct: 1086 QFEMVLISPETWEIVPDASI-TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-S 1143
Query: 481 ENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 529
E+ ++G I ++ + E GK ++ I +KE KG V +++ G L+ + QKI
Sbjct: 1144 EDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI 1203
Query: 530 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 589
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1204 --YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1258
Query: 590 RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 644
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YH
Sbjct: 1259 ASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYH 1316
Query: 645 LGEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 699
LG+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 1317 LGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQ 1375
Query: 700 TNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 757
L + + GLN +++R+ + KK ++ +DGDLI S+ ++ + +E++K +
Sbjct: 1376 NVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIG 1435
Query: 758 VSVEELCKRVEELTRL 773
EE+ + E+ RL
Sbjct: 1436 TRTEEILGDLLEIERL 1451
Score = 43.1 bits (100), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)
Query: 26 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 69
EV + +N+ PI C V+ E G + ++V +G K+G+L +
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 544
Query: 70 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 123
N I S EL G +W++ +SS +D D F+++S + T +L E+
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 601
Query: 124 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 161
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640
>sp|A8XPU7|CPSF1_CAEBR Probable cleavage and polyadenylation specificity factor subunit 1
OS=Caenorhabditis briggsae GN=cpsf-1 PE=3 SV=1
Length = 1454
Score = 87.0 bits (214), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 146/315 (46%), Gaps = 35/315 (11%)
Query: 477 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 536
V+PE +PT R K++++ +KE KG V L A NG LL+ + QK+ + W
Sbjct: 1153 VVPEPGQPTSNR---------KIKVLYDKEQKGPVTGLCAINGLLLSGMGQKV--FIWQF 1201
Query: 537 RDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 595
+D+ + S H ++ L+ ++T + D +S+SL+ ++ E A+ +RD
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLHSIRT---IALALDARESMSLIRFQEENKAMSIASRDDR 1257
Query: 596 --ANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 648
A A E L D +++G N LF+ + + E RL V ++G
Sbjct: 1258 KCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGE--RLTVKAAINIGTN 1315
Query: 649 VNRFRHGSLVMRLPDS-------DVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 701
+N F L DS ++ Q IFG+++G G I L + Y L LQT
Sbjct: 1316 INAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLTEKSYRRLHFLQTF 1375
Query: 702 LRKVIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNV 758
+ V + GL+ + RS + V +A+N +DGD++E +L LS ++++ + V
Sbjct: 1376 IGSVTPQIAGLHIKGARSSKPSQPIVNGRNARNLIDGDVVEQYLHLSVYDKTDLARRLGV 1435
Query: 759 SVEELCKRVEELTRL 773
+ + +L R+
Sbjct: 1436 GRYHILDDLMQLRRM 1450
>sp|Q5BDG7|CFT1_EMENI Protein cft1 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 /
CBS 112.46 / NRRL 194 / M139) GN=cft1 PE=3 SV=1
Length = 1339
Score = 86.3 bits (212), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 96/438 (21%), Positives = 192/438 (43%), Gaps = 70/438 (15%)
Query: 392 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 432
+R++P+G+ ++ + S T+ + C E+ E+H +
Sbjct: 907 MRTVPIGQQIDKLTYVSASDTYVLGTCQRCEFRLPEDDELHPEWRNEEISFLPEVNQSSL 966
Query: 433 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 487
+++ +T+ I +YPL+ E+ + + S N + VGT+ E+ P++G
Sbjct: 967 KVVSPKTWSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLAR-GEDIPSRG 1025
Query: 488 RILVF----IVEDG-------KLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKW 534
I VF +V D +L+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1026 CIYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRG- 1084
Query: 535 MLRDDGT----RELQSECGHHGHILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEE 589
L++DG+ + +C +++ + +G + GD +K + Y E +
Sbjct: 1085 -LKEDGSLLPVAFMDMQC-----FVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSL 1138
Query: 590 RARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 646
A+D + + A + L D + A+++ NL+ ++ + E +L ++H G
Sbjct: 1139 FAKDLDYLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTG 1198
Query: 647 EFVN-------------RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 693
F + R GS M + + + V+ + NG IG++ +P E Y
Sbjct: 1199 NFASTVTLLPRTLVSSERAMSGSDKMDI--DNTAPLHQVLVTSHNGSIGLVTCVPEESYR 1256
Query: 694 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 753
L LQ+ L ++ GLN +R+ ++ + LD +L+ +LD+S+ R EI+
Sbjct: 1257 RLSALQSQLTNTLEHPCGLNPRAYRAVESDASA--GRGMLDSNLLLQYLDMSKQRKAEIA 1314
Query: 754 KTMNVSVEELCKRVEELT 771
+ + E+ +E ++
Sbjct: 1315 GRVGATEWEIRADLEAIS 1332
>sp|A1DB13|CFT1_NEOFI Protein cft1 OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 /
FGSC A1164 / NRRL 181) GN=cft1 PE=3 SV=1
Length = 1400
Score = 84.0 bits (206), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 88/371 (23%), Positives = 160/371 (43%), Gaps = 37/371 (9%)
Query: 432 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 486
++++ +T+ I +Y L EY ++ + N + VGTA+ E+ P++
Sbjct: 1027 LKVVSPRTWTVIDSYSLGPAEYVMAVKNMDLEVSENTHERRNMIVVGTAFAW-GEDIPSR 1085
Query: 487 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 533
G I VF V D KL+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1086 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1145
Query: 534 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 593
L++DG+ + ++ + ++GD +K + Y E + +D
Sbjct: 1146 --LKEDGSLLPVAFMDMQCYVNVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1203
Query: 594 YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 650
+ A E L D L A+++ NL ++ + E RL ++H+G F
Sbjct: 1204 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1263
Query: 651 RFRHGSLVM-----RLPDSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQ 699
M + D D +I + V+ + +G +G++ S+P E Y L LQ
Sbjct: 1264 TMTLLPRTMVSSEKAMADPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1323
Query: 700 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 759
+ L ++ GLN +R+ E + LDG+L+ +LD+ + R EI+ +
Sbjct: 1324 SQLTNSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1381
Query: 760 VEELCKRVEEL 770
E+ +E +
Sbjct: 1382 EWEIKADLEAI 1392
>sp|Q9N4C2|CPSF1_CAEEL Probable cleavage and polyadenylation specificity factor subunit 1
OS=Caenorhabditis elegans GN=cpsf-1 PE=3 SV=2
Length = 1454
Score = 83.6 bits (205), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 146/312 (46%), Gaps = 29/312 (9%)
Query: 477 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 536
V+PE ++PT R K++++ +KE KG V L A NG LL + QK+ + W
Sbjct: 1153 VVPEPDQPTSNR---------KIKVLFDKEQKGPVTGLCAINGLLLCGMGQKV--FIWQF 1201
Query: 537 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN- 595
+D+ + S H ++ L+ + + D +S+SL+ ++ + A+ +RD
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLH--SLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRK 1258
Query: 596 -ANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN- 650
A A +++ D ++G ++ N+ E RL V ++G +N
Sbjct: 1259 CAQPPMASQLVVDGAHVGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINA 1318
Query: 651 --RFRHGSLVMRLPDSD----VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 704
R R + +++L + D + Q T +F +++G G + L + Y L LQT +
Sbjct: 1319 FVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGS 1378
Query: 705 VIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 761
V + GL+ + RS + V +A+N +DGD++E +L LS ++++ + V
Sbjct: 1379 VTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRY 1438
Query: 762 ELCKRVEELTRL 773
+ + +L R+
Sbjct: 1439 HIIDDLMQLRRM 1450
>sp|Q4WCL1|CFT1_ASPFU Protein cft1 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 /
CBS 101355 / FGSC A1100) GN=cft1 PE=3 SV=2
Length = 1401
Score = 83.2 bits (204), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 159/371 (42%), Gaps = 37/371 (9%)
Query: 432 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 486
++++ +T+ I +Y L EY ++ + N + VGTA+ E+ P++
Sbjct: 1028 LKVVSPRTWTVIDSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFAR-GEDIPSR 1086
Query: 487 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 533
G I VF V D KL+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1087 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1146
Query: 534 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 593
L++DG+ + ++ L ++GD +K + Y E + +D
Sbjct: 1147 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1204
Query: 594 YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 650
+ A E L D L A+++ NL ++ + E RL ++H+G F
Sbjct: 1205 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1264
Query: 651 RFR-------HGSLVMRLPDS---DVGQIPT-VIFGTVNGVIGVIASLPHEQYLFLEKLQ 699
M PDS D I V+ + +G +G++ S+P E Y L LQ
Sbjct: 1265 TMTLLPRTMVSSEKAMANPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1324
Query: 700 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 759
+ L ++ GLN +R+ E + LDG+L+ +LD+ + R EI+ +
Sbjct: 1325 SQLANSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1382
Query: 760 VEELCKRVEEL 770
E+ +E +
Sbjct: 1383 EWEIKADLEAI 1393
>sp|A1C3U1|CFT1_ASPCL Protein cft1 OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 /
DSM 816 / NCTC 3887 / NRRL 1) GN=cft1 PE=3 SV=1
Length = 1401
Score = 83.2 bits (204), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 102/436 (23%), Positives = 181/436 (41%), Gaps = 57/436 (13%)
Query: 369 DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEE 426
DS + + +L TI D ++ + +GEH + + S T+ + + + E+
Sbjct: 947 DSKDVVRICQLPPETIYDYS-WTLKKVAIGEHVDHLAYSISSETYVLGTSHSADFKLPED 1005
Query: 427 SEMH-----------------FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 469
E+H ++++ +T+ I +Y L E ++ + + N +
Sbjct: 1006 DELHPEWRNEAISFLPELRQCCLKVVHPKTWTVIDSYTLGPDEEIMAVKNMNLEVSENTH 1065
Query: 470 -----YCVGTAYVLPEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYS 513
VGTA E+ P +G I VF V D KL+LI ++ KGAV +
Sbjct: 1066 ERKNMIVVGTALAR-GEDIPARGCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTA 1124
Query: 514 LNAFNGK--LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 571
L+ G+ L+AA QK + L++DG+ + ++ L +VGD
Sbjct: 1125 LSEIGGQGFLIAAQGQKCMVRG--LKEDGSLLPVAFMDVQCYVNVLKELKGTGMCIVGDA 1182
Query: 572 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEG 628
K I Y E + +D + A + L D L A+++ NL ++ E
Sbjct: 1183 FKGIWFAGYSEEPYKMSLFGKDLEYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPED 1242
Query: 629 ATDEERGRLEVVGEYHLGEFVNRFR-----HGSLVMRLPDSDVGQIPT------VIFGTV 677
+L V ++H+G F + S + DSD ++ V+ +
Sbjct: 1243 PMSSNGDKLLVRSKFHMGHFTSTLTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQ 1302
Query: 678 NGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDL 737
+G IG++ S+P E Y L LQ+ L ++ GLN +R+ E + LDG+L
Sbjct: 1303 SGSIGIVTSIPEESYRRLSALQSQLANTVEHPCGLNPRAYRAI--ESDGTAGRGMLDGNL 1360
Query: 738 IESFLDLSRTRMDEIS 753
+ +L +S+ R EI+
Sbjct: 1361 LYQWLSMSKQRRMEIA 1376
>sp|A2R919|CFT1_ASPNC Protein cft1 OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513)
GN=cft1 PE=3 SV=1
Length = 1383
Score = 82.8 bits (203), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 97/438 (22%), Positives = 179/438 (40%), Gaps = 61/438 (13%)
Query: 392 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 432
++ + LGE + + S + + C + E+ E+H F+
Sbjct: 942 LKRVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGSFI 1001
Query: 433 RLLDDQTFE---------FISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVL 478
+L+ D + + ++ L EY +I + S N + VGTA+
Sbjct: 1002 KLVWDHHLQRQDSVILIFHLHSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFAR 1061
Query: 479 PEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ 527
E+ P++G I VF V D KL+LI ++ KGAV +L+ G+ + Q
Sbjct: 1062 -GEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQ 1120
Query: 528 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 587
+ L++DG+ + ++ + ++GD +K + Y E +
Sbjct: 1121 GQKCMVRGLKEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKM 1180
Query: 588 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 644
A+D + + A E L D L A+++ N+ ++ + E RL ++H
Sbjct: 1181 SLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFH 1240
Query: 645 LGEFVNRFRHGSLVM----RLPDSDVG-----QIP--TVIFGTVNGVIGVIASLPHEQYL 693
+G F + M ++ S G Q P V+ T NG +G+I +P E Y
Sbjct: 1241 MGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYR 1300
Query: 694 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 753
L LQ+ L ++ GLN +R+ E + LDG+L+ ++D+S+ R EI+
Sbjct: 1301 RLSALQSQLTNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIA 1358
Query: 754 KTMNVSVEELCKRVEELT 771
+ E+ +E ++
Sbjct: 1359 GRVGAREWEIKADLEAIS 1376
>sp|Q6BYK1|RSE1_DEBHA Pre-mRNA-splicing factor RSE1 OS=Debaryomyces hansenii (strain ATCC
36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
GN=RSE1 PE=3 SV=2
Length = 1256
Score = 79.7 bits (195), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/306 (27%), Positives = 133/306 (43%), Gaps = 52/306 (16%)
Query: 499 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL----QSECGHHGHI 554
L+ + + E ++ FNG+LL ++ ++LY D G R+L S + +I
Sbjct: 963 LEFVHKTELDYQPTAIIPFNGRLLVGMSNFLRLY-----DLGQRQLLRKASSNIEYLKNI 1017
Query: 555 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 614
+ L Q G IVVGD S + + Y E A D ++A+ LD D +G +
Sbjct: 1018 IRLTHQG-GSRIVVGDSSMSTTFVKYDSTENQFIPFADDIMKRQITALVTLDYDTIIGGD 1076
Query: 615 NNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 658
N+F R + E + RL+ + E++L + F GSLV
Sbjct: 1077 KFGNIFVSRVPETISQQSDKDWSLLRYQESYLNGSGSRLKNICEFYLQDIPTSFTKGSLV 1136
Query: 659 MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVI--------KGV 709
M G ++I+ + G +G++ L E + FL LQ LRK K
Sbjct: 1137 M-------GGKESIIYTGIQGTLGLLLPLSTENEVKFLGDLQLLLRKYFDYNFDDFDKDK 1189
Query: 710 GGLN-----HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 764
G N H ++RS+ N KN +DGDLIE F +LS++ I +N + E+
Sbjct: 1190 NGYNLLGKDHLKFRSYYNP-----VKNVMDGDLIERFYELSQSMKIRIGTELNRTPREIE 1244
Query: 765 KRVEEL 770
K++ E+
Sbjct: 1245 KKISEM 1250
Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 79/341 (23%), Positives = 142/341 (41%), Gaps = 34/341 (9%)
Query: 25 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGS-LRIVRNGIGINEQASVELQ 83
V+++E L PI D +++ R A S L+ + +GI N S L
Sbjct: 409 VDIME---TLNPITDGALIETLRPEVPDPFKQLTALSSHSYLKTLTHGISTNTVVSSPLP 465
Query: 84 GIK--GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 141
IK + + R + D +LV+S ++ L +++ + +EE F + T+
Sbjct: 466 -IKPTAIHTTRIFAESANDEYLVISSTLSSQTLVLSIGEVVEEVNDSQFVTNEPTINVQQ 524
Query: 142 AIYNQLVQVTSGSVRLVSSTSR-----ELRNEWKSPPGYSVNVATANASQVLLATGGGHL 196
+ +VQ+ S +R + T R + +W P G S+ A+ N QV++ +
Sbjct: 525 VGKSSVVQIYSNGIRHIKHTMRNDTIEKKYTDWYPPAGISIIQASTNNEQVIIGLSNREI 584
Query: 197 VYLEIG--DGILTEVKHAQLEYE--------ISCLDINPIGENPSYSQIAAVGMWTDISV 246
Y EI D L E + +LE IS I+ + SY A VG +D ++
Sbjct: 585 CYFEIDPHDDQLVEYQE-RLEMSGGSISALAISSSSISKLQRKSSY---AIVG-CSDETI 639
Query: 247 RIFSLPD---LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGEL 303
+ SL L ++T + L S+ + + + + +G + ++ TG+L
Sbjct: 640 QAISLKPHNCLEIVTLQALSAN--SSSIAMVPHGYSTSVHIGMENGLYVRVTIDEITGKL 697
Query: 304 TDRKKVSLGTQPITLRTFSSKNTTH--VFAASDRPTVIYSS 342
+D + LG++P+ L + A S RP + Y S
Sbjct: 698 SDTRIQFLGSKPVQLSVIGLPQLQQNGLLAISSRPWIGYYS 738
>sp|Q9FGR0|CPSF1_ARATH Cleavage and polyadenylation specificity factor subunit 1
OS=Arabidopsis thaliana GN=CPSF160 PE=1 SV=2
Length = 1442
Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 24/302 (7%)
Query: 440 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 495
+E + P+ T E+ ++ L + + ++ VGTAYV E+ +GR+L+F
Sbjct: 1097 WETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1155
Query: 496 ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 549
D ++ E +E KGA+ ++ + G LL + KI L+KW +GT
Sbjct: 1156 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1211
Query: 550 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD-- 607
+ + + FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1212 DAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGS 1271
Query: 608 DIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 666
+ L + V + + +G +L E+H+G V++F L +++ S
Sbjct: 1272 TLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKF----LRLQMVSSGA 1327
Query: 667 GQIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 724
+I ++FGT++G G IA L + L+ LQ L + V GLN +R F +
Sbjct: 1328 DKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSG 1387
Query: 725 KT 726
K
Sbjct: 1388 KA 1389
Score = 48.5 bits (114), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 62/136 (45%), Gaps = 27/136 (19%)
Query: 27 VLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 76
V + VN+GP+ DF + +Q ++V CSG K+G+L ++R I
Sbjct: 507 VRDSLVNVGPVKDFAYGLRINADANATGVSKQSNYELVCCSGHGKNGALCVLRQSIRPEM 566
Query: 77 QASVELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISETRILAMNLEDEL 122
VEL G KG+W++ ++ +D + +L++S E R + + D L
Sbjct: 567 ITEVELPGCKGIWTVYHKSSRGHNADSSKMAADEDEYHAYLIISL--EARTMVLETADLL 624
Query: 123 EE-TEIEGFCSQTQTL 137
E TE + Q +T+
Sbjct: 625 TEVTESVDYYVQGRTI 640
>sp|Q2TZ19|CFT1_ASPOR Protein cft1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
GN=cft1 PE=3 SV=1
Length = 1393
Score = 70.5 bits (171), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 76/331 (22%), Positives = 139/331 (41%), Gaps = 40/331 (12%)
Query: 472 VGTAYVLPEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK 520
VGTA+ E+ ++G + VF V D KL+L+ ++ KGAV +L+ G+
Sbjct: 1065 VGTAFARGEDIA-SRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQ 1123
Query: 521 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 580
+ Q + L++DG+ + H+ + ++ D +K + Y
Sbjct: 1124 GFLIVAQGQKCIVRGLKEDGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGY 1183
Query: 581 KHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRL 637
E + A+D + + A + L D L A+++ NL ++ + E RL
Sbjct: 1184 SEEPYKMSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRL 1243
Query: 638 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVG---------------QIP--TVIFGTVNGV 680
++H G F+ S + LP + V +IP ++ + NG
Sbjct: 1244 LSRSKFHTGNFI------STLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGS 1297
Query: 681 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 740
+G++ + E Y L LQ+ L I+ GLN +R+ E + LDG L+
Sbjct: 1298 VGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAV--ESDGTAGRGMLDGKLLFQ 1355
Query: 741 FLDLSRTRMDEISKTMNVSVEELCKRVEELT 771
+LD+S+ R EI+ + + E+ E ++
Sbjct: 1356 WLDMSKQRKVEIASRVGANEWEIKADFEAIS 1386
>sp|Q1E5B0|CFT1_COCIM Protein CFT1 OS=Coccidioides immitis (strain RS) GN=CFT1 PE=3 SV=1
Length = 1387
Score = 68.2 bits (165), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 159/372 (42%), Gaps = 37/372 (9%)
Query: 432 VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTK 486
++LL +T+ + +Y L E + + + + + VGTA V E+ P +
Sbjct: 1015 IKLLSPRTWSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITP-R 1073
Query: 487 GRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 533
G I VF I+E + KL++ A+ + KGAV +++ G+ L+ A QK +
Sbjct: 1074 GSIYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRG 1133
Query: 534 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 593
L++DG+ + ++ L ++GD +K I Y E + +D
Sbjct: 1134 --LKEDGSLLPVAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKD 1191
Query: 594 YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 650
+ A + L D +Y L A+++ + + + E T + RL +H G F +
Sbjct: 1192 NEYLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTGHFTS 1251
Query: 651 RF-----RHGSLVMRLP---DSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQ 699
S P D DV +P V+ + G IGV+ L + Y L LQ
Sbjct: 1252 TMTLLPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1311
Query: 700 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 759
+ L ++ GLN + +R+ E + +DG+L+ +LD+ R EI+ +
Sbjct: 1312 SQLVTSMEHPCGLNPKAYRAV--ESDGFGGRGIVDGNLLLRWLDMGVQRKAEIAGRVGAD 1369
Query: 760 VEELCKRVEELT 771
+E + +E ++
Sbjct: 1370 IESIRVDLETIS 1381
>sp|Q6FLQ6|RSE1_CANGA Pre-mRNA-splicing factor RSE1 OS=Candida glabrata (strain ATCC 2001
/ CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=RSE1
PE=3 SV=1
Length = 1296
Score = 67.8 bits (164), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/306 (23%), Positives = 140/306 (45%), Gaps = 46/306 (15%)
Query: 19 DAKGSYVEVLERYVNLGPI-VDFCVVD------LERQGQGQVVTCSGAYKDGSLRIVRNG 71
D + + V+ ++ N+ PI ++ C+++ + QG + + I+RN
Sbjct: 382 DNENENISVISKHTNINPIALNLCLMENMPLTFMHFQGGNRTTDSE------KVNIIRNA 435
Query: 72 IGINEQASVEL-QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGF 130
I + E S L QG+ ++++++ + +F+ ++ I+ T ++ ++ + IE +
Sbjct: 436 IPLKEYVSSPLPQGVSNIFTIKTQYQ-SYHSFIFLTMINFTTVIL-----KIADDSIEQY 489
Query: 131 CSQTQTLFCHDAIY--------NQLVQVTSGSVRLVSSTSRELRN-----EWKSPPGYSV 177
+ T D + N ++QV R + S++ N +W P G S+
Sbjct: 490 IPASDTFKLKDDMTIHVATMGDNSIIQVCKDEFRQILLDSKDEENFKMNLKWYPPAGVSI 549
Query: 178 NVATANASQVLLATGGGHLVYLEIGDGILTEVKH-AQLEYEISCLDINPIGENPSYSQIA 236
A +N SQ++LA +VYL++ + L E K+ +L I+ L + + +N S+I
Sbjct: 550 LSAVSNFSQLILALSNNEIVYLQLENNTLIEYKNRPELPDVITSLAL--LNDNTKKSEIL 607
Query: 237 AVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGH-LLNFL 295
AVG +D V + SL I E + E +V+ A + I L L GH L+N
Sbjct: 608 AVGT-SDNMVNVLSLE----IVDEAISFE----TVVFQALDAIPSSLLILNQGHKLVNLH 658
Query: 296 LNMKTG 301
+ ++ G
Sbjct: 659 IGVEDG 664
>sp|Q6CAH5|RSE1_YARLI Pre-mRNA-splicing factor RSE1 OS=Yarrowia lipolytica (strain CLIB 122
/ E 150) GN=RSE1 PE=3 SV=1
Length = 1143
Score = 64.3 bits (155), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 159/732 (21%), Positives = 284/732 (38%), Gaps = 106/732 (14%)
Query: 88 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 147
+W++R D ++V+S+ + T L + + D + ET G TL C ++ +
Sbjct: 459 LWTMRDGAGS--DKYIVLSYANAT--LVLEIGDSVVETTSSGLTLDKPTLHC-GSVGSSY 513
Query: 148 VQVTSGSVRLVSSTSRELRNE------WKSPPGYSVNVATANASQVLLATGGGHLVYLEI 201
VQV + + ++ SRE +E W +P G V A++++ QV+L L Y E
Sbjct: 514 VQVMTDGMNVIP-MSREGSSESLPATKWTAPSG-QVICASSSSHQVVLGLTSS-LFYFED 570
Query: 202 GDGILTEVKHAQLEYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNL 256
G +E+ YE+S + + P+ S AV D +VRI S+ P+
Sbjct: 571 TPG--SELSAYDGAYELSSPPTAVAVAPVPAGRVRSPFVAVAT-DDETVRIVSVDPESMF 627
Query: 257 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDR--KKVSLG-- 312
T G S+ L + + YL L +G + L+ TGE+ K V LG
Sbjct: 628 ETVAVQGLMATASSLALLSVGQVLYLHMGLANGVYVRVELDPLTGEIVGSWSKFVGLGRL 687
Query: 313 -TQPITLRT-----FSSKNTT----HVFAASDR--PTVIYSSNKKLLYS--NVNLKEVSH 358
P+T SS+ HV A SD PT N ++ ++ + +
Sbjct: 688 SVVPVTCGGEESILVSSRGVKTCLGHVNATSDTWVPT---GGNSAPFFALDAISGEPLDL 744
Query: 359 MCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSL 418
F++ P + L I T++ QK + L +R+ Q + T I
Sbjct: 745 AHSFHTQDCPHGVIGVAGSTLKIFTVNTAQKWTENEVKLEGTAKRLI-QHDATTLTITQN 803
Query: 419 KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVL 478
++ + +D+ D SI F D Y+ VG +
Sbjct: 804 PDRLVS----------VDNGAVGITK----DLGGPPTSICEVMFGDGKR-YFAVGGSRDG 848
Query: 479 PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 538
T G I +F L + E + +L A+NG L+A I +++LY L+
Sbjct: 849 SPGTSGTSGYISIF--SSSSLGHVHTTEVEAPPLALCAYNGLLVAGIGSQVRLYALGLKQ 906
Query: 539 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA--IEERARDYNA 596
R+ Q E LA + + + + VGD+ +S+++ + E+ I D +
Sbjct: 907 V-LRKAQIELSKRVTCLAHFAGS--NRVAVGDIRQSVTVCVVLEEDSGHVIYPLVCDKIS 963
Query: 597 NWMSAVEILD-DDIYLGAE-NNFNLFTVRKNSEGATDEERGRLEVV-------GEYHLGE 647
++ + +D + + LG F + + + DE+ + + G H
Sbjct: 964 RQVTCLFFVDYETVALGDRFGGFTMLRIPSEASKLADEDHNAVHLRQLEPTLNGPAHF-- 1021
Query: 648 FVNRFRHGSLVMRLPDSDVGQIPTVI------------FGTVNGVIGVIASLPHEQYLFL 695
RF H + + +P I GTV+ + V++ +Q L
Sbjct: 1022 ---RFDH------VASFHIEDVPVAIHMYNDYLVVCGLLGTVSAFVPVVSP---KQSRDL 1069
Query: 696 EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 755
+ ++ + G+ G +H ++R + V K +DGD++ L + R +E+ +
Sbjct: 1070 KTIEKFVCASDPGLMGRDHGRFRGYY-----VPVKEVVDGDMLREVLVMDEKRREEVGEK 1124
Query: 756 MNVSVEELCKRV 767
+ VE RV
Sbjct: 1125 TGLGVEGAVGRV 1136
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.136 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 286,527,424
Number of Sequences: 539616
Number of extensions: 12489594
Number of successful extensions: 29265
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 47
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 28963
Number of HSP's gapped (non-prelim): 124
length of query: 774
length of database: 191,569,459
effective HSP length: 125
effective length of query: 649
effective length of database: 124,117,459
effective search space: 80552230891
effective search space used: 80552230891
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)