BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001655
(1036 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9M0V3|DDB1A_ARATH DNA damage-binding protein 1a OS=Arabidopsis thaliana GN=DDB1A PE=1
SV=1
Length = 1088
Score = 1974 bits (5114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 952/1036 (91%), Positives = 1005/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVD DGSRYLLGDHAG++HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF ++ TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF++D NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773 SCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893 LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
>sp|Q6QNU4|DDB1_SOLLC DNA damage-binding protein 1 OS=Solanum lycopersicum GN=DDB1 PE=1
SV=1
Length = 1090
Score = 1967 bits (5095), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 173 PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 473 SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 533 SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRI HQEQ+RTFA+CS+K QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 713 EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773 LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
NGKLLAAINQKIQLYKW R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 833 NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 892
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 893 LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 952
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 953 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1012
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1013 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1072
Query: 1019 NVSVEELCKRVEELTRLH 1036
+V VEEL KRVEELTRLH
Sbjct: 1073 SVPVEELMKRVEELTRLH 1090
>sp|Q6E7D1|DDB1_SOLCE DNA damage-binding protein 1 OS=Solanum cheesmanii GN=DDB1 PE=3 SV=1
Length = 1095
Score = 1964 bits (5088), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 58 MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 117
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 118 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 178 PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 237
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 238 GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 297
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 298 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 357
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 358 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 417
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 418 YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 477
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 478 SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 537
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 538 SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 597
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 598 ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 657
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 658 IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 717
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRI HQEQ+RTFA+CS+K QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 718 EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 777
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 778 LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 837
Query: 780 NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
NGKLLAAINQKIQLYKW R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 838 NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 897
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 898 LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 957
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 958 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1017
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1018 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1077
Query: 1019 NVSVEELCKRVEELTRLH 1036
+V VEEL KRVEELTRLH
Sbjct: 1078 SVPVEELMKRVEELTRLH 1095
>sp|O49552|DDB1B_ARATH DNA damage-binding protein 1b OS=Arabidopsis thaliana GN=DDB1B PE=2
SV=2
Length = 1088
Score = 1920 bits (4973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 925/1036 (89%), Positives = 989/1036 (95%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVP+YGRIAT+ELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 ILDVPLYGRIATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173 TKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSYVE+LE+YVNLGPIV
Sbjct: 293 KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEILEKYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMN+EDELEETEIEGF S+ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG LTEVKH LEYE+
Sbjct: 473 SSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ G+L DRKKVSLGT+PITLRTFSSK+ THVFAASDRP V
Sbjct: 593 ISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653 IYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTFAI L+N+ AEESE HFVRLLD Q+FEF+S+YPLD FE GCSIL
Sbjct: 713 EHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLSSYPLDAFECGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773 SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI VGDLMKSISLL
Sbjct: 833 GKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+KN+EGATDEER R+E
Sbjct: 893 IYKHEEGAIEERARDYNANWMTAVEILNDDIYLGTDNCFNIFTVKKNNEGATDEERARME 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
>sp|Q3U1J4|DDB1_MOUSE DNA damage-binding protein 1 OS=Mus musculus GN=Ddb1 PE=1 SV=2
Length = 1140
Score = 1150 bits (2975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>sp|P33194|DDB1_CHLAE DNA damage-binding protein 1 OS=Chlorocebus aethiops GN=DDB1 PE=1
SV=1
Length = 1140
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>sp|A1A4K3|DDB1_BOVIN DNA damage-binding protein 1 OS=Bos taurus GN=DDB1 PE=2 SV=1
Length = 1140
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIAKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>sp|Q16531|DDB1_HUMAN DNA damage-binding protein 1 OS=Homo sapiens GN=DDB1 PE=1 SV=1
Length = 1140
Score = 1149 bits (2972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>sp|Q5R649|DDB1_PONAB DNA damage-binding protein 1 OS=Pongo abelii GN=DDB1 PE=2 SV=1
Length = 1140
Score = 1147 bits (2968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNVCILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVY + FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYPMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>sp|Q805F9|DDB1_CHICK DNA damage-binding protein 1 OS=Gallus gallus GN=DDB1 PE=2 SV=1
Length = 1140
Score = 1147 bits (2967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/1094 (52%), Positives = 761/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+ A +ELFRP GE++D LFI T +Y C+L++ + ++ ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKTAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDSHREMDNMLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N++QV++A G L YLEI L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQIN 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+G+ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 CTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F L+++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATV 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKIVEELTRIH 1140
>sp|Q6P6Z0|DDB1_XENLA DNA damage-binding protein 1 OS=Xenopus laevis GN=ddb1 PE=2 SV=1
Length = 1140
Score = 1144 bits (2958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ +S ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDPDCR+IGL LYDGLFKVIP + + +AFNIRLEEL V+D+KFLY C
Sbjct: 114 RPSETGIIGIIDPDCRMIGLRLYDGLFKVIPLERDNKELKAFNIRLEELHVIDVKFLYSC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFSKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDVNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KL + + +GSYV V+E
Sbjct: 293 GSVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLTTESNEQGSYVVVMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LR + D D LV+SF+ +TR+L + E E+EET++ GF QT FC + + QL+Q+
Sbjct: 413 LRVAADRDTDDTLVLSFVGQTRVLTLTGE-EVEETDLAGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G V+V + N+ QVLLA G L YLEI G L +
Sbjct: 472 TSASVRLVSQNPQNLVSEWKEPQGRKVSVCSCNSRQVLLAVGR-VLYYLEIHPGELRQTS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLD+ P+G N + S + A+G+WTDIS RI SLP L+ KE LGGEIIPRS
Sbjct: 531 CTEMEHEVACLDVTPLGGNDTLSSLCAIGLWTDISARILSLPGFQLLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ +FE YLLCALGDG L F LN TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTSFESSHYLLCALGDGALFYFSLNTDTGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLKNQ------------------------SCA-- 686
HIR++PL E PR+IC+QE S+ F + S + + SC+
Sbjct: 711 HIRTVPLFESPRKICYQEVSQCFGVLSSRIEVQDASGGSSPLRPSASTQALSSSVSCSKL 770
Query: 687 -------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
EE E+H + ++D TFE + T+ EY S++SC D Y+
Sbjct: 771 FSGSTSPHETSFGEEVEVHNLLIIDQHTFEVLHTHQFLQNEYTLSLVSCKLGKDPTTYFV 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V P+E EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPDEAEPKQGRIVVFQYNDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTV 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>sp|Q9ESW0|DDB1_RAT DNA damage-binding protein 1 OS=Rattus norvegicus GN=Ddb1 PE=2 SV=1
Length = 1140
Score = 1137 bits (2940), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/1094 (52%), Positives = 756/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQ +KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQPVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPRAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLD+ P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDVTPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGT+++IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANTSTLTIGTMNEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++P+ E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPIYESPRKICYQEVSQCFGVLSTRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSAAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF GKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSGGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+ GTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLLGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>sp|Q9XYZ5|DDB1_DROME DNA damage-binding protein 1 OS=Drosophila melanogaster GN=pic PE=1
SV=1
Length = 1140
Score = 962 bits (2486), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 509/1099 (46%), Positives = 708/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ I G IA + FRP +D LFI T RY +L+ + ++T+A G+VSD +G
Sbjct: 54 EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
P++ G I IDP R+IG+ LY GLF +IP D +A N+R++EL V D++FL+GC
Sbjct: 114 IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMDKDASELKATNLRMDELNVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++D+ D RHVK++E+ L+DK+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 LNPTVIVIHKDS-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RV ++G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++LN + GSYV +E +
Sbjct: 293 GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D+ P++ LV++F+ TRIL ++ E E+EETEI GF S QT C + Y+QL+QV
Sbjct: 412 KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVSS ++ L EW+ ++ V + N +Q+L+A+ + Y+ I DG L E
Sbjct: 471 TSDSVRLVSSATKALVAEWRPTGDRTIGVVSCNTTQILVASAC-DIFYIVIEDGSLREQS 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLDI P+ E S + AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590 ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQESSQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + E S++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V+PEE EP GRI++F + KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I AL+++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+ V
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
+IK VG + H +R+F K ++ F+DGDLIESFLDLSR +M + + + ++
Sbjct: 1062 IIKSVGKIEHTYYRNFQINSKVEPSEGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGER 1121
Query: 1022 ----VEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140
>sp|B0M0P5|DDB1_DICDI DNA damage-binding protein 1 OS=Dictyostelium discoideum GN=repE PE=1
SV=1
Length = 1181
Score = 929 bits (2402), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 499/1138 (43%), Positives = 719/1138 (63%), Gaps = 114/1138 (10%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M DV IYGRI+ L+LF G QD+LFI+TE +KFC+L +D E E+IT+A G+ D IG
Sbjct: 51 MFDVNIYGRISVLKLFSVAGSKQDYLFISTESFKFCILAYDYEKKEIITKASGNAEDTIG 110
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL---------------------- 98
RPT+ GQ+GIIDPD R++ LHLY+GL K+I DN
Sbjct: 111 RPTEAGQLGIIDPDGRIVALHLYEGLLKLITLDNNNTPNKINNNNNNNNNNNNNNNNNNN 170
Query: 99 --------------------KEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHV 138
K N+RLEELQVLD+ FLYGC PTI VL++D KD +H+
Sbjct: 171 NNINNNNFNINNNNNNSPIQKNVNNVRLEELQVLDMTFLYGCKVPTIAVLFKDTKDEKHI 230
Query: 139 KTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI 198
TYE++ KD + V GPWSQ+N+ + LL+PVP L GVL++ + I Y + +++ +
Sbjct: 231 STYEISSKDTELVVGPWSQSNVGVYSSLLVPVP--LGGVLVVADNGITYLNGKVTRSVAV 288
Query: 199 RPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
+ A+ RVD DGSR+L GDH G L +LV+ H+++KV LK E LG SI S+ISYLD
Sbjct: 289 SYTKFLAFTRVDKDGSRFLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLD 348
Query: 259 NAVVYIGSSYGDSQLIKLNLQPD-AKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVT 317
+ VVYIGSS GDSQLI+LN + D SYV LE + N+GP+VDFCVVD E+QGQ Q+VT
Sbjct: 349 SGVVYIGSSSGDSQLIRLNTEKDQTTDSYVTYLEAFTNIGPVVDFCVVDAEKQGQAQIVT 408
Query: 318 CSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------------------RSS 356
CSG Y+DGSLRI+RNGIGI EQAS+EL+GIKG++ + +
Sbjct: 409 CSGTYRDGSLRIIRNGIGIAEQASIELEGIKGIFPINNNNNNNNNNNNNNNNNNNNNSNG 468
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH--DAIYNQLVQVTS 414
D D +L+ SFI T++L+ +E+EETE EG S TL+C D + N L+Q+T+
Sbjct: 469 ITDSKDRYLITSFIECTKVLSFQ-GEEIEETEFEGLESNCSTLYCGTIDKL-NLLIQITN 526
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGILTEVK 472
S+ L+ S + + ++W P +N+ + N Q++L+ L+Y +I + + VK
Sbjct: 527 VSINLIDSNTFKRVSQWNVEPSRRINLVSTNQDQIVLSIDKS-LLYFQINSSNKSIQLVK 585
Query: 473 HAQLEYEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
+L +EISC+DI+P + SQ+ +VG+W DI++RIF LP L I KE LGGEI+PR
Sbjct: 586 EIELPHEISCIDISPFDSFMDTKSQLVSVGLWNDITLRIFKLPTLEEIWKEPLGGEILPR 645
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ +F+ I Y+ C+LGDGHL F + + +L D++K++LGTQPI L+ F KNT ++
Sbjct: 646 SILMISFDSIDYIFCSLGDGHLFKFQFDFSSFKLFDKRKLTLGTQPIILKKFKLKNTINI 705
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYS NKKL YS VNLK+V+++ FNS FP+S+AIA LTIGTID+IQK
Sbjct: 706 FAISDRPTVIYSHNKKLFYSVVNLKDVTNVTSFNSDGFPNSMAIATTNSLTIGTIDEIQK 765
Query: 652 LHIRSIPLGEH-PRRICHQEQSRTFAICSLKNQS---------CAEESEMHFVRLLDDQT 701
LHI++IPL E RRI H E +A+ ++KN C E+ E+ ++R+ +DQT
Sbjct: 766 LHIKTIPLNEEMGRRIVHLEDHSCYAVITVKNNEGLLGGAQDLCEEDEEVSYIRIYNDQT 825
Query: 702 FEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTKGRILVFIV---- 756
FE IS+Y LD +E G SI C F+ DD N Y VGT+ P + + GR+L+F +
Sbjct: 826 FELISSYKLDPYEMGWSITPCKFAGDDVNTYLAVGTSINTPIK---SSGRVLLFSLSSSS 882
Query: 757 -------------------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
+GKL L+ E + + +VY L +FNG+L+AA+++++ ++
Sbjct: 883 SSNDKDSLDNNNNNNNNSGANGKLTLLEEIKFRSSVYFLLSFNGRLIAAVHKRLFSIRYT 942
Query: 798 -LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
++ + + SE H GH + L + +RG FI+VGD+MKS+SLL+ + +G++E+ AR+
Sbjct: 943 HSKEKNCKVISSESVHKGHTMILKLASRGHFILVGDMMKSMSLLV-EQSDGSLEQIARNP 1001
Query: 857 NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
W+ +V +++DD ++GAE + N V+KN++ + ER L+ VG YH+GE +N RH
Sbjct: 1002 QPIWIRSVAMINDDYFIGAEASNNFIVVKKNNDSTNELERELLDSVGHYHIGESINSMRH 1061
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
GSLV RLPDSD IPT+++ +VNG IGV+AS+ E ++F KLQ L +V++GVGG +H
Sbjct: 1062 GSLV-RLPDSDQPIIPTILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSH 1120
Query: 977 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
E WR+F+N+ T+D+KNF+DGDLIE+FLDL + + ++ ++ +R+E L +
Sbjct: 1121 ETWRAFSNDHHTIDSKNFIDGDLIETFLDLKYESQLKAVADLGITPDDAFRRIESLMQ 1178
>sp|Q21554|DDB1_CAEEL DNA damage-binding protein 1 OS=Caenorhabditis elegans GN=ddb-1 PE=1
SV=2
Length = 1134
Score = 596 bits (1536), Expect = e-169, Method: Compositional matrix adjust.
Identities = 368/1098 (33%), Positives = 603/1098 (54%), Gaps = 81/1098 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
++PIYG++ T+ L + + + L + TE++ +L + +++TRA G ++D GR
Sbjct: 54 EIPIYGQVLTIALVKCKRDKRHSLIVVTEKWHMAILAY--RDGKVVTRAAGCIADPTGRA 111
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA 121
TDN + LI + ++G K+I +++ L+ FN+R + V D KF+ G
Sbjct: 112 TDN-LFSLTIHRNGLIAIRAFEGSVKMIQWESGTDLRH-FNVRFDYPNVSDFKFVDTGED 169
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
V D+ +H++ ++ + DK+F Q ++ + +LIPVP + GV+++G
Sbjct: 170 DVYRVAFIYDDDHGKHLQFSDLNMHDKEF-RTYSRQASIAADSSVLIPVPHAIGGVIVLG 228
Query: 182 EETIVY-CSANAFKAIPIRPSITK-----AYGRVDADGSRYLLGDHAG-LLHLLVITHEK 234
+++Y + N + +P S+ + +G VDA G R+LL D G LL LL+ E
Sbjct: 229 SNSVLYKPNDNLGEVVPYTCSLLENTTFTCHGIVDASGERFLLSDTDGRLLMLLLNVTES 288
Query: 235 EK---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
+ V ++I+ LGETSIA +I+Y+DN VV++GS GDSQLI+L +P+ GSY +LE
Sbjct: 289 QSGYTVKEMRIDYLGETSIADSINYIDNGVVFVGSRLGDSQLIRLMTEPNG-GSYSVILE 347
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
Y N+GPI D +V E GQ Q+VTC+GA KDGSLR++RNGIGI+E ASV+L G+ G++
Sbjct: 348 TYSNIGPIRDMVMV--ESDGQPQLVTCTGADKDGSLRVIRNGIGIDELASVDLAGVVGIF 405
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ--- 408
+R D D +++VS ET +L + E ELE+ ++ + T+F
Sbjct: 406 PIR--LDSNADNYVIVSLSDETHVLQITGE-ELEDVKLLEINTDLPTIFASTLFGPNDSG 462
Query: 409 -LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL------ 461
++Q T +RL+SS+ L W+ G ++ + NA+ + VYL
Sbjct: 463 IILQATEKQIRLMSSSG--LSKFWEPTNGEIISKVSVNAANGQIVLAARDTVYLLTCIVD 520
Query: 462 EIGDGILTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRIFSLPDLNLIT 520
E+G + + E EI+CLD++ G++P+ + + W+ ++ + LPDL +
Sbjct: 521 EMGALDIQLTAEKKFENEIACLDLSNEGDDPNNKATFLVLAFWSTFAMEVIQLPDLITVC 580
Query: 521 KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
L +IIPRS++ E + YLL A GDG L+ ++ ++KTG + KK ++GT+P +L
Sbjct: 581 HTDLPTKIIPRSIIATCIEEVHYLLVAFGDGALVYYVFDIKTGTHGEPKKSNVGTRPPSL 640
Query: 581 RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
+KN H+F SDRP +I+S++KKL++SNVN+K V +C +S+A+ D L I+
Sbjct: 641 HRVRNKNRQHLFVCSDRPVIIFSASKKLVFSNVNVKLVDTVCSLSSSAYRDCLVISDGNS 700
Query: 641 LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE------------ 688
+ GT+DDIQK+H+RSIP+GE RI +Q+ + T+ +CS + +S AE
Sbjct: 701 MVFGTVDDIQKIHVRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAERVFASKNALVTSQ 760
Query: 689 -----------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
+ +LD TF+ + ++ +E S +S F++DS+ Y
Sbjct: 761 SRPKVASTRADMDESPPNTTSSFMVLDQNTFQVLHSHEFGPWETALSCISGQFTNDSSTY 820
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAIN 788
Y VGT + P+E E GRI+VF V+D KL+ + E +G+ ++ NGKL+AAIN
Sbjct: 821 YVVGTGLIYPDETETKIGRIVVFEVDDVERSKLRRVHELVVRGSPLAIRILNGKLVAAIN 880
Query: 789 QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
I+L++W +EL+ EC H++AL ++ + + V D+M+S+SLL Y+ EG
Sbjct: 881 SSIRLFEWTT----DKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGN 936
Query: 849 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDEERGRLEVVGEYHL 907
EE A+D+N+ WM E + + LG E + NLFTV + TD+ R LE G ++L
Sbjct: 937 FEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDDGRYVLEPTGYWYL 996
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
GE +LV++ DS + ++FGT G IG+I + + FL ++ +
Sbjct: 997 GELPKVMTRSTLVIQPEDSIIQYSQPIMFGTNQGTIGMIVQIDDKWKKFLIAIEKAIADS 1056
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRT-RMDEISKTMNVSVE--- 1023
+K + H +R+F +K+ F+DGDL+ES LD+ R+ MD +SK + +
Sbjct: 1057 VKNCMHIEHSSYRTFVFQKRAEPPSGFVDGDLVESILDMDRSVAMDILSKVSDKGWDPSL 1116
Query: 1024 -----ELCKRVEELTRLH 1036
E+ K +E+L R+H
Sbjct: 1117 PRDPVEILKVIEDLARMH 1134
>sp|O13807|DDB1_SCHPO DNA damage-binding protein 1 OS=Schizosaccharomyces pombe (strain 972
/ ATCC 24843) GN=ddb1 PE=1 SV=1
Length = 1072
Score = 410 bits (1053), Expect = e-113, Method: Compositional matrix adjust.
Identities = 295/1072 (27%), Positives = 549/1072 (51%), Gaps = 96/1072 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR-AMGDVSDRIGRPTD 64
I+ +I ++ F+P D + +AT+ +++ L WDA + + + D S+R R +
Sbjct: 55 IFAKIVNVKAFKPVSSPTDHIIVATDSFRYFTLFWDANDNTVSNGIKIQDCSERSLRESQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF----------DNKGQLKEAFNIRLEELQVLDI 114
+G + ++DP R+I LH+Y GL +IP N L + F++R++EL V+DI
Sbjct: 115 SGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQELNVVDI 174
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
LY ++P++ VLY+D+K H+ TY++ +++++ E ++++ G +P
Sbjct: 175 AMLYNSSRPSLAVLYKDSKSIVHLSTYKINVREQEIDEDDVVCHDIEEGK----LIPSEN 230
Query: 175 CGVLIIGEETIVYCSAN--------AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLH 226
GV + GE + Y S + + PSI+ D S Y++ D +G+L+
Sbjct: 231 GGVFVFGEMYVYYISKDIQVSKLLLTYPITAFSPSISND-PETGLDSSIYIVADESGMLY 289
Query: 227 LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
E V+ +++E LGE+SIAS + L + +++GS + +S L++L
Sbjct: 290 KFKALFTDETVS-MELEKLGESSIASCLIALPDNHLFVGSHFNNSVLLQLP-SITKNNHK 347
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+E+L+ +VN+ PI DF + D Q ++TCSGAYKDG+LRI+RN I I A +E++G
Sbjct: 348 LEILQNFVNIAPISDFIIDD--DQTGSSIITCSGAYKDGTLRIIRNSINIENVALIEMEG 405
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA-- 404
IK +S+ + +D ++ +S I ETR + ++ EG S L C ++
Sbjct: 406 IKDFFSVSFRAN--YDNYIFLSLICETRAIIVSP---------EGVFSANHDLSCEESTI 454
Query: 405 ----IY--NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
IY +Q++Q+T+ +RL + W SP S+ ++ A V +A GG +
Sbjct: 455 FVSTIYGNSQILQITTKEIRLFDGKK---LHSWISP--MSITCGSSFADNVCVAVAGGLI 509
Query: 459 VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT-DISVRIFSLPDLN 517
++ E GI TEV Q + E+S L EN Y VG+W+ DI + + ++
Sbjct: 510 LFFE---GI-TEVGRYQCDTEVSSLCFTE--ENVVY-----VGLWSADIIMLTYCQDGIS 558
Query: 518 LITKEHLGGEIIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTDR--KKVSL 573
L L IPRS++ G L + +G++L F N + G++ + ++ L
Sbjct: 559 LTHSLKLTD--IPRSIVYSQKYGDDGGTLYVSTNNGYVLMF--NFQNGQVIEHSLRRNQL 614
Query: 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
G PI L+ F SK +FA ++P ++Y + KL+ + ++ E+ ++ + + + ++
Sbjct: 615 GVAPIILKHFDSKEKNAIFALGEKPQLMYYESDKLVITPLSCTEMLNISSYVNPSLGVNM 674
Query: 634 AIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA--EESEM 691
+++ + +I+ L+++++ + PRRIC F +C +S E+ +
Sbjct: 675 LYCTNSYISLAKMSEIRSLNVQTVSVKGFPRRICSNSLFY-FVLCMQLEESIGTQEQRLL 733
Query: 692 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 751
F+R+ + T I+ + + +E SI+ +DD V VGT + P+++ P GR+
Sbjct: 734 SFLRVYEKNTLSEIAHHKFNEYEMVESII--LMNDDKRV--VVGTGFNFPDQDAPDSGRL 789
Query: 752 LVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
+VF + D +++ AE + +G+V +L + ++A IN + ++++ + GT +++
Sbjct: 790 MVFEMTSDNNIEMQAEHKVQGSVNTLVLYKHLIVAGINASVCIFEY---EHGTMHVRNSI 846
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
+ + + V D I+ DLMKSI++L + ++ + E ARDY+ W ++VEIL +
Sbjct: 847 RTPTYTIDISVNQ--DEIIAADLMKSITVLQFIDDQ--LIEVARDYHPLWATSVEILSER 902
Query: 871 IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
Y E + N + +++ +R +L +++LGE +N+ RH + + P
Sbjct: 903 KYFVTEADGNAVILLRDNVSPQLSDRKKLRWYKKFYLGELINKTRHCTFIE--PQDKSLV 960
Query: 931 IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
P ++ TV+G + ++ L +LQ N+RKVI GGL+H++W+ + E +T
Sbjct: 961 TPQLLCATVDGSLMIVGDAGMSNTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET-S 1019
Query: 991 AKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
+ +DG LIES L L ++EI +++SV++L +E L +LH
Sbjct: 1020 PSDLIDGSLIESILGLREPILNEIVNGGHEGTKLDISVQDLKSIIENLEKLH 1071
>sp|Q54SA7|SF3B3_DICDI Probable splicing factor 3B subunit 3 OS=Dictyostelium discoideum
GN=sf3b3 PE=3 SV=1
Length = 1256
Score = 231 bits (588), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 245/1065 (23%), Positives = 456/1065 (42%), Gaps = 192/1065 (18%)
Query: 133 KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCS 189
K + + YE+ L + V WS + +D+ A++++ VP GVL+ E+ IVY +
Sbjct: 213 KKKKLLTYYELDLGLNNVVR-KWS-DQVDDSANIVMTVPGGTEGPGGVLVASEDYIVYRN 270
Query: 190 ANAFKAIPIRPSITKAYGRVDADGSR--------------YLLGDHAGLLHLLVITHEKE 235
+ + +R I + YG G +L+ G L+ + + ++ +
Sbjct: 271 QDHAE---VRSRIPRRYGSDPNKGVLIISHSSHKQKGMFFFLVQSEHGDLYKITLDYQGD 327
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI--------------------- 274
+V+ + + +A+ ++ L N ++ S +GD L
Sbjct: 328 QVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDHTLYFFKSIGDEEEEGQAKRLEDKD 387
Query: 275 -KLNLQP-DAKGSYVEVLE------RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGS 326
L P ++ G+ +E L+ +L PI+DF V+DL R+ Q+ + G + S
Sbjct: 388 GHLWFTPRNSCGTKMEELKNLEPTSHLSSLSPIIDFKVLDLVREENPQLYSLCGTGLNSS 447
Query: 327 LRIVRNGIGINEQASVELQGI-KGMWSLRSSTD----DPFDTFLVVSFISETRILAMNLE 381
L+++R+G+ + + L G+ G+W++ ST D D ++VVSF+ T +L++
Sbjct: 448 LKVLRHGLSVTTITTANLPGVPSGIWTVPKSTSPNAIDQTDKYIVVSFVGTTSVLSVG-- 505
Query: 382 DELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
D ++E G T TL + ++QV R + S R NEW++P ++
Sbjct: 506 DTIQENHESGILETTTTLLVKSMGDDAIIQVFPTGFRHIKSDLR--INEWRAPGRKTIVR 563
Query: 442 ATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
A+AN SQ+ +A GG ++Y E+ L E+ L +I+C++I+PI + + ++ AV
Sbjct: 564 ASANQSQLAIALSGGEIIYFELDQASNLIEIIKKDLRRDIACIEISPIPKGRNMARFIAV 623
Query: 501 GMWTDISVRIFSLPDLNLITKEHL--GGEIIPRSVLLCAFE----GIS------------ 542
W + +R+ SL N + + + ++ S+ + + GI
Sbjct: 624 SDW-EGPIRVLSLDRDNCLGQVSMLDTDKVYIESLSIIEMQLNEMGIETKKSQSQTGQTT 682
Query: 543 -------------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
+L L +G + L+ TGEL+D + LG +P+ L
Sbjct: 683 TTTTSTSSASSSVTSGGSLFLFVGLKNGVVKRATLDSVTGELSDIRTRLLGRKPVKLFKV 742
Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
+ + + A S R + Y + KL ++++ + + +S +S+ E ++ I
Sbjct: 743 KVRGSNAMLALSSRVWLNYINQGKLDIVPLSIEPLENASNLSSEQSAESIVATSENKIII 802
Query: 644 GTIDDIQKL-HIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSC--------AEESEM-- 691
+ID + L + +I L P+R I H + S + + N + E+SE
Sbjct: 803 FSIDKLGDLFNQETIKLNATPKRFIIHPQTSYIIILETETNYNTDNIDIDKINEQSEKLL 862
Query: 692 -----------------------------------------HFVRLLDDQTFEFISTYPL 710
+++++D T E + + L
Sbjct: 863 LEKQKELQQEMDIDDDDQNNNNEIEPFKKLFKPKAGKGKWKSYIKIMDPITHESLESLML 922
Query: 711 DTFEYGCSILSCSFSDDSNVYYCVG--TAYVL-PEENEPTKGRILVFIVEDGKLQLIAEK 767
+ E G S+ +CSF + ++ VG T VL P+ ++ + FI KL+L+ +
Sbjct: 923 EDGEAGFSVCTCSFGESGEIFLVVGCVTDMVLNPKSHKSAHLNLYRFIDGGKKLELLYKT 982
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
E + VY++ F GKL+ + + I++Y D G ++L +C + + + GD
Sbjct: 983 EVEEPVYAMAQFQGKLVCGVGKSIRIY-----DMGKKKLLRKCETKNLPNTIVNIHSLGD 1037
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
+VVGD+ +SI + YK E + A D WM++ +LD D GA+ N+F +R
Sbjct: 1038 RLVVGDIQESIHFIKYKRSENMLYVFADDLAPRWMTSSVMLDYDTVAGADKFGNIFVLRL 1097
Query: 886 -----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
K G + +L+ + + +G+ V SLV V
Sbjct: 1098 PLLISDEVEEDPTGTKLKFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLV-------V 1150
Query: 929 GQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
G +++ T++G IG +I E F L+ N+R + G +H +RS+
Sbjct: 1151 GGPEVILYTTISGAIGALIPFTSREDVDFFSTLEMNMRSDCLPLCGRDHLAYRSY----- 1205
Query: 988 TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F L+ + IS+ ++ S E+ K++EE+
Sbjct: 1206 YFPVKNIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250
>sp|Q15393|SF3B3_HUMAN Splicing factor 3B subunit 3 OS=Homo sapiens GN=SF3B3 PE=1 SV=4
Length = 1217
Score = 214 bits (545), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>sp|A0JN52|SF3B3_BOVIN Splicing factor 3B subunit 3 OS=Bos taurus GN=SF3B3 PE=2 SV=1
Length = 1217
Score = 214 bits (545), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>sp|Q5RBI5|SF3B3_PONAB Splicing factor 3B subunit 3 OS=Pongo abelii GN=SF3B3 PE=2 SV=1
Length = 1217
Score = 214 bits (545), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSETAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGSGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALRDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVTKKLEDI 1211
>sp|Q1LVE8|SF3B3_DANRE Splicing factor 3B subunit 3 OS=Danio rerio GN=sf3b3 PE=2 SV=1
Length = 1217
Score = 214 bits (545), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 265/1190 (22%), Positives = 491/1190 (41%), Gaps = 189/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L + ++G + +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTMEVFGVVRSLMAFRLTGGTKDYVVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMIGATEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + ++E VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V V E+ +L
Sbjct: 347 LFVSSEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G +LR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIIRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
++ ++ C+ + + S+ AVG+ D +VRI SL L ++ + L + P
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638
Query: 532 SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
S+ + G+ YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGVEKQDELGEKGTIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM VRL++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGSGQWASLVRLINPIQGNTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
L+ E S+ C F + + +Y VG A ++ G I + + G KL+
Sbjct: 879 LVQLEQNEAAFSVAICRFLNGGDDWYVLVGVARDMILNPRSVGGGYIYTYRIVGGGDKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
+ + + ++ F G++L + + +++Y D G ++L +C H L +
Sbjct: 939 FLHKTPVEDVPLAIAPFQGRVLVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
T G ++V D+ +S+ + Y+ E + A D W++ +LD D A+ N+
Sbjct: 994 HTIGQRVIVSDVQESLFWVRYRRNENQLIIFADDTYPRWITTACLLDYDTMASADKFGNI 1053
Query: 882 FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
VR N+ DE +RG L E++ YH+GE V + +L+
Sbjct: 1054 CVVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEIIINYHIGETVLSLQKTTLI--- 1110
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G ++++ T++G IG++ HE + F + L+ ++R + G +H +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVSEELDRTPPEVSKKLEDI 1211
>sp|Q921M3|SF3B3_MOUSE Splicing factor 3B subunit 3 OS=Mus musculus GN=Sf3b3 PE=2 SV=1
Length = 1217
Score = 214 bits (545), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>sp|Q4WLI5|RSE1_ASPFU Pre-mRNA-splicing factor rse1 OS=Neosartorya fumigata (strain ATCC
MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=rse1 PE=3
SV=1
Length = 1225
Score = 210 bits (535), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 273/1166 (23%), Positives = 477/1166 (40%), Gaps = 171/1166 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D +A T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVDYSEADQDPTGRAYEESEKLLVYYELDLGLNHVVRK-WA-DPVDRTASM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
L VP GVL+ E+ I Y +N AF+ IP R T+ R + + +
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRIGATENPERKRSIVAGVMHKM 300
Query: 221 HAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
LL ++ + ++TG LKI+ IAS++ L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKVTIDMVEDDNGQLTGEVKRLKIKYFDTVPIASSLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKLN----------------------------LQPDAKGSYVEVLERYVN 295
+ S G+ + QP + ++E +
Sbjct: 361 VASEAGNHHFYQFEKLGDDDEETEFNSENFPADLSVPCEPVFFQPRG-AENLNLVETLNS 419
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLR 354
L P++D +V+L Q+ T SG+ S R +++G+ ++E EL + +W+ +
Sbjct: 420 LNPLIDSKIVNLNEDDAPQIYTVSGSGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTK 479
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
+ D FD ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 480 LTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHP 537
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-K 472
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 538 RGIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDE 595
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
Q+ ++CL + + E S AVG D +VRI SL PD L K P
Sbjct: 596 RRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPS 654
Query: 532 SVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
++ + + S YL L G L +L+ TGEL+D + LG +P+ L S
Sbjct: 655 ALNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFRVSV 714
Query: 586 KNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
K T V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 715 KGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQAQNLRI 774
Query: 644 GTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR------- 695
+I+ + + SIPL PRR+ + F + N + + +
Sbjct: 775 FSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDNNVLSPATRARLIEDSKARNG 834
Query: 696 ---LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DSN 729
+L + F + IST L+ E S+ + FS D
Sbjct: 835 ETNVLPPEDFGYPRATGHWASCIQIVDPLDAKAVISTIELEENEAAVSMAAVPFSSQDDE 894
Query: 730 VYYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
+ VGTA + N P+ + I EDGK L+ I + + + +L F G+LLA
Sbjct: 895 TFLVVGTAKDM-IVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRLLA 953
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIY 842
I +++Y D G ++L +C +++ + QT+G IVV D+ +S++ ++Y
Sbjct: 954 GIGSTLRIY-----DLGMKQLLRKC--QAQVVSKTIVGLQTQGSRIVVSDVRESVTYVVY 1006
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE----- 894
K+++ + D + W ++ ++D + G + NL+ VR K SE A ++
Sbjct: 1007 KYQDNILIPFVDDSVSRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKASEEADEDGSGAH 1066
Query: 895 ---ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
ERG RL+++ + + LV G +++ G IG
Sbjct: 1067 LIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIG 1119
Query: 945 VIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
++ + E F + L+ L + G +H +RS+ K V +DGDL E +
Sbjct: 1120 MLVPFVSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMY 1174
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRV 1029
L I+ ++ SV E+ +++
Sbjct: 1175 FLLPNDTKMMIAAELDRSVREIERKI 1200
>sp|Q7RYR4|RSE1_NEUCR Pre-mRNA-splicing factor rse-1 OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=rse-1 PE=3 SV=2
Length = 1209
Score = 209 bits (531), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 279/1164 (23%), Positives = 487/1164 (41%), Gaps = 161/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G + + FR G +D++ +AT+ + ++++ ++++ + R
Sbjct: 64 IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ +++ +L + + + VL + + G
Sbjct: 124 GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D DA T YE+ L V WS + +D + LL
Sbjct: 184 ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE + Y +N AF+ IP R T+ RV G +
Sbjct: 242 FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVSGVMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 300 LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
++ S +G+ + + D SY V LE V
Sbjct: 360 GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419
Query: 295 --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P VD V +L + Q+ + G + R++++G+ ++E + EL G +W
Sbjct: 420 IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + + D +D ++V+SF + T +L++ + +EE GF + TL + L+Q
Sbjct: 480 TTKLTKYDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTTAPTLAVQQMGEDGLIQ 537
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R + NEW +P S+ ATAN +QV++A G +VY E+ DG L E
Sbjct: 538 VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 594
Query: 471 V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ ++ ++ L + + E S AVG D +VRI SL PD L K
Sbjct: 595 YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 653
Query: 529 IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
P ++ + + E YL L G L +L+ TGELTD ++ LG +P L
Sbjct: 654 APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTRLFQV 713
Query: 584 SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
S ++ V A S RP + Y+ K + + ++ E+ + F+S + + L
Sbjct: 714 SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 773
Query: 642 TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
I +I+ + I +SIPL P+ + H EQ + I S N E+
Sbjct: 774 RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 833
Query: 693 FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
+L + F + IS P LD E S F S +
Sbjct: 834 DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 893
Query: 729 NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
+ VGT ++ + + T+G I V+ EDG+ L+ I + + +L F G+LLA
Sbjct: 894 ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 953
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
+ + +++Y L+ R+ Q++ L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 954 GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 1009
Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
+ A D W + ++D + G + N++ VR + + + E E
Sbjct: 1010 GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1065
Query: 906 HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
HL N + HG+ LP S VG +++ + G +GV I
Sbjct: 1066 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1124
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ E F + L+ ++R + G +H +R + K V +DGDL E F L
Sbjct: 1125 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1179
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1180 DKKQMIAGELDRSVREIERKISDI 1203
>sp|Q5B1X8|RSE1_EMENI Pre-mRNA-splicing factor rse1 OS=Emericella nidulans (strain FGSC A4
/ ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=rse1 PE=3
SV=2
Length = 1209
Score = 207 bits (527), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 281/1165 (24%), Positives = 477/1165 (40%), Gaps = 163/1165 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
GQ +DP R LI + L V+ +++ +L + + + Q L + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVYSVVALDAG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ T YE+ L V W+ + +D + +
Sbjct: 183 YENPVFAALEVDYSESDQDPTGRAYEEVEKLLVYYELDLGLNHVVRK-WT-DPVDRTSSM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPS----------ITKAYGRVD 210
L VP GVL+ E+ I Y +N AF+ IP R IT
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRKGAMENPERKRCITAGVMHKM 300
Query: 211 ADGSRYLLGDHAG-LLHLLVITHEKEK------VTGLKIELLGETSIASTISYLDNAVVY 263
+LL G L L + E +K V GLKI+ +AS++ L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKLTLDMVEDDKGQLTGEVKGLKIKYFDTVPLASSLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKL-----------------NLQPDAKGSYV----------EVLERYVNL 296
+ + G+ + + P A + V ++E +L
Sbjct: 361 VAAEGGNHHFYQFEKLGDDDEETEFNSDDFSADPAAPCTPVYFQPRGAENLNLVEAINSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P+VD VV++ Q+ T SG + R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLVDSKVVNISEDDAPQIFTVSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ D FD ++V+SF + T +L++ + +EE GF S TL + L+Q+
Sbjct: 481 TRADEFDAYIVLSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQIHPR 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFELDADGSLAEYDER 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDTTLENKSVQALTAAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + A S YL L G L L+ TGEL+D + LG++ + L S
Sbjct: 656 LNIIAMADSSSGGTTLYLHIGLHSGVYLRTALDEVTGELSDTRTRFLGSKAVKLFQVSVT 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
T V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 GQTAVLALSSRPWLGYSDTQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFE 703
+I+ + + +SIPL PR + F + N + + RLL+D
Sbjct: 776 SIEKLDNNMLQQSIPLAYTPRHFIKHPEEPLFYVIEADNNVLSPATR---ARLLEDSKAR 832
Query: 704 FIST---------YP--------------------------LDTFEYGCSILSCSF-SDD 727
T YP L+ E SI + F S D
Sbjct: 833 GGDTTVLPPEDFGYPRGTGHWASCIQIIDPLDAKAVVGAVELEENEAAVSIAAVPFTSQD 892
Query: 728 SNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
+ VGTA + N P+ G I ++ EDGK L+ I + + + +L F G+L
Sbjct: 893 DETFLVVGTAKDM-TVNPPSSAGGYIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRL 951
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIY 842
LA + +++Y D G ++L +C A+ +QT+G IVV D+ +S++ ++Y
Sbjct: 952 LAGVGSVLRIY-----DLGMKQLLRKCQAAVAPKAIVGLQTQGSRIVVSDVRESVTYVVY 1006
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
K+++ + D A W +A ++D + G + NL+ VR K SE A +E G
Sbjct: 1007 KYQDNVLIPFVDDSIARWTTAATMVDYETTAGGDKFGNLWLVRCPKKASEEADEEGSGAH 1066
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDV-----------GQIPTVIFGTVNGVIGVIAS 948
+ +L NR L++ + D+ G +++ G IG++
Sbjct: 1067 LIHDRGYLQGTPNRLE---LMIHVFTQDIPTSLHKTQLVAGGRDILVWTGFQGTIGILVP 1123
Query: 949 -LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
+ E F + L+ L + G +H +RS+ K V +DGDL E + LS
Sbjct: 1124 FVSREDVDFFQSLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEQYFLLS 1178
Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
I+ ++ SV E+ +++ ++
Sbjct: 1179 NDTKMMIAAELDRSVREIERKISDM 1203
>sp|Q52E49|RSE1_MAGO7 Pre-mRNA-splicing factor RSE1 OS=Magnaporthe oryzae (strain 70-15 /
ATCC MYA-4617 / FGSC 8958) GN=RSE1 PE=3 SV=2
Length = 1216
Score = 206 bits (523), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 287/1167 (24%), Positives = 487/1167 (41%), Gaps = 160/1167 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I L FR G ++D+L IA++ + ++++ + +R + + G R
Sbjct: 64 VFGIIRDLASFRLAGSSKDYLIIASDSGRITIVEY-LPAQNRFSRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ DP R LI L V+ +++ +L + + + L + + G
Sbjct: 123 PGQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVG 182
Query: 120 CAKPTIVVLY-------QDNKD--ARHVKT----YEVALKDKDFVEGPWSQNNLDNGADL 166
+ P L QD K A++V+T YE+ L V W + +D+ A +
Sbjct: 183 YSNPVFAALEVDYSEVDQDPKGDAAQNVETVLNYYELDLGLNHVVRK-WF-DVVDSTASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
L VP P GVL+ GEE I Y +N AF+ IP R T+ + R G +
Sbjct: 241 LFQVPGGSDGP--SGVLVCGEENITYRHSNQDAFRVPIPRRKGATEDPSRKRNIVSGVMH 298
Query: 217 LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
L AG L+ T + + +V LKI+ +++ + L
Sbjct: 299 KLKGSAGAFFFLLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILK 358
Query: 259 NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
+ +++ S +G+ Q KL + D K Y + ++E
Sbjct: 359 SGFLFVASEFGNHLFYQFEKLGDDDEELEFFSSDFPVDPKEPYEPVYFYPRPTENLALVE 418
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
++ P++D V +L + Q+ T SG + R++++G+ +NE + +L G +
Sbjct: 419 SIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHGLEVNEIVASQLPGTPSAV 478
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + DD +D ++V+SF + T +L++ + +EE GF S TL + LV
Sbjct: 479 WTTKLRRDDEYDAYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVPTLAVQQLGDDGLV 536
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R + + + NEW SP S+ A N QV +A G +VY E+ DG L
Sbjct: 537 QVHPKGIRHIRNG---VVNEWSSPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLA 593
Query: 470 EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
E + + ++ L + + E S AVG D +VRI SL P+ L +K
Sbjct: 594 EYDEKKEMFGTVTSLSLGEVPEGRLRSSYLAVGC-DDCTVRILSLDPESTLESKSVQALT 652
Query: 528 IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
P ++ + + E S YL L G L +L+ TGELTD ++ LG + + L
Sbjct: 653 AAPSALSIMSMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLF 712
Query: 582 TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
S + T V A S R + +S K + +N +E+ F S + +
Sbjct: 713 QVSVQKRTCVLALSSRSWLGFSDPVTKGFTMTPLNYEELEWGWNFVSEQCEEGMVGVNGQ 772
Query: 640 ELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
L I I+ + I +SIPL PR++ R F N + A E +
Sbjct: 773 FLRIFAIEKLGDNVIQKSIPLTYTPRKLAKHPTQRIFYTIEADNNTLAPELREQLMAAPT 832
Query: 695 ------RLLDDQTFEF---------------------------ISTYPLDTFEYGCSILS 721
R+L F + + LD E S+
Sbjct: 833 AVNGDARVLPPDEFGYPRGNGRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAV 892
Query: 722 CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
SF S D + VGT ++ T+G I V+ EDG+ L+ I + + + +L
Sbjct: 893 VSFASQDGESFLVVGTGKDMVVNPRRFTEGYIHVYRFSEDGRELEFIHKTKVEEPPTALL 952
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
F G+L+A I + +++Y LR R+ Q+E L + + T+G I+VGD+ +
Sbjct: 953 PFQGRLVAGIGRMLRIYDLGLR-QLLRKAQAEVAPQ---LIVSLNTQGSRIIVGDVQHGL 1008
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEE 895
+ YK E + A D A W + ++D D GA+ NL+ +R + + +DE
Sbjct: 1009 IYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAGADKFGNLWILRCPEKASQESDEP 1068
Query: 896 RGRLEVV-GEYHLGEFVNRFRHGSLV--MRLPDS------DVGQIPTVIFGTVNGVIGV- 945
+ +V +L NR + V +P S VG +++G G IGV
Sbjct: 1069 GSEVHLVHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQEVLLWGGFQGTIGVL 1128
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
I + E F + L+ +LR + G +H +R V K +DGDL E +
Sbjct: 1129 IPFVSREDADFFQSLEQHLRSEDPPLAGRDHLMYRGC-----YVPVKGVIDGDLCERYTM 1183
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1184 LPNDKKQMIAGELDRSVREIERKISDI 1210
>sp|Q9UTT2|RSE1_SCHPO Pre-mRNA-splicing factor prp12 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=prp12 PE=1 SV=1
Length = 1206
Score = 201 bits (512), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 264/1176 (22%), Positives = 476/1176 (40%), Gaps = 177/1176 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L+ +G I + R G +D+L + ++ + +L+++ E ++L+
Sbjct: 59 ILNQNCFGIIRNVAPLRLTGFKRDYLVVTSDSGRITILEYNVEKNKLVPIYQETFGKSGI 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
R G+ ID R +I + L V+ D++ L + LE + +I F
Sbjct: 119 RRVVPGEYLAIDAKGRAAMIASVEKNKLVYVLNRDSEANL--TISSPLEAHKANNICFHL 176
Query: 119 -----GCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLD 161
G A P L D + H T E + + V WS+ +D
Sbjct: 177 IGLDTGYANPIFAALEVDYSEIDHDSTREAFTSSEKVLSYYELDLGLNHVVKRWSKV-VD 235
Query: 162 NGADLLIPVP-----PPLCGVLIIGEETIVYCS-ANAFKAIPI-------RPSITKAYGR 208
+ +LIPVP P G L+I I Y AF IPI +I+ + +
Sbjct: 236 RNSYMLIPVPGGNDGP--SGTLVISNGWISYRHLQKAFHQIPILRRQAASANAISTPWNQ 293
Query: 209 VDADGSR------------------YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETS 249
V+++ + YLL G L L I H+ + V L+++
Sbjct: 294 VNSNSANDGPLIVSAVLHKMKGSFFYLLQTGDGDLLKLTIEHDGQGNVVELRLKYFDTVP 353
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL---------------------------NLQPDA 282
+A ++ L +++ + +G+ QL + N+
Sbjct: 354 LAVQLNILKTGFLFVATEFGNHQLYQFENLGIDDDELEITSLDFQAQDNEVGTKNVHFGV 413
Query: 283 KG-SYVEVLERYVNLGPIVDFCVVDLERQGQG-QVVTCSGAYKDGSLRIVRNGIGINEQA 340
+G + ++E +L + D ++ G+ Q+ T G + SLR +R G+ E
Sbjct: 414 RGLQNLSLVEEIPSLYSLTDTLLMKAPSSGEANQLYTVCGRGSNSSLRQLRRGLETTEIV 473
Query: 341 SVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
+ EL G +W+L+ + D +D+++++SF + T +L++ + +EE GF S TL
Sbjct: 474 ASELPGAPIAIWTLKLNQTDVYDSYIILSFTNGTLVLSIG--ETVEEISDSGFLSSVSTL 531
Query: 400 FCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLV 459
+ LVQ+ +R + + + +EWK P V + N Q+++A G LV
Sbjct: 532 NARQMGRDSLVQIHPKGIRYIRANKQT--SEWKLPQDVYVVQSAINDMQIVVALSNGELV 589
Query: 460 YLEIGD----GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514
Y E+ D G L E + + L ++ L + P+ E S + D +VR+ SL
Sbjct: 590 YFEMSDDVEGGQLNEYQERKTLTANVTSLALGPVQEGSRRSNFMCLAC-DDATVRVLSL- 647
Query: 515 DLNLITKEHLGGE----------IIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKT 562
DL T E+L + IIP +V G+S YL L +G L ++++ +
Sbjct: 648 DL-YTTLENLSVQALSSPANSLCIIPMNV-----NGVSTLYLHIGLMNGVYLRTVIDVTS 701
Query: 563 GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
G+L D + LG + + + + KN V A S R + YS + L S + + H
Sbjct: 702 GQLLDTRTRFLGPRAVKIYPITMKNQNTVLAVSSRTFLAYSYQQNLQLSPIAYSAIDHAS 761
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQ---------EQS 672
F S P+ + ++ L I T+D +Q L PL PR+I +
Sbjct: 762 SFASEQCPEGIVAIQKNTLKIFTVDSLQDDLKSDIYPLICTPRKIVKHPNFPVLYILQSE 821
Query: 673 RTF--------------AICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
R F + K +S + F+ + D + + I PL E S
Sbjct: 822 RNFDSFKYAQENGDVGSSYTKEKQNEHTSKSWVSFISVFDMISKKIIHESPLGDNEAAFS 881
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLIAEKETKGAVYS 775
+ + F + + G+A + E R+ F E KL+LI+ E G +
Sbjct: 882 MTAAFFKNRDEFFLVAGSATNMDLECRTCSHGNFRVYRFHDEGKKLELISHTEIDGIPMA 941
Query: 776 LNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
L F G++LA + + +++Y K MLR + + ++ + IVV
Sbjct: 942 LTPFQGRMLAGVGRFLRIYDLGNKKMLRKGELSAVP--------LFITHITVQASRIVVA 993
Query: 832 DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSE 889
D S+ ++YK E+ + A D W + ++D D G + N++ +R ++
Sbjct: 994 DSQYSVRFVVYKPEDNHLLTFADDTIHRWTTTNVLVDYDTLAGGDKFGNIWLLRCPEHVS 1053
Query: 890 GATDEERGRLEVVGEYHLGEFVNRFRHG-SLVMRLPDSDV-----------GQIPTVIFG 937
DEE +++ H F+N H L+ +D+ G +++
Sbjct: 1054 KLADEENSESKLI---HEKPFLNSTPHKLDLMAHFFTNDIPTSLQKVQLVEGAREVLLWT 1110
Query: 938 TVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
+ G +GV + E F ++L+ LRK + G +H +RS+ K V +D
Sbjct: 1111 GLLGTVGVFTPFINQEDVRFFQQLEFLLRKECPPLAGRDHLAYRSYYAPVKCV-----ID 1165
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E + L + I+ ++ ++ E+ K++E+
Sbjct: 1166 GDLCEMYYSLPHPVQEMIANELDRTIAEVSKKIEDF 1201
>sp|P0CR22|RSE1_CRYNJ Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
neoformans serotype D (strain JEC21 / ATCC MYA-565)
GN=RSE1 PE=3 SV=1
Length = 1217
Score = 190 bits (482), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 277/1182 (23%), Positives = 476/1182 (40%), Gaps = 186/1182 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I + FR G +D++ ++ + +L++ + +V + G R
Sbjct: 60 FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESLYQEVFGKSGSRRIVP 119
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R L+G L V+ + +G+L + + + L + G
Sbjct: 120 GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHIVGVDQGY 179
Query: 121 AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
P L D ++ +H+ YE+ L V WS+ D A+LL
Sbjct: 180 DNPLYAALETDYSESDQDSTGEAYENTQKHLTFYELDLGLNHVVRK-WSEPT-DRRANLL 237
Query: 168 IPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGS 214
+ VP P GVL+ E+ I++ + IPI R D S
Sbjct: 238 VQVPGGQNANSDRFEGP--SGVLVCTEDHIIWKHMDVEAHRIPIPRRRNPLVQR--GDKS 293
Query: 215 R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
R +LL G L+ + I H E V LKI+ +A+++ L
Sbjct: 294 RGLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCIL 353
Query: 258 DNAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE---------------------- 288
+Y+ S + D L + + G Y E
Sbjct: 354 KRGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLL 413
Query: 289 VLERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+++ +L PI D VV+L Q+ G + R +++G+ + E S L G
Sbjct: 414 LVDTVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPG 473
Query: 347 IK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+L+ + DD +D+++V+SF + T +L++ + +EE GF S TL
Sbjct: 474 VPTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLG 531
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
L+QV +R + + R +EW +PPG ++ AT N QV++A LVY E+
Sbjct: 532 NAGLLQVHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDP 589
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
+G L+E + + L +C+ I + E + AVG + +V I SL PD L T
Sbjct: 590 EGSLSEYQEKKALPGNATCVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLS 648
Query: 523 HLGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
P S+ L S +L L +G LL +++ G L+D + LG
Sbjct: 649 LQALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLG 708
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
+P L + + V A S R ++Y+ L + + + ++A PD L
Sbjct: 709 AKPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLI 768
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQ 683
L I I + +KL S L PR+ I H S RT++ +++
Sbjct: 769 GISGNTLRIFNIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERI 828
Query: 684 SCAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGC 717
+ESE VR+LD E I T LD E
Sbjct: 829 VKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAF 888
Query: 718 SILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAV 773
SI F + VGT + + K G + V+ I E G+ L+ + + +T
Sbjct: 889 SIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIP 948
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
L F G LLA I + ++LY+ G + L +C ++G A+ + +G I+VGD
Sbjct: 949 LCLAGFQGFLLAGIGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGD 1003
Query: 833 LMKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
+ +S +Y+ +I R A D W++ V +D + + N+F R +
Sbjct: 1004 MQESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLD 1059
Query: 888 ---SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV---- 934
SE D+ G + ++G H E + + GS+V + + +IP V
Sbjct: 1060 PSISEKVDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGR 1114
Query: 935 ---IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
++ T++G +G + + + F+ L+ ++R + G +H +R + V
Sbjct: 1115 DVLVYTTISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVP 1169
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K +DGDL ESF L + I+ ++ SV ++ K++E++
Sbjct: 1170 IKGVVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQM 1211
>sp|P0CR23|RSE1_CRYNB Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
neoformans serotype D (strain B-3501A) GN=RSE1 PE=3 SV=1
Length = 1217
Score = 190 bits (482), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 277/1182 (23%), Positives = 476/1182 (40%), Gaps = 186/1182 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I + FR G +D++ ++ + +L++ + +V + G R
Sbjct: 60 FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESLYQEVFGKSGSRRIVP 119
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R L+G L V+ + +G+L + + + L + G
Sbjct: 120 GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHIVGVDQGY 179
Query: 121 AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
P L D ++ +H+ YE+ L V WS+ D A+LL
Sbjct: 180 DNPLYAALETDYSESDQDSTGEAYENTQKHLTFYELDLGLNHVVRK-WSEPT-DRRANLL 237
Query: 168 IPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGS 214
+ VP P GVL+ E+ I++ + IPI R D S
Sbjct: 238 VQVPGGQNANSDRFEGP--SGVLVCTEDHIIWKHMDVEAHRIPIPRRRNPLVQR--GDKS 293
Query: 215 R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
R +LL G L+ + I H E V LKI+ +A+++ L
Sbjct: 294 RGLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCIL 353
Query: 258 DNAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE---------------------- 288
+Y+ S + D L + + G Y E
Sbjct: 354 KRGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLL 413
Query: 289 VLERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+++ +L PI D VV+L Q+ G + R +++G+ + E S L G
Sbjct: 414 LVDTVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPG 473
Query: 347 IK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+L+ + DD +D+++V+SF + T +L++ + +EE GF S TL
Sbjct: 474 VPTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLG 531
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
L+QV +R + + R +EW +PPG ++ AT N QV++A LVY E+
Sbjct: 532 NAGLLQVHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDP 589
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
+G L+E + + L +C+ I + E + AVG + +V I SL PD L T
Sbjct: 590 EGSLSEYQEKKALPGNATCVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLS 648
Query: 523 HLGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
P S+ L S +L L +G LL +++ G L+D + LG
Sbjct: 649 LQALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLG 708
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
+P L + + V A S R ++Y+ L + + + ++A PD L
Sbjct: 709 AKPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLI 768
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQ 683
L I I + +KL S L PR+ I H S RT++ +++
Sbjct: 769 GISGNTLRIFNIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERI 828
Query: 684 SCAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGC 717
+ESE VR+LD E I T LD E
Sbjct: 829 VKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAF 888
Query: 718 SILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAV 773
SI F + VGT + + K G + V+ I E G+ L+ + + +T
Sbjct: 889 SIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIP 948
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
L F G LLA I + ++LY+ G + L +C ++G A+ + +G I+VGD
Sbjct: 949 LCLAGFQGFLLAGIGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGD 1003
Query: 833 LMKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
+ +S +Y+ +I R A D W++ V +D + + N+F R +
Sbjct: 1004 MQESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLD 1059
Query: 888 ---SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV---- 934
SE D+ G + ++G H E + + GS+V + + +IP V
Sbjct: 1060 PSISEKVDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGR 1114
Query: 935 ---IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
++ T++G +G + + + F+ L+ ++R + G +H +R + V
Sbjct: 1115 DVLVYTTISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVP 1169
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K +DGDL ESF L + I+ ++ SV ++ K++E++
Sbjct: 1170 IKGVVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQM 1211
>sp|Q4PGM6|RSE1_USTMA Pre-mRNA-splicing factor RSE1 OS=Ustilago maydis (strain 521 / FGSC
9021) GN=RSE1 PE=3 SV=1
Length = 1221
Score = 172 bits (435), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 259/1175 (22%), Positives = 483/1175 (41%), Gaps = 171/1175 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I +L FR G ++D++ + ++ + +L++ +++ L + + R G R
Sbjct: 62 FGVIRSLAAFRLTGGSKDYVIVGSDSGRIVILEYQPKTNSL-EKVHQETFGRSGSRRIVP 120
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
GQ DP R +IG L ++ D + L + + + + +D+ F
Sbjct: 121 GQYLATDPKGRATMIGAMEKAMLVYILNRDAQANLTISSPLEAHRPSAIIHHIVGVDVGF 180
Query: 117 ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGA 164
L+ C + V Y D+ + +E A K + E WS+ +D +
Sbjct: 181 ENPLFACLE----VDYSDSDHDPSGRAFEEAAKTLTYYELDLGLNHVVRKWSEP-VDPRS 235
Query: 165 DLLIPVPPPL----------CGVLIIGEETIVYCSANAFK-AIPIRPSITKAYGRVDADG 213
+LL+ VP GVL+ E+ I Y + + +PI + + G
Sbjct: 236 NLLVQVPGGYNQNLEKWDGPSGVLVCSEDYITYKHQDQPEHRVPIPKRLNPVEKLSERRG 295
Query: 214 SR--------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
+ +L+ G L + + H+ +++ LKI+ +AS + L +
Sbjct: 296 TLIVASVLHKMKNAFFFLVQTEDGDLFKITMEHQDDEIRSLKIKYFDTVPVASGLVILRS 355
Query: 260 AVVYIGSSYGDSQLIKLN------------------------------LQPDAKGSYVEV 289
+++ S YG L P + V+V
Sbjct: 356 GFLFVASEYGAQLLYSFQKLGDDDDLPEYISTDYDENGAGRRRPQLPTFTPRPLDNLVQV 415
Query: 290 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
E +L PI+D ++ Q+ G S +++R+G+ + E S +L G+
Sbjct: 416 DE-MPSLDPILDAKPLNPLAADSPQIFAACGRGARSSFKMLRHGLEVQEAVSSDLPGVPS 474
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+W+ + + D +D+++++SF++ T +L++ + +EE GF + + TL +
Sbjct: 475 AVWTTKITQQDEYDSYIILSFVNGTLVLSIG--ETIEEVSDSGFLTSSSTLAVQQLGQDA 532
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSP--PG--YSVNVAT-ANASQVLLATGGGHLVYLEI 463
L+QV +R V +++ NEW +P P + VAT N QV++A LVY E+
Sbjct: 533 LLQVHPHGIRHVL-VDKQI-NEWATPSLPNGRQTTIVATCTNERQVVVALSSNELVYFEL 590
Query: 464 G-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
DG L E + + + + + + E + AVG D +VRI SL + +
Sbjct: 591 DMDGQLNEYQERKAMGAGVLTMSMPDCPEGRQRTPYLAVGC-DDSTVRIISLEPNSTLAS 649
Query: 522 EHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
+ P S + A +++ L +G LL +L+ TG+LTD +
Sbjct: 650 ISIQALTAPASSICMAEMLDATIDRNHATTFVNIGLQNGVLLRTILDAVTGQLTDTRTRF 709
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
LG++ + L V A S R + Y+ +L + + + H F++ P+
Sbjct: 710 LGSKAVRLIRTKVHGQAAVMALSTRTWLSYTYQDRLQFVPLIFDVLDHAWSFSAELCPEG 769
Query: 633 LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCA---- 686
L L I TI + KL S+ L PR+I H + F + ++++ +
Sbjct: 770 LIGIVGSTLRIFTIPSLASKLKQDSVALSYTPRKIANHPNEQGLFYVVEAEHRTLSPGAQ 829
Query: 687 --------EESEMHFVRLLDDQTFEF------------------------ISTYPLDTFE 714
+E + H +LD EF +D E
Sbjct: 830 RRRTEMLGKELKPHQRGVLDLNPAEFGAIRAEAGNWASCIRAVDGVQAQTTHRLEMDDNE 889
Query: 715 YGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETK 770
SI F S + V VG+A V+ K + + ++++G+ L+L+ + E
Sbjct: 890 AAFSIAVVPFASAEKEVMLVVGSAVDVVLSPRSCKKAYLTTYRLLDNGRELELLHKTEVD 949
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
L AF G+LLA I + +++Y D G ++L +C + A+ + +G IV
Sbjct: 950 DIPLVLRAFQGRLLAGIGKALRIY-----DLGKKKLLRKCENRSFPTAVVSLDAQGSRIV 1004
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KN 887
VGD+ +SI YK E + A D +++ +LD D A+ N++ +R N
Sbjct: 1005 VGDMQESIIFASYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNIYVLRLDGN 1064
Query: 888 SEGATDEERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGT 938
+ + DE+ + +V G H V F G ++ L + + G +++
Sbjct: 1065 TSRSVDEDPTGMTIVHEKPVLMGAAHKASLVAHFFVGDIITSLHRTAMVAGGREVLLYTG 1124
Query: 939 VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
++G IG + + E L L+++LR+ + G +H +RS K+V +DG
Sbjct: 1125 LSGSIGALVPFVSKEDVDTLSTLESHLRQENNSIVGRDHLAYRSSYAPVKSV-----IDG 1179
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL E+F LS + + I+ ++ E+ K++ +L
Sbjct: 1180 DLCETFGLLSPAKQNAIAGELDRKPGEINKKLAQL 1214
>sp|Q10426|RIK1_SCHPO Chromatin modification-related protein rik1 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=rik1 PE=1 SV=2
Length = 1040
Score = 131 bits (330), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 226/1060 (21%), Positives = 428/1060 (40%), Gaps = 127/1060 (11%)
Query: 4 VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
+P++ + + +RP G +D+LF+ + + WD + ++I V R+ P
Sbjct: 53 IPLFSTVKHIRPYRPPGLDRDYLFVVLNDDTYFSIYWDEDYQKVIVDH-PPVRYRVTFPW 111
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVI---------PFDNKGQLKEAFNIRLEELQVLDI 114
+ D R+ + L +I + F + D+
Sbjct: 112 NRNAKSYCLVDLRMRAIFLSIDEISMICIRILSAEERLKTGRSIDSGFPFSFPVHLIYDM 171
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
L + PT+VVL+ D D +V + + L K +L G L V P +
Sbjct: 172 CILNDSSTPTLVVLHSDGLDC-YVTAFLLDLSSK----------SLGKGIRLFERVKPSM 220
Query: 175 CGV-----LIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL 228
L++ E ++C F I + + + Y++ D G L +
Sbjct: 221 IMPFGKRGLLVFESLFIHCMYRGNFVTINGPCTTYMHWTPLKGQKMHYIVCDTNGYLFGV 280
Query: 229 VITHEKEKVTGLKIELLG--ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
+ + L +E L ++++ + +++IGS +S+LI L+ D
Sbjct: 281 YSSILGKNKWSLVMERLPIPPFDFITSLNSIHEGLLFIGSKNSESKLINLSTLKD----- 335
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
++ NLGPI D V L+ + + C+G ++ SL ++ + ++ ++ G
Sbjct: 336 ---VDSIPNLGPIHDLLV--LKNDIEKSFLVCAGTPRNASLIYFQHALKLDILGQTKISG 390
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
I L S + L + F SET +A N++++ + E++ S +
Sbjct: 391 ILRAMVLPSYPEHK----LFLGFPSET--VAFNIKEDFQ-LELDPSLSTKERTIALSGTN 443
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
+ VQVTS + + S R + N A ++ G ++ +
Sbjct: 444 GEFVQVTSTFLCIYDSAKRSRLVYIEK----ITNAACYQEYSAIVINGTALAIFKKD--- 496
Query: 467 ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-KEHLG 525
TEV E EISCLD + + QI VG W+ V I + D + I+
Sbjct: 497 --TEVARKVFESEISCLDFS------AQFQIG-VGFWSK-QVMILTFSDNSSISCAFQTN 546
Query: 526 GEIIPRSVLLCAFEGI----SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
+PR+++L EG+ + LL + G G +++L ++ K GT P++ R
Sbjct: 547 VPSLPRNIIL---EGVGVDRNLLLVSSGSGEFKSYVLFKNNLVFSETKH--FGTTPVSFR 601
Query: 582 TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
F+ T++ +D P ++Y N L Y +++ + +C F + D L G L
Sbjct: 602 RFTMNIGTYIICNNDCPHMVYGFNGALCYMPLSMPQSYDVCQFRDNSGKDFLISVSLGGL 661
Query: 642 TIGTIDDIQKLHIRSIPLGEHP-RRICHQEQSRTFAICSLKN-----QSCAEESEMHFVR 695
++ + +L R + L P + I Q + + +L+N +S E + V
Sbjct: 662 KFLQLNPLPELTPRKVLLEHVPLQAIIFQNK---LLLRTLENRYEDYESYKENYHLELVD 718
Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV-- 753
DD +F S + E I S VGT+ + ++ P GR+++
Sbjct: 719 SYDDNSFRVFSFTENERCEKVLKINESSL--------LVGTSIIEQDKLVPVNGRLILLE 770
Query: 754 FIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH 813
F E L++++ AV L +N + + A Q++ + K + + S
Sbjct: 771 FEKELQSLKVVSSMVLSAAVIDLGVYNDRYIVAFGQQVAIVKL---TEERLMIDSRISLG 827
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+L L V+ G+ I + D + +++ + ++ + R + N + A + + +Y+
Sbjct: 828 SIVLQLIVE--GNEIAIADSIGRFTIMYFDGQKFIVVARYL-FGENIVKAA-LYEGTVYI 883
Query: 874 GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-- 931
A N+ L +R N + +R E V YHL + V++F++ P ++
Sbjct: 884 IATNSGLLKLLRYNKDAKNFNDRFICESV--YHLHDKVSKFQN------FPITNTNSFLE 935
Query: 932 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV-IKGVGGLNHEQWRSFNNEKKTVD 990
P ++F T G IG I SL ++ L LE+L +RK+ + +++E E +
Sbjct: 936 PKMLFATEIGAIGSIVSLKDKE-LELEELTRKIRKLKFSYLSSMDYESI-----EADLIS 989
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
F+DGDL+ +D+ R E+ + LC+ VE
Sbjct: 990 PVPFIDGDLV---IDVKRWASSELFR--------LCRSVE 1018
>sp|Q9EPU4|CPSF1_MOUSE Cleavage and polyadenylation specificity factor subunit 1 OS=Mus
musculus GN=Cpsf1 PE=1 SV=1
Length = 1441
Score = 102 bits (253), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 970 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1318 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1423
Score = 59.3 bits (142), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 108/565 (19%), Positives = 220/565 (38%), Gaps = 151/565 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
+ +++ ++ +++GS G+S L+K LQ P +K VE
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 429
Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
V + +N+GP + V + L +
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489
Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
Q ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 549
Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
++ D FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607
Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632
>sp|Q10570|CPSF1_HUMAN Cleavage and polyadenylation specificity factor subunit 1 OS=Homo
sapiens GN=CPSF1 PE=1 SV=2
Length = 1443
Score = 102 bits (253), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 972 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1031
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 1032 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1091
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1092 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1150
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1151 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1205
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1206 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1265
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1266 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1325
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1326 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1385
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1386 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1425
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 107/567 (18%), Positives = 220/567 (38%), Gaps = 153/567 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
+ +++ ++ +++GS G+S L+ KL P DA
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429
Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
+ EV + +N+GP + V + L +
Sbjct: 430 WSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEE 489
Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
Q ++V CSG K+G+L +++ I + EL G MW++ + D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549
Query: 360 P--------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
P FL++S T IL E+ E + GF +Q T+
Sbjct: 550 PKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTV 607
Query: 400 FCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
F + N+ +VQV+ +RL+ ++
Sbjct: 608 FAGNIGDNRYIVQVSPLGIRLLEGVNQ 634
>sp|Q10569|CPSF1_BOVIN Cleavage and polyadenylation specificity factor subunit 1 OS=Bos
taurus GN=CPSF1 PE=1 SV=1
Length = 1444
Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 973 IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + + C + E ++L+ ++E I +
Sbjct: 1033 KVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVSWEAIPNAR 1092
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1093 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1151
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1206
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1207 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1266
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1267 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1320
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1321 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1380
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1381 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1426
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 62/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 77 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 195
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 315
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 316 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 372
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 373 LTTSMVTMEPGYLFLGSRLGNSLLLK 398
Score = 38.5 bits (88), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 40/172 (23%), Positives = 72/172 (41%), Gaps = 36/172 (20%)
Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
EV + +N+GP + + + L + Q ++V CSG K+G+L +++ I
Sbjct: 466 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 525
Query: 339 QASVELQGIKGMWSL------------------------RSSTDDPFDTFLVVSFISETR 374
+ EL G MW++ + D FL++S T
Sbjct: 526 VTTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTM 585
Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
IL E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 586 ILQTG--QEIMELDASGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 635
>sp|Q5A7S5|RSE1_CANAL Pre-mRNA-splicing factor RSE1 OS=Candida albicans (strain SC5314 /
ATCC MYA-2876) GN=RSE1 PE=3 SV=1
Length = 1219
Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 234/1145 (20%), Positives = 457/1145 (39%), Gaps = 185/1145 (16%)
Query: 24 DFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPD--CRLIGLH 81
D + I ++ +LQ+D ++ + I++ ++ G+ IDP+ C L+
Sbjct: 118 DGVVITSDSGNLSILQYDNKTKKFISKIQEPMTKNGWGRNYVGENLAIDPENRCILVAAM 177
Query: 82 LYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKF----------LYGC-----AKPTIV 126
+ LF I ++ G + + + QVL +K L+G K I+
Sbjct: 178 EKNKLFYKIESNSSGSKELSSPLEAHSKQVLCLKIVALNTDHNNPLFGALELTPEKKCII 237
Query: 127 VLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--LDNGADLLIPVPPPLCGVLIIGEET 184
Y+ ++ HV +K K P S N+ L N + LIP+P + G+++ G
Sbjct: 238 NYYELDQGLNHV------VKKK-----PNSSNSDPLPNDVNYLIPLPGHIGGMVVCGTNW 286
Query: 185 IVYCSANA---FKAIPIRPSITKAYGRVD-----ADGSRY--LLGDHAGLLHLLVITHE- 233
Y + + +P R T+ V+ ++ LL + G L L + ++
Sbjct: 287 CFYDKLDGPRIYLPLPRRNGQTQDSIIVNHVTHVLKKKKFFILLQNALGDLFKLTVDYDF 346
Query: 234 -KEKVTGLKIELLGETSIASTISYLDNAVVY---------------IGSSYGDSQLI--- 274
KE + + I A +++ N ++ +G + +L+
Sbjct: 347 DKEIIKNISITYFDTIPPALSLNIFKNGFLFANVLNNDKLLYQFEKLGDDLTEGELVINS 406
Query: 275 ----KLNLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 320
LN ++ S+ ++VLE L PI D ++D ++VT S
Sbjct: 407 SDYESLNSVRESVTSFKLKGLDNLALIDVLE---TLSPITDSKIID------SKLVTLSS 457
Query: 321 AYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
++ + +G+ L +++ + S + D +LV+S ++ L ++
Sbjct: 458 H---SYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDEYLVISSSLSSKTLVLS 514
Query: 380 LEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN-EWKSPPGYS 438
+ + +E+ E F T+ +VQ+ S ++ V + + + +W P G +
Sbjct: 515 IGEVVEDVEDSEFVLDQPTIAVQQVGIASVVQIYSNGIKHVRTVNGNKKTTDWFPPAGIT 574
Query: 439 VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI-SCLDINPIGENPSYSQI 497
+ AT N QVL+A +VY EI D ++ Q EI + + I EN S
Sbjct: 575 ITHATTNNQQVLIALSNLSVVYFEI-DATDDQLIEYQDRLEIATTITAMAIQENISEKSP 633
Query: 498 AAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGISYLLCALGDGHL 553
A+ +D ++++ SL + N + + L + + E +++ + +G
Sbjct: 634 FAIIGCSDETIQVVSLQEHNCLEIKSLQALSANSSSLKMLKSSGKE--THVHIGMENGVY 691
Query: 554 LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSN 612
++ G L++ + +G++P++L N + A S P + Y +
Sbjct: 692 ARIKIDTINGNLSNSRVKYIGSKPVSLSVIKFSNEIEGILAISSAPWISYLYRDSFKITP 751
Query: 613 VNLKEVSHMCPFNSAAFP-DSLAIAKEGELTIGTI-------DDIQKLHIRSIPLGEHPR 664
+ ++++ F S + + K+ L I ++ D Q L I + L PR
Sbjct: 752 LLEIDITNGSSFISEDIGGEGIVGIKDNNLIIFSVGKEDSVFDPSQDLTIATTKLRYTPR 811
Query: 665 RICHQEQSRTFAI-----------CSLKNQSCAEESEMHF---------------VRLLD 698
++ +R F C++ E ++ ++++D
Sbjct: 812 KMI-TNGNRLFISESEYNVQGPFKCNINGDVKENVDEDYYEAFGYEWKQNSWASCIQVVD 870
Query: 699 DQTFEFISTYPLDTFEYGCSILSCSFSDDSN-----VYYCVGTAY---VLPEENEPTKGR 750
++ + I + LD E S+ + SF+ S + VG +LP N K
Sbjct: 871 SKSNQVIQSLQLDGNESIVSMSAVSFNKTSTPSVPASHLVVGVCTNQTILP--NSYDKSY 928
Query: 751 ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
+ F + LQL+ + E L F KLL A I+LY D G ++L +
Sbjct: 929 LYTFKIGKKHLQLVHKTELDHIPQVLENFQDKLLVASGNHIRLY-----DIGQKQLLKKS 983
Query: 811 ----GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE--ERARDYNANWMSAV 864
+I + QT + I++ D KS S++ K +E + A D ++++
Sbjct: 984 TTIIDFSTNINKIIPQT--NRIIICDSHKS-SIVFAKFDESQNQFVPFADDVMKRQITSI 1040
Query: 865 EILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLG 908
LD D +G + N+F R K +G + +L+ + E+H+G
Sbjct: 1041 MNLDIDTLIGGDKFGNIFVTRIDEDISKQADDDWTILKTQDGILNSCPYKLQNLIEFHIG 1100
Query: 909 EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKV 967
+ + F G L ++ +VI+ + G IG++ L + + L LQ +++
Sbjct: 1101 DIITSFNLGCL-------NLAGTESVIYTGLQGTIGLLIPLVSKSEVELLFNLQLYMQQS 1153
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
+ G +H + RS+ N KN +DGDL+E FL+ + EIS+ +N SV ++ K
Sbjct: 1154 QNNLVGKDHLKLRSYYNP-----IKNVIDGDLLERFLEFDISLKIEISRKLNKSVNDIEK 1208
Query: 1028 RVEEL 1032
++ +L
Sbjct: 1209 KLIDL 1213
>sp|Q7XWP1|CPSF1_ORYSJ Probable cleavage and polyadenylation specificity factor subunit 1
OS=Oryza sativa subsp. japonica GN=Os04g0252200 PE=3 SV=2
Length = 1441
Score = 90.5 bits (223), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 87/353 (24%), Positives = 160/353 (45%), Gaps = 33/353 (9%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
+E ST P+ FE ++ L + + ++ +GTAYVL E+ +GR+L+F
Sbjct: 1095 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVL-GEDVAARGRVLLFSFT 1153
Query: 756 VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+ L+ E KE+KGAV ++ + G LL A KI L KW EL + +
Sbjct: 1154 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GAELTAVAFY 1208
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
+ + + +F++ GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1209 DAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGST 1268
Query: 873 LG-----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
L ++ N +F + + +G +L E+H+G + +F + LP
Sbjct: 1269 LSLVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQ 1322
Query: 927 DVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
+ T ++FG ++G IG IA + + L+ LQ L + V GLN +R F
Sbjct: 1323 GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQF 1382
Query: 983 NNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
++ K N +D +L+ S+ LS ++++ + + ++ +++
Sbjct: 1383 HSNGKGHRPGPDNIIDFELLCSYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 1435
Score = 66.2 bits (160), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/393 (22%), Positives = 160/393 (40%), Gaps = 109/393 (27%)
Query: 110 QVLDIKFLYGCAKPTIVVLYQDNKD------ARH----VKTYEVALKDKDFVEGP--WSQ 157
V D F++G +P +V+L++ ++H + + +++ K + P WS
Sbjct: 250 HVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLK---QHPVIWSA 306
Query: 158 NNLDNGADLLIPVPPPLCGVLIIGEETIVY------CSA--NAFKAIPI-RPSITKAYGR 208
NL + A L+ VPPP+ GVL+I +I Y CS N F + P P I+K+ +
Sbjct: 307 ANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQ 366
Query: 209 VDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGL-----KIELLGE--TSIAST 253
V+ D ++ + AG + LL + ++ V L K +L TSI ++
Sbjct: 367 VELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNS 426
Query: 254 ISYL-----DNAVVYIGSSYGDSQLIKL-------------------------------- 276
+L D+ +V S L L
Sbjct: 427 FFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSV 486
Query: 277 ------------NLQPDAKGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
+L+ K SY+ V + +N+GP+ DF + +Q +
Sbjct: 487 EELSFQNIIAPNSLESAQKISYI-VRDALINVGPLKDFSYGLRANADPNAMGNAKQSNYE 545
Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------RSSTDDPFDTFLVV 367
+V CSG K+GSL +++ I + VEL +G+W++ + + D+ + +L++
Sbjct: 546 LVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRGQMAEDNEYHAYLII 605
Query: 368 SFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
S E R + + D+L E TE + Q T+
Sbjct: 606 SL--ENRTMVLETGDDLGEVTETVDYFVQASTI 636
>sp|Q9V726|CPSF1_DROME Cleavage and polyadenylation specificity factor subunit 1
OS=Drosophila melanogaster GN=Cpsf160 PE=1 SV=1
Length = 1455
Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 108/436 (24%), Positives = 198/436 (45%), Gaps = 64/436 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
+R +PL PR++ + ++R + + + +++ +EES F+
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085
Query: 696 -----LLDDQTFEFISTYPLDTFE-----YGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
L+ +T+E + + TFE I+ S+ + Y C+GT +
Sbjct: 1086 QFEMVLISPETWEIVPDASI-TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-S 1143
Query: 743 ENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
E+ ++G I ++ + E GK ++ I +KE KG V +++ G L+ + QKI
Sbjct: 1144 EDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI 1203
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1204 --YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1258
Query: 852 RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YH
Sbjct: 1259 ASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYH 1316
Query: 907 LGEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
LG+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 1317 LGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQ 1375
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
L + + GLN +++R+ + KK ++ +DGDLI S+ ++ + +E++K +
Sbjct: 1376 NVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIG 1435
Query: 1020 VSVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1436 TRTEEILGDLLEIERL 1451
Score = 56.6 bits (135), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L I+ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
++ I L +L VLDI+FL+G +PT+++LY+ + +K ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
L + V W+ N+L + P+ P+ G L++ ++Y SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315
Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
+ A P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373
Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I L + +++GS G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405
Score = 43.5 bits (101), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G + ++V +G K+G+L +
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 544
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N I S EL G +W++ +SS +D D F+++S + T +L E+
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 601
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640
>sp|A8XPU7|CPSF1_CAEBR Probable cleavage and polyadenylation specificity factor subunit 1
OS=Caenorhabditis briggsae GN=cpsf-1 PE=3 SV=1
Length = 1454
Score = 87.0 bits (214), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 146/315 (46%), Gaps = 35/315 (11%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE +PT R K++++ +KE KG V L A NG LL+ + QK+ + W
Sbjct: 1153 VVPEPGQPTSNR---------KIKVLYDKEQKGPVTGLCAINGLLLSGMGQKV--FIWQF 1201
Query: 799 RDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
+D+ + S H ++ L+ ++T + D +S+SL+ ++ E A+ +RD
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLHSIRT---IALALDARESMSLIRFQEENKAMSIASRDDR 1257
Query: 858 --ANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
A A E L D +++G N LF+ + + E RL V ++G
Sbjct: 1258 KCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGE--RLTVKAAINIGTN 1315
Query: 911 VNRFRHGSLVMRLPDS-------DVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
+N F L DS ++ Q IFG+++G G I L + Y L LQT
Sbjct: 1316 INAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLTEKSYRRLHFLQTF 1375
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
+ V + GL+ + RS + V +A+N +DGD++E +L LS ++++ + V
Sbjct: 1376 IGSVTPQIAGLHIKGARSSKPSQPIVNGRNARNLIDGDVVEQYLHLSVYDKTDLARRLGV 1435
Query: 1021 SVEELCKRVEELTRL 1035
+ + +L R+
Sbjct: 1436 GRYHILDDLMQLRRM 1450
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 90/399 (22%), Positives = 158/399 (39%), Gaps = 93/399 (23%)
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYG 119
N I DP R +Y ++PF + ++ I L+++ V D+ FL G
Sbjct: 138 NPPIVRTDPANRCAASLVYGKHIAILPFHENSKRILSYIIPLKQIDPRLDNVADMVFLEG 197
Query: 120 CAKPTIVVLYQD----------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIP 169
+PTI+ LY+ D + V + D+ F W NL + L+
Sbjct: 198 YYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVV-WQTANLPMDCNSLLS 256
Query: 170 VPPPLCGVLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRY 216
+P PL G ++ G TIVY + + F P++ + +D S Y
Sbjct: 257 IPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNSCYDGFTKFPLK-DMKHLKMTLDCSTSVY 315
Query: 217 L------LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+ +G G L+L LV + V L+ + +TSIA T++ +++GS
Sbjct: 316 MEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPGHLFVGSRL 375
Query: 269 GDSQLIKLNL----QPDAK----------------------GSYVEV------------- 289
GDSQL++ L + AK G +E+
Sbjct: 376 GDSQLLEYTLLKVTKESAKKQRLEQQNPSEIELDEDDIELYGGAIEMQQNDDDEQISESL 435
Query: 290 ----LERYVNLGPIVDFC----------VVDLERQGQ-GQVVTCSGAYKDGSLRIVRNGI 334
L+R +N+GP+ C ++D +R+ +VT SG K+G+L + + +
Sbjct: 436 QFRELDRLLNVGPVKSMCFGRPNYMSNDLIDAKRKDPVFDLVTASGHGKNGALCVHQRSM 495
Query: 335 GINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
S L+G + +W++ ++ +L+VS + T
Sbjct: 496 RPEIITSSLLEGAEQLWAVGRKENESH-KYLIVSRVRST 533
>sp|Q5BDG7|CFT1_EMENI Protein cft1 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 /
CBS 112.46 / NRRL 194 / M139) GN=cft1 PE=3 SV=1
Length = 1339
Score = 86.3 bits (212), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/438 (21%), Positives = 192/438 (43%), Gaps = 70/438 (15%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
+R++P+G+ ++ + S T+ + C E+ E+H +
Sbjct: 907 MRTVPIGQQIDKLTYVSASDTYVLGTCQRCEFRLPEDDELHPEWRNEEISFLPEVNQSSL 966
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
+++ +T+ I +YPL+ E+ + + S N + VGT+ E+ P++G
Sbjct: 967 KVVSPKTWSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLAR-GEDIPSRG 1025
Query: 750 RILVF----IVEDG-------KLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKW 796
I VF +V D +L+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1026 CIYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRG- 1084
Query: 797 MLRDDGT----RELQSECGHHGHILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEE 851
L++DG+ + +C +++ + +G + GD +K + Y E +
Sbjct: 1085 -LKEDGSLLPVAFMDMQC-----FVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSL 1138
Query: 852 RARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
A+D + + A + L D + A+++ NL+ ++ + E +L ++H G
Sbjct: 1139 FAKDLDYLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTG 1198
Query: 909 EFVN-------------RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
F + R GS M + + + V+ + NG IG++ +P E Y
Sbjct: 1199 NFASTVTLLPRTLVSSERAMSGSDKMDI--DNTAPLHQVLVTSHNGSIGLVTCVPEESYR 1256
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
L LQ+ L ++ GLN +R+ ++ + LD +L+ +LD+S+ R EI+
Sbjct: 1257 RLSALQSQLTNTLEHPCGLNPRAYRAVESDASA--GRGMLDSNLLLQYLDMSKQRKAEIA 1314
Query: 1016 KTMNVSVEELCKRVEELT 1033
+ + E+ +E ++
Sbjct: 1315 GRVGATEWEIRADLEAIS 1332
>sp|A1DB13|CFT1_NEOFI Protein cft1 OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 /
FGSC A1164 / NRRL 181) GN=cft1 PE=3 SV=1
Length = 1400
Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 88/371 (23%), Positives = 160/371 (43%), Gaps = 37/371 (9%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
++++ +T+ I +Y L EY ++ + N + VGTA+ E+ P++
Sbjct: 1027 LKVVSPRTWTVIDSYSLGPAEYVMAVKNMDLEVSENTHERRNMIVVGTAFAW-GEDIPSR 1085
Query: 749 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I VF V D KL+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1086 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1145
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ + ++GD +K + Y E + +D
Sbjct: 1146 --LKEDGSLLPVAFMDMQCYVNVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1203
Query: 856 YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A E L D L A+++ NL ++ + E RL ++H+G F
Sbjct: 1204 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1263
Query: 913 RFRHGSLVM-----RLPDSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
M + D D +I + V+ + +G +G++ S+P E Y L LQ
Sbjct: 1264 TMTLLPRTMVSSEKAMADPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1323
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN +R+ E + LDG+L+ +LD+ + R EI+ +
Sbjct: 1324 SQLTNSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1381
Query: 1022 VEELCKRVEEL 1032
E+ +E +
Sbjct: 1382 EWEIKADLEAI 1392
>sp|Q4WCL1|CFT1_ASPFU Protein cft1 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 /
CBS 101355 / FGSC A1100) GN=cft1 PE=3 SV=2
Length = 1401
Score = 83.6 bits (205), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 159/371 (42%), Gaps = 37/371 (9%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
++++ +T+ I +Y L EY ++ + N + VGTA+ E+ P++
Sbjct: 1028 LKVVSPRTWTVIDSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFAR-GEDIPSR 1086
Query: 749 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I VF V D KL+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1087 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1146
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K + Y E + +D
Sbjct: 1147 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1204
Query: 856 YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A E L D L A+++ NL ++ + E RL ++H+G F
Sbjct: 1205 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1264
Query: 913 RFR-------HGSLVMRLPDS---DVGQIPT-VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
M PDS D I V+ + +G +G++ S+P E Y L LQ
Sbjct: 1265 TMTLLPRTMVSSEKAMANPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1324
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN +R+ E + LDG+L+ +LD+ + R EI+ +
Sbjct: 1325 SQLANSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1382
Query: 1022 VEELCKRVEEL 1032
E+ +E +
Sbjct: 1383 EWEIKADLEAI 1393
Score = 33.5 bits (75), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 90/214 (42%), Gaps = 38/214 (17%)
Query: 109 LQVLDIKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKDFVEGPWSQN 158
L + + FLY +PT +LY + KD+ + + L+ + S
Sbjct: 232 LHPVSLAFLYEYREPTFGILYSQIATSHALLSERKDSIFYTVFTLDLEQRASTTL-LSVP 290
Query: 159 NLDNGADLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIPIR--PSITKAYGRVD-ADGS 214
L + ++ +PPP+ G L+IG +V+ A A+ + A+ VD +D +
Sbjct: 291 KLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLA 350
Query: 215 RYLLG-------DHAGLLHL------LVITH---EKEKVTGLKIELL----GET---SIA 251
L G D G L L +V+ H + V+G+ + L G T S A
Sbjct: 351 LRLEGCVVEHISDSTGDLLLVLSSGNMVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAA 410
Query: 252 STISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
S+ ++L + V+ GS DS L+ + P+ K S
Sbjct: 411 SSSAFLGSGRVFFGSEDADSVLLSWSSMPNPKKS 444
>sp|A1C3U1|CFT1_ASPCL Protein cft1 OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 /
DSM 816 / NCTC 3887 / NRRL 1) GN=cft1 PE=3 SV=1
Length = 1401
Score = 83.6 bits (205), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 96/413 (23%), Positives = 172/413 (41%), Gaps = 56/413 (13%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
++ + +GEH + + S T+ + + + E+ E+H +
Sbjct: 969 LKKVAIGEHVDHLAYSISSETYVLGTSHSADFKLPEDDELHPEWRNEAISFLPELRQCCL 1028
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
+++ +T+ I +Y L E ++ + + N + VGTA E+ P +G
Sbjct: 1029 KVVHPKTWTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALAR-GEDIPARG 1087
Query: 750 RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKW 796
I VF V D KL+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1088 CIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQGQKCMVRG- 1146
Query: 797 MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
L++DG+ + ++ L +VGD K I Y E + +D
Sbjct: 1147 -LKEDGSLLPVAFMDVQCYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPYKMSLFGKDL 1205
Query: 857 NANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
+ A + L D L A+++ NL ++ E +L V ++H+G F +
Sbjct: 1206 EYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSKFHMGHFTST 1265
Query: 914 FR-----HGSLVMRLPDSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
S + DSD ++ V+ + +G IG++ S+P E Y L LQ+
Sbjct: 1266 LTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEESYRRLSALQS 1325
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
L ++ GLN +R+ E + LDG+L+ +L +S+ R EI+
Sbjct: 1326 QLANTVEHPCGLNPRAYRAI--ESDGTAGRGMLDGNLLYQWLSMSKQRRMEIA 1376
>sp|Q9N4C2|CPSF1_CAEEL Probable cleavage and polyadenylation specificity factor subunit 1
OS=Caenorhabditis elegans GN=cpsf-1 PE=3 SV=2
Length = 1454
Score = 83.6 bits (205), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 146/312 (46%), Gaps = 29/312 (9%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE ++PT R K++++ +KE KG V L A NG LL + QK+ + W
Sbjct: 1153 VVPEPDQPTSNR---------KIKVLFDKEQKGPVTGLCAINGLLLCGMGQKV--FIWQF 1201
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN- 857
+D+ + S H ++ L+ + + D +S+SL+ ++ + A+ +RD
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLH--SLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRK 1258
Query: 858 -ANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN- 912
A A +++ D ++G ++ N+ E RL V ++G +N
Sbjct: 1259 CAQPPMASQLVVDGAHVGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINA 1318
Query: 913 --RFRHGSLVMRLPDSD----VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
R R + +++L + D + Q T +F +++G G + L + Y L LQT +
Sbjct: 1319 FVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGS 1378
Query: 967 VIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
V + GL+ + RS + V +A+N +DGD++E +L LS ++++ + V
Sbjct: 1379 VTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRY 1438
Query: 1024 ELCKRVEELTRL 1035
+ + +L R+
Sbjct: 1439 HIIDDLMQLRRM 1450
Score = 70.5 bits (171), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 87/395 (22%), Positives = 153/395 (38%), Gaps = 96/395 (24%)
Query: 72 DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTIV 126
DP R +Y ++PF + ++ I L+++ + D+ FL G +PTI+
Sbjct: 145 DPSNRCAACLVYGKHIAILPFHENSKRIHSYVIPLKQIDPRLDNIADMVFLDGYYEPTIL 204
Query: 127 VLYQDNK----------DARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
LY+ + D + V + D+ F W NL L+P+P PL G
Sbjct: 205 FLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCSQLLPIPKPLGG 263
Query: 177 VLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRYL------ 217
L+ G T+VY + + F P++ + +D S Y+
Sbjct: 264 ALVFGSNTVVYLNQAVPPCGLVLNSCYDGFTKFPLK-DLKHLKMTLDCSTSVYMEDGRIA 322
Query: 218 LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+G G L L L+ + V L+ + ETSIA +++ +++GS GDSQL++
Sbjct: 323 VGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLE 382
Query: 276 LNL---------------QPDAKGSYVEV------------------------------- 289
L D + +E+
Sbjct: 383 YTLLKTTRDCAVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRE 442
Query: 290 LERYVNLGPIVDFCV----------VDLERQGQ-GQVVTCSGAYKDGSLRIVRNGIGINE 338
L+R N+GP+ CV VD +R+ +VT SG K+G+L + + +
Sbjct: 443 LDRLRNVGPVKSMCVGRPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEI 502
Query: 339 QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
S L+G + +W++ ++ +L+VS + T
Sbjct: 503 ITSSLLEGAEQLWAVGRKENESH-KYLIVSRVRST 536
>sp|A2R919|CFT1_ASPNC Protein cft1 OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513)
GN=cft1 PE=3 SV=1
Length = 1383
Score = 83.2 bits (204), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 97/438 (22%), Positives = 179/438 (40%), Gaps = 61/438 (13%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
++ + LGE + + S + + C + E+ E+H F+
Sbjct: 942 LKRVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGSFI 1001
Query: 695 RLLDDQTFE---------FISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVL 740
+L+ D + + ++ L EY +I + S N + VGTA+
Sbjct: 1002 KLVWDHHLQRQDSVILIFHLHSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFAR 1061
Query: 741 PEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
E+ P++G I VF V D KL+LI ++ KGAV +L+ G+ + Q
Sbjct: 1062 -GEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQ 1120
Query: 790 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
+ L++DG+ + ++ + ++GD +K + Y E +
Sbjct: 1121 GQKCMVRGLKEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKM 1180
Query: 850 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
A+D + + A E L D L A+++ N+ ++ + E RL ++H
Sbjct: 1181 SLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFH 1240
Query: 907 LGEFVNRFRHGSLVM----RLPDSDVG-----QIP--TVIFGTVNGVIGVIASLPHEQYL 955
+G F + M ++ S G Q P V+ T NG +G+I +P E Y
Sbjct: 1241 MGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYR 1300
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
L LQ+ L ++ GLN +R+ E + LDG+L+ ++D+S+ R EI+
Sbjct: 1301 RLSALQSQLTNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIA 1358
Query: 1016 KTMNVSVEELCKRVEELT 1033
+ E+ +E ++
Sbjct: 1359 GRVGAREWEIKADLEAIS 1376
Score = 38.1 bits (87), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 73/348 (20%), Positives = 132/348 (37%), Gaps = 105/348 (30%)
Query: 109 LQVLDIKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKDFVEGPWSQN 158
L + + FLY +PT +LY + KD + + L+ + S +
Sbjct: 225 LHPISLAFLYEYREPTFGILYSQVATSSALLPERKDVVFYTVFTLDLEQQASTV-LLSVS 283
Query: 159 NLDNGADLLIPVPPPLCGVLIIGEETIVYC----SANAFKAIPIRPSITKAYGRVDADGS 214
L + ++ +PPP+ G L+IG +V+ NA ++ +D +
Sbjct: 284 RLPSDLFRVVALPPPVGGALLIGSNELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLA 343
Query: 215 RYL-------LGDHAGLLHLLVITHEK---------EKVTGLKIELL----GETSI---A 251
L LGD +G + L++ T E V+G+ + LL G TSI A
Sbjct: 344 LRLENCIVECLGDSSGDMLLVLTTGEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAA 403
Query: 252 STISYLDNAVVYIGSSYGDSQLI----------KLNLQ---------------------- 279
+ +++ + +++GS GDS L+ K LQ
Sbjct: 404 AASTFIGDGKIFLGSEDGDSVLLGYSYSSSSTKKHRLQAKQVIDDSADMSEEDQSDDDVY 463
Query: 280 --------PD-------------AKGSY-VEVLERYVNLGPIVDFCV-----VDLERQG- 311
PD A G Y + ++ +N+GP+ D + +LE+ G
Sbjct: 464 EDDLYSTSPDTTLTGRRPSGESSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGD 523
Query: 312 -------QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ Q+V G++K G L ++ I + AS+ L+ + +W+
Sbjct: 524 RTNSTSPELQIVASQGSHKSGGLVVMAREIDPHVVASISLESVDCIWT 571
>sp|Q6BYK1|RSE1_DEBHA Pre-mRNA-splicing factor RSE1 OS=Debaryomyces hansenii (strain ATCC
36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
GN=RSE1 PE=3 SV=2
Length = 1256
Score = 79.3 bits (194), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/306 (27%), Positives = 133/306 (43%), Gaps = 52/306 (16%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL----QSECGHHGHI 816
L+ + + E ++ FNG+LL ++ ++LY D G R+L S + +I
Sbjct: 963 LEFVHKTELDYQPTAIIPFNGRLLVGMSNFLRLY-----DLGQRQLLRKASSNIEYLKNI 1017
Query: 817 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
+ L Q G IVVGD S + + Y E A D ++A+ LD D +G +
Sbjct: 1018 IRLTHQG-GSRIVVGDSSMSTTFVKYDSTENQFIPFADDIMKRQITALVTLDYDTIIGGD 1076
Query: 877 NNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
N+F R + E + RL+ + E++L + F GSLV
Sbjct: 1077 KFGNIFVSRVPETISQQSDKDWSLLRYQESYLNGSGSRLKNICEFYLQDIPTSFTKGSLV 1136
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVI--------KGV 971
M G ++I+ + G +G++ L E + FL LQ LRK K
Sbjct: 1137 M-------GGKESIIYTGIQGTLGLLLPLSTENEVKFLGDLQLLLRKYFDYNFDDFDKDK 1189
Query: 972 GGLN-----HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
G N H ++RS+ N KN +DGDLIE F +LS++ I +N + E+
Sbjct: 1190 NGYNLLGKDHLKFRSYYNP-----VKNVMDGDLIERFYELSQSMKIRIGTELNRTPREIE 1244
Query: 1027 KRVEEL 1032
K++ E+
Sbjct: 1245 KKISEM 1250
Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 141/680 (20%), Positives = 264/680 (38%), Gaps = 90/680 (13%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+G I ++ R G +D++ I ++ K VL++D E + + S R T G
Sbjct: 67 FGIIQGIDKIRLVGTQKDYVVITSDSGKLVVLEFDIEKLQFVPLFQEPHSKNGLRRTSPG 126
Query: 67 QIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQV--LDIKF- 116
+ +DP R LIG + L + +++G+L+ E F+ LQ+ +D F
Sbjct: 127 EYLCVDPHNRAILIGAIEKNKLVYKVQSNDEGKLELSSPLETFSKHTLTLQICAMDTGFE 186
Query: 117 --LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDNGADLLI 168
++ + QDN + + L + +G S + + LI
Sbjct: 187 NPMFAAIECDYNARQQDNGEEEDAGEASLLLNYYELDQGLNHVVKHKSNEKIPGSSSHLI 246
Query: 169 PVPPPLCGVLIIGEETIVYCSANAFK---AIPIRPS------ITKAYGRVDADGSRYLLG 219
P+P + G+L+ + TI+Y + K IPIR + + R+ + L+
Sbjct: 247 PLPDFIGGLLVCSKSTIIYAHPSKDKLYLPIPIRSNTNETLIVNHVIHRLKKNNFFILVQ 306
Query: 220 DHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS---QLI 274
G + I H++ E + + I ++ +++ + ++ + + Q
Sbjct: 307 SQLGDCFKITIDHDEVNESIENINITYFDTIPLSQSLNIFKSGFLFANVATNNKLFYQFE 366
Query: 275 KL-------NLQPDAKGSYVEVLERYVN--------------------LGPIVDFCVVDL 307
KL LQ Y + E ++ L PI D +++
Sbjct: 367 KLGDDNNNTTLQSCNFSDYNSIFELDISKRSFKVAGLENLALVDIMETLNPITDGALIET 426
Query: 308 ERQGQGQVVTCSGAYKDGS-LRIVRNGIGINEQASVELQGIK--GMWSLRSSTDDPFDTF 364
R A S L+ + +GI N S L IK + + R + D +
Sbjct: 427 LRPEVPDPFKQLTALSSHSYLKTLTHGISTNTVVSSPLP-IKPTAIHTTRIFAESANDEY 485
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
LV+S ++ L +++ + +EE F + T+ + +VQ+ S +R + T
Sbjct: 486 LVISSTLSSQTLVLSIGEVVEEVNDSQFVTNEPTINVQQVGKSSVVQIYSNGIRHIKHTM 545
Query: 425 R-----ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILTEVKHAQLE 477
R + +W P G S+ A+ N QV++ + Y EI D L E + +LE
Sbjct: 546 RNDTIEKKYTDWYPPAGISIIQASTNNEQVIIGLSNREICYFEIDPHDDQLVEYQE-RLE 604
Query: 478 YE--------ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGG 526
IS I+ + SY A VG +D +++ SL L ++T + L
Sbjct: 605 MSGGSISALAISSSSISKLQRKSSY---AIVG-CSDETIQAISLKPHNCLEIVTLQALSA 660
Query: 527 EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
S+ + + + + +G + ++ TG+L+D + LG++P+ L
Sbjct: 661 N--SSSIAMVPHGYSTSVHIGMENGLYVRVTIDEITGKLSDTRIQFLGSKPVQLSVIGLP 718
Query: 587 NTTH--VFAASDRPTVIYSS 604
+ A S RP + Y S
Sbjct: 719 QLQQNGLLAISSRPWIGYYS 738
>sp|Q9FGR0|CPSF1_ARATH Cleavage and polyadenylation specificity factor subunit 1
OS=Arabidopsis thaliana GN=CPSF160 PE=1 SV=2
Length = 1442
Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 24/302 (7%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+E + P+ T E+ ++ L + + ++ VGTAYV E+ +GR+L+F
Sbjct: 1097 WETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1155
Query: 758 ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
D ++ E +E KGA+ ++ + G LL + KI L+KW +GT
Sbjct: 1156 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1211
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD-- 869
+ + + FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1212 DAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGS 1271
Query: 870 DIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
+ L + V + + +G +L E+H+G V++F L +++ S
Sbjct: 1272 TLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKF----LRLQMVSSGA 1327
Query: 929 GQIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
+I ++FGT++G G IA L + L+ LQ L + V GLN +R F +
Sbjct: 1328 DKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSG 1387
Query: 987 KT 988
K
Sbjct: 1388 KA 1389
Score = 77.0 bits (188), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 95/403 (23%), Positives = 159/403 (39%), Gaps = 108/403 (26%)
Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARHVKTYEVALK-DKDFVEGP 154
N+R LE V D FL+G +P IV+L ++ +H AL + + P
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 299
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
WS NL + A L+ VP P+ GVL++ TI Y S +A A+ + + A +
Sbjct: 300 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 359
Query: 213 GSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
S + LL +G L LL + ++ V L + + +AS I+
Sbjct: 360 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 419
Query: 256 YLDNAVVYIGSSYGDSQLI------------------------------KLNLQPD---- 281
+ N++ ++GS GDS L+ +L + D
Sbjct: 420 SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTSDTFQD 479
Query: 282 -------------------AKGSY-VEVLERYVNLGPIVDFC----------VVDLERQG 311
A+ S+ V + VN+GP+ DF + +Q
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL--------------RSST 357
++V CSG K+G+L ++R I VEL G KG+W++ ++
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
+D + +L++S E R + + D L E TE + Q +T+
Sbjct: 600 EDEYHAYLIISL--EARTMVLETADLLTEVTESVDYYVQGRTI 640
>sp|Q2TZ19|CFT1_ASPOR Protein cft1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
GN=cft1 PE=3 SV=1
Length = 1393
Score = 70.5 bits (171), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 76/331 (22%), Positives = 139/331 (41%), Gaps = 40/331 (12%)
Query: 734 VGTAYVLPEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK 782
VGTA+ E+ ++G + VF V D KL+L+ ++ KGAV +L+ G+
Sbjct: 1065 VGTAFARGEDIA-SRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQ 1123
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
+ Q + L++DG+ + H+ + ++ D +K + Y
Sbjct: 1124 GFLIVAQGQKCIVRGLKEDGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGY 1183
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRL 899
E + A+D + + A + L D L A+++ NL ++ + E RL
Sbjct: 1184 SEEPYKMSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRL 1243
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVG---------------QIP--TVIFGTVNGV 942
++H G F+ S + LP + V +IP ++ + NG
Sbjct: 1244 LSRSKFHTGNFI------STLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGS 1297
Query: 943 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
+G++ + E Y L LQ+ L I+ GLN +R+ E + LDG L+
Sbjct: 1298 VGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAV--ESDGTAGRGMLDGKLLFQ 1355
Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
+LD+S+ R EI+ + + E+ E ++
Sbjct: 1356 WLDMSKQRKVEIASRVGANEWEIKADFEAIS 1386
>sp|Q6FLQ6|RSE1_CANGA Pre-mRNA-splicing factor RSE1 OS=Candida glabrata (strain ATCC 2001
/ CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=RSE1
PE=3 SV=1
Length = 1296
Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/349 (22%), Positives = 155/349 (44%), Gaps = 58/349 (16%)
Query: 249 SIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-----------DAKGSYVEVLERYVNLG 297
S +S + +N +YI S + + LN + D + + V+ ++ N+
Sbjct: 340 SNSSKLHIFNNGSMYINSQFNYDH-VYLNFESIGDNDENYDKIDNENENISVISKHTNIN 398
Query: 298 PI-VDFCVVD------LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QGIKG 349
PI ++ C+++ + QG + + I+RN I + E S L QG+
Sbjct: 399 PIALNLCLMENMPLTFMHFQGGNRTTDSE------KVNIIRNAIPLKEYVSSPLPQGVSN 452
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY--- 406
++++++ + +F+ ++ I+ T ++ ++ + IE + + T D +
Sbjct: 453 IFTIKTQYQ-SYHSFIFLTMINFTTVIL-----KIADDSIEQYIPASDTFKLKDDMTIHV 506
Query: 407 -----NQLVQVTSGSVRLVSSTSRELRN-----EWKSPPGYSVNVATANASQVLLATGGG 456
N ++QV R + S++ N +W P G S+ A +N SQ++LA
Sbjct: 507 ATMGDNSIIQVCKDEFRQILLDSKDEENFKMNLKWYPPAGVSILSAVSNFSQLILALSNN 566
Query: 457 HLVYLEIGDGILTEVKH-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
+VYL++ + L E K+ +L I+ L + + +N S+I AVG +D V + SL
Sbjct: 567 EIVYLQLENNTLIEYKNRPELPDVITSLAL--LNDNTKKSEILAVGT-SDNMVNVLSLE- 622
Query: 516 LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGH-LLNFLLNMKTG 563
I E + E +V+ A + I L L GH L+N + ++ G
Sbjct: 623 ---IVDEAISFE----TVVFQALDAIPSSLLILNQGHKLVNLHIGVEDG 664
>sp|Q1E5B0|CFT1_COCIM Protein CFT1 OS=Coccidioides immitis (strain RS) GN=CFT1 PE=3 SV=1
Length = 1387
Score = 68.2 bits (165), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 159/372 (42%), Gaps = 37/372 (9%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTK 748
++LL +T+ + +Y L E + + + + + VGTA V E+ P +
Sbjct: 1015 IKLLSPRTWSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITP-R 1073
Query: 749 GRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I VF I+E + KL++ A+ + KGAV +++ G+ L+ A QK +
Sbjct: 1074 GSIYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRG 1133
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K I Y E + +D
Sbjct: 1134 --LKEDGSLLPVAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKD 1191
Query: 856 YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A + L D +Y L A+++ + + + E T + RL +H G F +
Sbjct: 1192 NEYLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTGHFTS 1251
Query: 913 RF-----RHGSLVMRLP---DSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
S P D DV +P V+ + G IGV+ L + Y L LQ
Sbjct: 1252 TMTLLPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1311
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN + +R+ E + +DG+L+ +LD+ R EI+ +
Sbjct: 1312 SQLVTSMEHPCGLNPKAYRAV--ESDGFGGRGIVDGNLLLRWLDMGVQRKAEIAGRVGAD 1369
Query: 1022 VEELCKRVEELT 1033
+E + +E ++
Sbjct: 1370 IESIRVDLETIS 1381
>sp|Q6CAH5|RSE1_YARLI Pre-mRNA-splicing factor RSE1 OS=Yarrowia lipolytica (strain CLIB 122
/ E 150) GN=RSE1 PE=3 SV=1
Length = 1143
Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 155/735 (21%), Positives = 283/735 (38%), Gaps = 102/735 (13%)
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+W++R D ++V+S+ + T L + + D + ET G TL C ++ +
Sbjct: 459 LWTMRDGAGS--DKYIVLSYANAT--LVLEIGDSVVETTSSGLTLDKPTLHC-GSVGSSY 513
Query: 410 VQVTSGSVRLVSSTSRELRNE------WKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
VQV + + ++ SRE +E W +P G V A++++ QV+L L Y E
Sbjct: 514 VQVMTDGMNVIP-MSREGSSESLPATKWTAPSG-QVICASSSSHQVVLGLTSS-LFYFED 570
Query: 464 GDGILTEVKHAQLEYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNL 518
G +E+ YE+S + + P+ S AV D +VRI S+ P+
Sbjct: 571 TPG--SELSAYDGAYELSSPPTAVAVAPVPAGRVRSPFVAVAT-DDETVRIVSVDPESMF 627
Query: 519 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDR--KKVSLGTQ 576
T G S+ L + + YL L +G + L+ TGE+ K V LG
Sbjct: 628 ETVAVQGLMATASSLALLSVGQVLYLHMGLANGVYVRVELDPLTGEIVGSWSKFVGLGRL 687
Query: 577 PITLRTFSSKNTT------------HVFAASDRPTVIYSSNKKLLYS--NVNLKEVSHMC 622
+ T + + HV A SD V N ++ ++ + +
Sbjct: 688 SVVPVTCGGEESILVSSRGVKTCLGHVNATSDT-WVPTGGNSAPFFALDAISGEPLDLAH 746
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682
F++ P + L I T++ QK + L +R+ Q + T I +
Sbjct: 747 SFHTQDCPHGVIGVAGSTLKIFTVNTAQKWTENEVKLEGTAKRLI-QHDATTLTITQNPD 805
Query: 683 QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742
+ + +D+ D SI F D Y+ VG +
Sbjct: 806 RLVS----------VDNGAVGITK----DLGGPPTSICEVMFGDGKR-YFAVGGSRDGSP 850
Query: 743 ENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG 802
T G I +F L + E + +L A+NG L+A I +++LY L+
Sbjct: 851 GTSGTSGYISIF--SSSSLGHVHTTEVEAPPLALCAYNGLLVAGIGSQVRLYALGLKQV- 907
Query: 803 TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA--IEERARDYNANW 860
R+ Q E LA + + + + VGD+ +S+++ + E+ I D +
Sbjct: 908 LRKAQIELSKRVTCLAHFAGS--NRVAVGDIRQSVTVCVVLEEDSGHVIYPLVCDKISRQ 965
Query: 861 MSAVEILD-DDIYLGAE-NNFNLFTVRKNSEGATDEERGRLEVV-------GEYHLGEFV 911
++ + +D + + LG F + + + DE+ + + G H
Sbjct: 966 VTCLFFVDYETVALGDRFGGFTMLRIPSEASKLADEDHNAVHLRQLEPTLNGPAHF---- 1021
Query: 912 NRFRHGSLVMRLPDSDVGQIPTVI------------FGTVNGVIGVIASLPHEQYLFLEK 959
RF H + + +P I GTV+ + V++ +Q L+
Sbjct: 1022 -RFDH------VASFHIEDVPVAIHMYNDYLVVCGLLGTVSAFVPVVSP---KQSRDLKT 1071
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
++ + G+ G +H ++R + V K +DGD++ L + R +E+ +
Sbjct: 1072 IEKFVCASDPGLMGRDHGRFRGYY-----VPVKEVVDGDMLREVLVMDEKRREEVGEKTG 1126
Query: 1020 VSVEELCKRVEELTR 1034
+ VE RV + +
Sbjct: 1127 LGVEGAVGRVVNVMK 1141
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.138 0.404
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 391,240,282
Number of Sequences: 539616
Number of extensions: 17458044
Number of successful extensions: 39427
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 46
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 39003
Number of HSP's gapped (non-prelim): 166
length of query: 1036
length of database: 191,569,459
effective HSP length: 128
effective length of query: 908
effective length of database: 122,498,611
effective search space: 111228738788
effective search space used: 111228738788
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 67 (30.4 bits)