BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 001655
         (1036 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9M0V3|DDB1A_ARATH DNA damage-binding protein 1a OS=Arabidopsis thaliana GN=DDB1A PE=1
            SV=1
          Length = 1088

 Score = 1974 bits (5114), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 952/1036 (91%), Positives = 1005/1036 (97%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 173  AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSITKAYGRVD DGSRYLLGDHAG++HLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+ 
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473  SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF ++  TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593  ISYLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713  EHARRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSF++D NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773  SCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            +YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893  LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953  VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072

Query: 1021 SVEELCKRVEELTRLH 1036
             VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088


>sp|Q6QNU4|DDB1_SOLLC DNA damage-binding protein 1 OS=Solanum lycopersicum GN=DDB1 PE=1
            SV=1
          Length = 1090

 Score = 1967 bits (5095), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 173  PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            +DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 473  SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 533  SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 593  ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRI HQEQ+RTFA+CS+K  QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 713  EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 772

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773  LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832

Query: 780  NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            NGKLLAAINQKIQLYKW  R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 833  NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 892

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 893  LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 952

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 953  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1012

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1013 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1072

Query: 1019 NVSVEELCKRVEELTRLH 1036
            +V VEEL KRVEELTRLH
Sbjct: 1073 SVPVEELMKRVEELTRLH 1090


>sp|Q6E7D1|DDB1_SOLCE DNA damage-binding protein 1 OS=Solanum cheesmanii GN=DDB1 PE=3 SV=1
          Length = 1095

 Score = 1964 bits (5088), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 58   MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 117

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 118  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 177

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 178  PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 237

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 238  GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 297

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 298  KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 357

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 358  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 417

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            +DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 418  YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 477

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 478  SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 537

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 538  SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 597

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 598  ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 657

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 658  IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 717

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRI HQEQ+RTFA+CS+K  QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 718  EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 777

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 778  LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 837

Query: 780  NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            NGKLLAAINQKIQLYKW  R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 838  NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 897

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 898  LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 957

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 958  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1017

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1018 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1077

Query: 1019 NVSVEELCKRVEELTRLH 1036
            +V VEEL KRVEELTRLH
Sbjct: 1078 SVPVEELMKRVEELTRLH 1095


>sp|O49552|DDB1B_ARATH DNA damage-binding protein 1b OS=Arabidopsis thaliana GN=DDB1B PE=2
            SV=2
          Length = 1088

 Score = 1920 bits (4973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 925/1036 (89%), Positives = 989/1036 (95%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVP+YGRIAT+ELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53   ILDVPLYGRIATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173  TKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSYVE+LE+YVNLGPIV
Sbjct: 293  KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEILEKYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+ 
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMN+EDELEETEIEGF S+ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG LTEVKH  LEYE+
Sbjct: 473  SSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF L+   G+L DRKKVSLGT+PITLRTFSSK+ THVFAASDRP V
Sbjct: 593  ISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653  IYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQ+RTFAI  L+N+  AEESE HFVRLLD Q+FEF+S+YPLD FE GCSIL
Sbjct: 713  EHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLSSYPLDAFECGCSIL 772

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773  SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI VGDLMKSISLL
Sbjct: 833  GKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLL 892

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            IYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+KN+EGATDEER R+E
Sbjct: 893  IYKHEEGAIEERARDYNANWMTAVEILNDDIYLGTDNCFNIFTVKKNNEGATDEERARME 952

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953  VVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDV 1072

Query: 1021 SVEELCKRVEELTRLH 1036
             VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088


>sp|Q3U1J4|DDB1_MOUSE DNA damage-binding protein 1 OS=Mus musculus GN=Ddb1 PE=1 SV=2
          Length = 1140

 Score = 1150 bits (2975), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>sp|P33194|DDB1_CHLAE DNA damage-binding protein 1 OS=Chlorocebus aethiops GN=DDB1 PE=1
            SV=1
          Length = 1140

 Score = 1150 bits (2974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>sp|A1A4K3|DDB1_BOVIN DNA damage-binding protein 1 OS=Bos taurus GN=DDB1 PE=2 SV=1
          Length = 1140

 Score = 1150 bits (2974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIAKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>sp|Q16531|DDB1_HUMAN DNA damage-binding protein 1 OS=Homo sapiens GN=DDB1 PE=1 SV=1
          Length = 1140

 Score = 1149 bits (2972), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>sp|Q5R649|DDB1_PONAB DNA damage-binding protein 1 OS=Pongo abelii GN=DDB1 PE=2 SV=1
          Length = 1140

 Score = 1147 bits (2968), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNVCILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVY +  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYPMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>sp|Q805F9|DDB1_CHICK DNA damage-binding protein 1 OS=Gallus gallus GN=DDB1 PE=2 SV=1
          Length = 1140

 Score = 1147 bits (2967), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 576/1094 (52%), Positives = 761/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+ A +ELFRP GE++D LFI T +Y  C+L++  + ++ ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKTAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS +    D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDSHREMDNMLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N++QV++A G   L YLEI    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQIN 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLDI P+G+    S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  CTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F L+++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG  HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +          +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATV 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKIVEELTRIH 1140


>sp|Q6P6Z0|DDB1_XENLA DNA damage-binding protein 1 OS=Xenopus laevis GN=ddb1 PE=2 SV=1
          Length = 1140

 Score = 1144 bits (2958), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    +S ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDPDCR+IGL LYDGLFKVIP +   +  +AFNIRLEEL V+D+KFLY C
Sbjct: 114  RPSETGIIGIIDPDCRMIGLRLYDGLFKVIPLERDNKELKAFNIRLEELHVIDVKFLYSC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFSKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDVNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KL  + + +GSYV V+E 
Sbjct: 293  GSVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLTTESNEQGSYVVVMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LR + D   D  LV+SF+ +TR+L +  E E+EET++ GF    QT FC +  + QL+Q+
Sbjct: 413  LRVAADRDTDDTLVLSFVGQTRVLTLTGE-EVEETDLAGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G  V+V + N+ QVLLA G   L YLEI  G L +  
Sbjct: 472  TSASVRLVSQNPQNLVSEWKEPQGRKVSVCSCNSRQVLLAVGR-VLYYLEIHPGELRQTS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLD+ P+G N + S + A+G+WTDIS RI SLP   L+ KE LGGEIIPRS
Sbjct: 531  CTEMEHEVACLDVTPLGGNDTLSSLCAIGLWTDISARILSLPGFQLLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+ +FE   YLLCALGDG L  F LN  TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTSFESSHYLLCALGDGALFYFSLNTDTGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLKNQ------------------------SCA-- 686
            HIR++PL E PR+IC+QE S+ F + S + +                        SC+  
Sbjct: 711  HIRTVPLFESPRKICYQEVSQCFGVLSSRIEVQDASGGSSPLRPSASTQALSSSVSCSKL 770

Query: 687  -------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         EE E+H + ++D  TFE + T+     EY  S++SC    D   Y+ 
Sbjct: 771  FSGSTSPHETSFGEEVEVHNLLIIDQHTFEVLHTHQFLQNEYTLSLVSCKLGKDPTTYFV 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V P+E EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPDEAEPKQGRIVVFQYNDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +          +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTV 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>sp|Q9ESW0|DDB1_RAT DNA damage-binding protein 1 OS=Rattus norvegicus GN=Ddb1 PE=2 SV=1
          Length = 1140

 Score = 1137 bits (2940), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 571/1094 (52%), Positives = 756/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQ +KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQPVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPRAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLD+ P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDVTPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGT+++IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANTSTLTIGTMNEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++P+ E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPIYESPRKICYQEVSQCFGVLSTRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSAAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF    GKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSGGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+ GTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLLGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>sp|Q9XYZ5|DDB1_DROME DNA damage-binding protein 1 OS=Drosophila melanogaster GN=pic PE=1
            SV=1
          Length = 1140

 Score =  962 bits (2486), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 509/1099 (46%), Positives = 708/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
            ++ I G IA +  FRP    +D LFI T RY   +L+    +    ++T+A G+VSD +G
Sbjct: 54   EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             P++ G I  IDP  R+IG+ LY GLF +IP D      +A N+R++EL V D++FL+GC
Sbjct: 114  IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMDKDASELKATNLRMDELNVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++D+ D RHVK++E+ L+DK+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  LNPTVIVIHKDS-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RV ++G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++LN +    GSYV  +E +
Sbjct: 293  GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   D+ P++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  Y+QL+QV
Sbjct: 412  KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVSS ++ L  EW+     ++ V + N +Q+L+A+    + Y+ I DG L E  
Sbjct: 471  TSDSVRLVSSATKALVAEWRPTGDRTIGVVSCNTTQILVASAC-DIFYIVIEDGSLREQS 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLDI P+ E    S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590  ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQESSQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +     E   S++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V+PEE EP  GRI++F   + KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I AL+++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ V
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
            +IK VG + H  +R+F    K   ++ F+DGDLIESFLDLSR +M +  + + ++     
Sbjct: 1062 IIKSVGKIEHTYYRNFQINSKVEPSEGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGER 1121

Query: 1022 ----VEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140


>sp|B0M0P5|DDB1_DICDI DNA damage-binding protein 1 OS=Dictyostelium discoideum GN=repE PE=1
            SV=1
          Length = 1181

 Score =  929 bits (2402), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 499/1138 (43%), Positives = 719/1138 (63%), Gaps = 114/1138 (10%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M DV IYGRI+ L+LF   G  QD+LFI+TE +KFC+L +D E  E+IT+A G+  D IG
Sbjct: 51   MFDVNIYGRISVLKLFSVAGSKQDYLFISTESFKFCILAYDYEKKEIITKASGNAEDTIG 110

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL---------------------- 98
            RPT+ GQ+GIIDPD R++ LHLY+GL K+I  DN                          
Sbjct: 111  RPTEAGQLGIIDPDGRIVALHLYEGLLKLITLDNNNTPNKINNNNNNNNNNNNNNNNNNN 170

Query: 99   --------------------KEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHV 138
                                K   N+RLEELQVLD+ FLYGC  PTI VL++D KD +H+
Sbjct: 171  NNINNNNFNINNNNNNSPIQKNVNNVRLEELQVLDMTFLYGCKVPTIAVLFKDTKDEKHI 230

Query: 139  KTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI 198
             TYE++ KD + V GPWSQ+N+   + LL+PVP  L GVL++ +  I Y +    +++ +
Sbjct: 231  STYEISSKDTELVVGPWSQSNVGVYSSLLVPVP--LGGVLVVADNGITYLNGKVTRSVAV 288

Query: 199  RPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
              +   A+ RVD DGSR+L GDH G L +LV+ H+++KV  LK E LG  SI S+ISYLD
Sbjct: 289  SYTKFLAFTRVDKDGSRFLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLD 348

Query: 259  NAVVYIGSSYGDSQLIKLNLQPD-AKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVT 317
            + VVYIGSS GDSQLI+LN + D    SYV  LE + N+GP+VDFCVVD E+QGQ Q+VT
Sbjct: 349  SGVVYIGSSSGDSQLIRLNTEKDQTTDSYVTYLEAFTNIGPVVDFCVVDAEKQGQAQIVT 408

Query: 318  CSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------------------RSS 356
            CSG Y+DGSLRI+RNGIGI EQAS+EL+GIKG++ +                      + 
Sbjct: 409  CSGTYRDGSLRIIRNGIGIAEQASIELEGIKGIFPINNNNNNNNNNNNNNNNNNNNNSNG 468

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH--DAIYNQLVQVTS 414
              D  D +L+ SFI  T++L+    +E+EETE EG  S   TL+C   D + N L+Q+T+
Sbjct: 469  ITDSKDRYLITSFIECTKVLSFQ-GEEIEETEFEGLESNCSTLYCGTIDKL-NLLIQITN 526

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGILTEVK 472
             S+ L+ S + +  ++W   P   +N+ + N  Q++L+     L+Y +I   +  +  VK
Sbjct: 527  VSINLIDSNTFKRVSQWNVEPSRRINLVSTNQDQIVLSIDKS-LLYFQINSSNKSIQLVK 585

Query: 473  HAQLEYEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
              +L +EISC+DI+P      + SQ+ +VG+W DI++RIF LP L  I KE LGGEI+PR
Sbjct: 586  EIELPHEISCIDISPFDSFMDTKSQLVSVGLWNDITLRIFKLPTLEEIWKEPLGGEILPR 645

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+ +F+ I Y+ C+LGDGHL  F  +  + +L D++K++LGTQPI L+ F  KNT ++
Sbjct: 646  SILMISFDSIDYIFCSLGDGHLFKFQFDFSSFKLFDKRKLTLGTQPIILKKFKLKNTINI 705

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYS NKKL YS VNLK+V+++  FNS  FP+S+AIA    LTIGTID+IQK
Sbjct: 706  FAISDRPTVIYSHNKKLFYSVVNLKDVTNVTSFNSDGFPNSMAIATTNSLTIGTIDEIQK 765

Query: 652  LHIRSIPLGEH-PRRICHQEQSRTFAICSLKNQS---------CAEESEMHFVRLLDDQT 701
            LHI++IPL E   RRI H E    +A+ ++KN           C E+ E+ ++R+ +DQT
Sbjct: 766  LHIKTIPLNEEMGRRIVHLEDHSCYAVITVKNNEGLLGGAQDLCEEDEEVSYIRIYNDQT 825

Query: 702  FEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTKGRILVFIV---- 756
            FE IS+Y LD +E G SI  C F+ DD N Y  VGT+   P +   + GR+L+F +    
Sbjct: 826  FELISSYKLDPYEMGWSITPCKFAGDDVNTYLAVGTSINTPIK---SSGRVLLFSLSSSS 882

Query: 757  -------------------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
                                +GKL L+ E + + +VY L +FNG+L+AA+++++   ++ 
Sbjct: 883  SSNDKDSLDNNNNNNNNSGANGKLTLLEEIKFRSSVYFLLSFNGRLIAAVHKRLFSIRYT 942

Query: 798  -LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
              ++   + + SE  H GH + L + +RG FI+VGD+MKS+SLL+ +  +G++E+ AR+ 
Sbjct: 943  HSKEKNCKVISSESVHKGHTMILKLASRGHFILVGDMMKSMSLLV-EQSDGSLEQIARNP 1001

Query: 857  NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
               W+ +V +++DD ++GAE + N   V+KN++   + ER  L+ VG YH+GE +N  RH
Sbjct: 1002 QPIWIRSVAMINDDYFIGAEASNNFIVVKKNNDSTNELERELLDSVGHYHIGESINSMRH 1061

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
            GSLV RLPDSD   IPT+++ +VNG IGV+AS+  E ++F  KLQ  L +V++GVGG +H
Sbjct: 1062 GSLV-RLPDSDQPIIPTILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSH 1120

Query: 977  EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            E WR+F+N+  T+D+KNF+DGDLIE+FLDL      +    + ++ ++  +R+E L +
Sbjct: 1121 ETWRAFSNDHHTIDSKNFIDGDLIETFLDLKYESQLKAVADLGITPDDAFRRIESLMQ 1178


>sp|Q21554|DDB1_CAEEL DNA damage-binding protein 1 OS=Caenorhabditis elegans GN=ddb-1 PE=1
            SV=2
          Length = 1134

 Score =  596 bits (1536), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 368/1098 (33%), Positives = 603/1098 (54%), Gaps = 81/1098 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            ++PIYG++ T+ L +   + +  L + TE++   +L +     +++TRA G ++D  GR 
Sbjct: 54   EIPIYGQVLTIALVKCKRDKRHSLIVVTEKWHMAILAY--RDGKVVTRAAGCIADPTGRA 111

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA 121
            TDN    +      LI +  ++G  K+I +++   L+  FN+R +   V D KF+  G  
Sbjct: 112  TDN-LFSLTIHRNGLIAIRAFEGSVKMIQWESGTDLRH-FNVRFDYPNVSDFKFVDTGED 169

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
                V    D+   +H++  ++ + DK+F      Q ++   + +LIPVP  + GV+++G
Sbjct: 170  DVYRVAFIYDDDHGKHLQFSDLNMHDKEF-RTYSRQASIAADSSVLIPVPHAIGGVIVLG 228

Query: 182  EETIVY-CSANAFKAIPIRPSITK-----AYGRVDADGSRYLLGDHAG-LLHLLVITHEK 234
              +++Y  + N  + +P   S+ +      +G VDA G R+LL D  G LL LL+   E 
Sbjct: 229  SNSVLYKPNDNLGEVVPYTCSLLENTTFTCHGIVDASGERFLLSDTDGRLLMLLLNVTES 288

Query: 235  EK---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
            +    V  ++I+ LGETSIA +I+Y+DN VV++GS  GDSQLI+L  +P+  GSY  +LE
Sbjct: 289  QSGYTVKEMRIDYLGETSIADSINYIDNGVVFVGSRLGDSQLIRLMTEPNG-GSYSVILE 347

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
             Y N+GPI D  +V  E  GQ Q+VTC+GA KDGSLR++RNGIGI+E ASV+L G+ G++
Sbjct: 348  TYSNIGPIRDMVMV--ESDGQPQLVTCTGADKDGSLRVIRNGIGIDELASVDLAGVVGIF 405

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ--- 408
             +R   D   D +++VS   ET +L +  E ELE+ ++    +   T+F           
Sbjct: 406  PIR--LDSNADNYVIVSLSDETHVLQITGE-ELEDVKLLEINTDLPTIFASTLFGPNDSG 462

Query: 409  -LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL------ 461
             ++Q T   +RL+SS+   L   W+   G  ++  + NA+   +       VYL      
Sbjct: 463  IILQATEKQIRLMSSSG--LSKFWEPTNGEIISKVSVNAANGQIVLAARDTVYLLTCIVD 520

Query: 462  EIGDGILTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRIFSLPDLNLIT 520
            E+G   +      + E EI+CLD++  G++P+  +    +  W+  ++ +  LPDL  + 
Sbjct: 521  EMGALDIQLTAEKKFENEIACLDLSNEGDDPNNKATFLVLAFWSTFAMEVIQLPDLITVC 580

Query: 521  KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
               L  +IIPRS++    E + YLL A GDG L+ ++ ++KTG   + KK ++GT+P +L
Sbjct: 581  HTDLPTKIIPRSIIATCIEEVHYLLVAFGDGALVYYVFDIKTGTHGEPKKSNVGTRPPSL 640

Query: 581  RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
                +KN  H+F  SDRP +I+S++KKL++SNVN+K V  +C  +S+A+ D L I+    
Sbjct: 641  HRVRNKNRQHLFVCSDRPVIIFSASKKLVFSNVNVKLVDTVCSLSSSAYRDCLVISDGNS 700

Query: 641  LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE------------ 688
            +  GT+DDIQK+H+RSIP+GE   RI +Q+ + T+ +CS + +S AE             
Sbjct: 701  MVFGTVDDIQKIHVRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAERVFASKNALVTSQ 760

Query: 689  -----------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                             +      +LD  TF+ + ++    +E   S +S  F++DS+ Y
Sbjct: 761  SRPKVASTRADMDESPPNTTSSFMVLDQNTFQVLHSHEFGPWETALSCISGQFTNDSSTY 820

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAIN 788
            Y VGT  + P+E E   GRI+VF V+D    KL+ + E   +G+  ++   NGKL+AAIN
Sbjct: 821  YVVGTGLIYPDETETKIGRIVVFEVDDVERSKLRRVHELVVRGSPLAIRILNGKLVAAIN 880

Query: 789  QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
              I+L++W       +EL+ EC    H++AL ++   + + V D+M+S+SLL Y+  EG 
Sbjct: 881  SSIRLFEWTT----DKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGN 936

Query: 849  IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDEERGRLEVVGEYHL 907
             EE A+D+N+ WM   E +  +  LG E + NLFTV    +   TD+ R  LE  G ++L
Sbjct: 937  FEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDDGRYVLEPTGYWYL 996

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
            GE        +LV++  DS +     ++FGT  G IG+I  +  +   FL  ++  +   
Sbjct: 997  GELPKVMTRSTLVIQPEDSIIQYSQPIMFGTNQGTIGMIVQIDDKWKKFLIAIEKAIADS 1056

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRT-RMDEISKTMNVSVE--- 1023
            +K    + H  +R+F  +K+      F+DGDL+ES LD+ R+  MD +SK  +   +   
Sbjct: 1057 VKNCMHIEHSSYRTFVFQKRAEPPSGFVDGDLVESILDMDRSVAMDILSKVSDKGWDPSL 1116

Query: 1024 -----ELCKRVEELTRLH 1036
                 E+ K +E+L R+H
Sbjct: 1117 PRDPVEILKVIEDLARMH 1134


>sp|O13807|DDB1_SCHPO DNA damage-binding protein 1 OS=Schizosaccharomyces pombe (strain 972
            / ATCC 24843) GN=ddb1 PE=1 SV=1
          Length = 1072

 Score =  410 bits (1053), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 295/1072 (27%), Positives = 549/1072 (51%), Gaps = 96/1072 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR-AMGDVSDRIGRPTD 64
            I+ +I  ++ F+P     D + +AT+ +++  L WDA  + +     + D S+R  R + 
Sbjct: 55   IFAKIVNVKAFKPVSSPTDHIIVATDSFRYFTLFWDANDNTVSNGIKIQDCSERSLRESQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF----------DNKGQLKEAFNIRLEELQVLDI 114
            +G + ++DP  R+I LH+Y GL  +IP            N   L + F++R++EL V+DI
Sbjct: 115  SGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQELNVVDI 174

Query: 115  KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
              LY  ++P++ VLY+D+K   H+ TY++ +++++  E     ++++ G      +P   
Sbjct: 175  AMLYNSSRPSLAVLYKDSKSIVHLSTYKINVREQEIDEDDVVCHDIEEGK----LIPSEN 230

Query: 175  CGVLIIGEETIVYCSAN--------AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLH 226
             GV + GE  + Y S +         +      PSI+        D S Y++ D +G+L+
Sbjct: 231  GGVFVFGEMYVYYISKDIQVSKLLLTYPITAFSPSISND-PETGLDSSIYIVADESGMLY 289

Query: 227  LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
                    E V+ +++E LGE+SIAS +  L +  +++GS + +S L++L          
Sbjct: 290  KFKALFTDETVS-MELEKLGESSIASCLIALPDNHLFVGSHFNNSVLLQLP-SITKNNHK 347

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            +E+L+ +VN+ PI DF + D   Q    ++TCSGAYKDG+LRI+RN I I   A +E++G
Sbjct: 348  LEILQNFVNIAPISDFIIDD--DQTGSSIITCSGAYKDGTLRIIRNSINIENVALIEMEG 405

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA-- 404
            IK  +S+    +  +D ++ +S I ETR + ++          EG  S    L C ++  
Sbjct: 406  IKDFFSVSFRAN--YDNYIFLSLICETRAIIVSP---------EGVFSANHDLSCEESTI 454

Query: 405  ----IY--NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
                IY  +Q++Q+T+  +RL         + W SP   S+   ++ A  V +A  GG +
Sbjct: 455  FVSTIYGNSQILQITTKEIRLFDGKK---LHSWISP--MSITCGSSFADNVCVAVAGGLI 509

Query: 459  VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT-DISVRIFSLPDLN 517
            ++ E   GI TEV   Q + E+S L      EN  Y     VG+W+ DI +  +    ++
Sbjct: 510  LFFE---GI-TEVGRYQCDTEVSSLCFTE--ENVVY-----VGLWSADIIMLTYCQDGIS 558

Query: 518  LITKEHLGGEIIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTDR--KKVSL 573
            L     L    IPRS++     G     L  +  +G++L F  N + G++ +   ++  L
Sbjct: 559  LTHSLKLTD--IPRSIVYSQKYGDDGGTLYVSTNNGYVLMF--NFQNGQVIEHSLRRNQL 614

Query: 574  GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
            G  PI L+ F SK    +FA  ++P ++Y  + KL+ + ++  E+ ++  + + +   ++
Sbjct: 615  GVAPIILKHFDSKEKNAIFALGEKPQLMYYESDKLVITPLSCTEMLNISSYVNPSLGVNM 674

Query: 634  AIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA--EESEM 691
                   +++  + +I+ L+++++ +   PRRIC       F +C    +S    E+  +
Sbjct: 675  LYCTNSYISLAKMSEIRSLNVQTVSVKGFPRRICSNSLFY-FVLCMQLEESIGTQEQRLL 733

Query: 692  HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 751
             F+R+ +  T   I+ +  + +E   SI+    +DD  V   VGT +  P+++ P  GR+
Sbjct: 734  SFLRVYEKNTLSEIAHHKFNEYEMVESII--LMNDDKRV--VVGTGFNFPDQDAPDSGRL 789

Query: 752  LVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
            +VF +  D  +++ AE + +G+V +L  +   ++A IN  + ++++   + GT  +++  
Sbjct: 790  MVFEMTSDNNIEMQAEHKVQGSVNTLVLYKHLIVAGINASVCIFEY---EHGTMHVRNSI 846

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
                + + + V    D I+  DLMKSI++L +  ++  + E ARDY+  W ++VEIL + 
Sbjct: 847  RTPTYTIDISVNQ--DEIIAADLMKSITVLQFIDDQ--LIEVARDYHPLWATSVEILSER 902

Query: 871  IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
             Y   E + N   + +++      +R +L    +++LGE +N+ RH + +   P      
Sbjct: 903  KYFVTEADGNAVILLRDNVSPQLSDRKKLRWYKKFYLGELINKTRHCTFIE--PQDKSLV 960

Query: 931  IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
             P ++  TV+G + ++          L +LQ N+RKVI   GGL+H++W+ +  E +T  
Sbjct: 961  TPQLLCATVDGSLMIVGDAGMSNTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET-S 1019

Query: 991  AKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
              + +DG LIES L L    ++EI         +++SV++L   +E L +LH
Sbjct: 1020 PSDLIDGSLIESILGLREPILNEIVNGGHEGTKLDISVQDLKSIIENLEKLH 1071


>sp|Q54SA7|SF3B3_DICDI Probable splicing factor 3B subunit 3 OS=Dictyostelium discoideum
            GN=sf3b3 PE=3 SV=1
          Length = 1256

 Score =  231 bits (588), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 245/1065 (23%), Positives = 456/1065 (42%), Gaps = 192/1065 (18%)

Query: 133  KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCS 189
            K  + +  YE+ L   + V   WS + +D+ A++++ VP       GVL+  E+ IVY +
Sbjct: 213  KKKKLLTYYELDLGLNNVVR-KWS-DQVDDSANIVMTVPGGTEGPGGVLVASEDYIVYRN 270

Query: 190  ANAFKAIPIRPSITKAYGRVDADGSR--------------YLLGDHAGLLHLLVITHEKE 235
             +  +   +R  I + YG     G                +L+    G L+ + + ++ +
Sbjct: 271  QDHAE---VRSRIPRRYGSDPNKGVLIISHSSHKQKGMFFFLVQSEHGDLYKITLDYQGD 327

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI--------------------- 274
            +V+ + +       +A+ ++ L N  ++  S +GD  L                      
Sbjct: 328  QVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDHTLYFFKSIGDEEEEGQAKRLEDKD 387

Query: 275  -KLNLQP-DAKGSYVEVLE------RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGS 326
              L   P ++ G+ +E L+         +L PI+DF V+DL R+   Q+ +  G   + S
Sbjct: 388  GHLWFTPRNSCGTKMEELKNLEPTSHLSSLSPIIDFKVLDLVREENPQLYSLCGTGLNSS 447

Query: 327  LRIVRNGIGINEQASVELQGI-KGMWSLRSSTD----DPFDTFLVVSFISETRILAMNLE 381
            L+++R+G+ +    +  L G+  G+W++  ST     D  D ++VVSF+  T +L++   
Sbjct: 448  LKVLRHGLSVTTITTANLPGVPSGIWTVPKSTSPNAIDQTDKYIVVSFVGTTSVLSVG-- 505

Query: 382  DELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
            D ++E    G    T TL       + ++QV     R + S  R   NEW++P   ++  
Sbjct: 506  DTIQENHESGILETTTTLLVKSMGDDAIIQVFPTGFRHIKSDLR--INEWRAPGRKTIVR 563

Query: 442  ATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
            A+AN SQ+ +A  GG ++Y E+     L E+    L  +I+C++I+PI +  + ++  AV
Sbjct: 564  ASANQSQLAIALSGGEIIYFELDQASNLIEIIKKDLRRDIACIEISPIPKGRNMARFIAV 623

Query: 501  GMWTDISVRIFSLPDLNLITKEHL--GGEIIPRSVLLCAFE----GIS------------ 542
              W +  +R+ SL   N + +  +    ++   S+ +   +    GI             
Sbjct: 624  SDW-EGPIRVLSLDRDNCLGQVSMLDTDKVYIESLSIIEMQLNEMGIETKKSQSQTGQTT 682

Query: 543  -------------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
                               +L   L +G +    L+  TGEL+D +   LG +P+ L   
Sbjct: 683  TTTTSTSSASSSVTSGGSLFLFVGLKNGVVKRATLDSVTGELSDIRTRLLGRKPVKLFKV 742

Query: 584  SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
              + +  + A S R  + Y +  KL    ++++ + +    +S    +S+    E ++ I
Sbjct: 743  KVRGSNAMLALSSRVWLNYINQGKLDIVPLSIEPLENASNLSSEQSAESIVATSENKIII 802

Query: 644  GTIDDIQKL-HIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSC--------AEESEM-- 691
             +ID +  L +  +I L   P+R I H + S    + +  N +          E+SE   
Sbjct: 803  FSIDKLGDLFNQETIKLNATPKRFIIHPQTSYIIILETETNYNTDNIDIDKINEQSEKLL 862

Query: 692  -----------------------------------------HFVRLLDDQTFEFISTYPL 710
                                                      +++++D  T E + +  L
Sbjct: 863  LEKQKELQQEMDIDDDDQNNNNEIEPFKKLFKPKAGKGKWKSYIKIMDPITHESLESLML 922

Query: 711  DTFEYGCSILSCSFSDDSNVYYCVG--TAYVL-PEENEPTKGRILVFIVEDGKLQLIAEK 767
            +  E G S+ +CSF +   ++  VG  T  VL P+ ++     +  FI    KL+L+ + 
Sbjct: 923  EDGEAGFSVCTCSFGESGEIFLVVGCVTDMVLNPKSHKSAHLNLYRFIDGGKKLELLYKT 982

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
            E +  VY++  F GKL+  + + I++Y     D G ++L  +C        +  + + GD
Sbjct: 983  EVEEPVYAMAQFQGKLVCGVGKSIRIY-----DMGKKKLLRKCETKNLPNTIVNIHSLGD 1037

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
             +VVGD+ +SI  + YK  E  +   A D    WM++  +LD D   GA+   N+F +R 
Sbjct: 1038 RLVVGDIQESIHFIKYKRSENMLYVFADDLAPRWMTSSVMLDYDTVAGADKFGNIFVLRL 1097

Query: 886  -----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
                             K   G  +    +L+ +  + +G+ V      SLV       V
Sbjct: 1098 PLLISDEVEEDPTGTKLKFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLV-------V 1150

Query: 929  GQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
            G    +++ T++G IG +I     E   F   L+ N+R     + G +H  +RS+     
Sbjct: 1151 GGPEVILYTTISGAIGALIPFTSREDVDFFSTLEMNMRSDCLPLCGRDHLAYRSY----- 1205

Query: 988  TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                KN +DGDL E F  L+  +   IS+ ++ S  E+ K++EE+
Sbjct: 1206 YFPVKNIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250


>sp|Q15393|SF3B3_HUMAN Splicing factor 3B subunit 3 OS=Homo sapiens GN=SF3B3 PE=1 SV=4
          Length = 1217

 Score =  214 bits (545), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>sp|A0JN52|SF3B3_BOVIN Splicing factor 3B subunit 3 OS=Bos taurus GN=SF3B3 PE=2 SV=1
          Length = 1217

 Score =  214 bits (545), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>sp|Q5RBI5|SF3B3_PONAB Splicing factor 3B subunit 3 OS=Pongo abelii GN=SF3B3 PE=2 SV=1
          Length = 1217

 Score =  214 bits (545), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSETAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGSGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALRDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVTKKLEDI 1211


>sp|Q1LVE8|SF3B3_DANRE Splicing factor 3B subunit 3 OS=Danio rerio GN=sf3b3 PE=2 SV=1
          Length = 1217

 Score =  214 bits (545), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 265/1190 (22%), Positives = 491/1190 (41%), Gaps = 189/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L + ++G + +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTMEVFGVVRSLMAFRLTGGTKDYVVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMIGATEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  ++E VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V V E+  +L 
Sbjct: 347  LFVSSEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIIRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL     L  ++ + L  +  P 
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638

Query: 532  SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            S+ +    G+               YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGVEKQDELGEKGTIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + + +   F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E            + T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           VRL++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGSGQWASLVRLINPIQGNTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
               L+  E   S+  C F +  + +Y  VG A  ++        G I  + +  G  KL+
Sbjct: 879  LVQLEQNEAAFSVAICRFLNGGDDWYVLVGVARDMILNPRSVGGGYIYTYRIVGGGDKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
             + +   +    ++  F G++L  + + +++Y     D G ++L  +C   H   L   +
Sbjct: 939  FLHKTPVEDVPLAIAPFQGRVLVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
             T G  ++V D+ +S+  + Y+  E  +   A D    W++   +LD D    A+   N+
Sbjct: 994  HTIGQRVIVSDVQESLFWVRYRRNENQLIIFADDTYPRWITTACLLDYDTMASADKFGNI 1053

Query: 882  FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
              VR   N+    DE         +RG L       E++  YH+GE V   +  +L+   
Sbjct: 1054 CVVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEIIINYHIGETVLSLQKTTLI--- 1110

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G   ++++ T++G IG++     HE + F + L+ ++R     + G +H  +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVSEELDRTPPEVSKKLEDI 1211


>sp|Q921M3|SF3B3_MOUSE Splicing factor 3B subunit 3 OS=Mus musculus GN=Sf3b3 PE=2 SV=1
          Length = 1217

 Score =  214 bits (545), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>sp|Q4WLI5|RSE1_ASPFU Pre-mRNA-splicing factor rse1 OS=Neosartorya fumigata (strain ATCC
            MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=rse1 PE=3
            SV=1
          Length = 1225

 Score =  210 bits (535), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 273/1166 (23%), Positives = 477/1166 (40%), Gaps = 171/1166 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q +         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  +A    T             YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVDYSEADQDPTGRAYEESEKLLVYYELDLGLNHVVRK-WA-DPVDRTASM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
            L  VP       GVL+  E+ I Y  +N  AF+  IP R   T+   R  +  +  +   
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRIGATENPERKRSIVAGVMHKM 300

Query: 221  HAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
                  LL             ++  +  ++TG    LKI+      IAS++  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKVTIDMVEDDNGQLTGEVKRLKIKYFDTVPIASSLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKLN----------------------------LQPDAKGSYVEVLERYVN 295
            + S  G+    +                               QP      + ++E   +
Sbjct: 361  VASEAGNHHFYQFEKLGDDDEETEFNSENFPADLSVPCEPVFFQPRG-AENLNLVETLNS 419

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLR 354
            L P++D  +V+L      Q+ T SG+    S R +++G+ ++E    EL  +   +W+ +
Sbjct: 420  LNPLIDSKIVNLNEDDAPQIYTVSGSGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTK 479

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
             +  D FD ++++SF + T +L++   + +EE    GF S   TL       + L+QV  
Sbjct: 480  LTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHP 537

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-K 472
              +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  +
Sbjct: 538  RGIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDE 595

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
              Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P 
Sbjct: 596  RRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPS 654

Query: 532  SVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
            ++ + +    S      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S 
Sbjct: 655  ALNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFRVSV 714

Query: 586  KNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
            K  T V A S RP + YS    K  + + ++   +     F+S    + +   +   L I
Sbjct: 715  KGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQAQNLRI 774

Query: 644  GTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR------- 695
             +I+ +   +   SIPL   PRR+    +   F +    N   +  +    +        
Sbjct: 775  FSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDNNVLSPATRARLIEDSKARNG 834

Query: 696  ---LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DSN 729
               +L  + F +                      IST  L+  E   S+ +  FS  D  
Sbjct: 835  ETNVLPPEDFGYPRATGHWASCIQIVDPLDAKAVISTIELEENEAAVSMAAVPFSSQDDE 894

Query: 730  VYYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
             +  VGTA  +   N P+     + I    EDGK L+ I + + +    +L  F G+LLA
Sbjct: 895  TFLVVGTAKDM-IVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRLLA 953

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIY 842
             I   +++Y     D G ++L  +C     +++  +   QT+G  IVV D+ +S++ ++Y
Sbjct: 954  GIGSTLRIY-----DLGMKQLLRKC--QAQVVSKTIVGLQTQGSRIVVSDVRESVTYVVY 1006

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE----- 894
            K+++  +     D  + W ++  ++D +   G +   NL+ VR   K SE A ++     
Sbjct: 1007 KYQDNILIPFVDDSVSRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKASEEADEDGSGAH 1066

Query: 895  ---ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
               ERG       RL+++   +  +         LV        G    +++    G IG
Sbjct: 1067 LIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIG 1119

Query: 945  VIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            ++   +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E +
Sbjct: 1120 MLVPFVSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMY 1174

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRV 1029
              L       I+  ++ SV E+ +++
Sbjct: 1175 FLLPNDTKMMIAAELDRSVREIERKI 1200


>sp|Q7RYR4|RSE1_NEUCR Pre-mRNA-splicing factor rse-1 OS=Neurospora crassa (strain ATCC
            24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
            GN=rse-1 PE=3 SV=2
          Length = 1209

 Score =  209 bits (531), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 279/1164 (23%), Positives = 487/1164 (41%), Gaps = 161/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G +  +  FR  G  +D++ +AT+  +  ++++  ++++     +        R    
Sbjct: 64   IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  +++ +L  +  +   +  VL +  +    G 
Sbjct: 124  GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  DA    T             YE+ L     V   WS + +D  + LL
Sbjct: 184  ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE + Y  +N  AF+  IP R   T+     RV   G  + 
Sbjct: 242  FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVSGVMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L   AG    L+ T + +                  +V  LKI+      +A+++  L +
Sbjct: 300  LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359

Query: 260  AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
              ++  S +G+    +                +   D   SY  V      LE  V    
Sbjct: 360  GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419

Query: 295  --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P VD  V +L  +   Q+ +  G     + R++++G+ ++E  + EL G    +W
Sbjct: 420  IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            + + +  D +D ++V+SF + T +L++   + +EE    GF +   TL       + L+Q
Sbjct: 480  TTKLTKYDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTTAPTLAVQQMGEDGLIQ 537

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R +        NEW +P   S+  ATAN +QV++A   G +VY E+  DG L E
Sbjct: 538  VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 594

Query: 471  V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
              +  ++   ++ L +  + E    S   AVG   D +VRI SL PD  L  K       
Sbjct: 595  YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 653

Query: 529  IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             P ++ + + E        YL   L  G  L  +L+  TGELTD ++  LG +P  L   
Sbjct: 654  APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTRLFQV 713

Query: 584  SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            S ++   V A S RP + Y+    K  + + ++  E+ +   F+S    + +       L
Sbjct: 714  SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 773

Query: 642  TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
             I +I+ +    I +SIPL   P+ +  H EQ   + I S  N           E+    
Sbjct: 774  RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 833

Query: 693  FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
               +L  + F +                IS  P       LD  E   S     F S + 
Sbjct: 834  DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 893

Query: 729  NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
              +  VGT   ++ +  + T+G I V+   EDG+ L+ I +   +    +L  F G+LLA
Sbjct: 894  ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 953

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
             + + +++Y   L+    R+ Q++       L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 954  GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 1009

Query: 846  EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
               +   A D    W +   ++D +   G +   N++ VR     + + +    E   E 
Sbjct: 1010 GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1065

Query: 906  HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
            HL    N + HG+              LP S       VG    +++  + G +GV I  
Sbjct: 1066 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1124

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  E   F + L+ ++R     + G +H  +R +    K V     +DGDL E F  L  
Sbjct: 1125 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1179

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   I+  ++ SV E+ +++ ++
Sbjct: 1180 DKKQMIAGELDRSVREIERKISDI 1203


>sp|Q5B1X8|RSE1_EMENI Pre-mRNA-splicing factor rse1 OS=Emericella nidulans (strain FGSC A4
            / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=rse1 PE=3
            SV=2
          Length = 1209

 Score =  207 bits (527), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 281/1165 (24%), Positives = 477/1165 (40%), Gaps = 163/1165 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L    +    G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVYSVVALDAG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++    T             YE+ L     V   W+ + +D  + +
Sbjct: 183  YENPVFAALEVDYSESDQDPTGRAYEEVEKLLVYYELDLGLNHVVRK-WT-DPVDRTSSM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPS----------ITKAYGRVD 210
            L  VP       GVL+  E+ I Y  +N  AF+  IP R            IT       
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRKGAMENPERKRCITAGVMHKM 300

Query: 211  ADGSRYLLGDHAG-LLHLLVITHEKEK------VTGLKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  L +   E +K      V GLKI+      +AS++  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKLTLDMVEDDKGQLTGEVKGLKIKYFDTVPLASSLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKL-----------------NLQPDAKGSYV----------EVLERYVNL 296
            + +  G+    +                  +  P A  + V           ++E   +L
Sbjct: 361  VAAEGGNHHFYQFEKLGDDDEETEFNSDDFSADPAAPCTPVYFQPRGAENLNLVEAINSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P+VD  VV++      Q+ T SG     + R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLVDSKVVNISEDDAPQIFTVSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            +  D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+Q+   
Sbjct: 481  TRADEFDAYIVLSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQIHPR 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
             +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFELDADGSLAEYDER 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDTTLENKSVQALTAAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + A    S      YL   L  G  L   L+  TGEL+D +   LG++ + L   S  
Sbjct: 656  LNIIAMADSSSGGTTLYLHIGLHSGVYLRTALDEVTGELSDTRTRFLGSKAVKLFQVSVT 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
              T V A S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  GQTAVLALSSRPWLGYSDTQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFE 703
            +I+ +   +  +SIPL   PR      +   F +    N   +  +     RLL+D    
Sbjct: 776  SIEKLDNNMLQQSIPLAYTPRHFIKHPEEPLFYVIEADNNVLSPATR---ARLLEDSKAR 832

Query: 704  FIST---------YP--------------------------LDTFEYGCSILSCSF-SDD 727
               T         YP                          L+  E   SI +  F S D
Sbjct: 833  GGDTTVLPPEDFGYPRGTGHWASCIQIIDPLDAKAVVGAVELEENEAAVSIAAVPFTSQD 892

Query: 728  SNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
               +  VGTA  +   N P+   G I ++   EDGK L+ I + + +    +L  F G+L
Sbjct: 893  DETFLVVGTAKDM-TVNPPSSAGGYIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRL 951

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIY 842
            LA +   +++Y     D G ++L  +C       A+  +QT+G  IVV D+ +S++ ++Y
Sbjct: 952  LAGVGSVLRIY-----DLGMKQLLRKCQAAVAPKAIVGLQTQGSRIVVSDVRESVTYVVY 1006

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
            K+++  +     D  A W +A  ++D +   G +   NL+ VR   K SE A +E  G  
Sbjct: 1007 KYQDNVLIPFVDDSIARWTTAATMVDYETTAGGDKFGNLWLVRCPKKASEEADEEGSGAH 1066

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDV-----------GQIPTVIFGTVNGVIGVIAS 948
             +    +L    NR     L++ +   D+           G    +++    G IG++  
Sbjct: 1067 LIHDRGYLQGTPNRLE---LMIHVFTQDIPTSLHKTQLVAGGRDILVWTGFQGTIGILVP 1123

Query: 949  -LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
             +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E +  LS
Sbjct: 1124 FVSREDVDFFQSLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEQYFLLS 1178

Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
                  I+  ++ SV E+ +++ ++
Sbjct: 1179 NDTKMMIAAELDRSVREIERKISDM 1203


>sp|Q52E49|RSE1_MAGO7 Pre-mRNA-splicing factor RSE1 OS=Magnaporthe oryzae (strain 70-15 /
            ATCC MYA-4617 / FGSC 8958) GN=RSE1 PE=3 SV=2
          Length = 1216

 Score =  206 bits (523), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 287/1167 (24%), Positives = 487/1167 (41%), Gaps = 160/1167 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I  L  FR  G ++D+L IA++  +  ++++   +    +R   +   + G R   
Sbjct: 64   VFGIIRDLASFRLAGSSKDYLIIASDSGRITIVEY-LPAQNRFSRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ    DP  R  LI       L  V+  +++ +L  +  +   +   L +  +    G
Sbjct: 123  PGQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVG 182

Query: 120  CAKPTIVVLY-------QDNKD--ARHVKT----YEVALKDKDFVEGPWSQNNLDNGADL 166
             + P    L        QD K   A++V+T    YE+ L     V   W  + +D+ A +
Sbjct: 183  YSNPVFAALEVDYSEVDQDPKGDAAQNVETVLNYYELDLGLNHVVRK-WF-DVVDSTASM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
            L  VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+  +  R    G  +
Sbjct: 241  LFQVPGGSDGP--SGVLVCGEENITYRHSNQDAFRVPIPRRKGATEDPSRKRNIVSGVMH 298

Query: 217  LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
             L   AG    L+ T + +                  +V  LKI+      +++ +  L 
Sbjct: 299  KLKGSAGAFFFLLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILK 358

Query: 259  NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
            +  +++ S +G+    Q  KL            +   D K  Y            + ++E
Sbjct: 359  SGFLFVASEFGNHLFYQFEKLGDDDEELEFFSSDFPVDPKEPYEPVYFYPRPTENLALVE 418

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
               ++ P++D  V +L  +   Q+ T SG     + R++++G+ +NE  + +L G    +
Sbjct: 419  SIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHGLEVNEIVASQLPGTPSAV 478

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ +   DD +D ++V+SF + T +L++   + +EE    GF S   TL       + LV
Sbjct: 479  WTTKLRRDDEYDAYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVPTLAVQQLGDDGLV 536

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R + +    + NEW SP   S+  A  N  QV +A   G +VY E+  DG L 
Sbjct: 537  QVHPKGIRHIRNG---VVNEWSSPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLA 593

Query: 470  EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
            E    +  +  ++ L +  + E    S   AVG   D +VRI SL P+  L +K      
Sbjct: 594  EYDEKKEMFGTVTSLSLGEVPEGRLRSSYLAVGC-DDCTVRILSLDPESTLESKSVQALT 652

Query: 528  IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
              P ++ + + E  S      YL   L  G  L  +L+  TGELTD ++  LG + + L 
Sbjct: 653  AAPSALSIMSMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLF 712

Query: 582  TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
              S +  T V A S R  + +S    K    + +N +E+     F S    + +      
Sbjct: 713  QVSVQKRTCVLALSSRSWLGFSDPVTKGFTMTPLNYEELEWGWNFVSEQCEEGMVGVNGQ 772

Query: 640  ELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
             L I  I+ +    I +SIPL   PR++      R F      N + A E     +    
Sbjct: 773  FLRIFAIEKLGDNVIQKSIPLTYTPRKLAKHPTQRIFYTIEADNNTLAPELREQLMAAPT 832

Query: 695  ------RLLDDQTFEF---------------------------ISTYPLDTFEYGCSILS 721
                  R+L    F +                           +    LD  E   S+  
Sbjct: 833  AVNGDARVLPPDEFGYPRGNGRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAV 892

Query: 722  CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
             SF S D   +  VGT   ++      T+G I V+   EDG+ L+ I + + +    +L 
Sbjct: 893  VSFASQDGESFLVVGTGKDMVVNPRRFTEGYIHVYRFSEDGRELEFIHKTKVEEPPTALL 952

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
             F G+L+A I + +++Y   LR    R+ Q+E       L + + T+G  I+VGD+   +
Sbjct: 953  PFQGRLVAGIGRMLRIYDLGLR-QLLRKAQAEVAPQ---LIVSLNTQGSRIIVGDVQHGL 1008

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEE 895
              + YK E   +   A D  A W +   ++D D   GA+   NL+ +R  + +   +DE 
Sbjct: 1009 IYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAGADKFGNLWILRCPEKASQESDEP 1068

Query: 896  RGRLEVV-GEYHLGEFVNRFRHGSLV--MRLPDS------DVGQIPTVIFGTVNGVIGV- 945
               + +V    +L    NR    + V    +P S       VG    +++G   G IGV 
Sbjct: 1069 GSEVHLVHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQEVLLWGGFQGTIGVL 1128

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
            I  +  E   F + L+ +LR     + G +H  +R        V  K  +DGDL E +  
Sbjct: 1129 IPFVSREDADFFQSLEQHLRSEDPPLAGRDHLMYRGC-----YVPVKGVIDGDLCERYTM 1183

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L   +   I+  ++ SV E+ +++ ++
Sbjct: 1184 LPNDKKQMIAGELDRSVREIERKISDI 1210


>sp|Q9UTT2|RSE1_SCHPO Pre-mRNA-splicing factor prp12 OS=Schizosaccharomyces pombe (strain
            972 / ATCC 24843) GN=prp12 PE=1 SV=1
          Length = 1206

 Score =  201 bits (512), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 264/1176 (22%), Positives = 476/1176 (40%), Gaps = 177/1176 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L+   +G I  +   R  G  +D+L + ++  +  +L+++ E ++L+            
Sbjct: 59   ILNQNCFGIIRNVAPLRLTGFKRDYLVVTSDSGRITILEYNVEKNKLVPIYQETFGKSGI 118

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
            R    G+   ID   R  +I     + L  V+  D++  L    +  LE  +  +I F  
Sbjct: 119  RRVVPGEYLAIDAKGRAAMIASVEKNKLVYVLNRDSEANL--TISSPLEAHKANNICFHL 176

Query: 119  -----GCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLD 161
                 G A P    L  D  +  H  T E     +            + V   WS+  +D
Sbjct: 177  IGLDTGYANPIFAALEVDYSEIDHDSTREAFTSSEKVLSYYELDLGLNHVVKRWSKV-VD 235

Query: 162  NGADLLIPVP-----PPLCGVLIIGEETIVYCS-ANAFKAIPI-------RPSITKAYGR 208
              + +LIPVP     P   G L+I    I Y     AF  IPI         +I+  + +
Sbjct: 236  RNSYMLIPVPGGNDGP--SGTLVISNGWISYRHLQKAFHQIPILRRQAASANAISTPWNQ 293

Query: 209  VDADGSR------------------YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETS 249
            V+++ +                   YLL    G L  L I H+ +  V  L+++      
Sbjct: 294  VNSNSANDGPLIVSAVLHKMKGSFFYLLQTGDGDLLKLTIEHDGQGNVVELRLKYFDTVP 353

Query: 250  IASTISYLDNAVVYIGSSYGDSQLIKL---------------------------NLQPDA 282
            +A  ++ L    +++ + +G+ QL +                            N+    
Sbjct: 354  LAVQLNILKTGFLFVATEFGNHQLYQFENLGIDDDELEITSLDFQAQDNEVGTKNVHFGV 413

Query: 283  KG-SYVEVLERYVNLGPIVDFCVVDLERQGQG-QVVTCSGAYKDGSLRIVRNGIGINEQA 340
            +G   + ++E   +L  + D  ++     G+  Q+ T  G   + SLR +R G+   E  
Sbjct: 414  RGLQNLSLVEEIPSLYSLTDTLLMKAPSSGEANQLYTVCGRGSNSSLRQLRRGLETTEIV 473

Query: 341  SVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
            + EL G    +W+L+ +  D +D+++++SF + T +L++   + +EE    GF S   TL
Sbjct: 474  ASELPGAPIAIWTLKLNQTDVYDSYIILSFTNGTLVLSIG--ETVEEISDSGFLSSVSTL 531

Query: 400  FCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLV 459
                   + LVQ+    +R + +  +   +EWK P    V  +  N  Q+++A   G LV
Sbjct: 532  NARQMGRDSLVQIHPKGIRYIRANKQT--SEWKLPQDVYVVQSAINDMQIVVALSNGELV 589

Query: 460  YLEIGD----GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514
            Y E+ D    G L E +  + L   ++ L + P+ E    S    +    D +VR+ SL 
Sbjct: 590  YFEMSDDVEGGQLNEYQERKTLTANVTSLALGPVQEGSRRSNFMCLAC-DDATVRVLSL- 647

Query: 515  DLNLITKEHLGGE----------IIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKT 562
            DL   T E+L  +          IIP +V      G+S  YL   L +G  L  ++++ +
Sbjct: 648  DL-YTTLENLSVQALSSPANSLCIIPMNV-----NGVSTLYLHIGLMNGVYLRTVIDVTS 701

Query: 563  GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
            G+L D +   LG + + +   + KN   V A S R  + YS  + L  S +    + H  
Sbjct: 702  GQLLDTRTRFLGPRAVKIYPITMKNQNTVLAVSSRTFLAYSYQQNLQLSPIAYSAIDHAS 761

Query: 623  PFNSAAFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQ---------EQS 672
             F S   P+ +   ++  L I T+D +Q  L     PL   PR+I            +  
Sbjct: 762  SFASEQCPEGIVAIQKNTLKIFTVDSLQDDLKSDIYPLICTPRKIVKHPNFPVLYILQSE 821

Query: 673  RTF--------------AICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
            R F              +    K      +S + F+ + D  + + I   PL   E   S
Sbjct: 822  RNFDSFKYAQENGDVGSSYTKEKQNEHTSKSWVSFISVFDMISKKIIHESPLGDNEAAFS 881

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLIAEKETKGAVYS 775
            + +  F +    +   G+A  +  E         R+  F  E  KL+LI+  E  G   +
Sbjct: 882  MTAAFFKNRDEFFLVAGSATNMDLECRTCSHGNFRVYRFHDEGKKLELISHTEIDGIPMA 941

Query: 776  LNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
            L  F G++LA + + +++Y    K MLR      +         +   ++  +   IVV 
Sbjct: 942  LTPFQGRMLAGVGRFLRIYDLGNKKMLRKGELSAVP--------LFITHITVQASRIVVA 993

Query: 832  DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSE 889
            D   S+  ++YK E+  +   A D    W +   ++D D   G +   N++ +R  ++  
Sbjct: 994  DSQYSVRFVVYKPEDNHLLTFADDTIHRWTTTNVLVDYDTLAGGDKFGNIWLLRCPEHVS 1053

Query: 890  GATDEERGRLEVVGEYHLGEFVNRFRHG-SLVMRLPDSDV-----------GQIPTVIFG 937
               DEE    +++   H   F+N   H   L+     +D+           G    +++ 
Sbjct: 1054 KLADEENSESKLI---HEKPFLNSTPHKLDLMAHFFTNDIPTSLQKVQLVEGAREVLLWT 1110

Query: 938  TVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
             + G +GV    +  E   F ++L+  LRK    + G +H  +RS+    K V     +D
Sbjct: 1111 GLLGTVGVFTPFINQEDVRFFQQLEFLLRKECPPLAGRDHLAYRSYYAPVKCV-----ID 1165

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E +  L     + I+  ++ ++ E+ K++E+ 
Sbjct: 1166 GDLCEMYYSLPHPVQEMIANELDRTIAEVSKKIEDF 1201


>sp|P0CR22|RSE1_CRYNJ Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
            neoformans serotype D (strain JEC21 / ATCC MYA-565)
            GN=RSE1 PE=3 SV=1
          Length = 1217

 Score =  190 bits (482), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 277/1182 (23%), Positives = 476/1182 (40%), Gaps = 186/1182 (15%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I  +  FR  G  +D++   ++  +  +L++    +        +V  + G R    
Sbjct: 60   FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESLYQEVFGKSGSRRIVP 119

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  L+G      L  V+  + +G+L  +  +   +   L    +    G 
Sbjct: 120  GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHIVGVDQGY 179

Query: 121  AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
              P    L  D  ++             +H+  YE+ L     V   WS+   D  A+LL
Sbjct: 180  DNPLYAALETDYSESDQDSTGEAYENTQKHLTFYELDLGLNHVVRK-WSEPT-DRRANLL 237

Query: 168  IPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGS 214
            + VP            P   GVL+  E+ I++   +     IPI         R   D S
Sbjct: 238  VQVPGGQNANSDRFEGP--SGVLVCTEDHIIWKHMDVEAHRIPIPRRRNPLVQR--GDKS 293

Query: 215  R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
            R                 +LL    G L+ + I H  E V  LKI+      +A+++  L
Sbjct: 294  RGLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCIL 353

Query: 258  DNAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE---------------------- 288
                +Y+ S + D  L +     +  G        Y E                      
Sbjct: 354  KRGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLL 413

Query: 289  VLERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            +++   +L PI D  VV+L        Q+    G     + R +++G+ + E  S  L G
Sbjct: 414  LVDTVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPG 473

Query: 347  IK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+L+ + DD +D+++V+SF + T +L++   + +EE    GF S   TL      
Sbjct: 474  VPTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLG 531

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
               L+QV    +R + +  R   +EW +PPG ++  AT N  QV++A     LVY E+  
Sbjct: 532  NAGLLQVHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDP 589

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            +G L+E +  + L    +C+ I  + E    +   AVG   + +V I SL PD  L T  
Sbjct: 590  EGSLSEYQEKKALPGNATCVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLS 648

Query: 523  HLGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
                   P S+ L      S        +L   L +G LL  +++   G L+D +   LG
Sbjct: 649  LQALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLG 708

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
             +P  L   + +    V A S R  ++Y+    L    +    + +    ++A  PD L 
Sbjct: 709  AKPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLI 768

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQ 683
                  L I  I  + +KL   S  L   PR+ I H   S         RT++  +++  
Sbjct: 769  GISGNTLRIFNIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERI 828

Query: 684  SCAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGC 717
               +ESE                             VR+LD    E I T  LD  E   
Sbjct: 829  VKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAF 888

Query: 718  SILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAV 773
            SI    F       +  VGT      + +  K G + V+ I E G+ L+ + + +T    
Sbjct: 889  SIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIP 948

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
              L  F G LLA I + ++LY+      G + L  +C ++G   A+  +  +G  I+VGD
Sbjct: 949  LCLAGFQGFLLAGIGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGD 1003

Query: 833  LMKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            + +S    +Y+    +I  R     A D    W++ V  +D +     +   N+F  R +
Sbjct: 1004 MQESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLD 1059

Query: 888  ---SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV---- 934
               SE   D+  G      +  ++G  H  E +  +  GS+V     + + +IP V    
Sbjct: 1060 PSISEKVDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGR 1114

Query: 935  ---IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
               ++ T++G +G +   +  +   F+  L+ ++R     + G +H  +R +      V 
Sbjct: 1115 DVLVYTTISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVP 1169

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             K  +DGDL ESF  L   +   I+  ++ SV ++ K++E++
Sbjct: 1170 IKGVVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQM 1211


>sp|P0CR23|RSE1_CRYNB Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
            neoformans serotype D (strain B-3501A) GN=RSE1 PE=3 SV=1
          Length = 1217

 Score =  190 bits (482), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 277/1182 (23%), Positives = 476/1182 (40%), Gaps = 186/1182 (15%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I  +  FR  G  +D++   ++  +  +L++    +        +V  + G R    
Sbjct: 60   FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESLYQEVFGKSGSRRIVP 119

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  L+G      L  V+  + +G+L  +  +   +   L    +    G 
Sbjct: 120  GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHIVGVDQGY 179

Query: 121  AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
              P    L  D  ++             +H+  YE+ L     V   WS+   D  A+LL
Sbjct: 180  DNPLYAALETDYSESDQDSTGEAYENTQKHLTFYELDLGLNHVVRK-WSEPT-DRRANLL 237

Query: 168  IPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGS 214
            + VP            P   GVL+  E+ I++   +     IPI         R   D S
Sbjct: 238  VQVPGGQNANSDRFEGP--SGVLVCTEDHIIWKHMDVEAHRIPIPRRRNPLVQR--GDKS 293

Query: 215  R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
            R                 +LL    G L+ + I H  E V  LKI+      +A+++  L
Sbjct: 294  RGLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCIL 353

Query: 258  DNAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE---------------------- 288
                +Y+ S + D  L +     +  G        Y E                      
Sbjct: 354  KRGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLL 413

Query: 289  VLERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            +++   +L PI D  VV+L        Q+    G     + R +++G+ + E  S  L G
Sbjct: 414  LVDTVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPG 473

Query: 347  IK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+L+ + DD +D+++V+SF + T +L++   + +EE    GF S   TL      
Sbjct: 474  VPTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLG 531

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
               L+QV    +R + +  R   +EW +PPG ++  AT N  QV++A     LVY E+  
Sbjct: 532  NAGLLQVHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDP 589

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            +G L+E +  + L    +C+ I  + E    +   AVG   + +V I SL PD  L T  
Sbjct: 590  EGSLSEYQEKKALPGNATCVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLS 648

Query: 523  HLGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
                   P S+ L      S        +L   L +G LL  +++   G L+D +   LG
Sbjct: 649  LQALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLG 708

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
             +P  L   + +    V A S R  ++Y+    L    +    + +    ++A  PD L 
Sbjct: 709  AKPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLI 768

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQ 683
                  L I  I  + +KL   S  L   PR+ I H   S         RT++  +++  
Sbjct: 769  GISGNTLRIFNIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERI 828

Query: 684  SCAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGC 717
               +ESE                             VR+LD    E I T  LD  E   
Sbjct: 829  VKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAF 888

Query: 718  SILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAV 773
            SI    F       +  VGT      + +  K G + V+ I E G+ L+ + + +T    
Sbjct: 889  SIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIP 948

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
              L  F G LLA I + ++LY+      G + L  +C ++G   A+  +  +G  I+VGD
Sbjct: 949  LCLAGFQGFLLAGIGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGD 1003

Query: 833  LMKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            + +S    +Y+    +I  R     A D    W++ V  +D +     +   N+F  R +
Sbjct: 1004 MQESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLD 1059

Query: 888  ---SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV---- 934
               SE   D+  G      +  ++G  H  E +  +  GS+V     + + +IP V    
Sbjct: 1060 PSISEKVDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGR 1114

Query: 935  ---IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
               ++ T++G +G +   +  +   F+  L+ ++R     + G +H  +R +      V 
Sbjct: 1115 DVLVYTTISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVP 1169

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             K  +DGDL ESF  L   +   I+  ++ SV ++ K++E++
Sbjct: 1170 IKGVVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQM 1211


>sp|Q4PGM6|RSE1_USTMA Pre-mRNA-splicing factor RSE1 OS=Ustilago maydis (strain 521 / FGSC
            9021) GN=RSE1 PE=3 SV=1
          Length = 1221

 Score =  172 bits (435), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 259/1175 (22%), Positives = 483/1175 (41%), Gaps = 171/1175 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I +L  FR  G ++D++ + ++  +  +L++  +++ L  +   +   R G R    
Sbjct: 62   FGVIRSLAAFRLTGGSKDYVIVGSDSGRIVILEYQPKTNSL-EKVHQETFGRSGSRRIVP 120

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
            GQ    DP  R  +IG      L  ++  D +  L  +  +        +  +  +D+ F
Sbjct: 121  GQYLATDPKGRATMIGAMEKAMLVYILNRDAQANLTISSPLEAHRPSAIIHHIVGVDVGF 180

Query: 117  ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGA 164
               L+ C +    V Y D+      + +E A K   + E           WS+  +D  +
Sbjct: 181  ENPLFACLE----VDYSDSDHDPSGRAFEEAAKTLTYYELDLGLNHVVRKWSEP-VDPRS 235

Query: 165  DLLIPVPPPL----------CGVLIIGEETIVYCSANAFK-AIPIRPSITKAYGRVDADG 213
            +LL+ VP              GVL+  E+ I Y   +  +  +PI   +       +  G
Sbjct: 236  NLLVQVPGGYNQNLEKWDGPSGVLVCSEDYITYKHQDQPEHRVPIPKRLNPVEKLSERRG 295

Query: 214  SR--------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
            +               +L+    G L  + + H+ +++  LKI+      +AS +  L +
Sbjct: 296  TLIVASVLHKMKNAFFFLVQTEDGDLFKITMEHQDDEIRSLKIKYFDTVPVASGLVILRS 355

Query: 260  AVVYIGSSYGDSQLIKLN------------------------------LQPDAKGSYVEV 289
              +++ S YG   L                                    P    + V+V
Sbjct: 356  GFLFVASEYGAQLLYSFQKLGDDDDLPEYISTDYDENGAGRRRPQLPTFTPRPLDNLVQV 415

Query: 290  LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
             E   +L PI+D   ++       Q+    G     S +++R+G+ + E  S +L G+  
Sbjct: 416  DE-MPSLDPILDAKPLNPLAADSPQIFAACGRGARSSFKMLRHGLEVQEAVSSDLPGVPS 474

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +W+ + +  D +D+++++SF++ T +L++   + +EE    GF + + TL       + 
Sbjct: 475  AVWTTKITQQDEYDSYIILSFVNGTLVLSIG--ETIEEVSDSGFLTSSSTLAVQQLGQDA 532

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSP--PG--YSVNVAT-ANASQVLLATGGGHLVYLEI 463
            L+QV    +R V    +++ NEW +P  P    +  VAT  N  QV++A     LVY E+
Sbjct: 533  LLQVHPHGIRHVL-VDKQI-NEWATPSLPNGRQTTIVATCTNERQVVVALSSNELVYFEL 590

Query: 464  G-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
              DG L E +  + +   +  + +    E    +   AVG   D +VRI SL   + +  
Sbjct: 591  DMDGQLNEYQERKAMGAGVLTMSMPDCPEGRQRTPYLAVGC-DDSTVRIISLEPNSTLAS 649

Query: 522  EHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
              +     P S +  A             +++   L +G LL  +L+  TG+LTD +   
Sbjct: 650  ISIQALTAPASSICMAEMLDATIDRNHATTFVNIGLQNGVLLRTILDAVTGQLTDTRTRF 709

Query: 573  LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
            LG++ + L          V A S R  + Y+   +L +  +    + H   F++   P+ 
Sbjct: 710  LGSKAVRLIRTKVHGQAAVMALSTRTWLSYTYQDRLQFVPLIFDVLDHAWSFSAELCPEG 769

Query: 633  LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCA---- 686
            L       L I TI  +  KL   S+ L   PR+I  H  +   F +   ++++ +    
Sbjct: 770  LIGIVGSTLRIFTIPSLASKLKQDSVALSYTPRKIANHPNEQGLFYVVEAEHRTLSPGAQ 829

Query: 687  --------EESEMHFVRLLDDQTFEF------------------------ISTYPLDTFE 714
                    +E + H   +LD    EF                             +D  E
Sbjct: 830  RRRTEMLGKELKPHQRGVLDLNPAEFGAIRAEAGNWASCIRAVDGVQAQTTHRLEMDDNE 889

Query: 715  YGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETK 770
               SI    F S +  V   VG+A  V+       K  +  + ++++G+ L+L+ + E  
Sbjct: 890  AAFSIAVVPFASAEKEVMLVVGSAVDVVLSPRSCKKAYLTTYRLLDNGRELELLHKTEVD 949

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
                 L AF G+LLA I + +++Y     D G ++L  +C +     A+  +  +G  IV
Sbjct: 950  DIPLVLRAFQGRLLAGIGKALRIY-----DLGKKKLLRKCENRSFPTAVVSLDAQGSRIV 1004

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KN 887
            VGD+ +SI    YK  E  +   A D    +++   +LD D    A+   N++ +R   N
Sbjct: 1005 VGDMQESIIFASYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNIYVLRLDGN 1064

Query: 888  SEGATDEERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGT 938
            +  + DE+   + +V       G  H    V  F  G ++  L  + +  G    +++  
Sbjct: 1065 TSRSVDEDPTGMTIVHEKPVLMGAAHKASLVAHFFVGDIITSLHRTAMVAGGREVLLYTG 1124

Query: 939  VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            ++G IG +   +  E    L  L+++LR+    + G +H  +RS     K+V     +DG
Sbjct: 1125 LSGSIGALVPFVSKEDVDTLSTLESHLRQENNSIVGRDHLAYRSSYAPVKSV-----IDG 1179

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL E+F  LS  + + I+  ++    E+ K++ +L
Sbjct: 1180 DLCETFGLLSPAKQNAIAGELDRKPGEINKKLAQL 1214


>sp|Q10426|RIK1_SCHPO Chromatin modification-related protein rik1 OS=Schizosaccharomyces
            pombe (strain 972 / ATCC 24843) GN=rik1 PE=1 SV=2
          Length = 1040

 Score =  131 bits (330), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 226/1060 (21%), Positives = 428/1060 (40%), Gaps = 127/1060 (11%)

Query: 4    VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
            +P++  +  +  +RP G  +D+LF+      +  + WD +  ++I      V  R+  P 
Sbjct: 53   IPLFSTVKHIRPYRPPGLDRDYLFVVLNDDTYFSIYWDEDYQKVIVDH-PPVRYRVTFPW 111

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVI---------PFDNKGQLKEAFNIRLEELQVLDI 114
            +         D R+  + L      +I                +   F        + D+
Sbjct: 112  NRNAKSYCLVDLRMRAIFLSIDEISMICIRILSAEERLKTGRSIDSGFPFSFPVHLIYDM 171

Query: 115  KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
              L   + PT+VVL+ D  D  +V  + + L  K          +L  G  L   V P +
Sbjct: 172  CILNDSSTPTLVVLHSDGLDC-YVTAFLLDLSSK----------SLGKGIRLFERVKPSM 220

Query: 175  CGV-----LIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL 228
                    L++ E   ++C     F  I    +    +  +      Y++ D  G L  +
Sbjct: 221  IMPFGKRGLLVFESLFIHCMYRGNFVTINGPCTTYMHWTPLKGQKMHYIVCDTNGYLFGV 280

Query: 229  VITHEKEKVTGLKIELLG--ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
              +   +    L +E L        ++++ +   +++IGS   +S+LI L+   D     
Sbjct: 281  YSSILGKNKWSLVMERLPIPPFDFITSLNSIHEGLLFIGSKNSESKLINLSTLKD----- 335

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
               ++   NLGPI D  V  L+   +   + C+G  ++ SL   ++ + ++     ++ G
Sbjct: 336  ---VDSIPNLGPIHDLLV--LKNDIEKSFLVCAGTPRNASLIYFQHALKLDILGQTKISG 390

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
            I     L S  +      L + F SET  +A N++++ +  E++   S  +         
Sbjct: 391  ILRAMVLPSYPEHK----LFLGFPSET--VAFNIKEDFQ-LELDPSLSTKERTIALSGTN 443

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
             + VQVTS  + +  S  R      +       N A       ++  G    ++ +    
Sbjct: 444  GEFVQVTSTFLCIYDSAKRSRLVYIEK----ITNAACYQEYSAIVINGTALAIFKKD--- 496

Query: 467  ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-KEHLG 525
              TEV     E EISCLD +      +  QI  VG W+   V I +  D + I+      
Sbjct: 497  --TEVARKVFESEISCLDFS------AQFQIG-VGFWSK-QVMILTFSDNSSISCAFQTN 546

Query: 526  GEIIPRSVLLCAFEGI----SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
               +PR+++L   EG+    + LL + G G   +++L       ++ K    GT P++ R
Sbjct: 547  VPSLPRNIIL---EGVGVDRNLLLVSSGSGEFKSYVLFKNNLVFSETKH--FGTTPVSFR 601

Query: 582  TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
             F+    T++   +D P ++Y  N  L Y  +++ +   +C F   +  D L     G L
Sbjct: 602  RFTMNIGTYIICNNDCPHMVYGFNGALCYMPLSMPQSYDVCQFRDNSGKDFLISVSLGGL 661

Query: 642  TIGTIDDIQKLHIRSIPLGEHP-RRICHQEQSRTFAICSLKN-----QSCAEESEMHFVR 695
                ++ + +L  R + L   P + I  Q +     + +L+N     +S  E   +  V 
Sbjct: 662  KFLQLNPLPELTPRKVLLEHVPLQAIIFQNK---LLLRTLENRYEDYESYKENYHLELVD 718

Query: 696  LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV-- 753
              DD +F   S    +  E    I   S          VGT+ +  ++  P  GR+++  
Sbjct: 719  SYDDNSFRVFSFTENERCEKVLKINESSL--------LVGTSIIEQDKLVPVNGRLILLE 770

Query: 754  FIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH 813
            F  E   L++++      AV  L  +N + + A  Q++ + K     +    + S     
Sbjct: 771  FEKELQSLKVVSSMVLSAAVIDLGVYNDRYIVAFGQQVAIVKL---TEERLMIDSRISLG 827

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
              +L L V+  G+ I + D +   +++ +  ++  +  R   +  N + A  + +  +Y+
Sbjct: 828  SIVLQLIVE--GNEIAIADSIGRFTIMYFDGQKFIVVARYL-FGENIVKAA-LYEGTVYI 883

Query: 874  GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-- 931
             A N+  L  +R N +     +R   E V  YHL + V++F++       P ++      
Sbjct: 884  IATNSGLLKLLRYNKDAKNFNDRFICESV--YHLHDKVSKFQN------FPITNTNSFLE 935

Query: 932  PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV-IKGVGGLNHEQWRSFNNEKKTVD 990
            P ++F T  G IG I SL  ++ L LE+L   +RK+    +  +++E       E   + 
Sbjct: 936  PKMLFATEIGAIGSIVSLKDKE-LELEELTRKIRKLKFSYLSSMDYESI-----EADLIS 989

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
               F+DGDL+   +D+ R    E+ +        LC+ VE
Sbjct: 990  PVPFIDGDLV---IDVKRWASSELFR--------LCRSVE 1018


>sp|Q9EPU4|CPSF1_MOUSE Cleavage and polyadenylation specificity factor subunit 1 OS=Mus
            musculus GN=Cpsf1 PE=1 SV=1
          Length = 1441

 Score =  102 bits (253), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 970  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1318 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1423



 Score = 59.3 bits (142), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 108/565 (19%), Positives = 220/565 (38%), Gaps = 151/565 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
           + +++  ++   +++GS  G+S L+K    LQ               P +K   VE    
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 429

Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
                                             V +  +N+GP  +  V +   L  + 
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489

Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
           Q       ++V CSG  K+G+L +++  I      + EL G   MW++            
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 549

Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
                       ++  D     FL++S    T IL      E+ E +  GF +Q  T+F 
Sbjct: 550 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607

Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
            +   N+ +VQV+   +RL+   ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632


>sp|Q10570|CPSF1_HUMAN Cleavage and polyadenylation specificity factor subunit 1 OS=Homo
            sapiens GN=CPSF1 PE=1 SV=2
          Length = 1443

 Score =  102 bits (253), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 972  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1031

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 1032 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1091

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1092 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1150

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1151 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1205

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1206 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1265

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1266 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1325

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1326 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1385

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1386 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1425



 Score = 55.8 bits (133), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 107/567 (18%), Positives = 220/567 (38%), Gaps = 153/567 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
           + +++  ++   +++GS  G+S L+    KL   P                    DA   
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429

Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
           +                                 EV +  +N+GP  +  V +   L  +
Sbjct: 430 WSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEE 489

Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
            Q       ++V CSG  K+G+L +++  I      + EL G   MW++     +   D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549

Query: 360 P--------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
           P                       FL++S    T IL      E+ E +  GF +Q  T+
Sbjct: 550 PKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTV 607

Query: 400 FCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           F  +   N+ +VQV+   +RL+   ++
Sbjct: 608 FAGNIGDNRYIVQVSPLGIRLLEGVNQ 634


>sp|Q10569|CPSF1_BOVIN Cleavage and polyadenylation specificity factor subunit 1 OS=Bos
            taurus GN=CPSF1 PE=1 SV=1
          Length = 1444

 Score = 99.0 bits (245), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 973  IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  +  C                         + E   ++L+   ++E I  + 
Sbjct: 1033 KVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVSWEAIPNAR 1092

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1093 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1151

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1206

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1207 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1266

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1267 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1320

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1321 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1380

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1381 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1426



 Score = 46.6 bits (109), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 62/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 77  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 195

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 315

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 316 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 372

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 373 LTTSMVTMEPGYLFLGSRLGNSLLLK 398



 Score = 38.5 bits (88), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 40/172 (23%), Positives = 72/172 (41%), Gaps = 36/172 (20%)

Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
           EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct: 466 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 525

Query: 339 QASVELQGIKGMWSL------------------------RSSTDDPFDTFLVVSFISETR 374
             + EL G   MW++                         +  D     FL++S    T 
Sbjct: 526 VTTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTM 585

Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           IL      E+ E +  GF +Q  T+F  +   N+ +VQV+   +RL+   ++
Sbjct: 586 ILQTG--QEIMELDASGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 635


>sp|Q5A7S5|RSE1_CANAL Pre-mRNA-splicing factor RSE1 OS=Candida albicans (strain SC5314 /
            ATCC MYA-2876) GN=RSE1 PE=3 SV=1
          Length = 1219

 Score = 96.7 bits (239), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 234/1145 (20%), Positives = 457/1145 (39%), Gaps = 185/1145 (16%)

Query: 24   DFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPD--CRLIGLH 81
            D + I ++     +LQ+D ++ + I++    ++         G+   IDP+  C L+   
Sbjct: 118  DGVVITSDSGNLSILQYDNKTKKFISKIQEPMTKNGWGRNYVGENLAIDPENRCILVAAM 177

Query: 82   LYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKF----------LYGC-----AKPTIV 126
              + LF  I  ++ G  + +  +     QVL +K           L+G       K  I+
Sbjct: 178  EKNKLFYKIESNSSGSKELSSPLEAHSKQVLCLKIVALNTDHNNPLFGALELTPEKKCII 237

Query: 127  VLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--LDNGADLLIPVPPPLCGVLIIGEET 184
              Y+ ++   HV      +K K     P S N+  L N  + LIP+P  + G+++ G   
Sbjct: 238  NYYELDQGLNHV------VKKK-----PNSSNSDPLPNDVNYLIPLPGHIGGMVVCGTNW 286

Query: 185  IVYCSANA---FKAIPIRPSITKAYGRVD-----ADGSRY--LLGDHAGLLHLLVITHE- 233
              Y   +    +  +P R   T+    V+         ++  LL +  G L  L + ++ 
Sbjct: 287  CFYDKLDGPRIYLPLPRRNGQTQDSIIVNHVTHVLKKKKFFILLQNALGDLFKLTVDYDF 346

Query: 234  -KEKVTGLKIELLGETSIASTISYLDNAVVY---------------IGSSYGDSQLI--- 274
             KE +  + I        A +++   N  ++               +G    + +L+   
Sbjct: 347  DKEIIKNISITYFDTIPPALSLNIFKNGFLFANVLNNDKLLYQFEKLGDDLTEGELVINS 406

Query: 275  ----KLNLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 320
                 LN   ++  S+          ++VLE    L PI D  ++D       ++VT S 
Sbjct: 407  SDYESLNSVRESVTSFKLKGLDNLALIDVLE---TLSPITDSKIID------SKLVTLSS 457

Query: 321  AYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
                  ++ + +G+         L      +++ + S +   D +LV+S    ++ L ++
Sbjct: 458  H---SYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDEYLVISSSLSSKTLVLS 514

Query: 380  LEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN-EWKSPPGYS 438
            + + +E+ E   F     T+         +VQ+ S  ++ V + +   +  +W  P G +
Sbjct: 515  IGEVVEDVEDSEFVLDQPTIAVQQVGIASVVQIYSNGIKHVRTVNGNKKTTDWFPPAGIT 574

Query: 439  VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI-SCLDINPIGENPSYSQI 497
            +  AT N  QVL+A     +VY EI D    ++   Q   EI + +    I EN S    
Sbjct: 575  ITHATTNNQQVLIALSNLSVVYFEI-DATDDQLIEYQDRLEIATTITAMAIQENISEKSP 633

Query: 498  AAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGISYLLCALGDGHL 553
             A+   +D ++++ SL + N +  + L          + +     E  +++   + +G  
Sbjct: 634  FAIIGCSDETIQVVSLQEHNCLEIKSLQALSANSSSLKMLKSSGKE--THVHIGMENGVY 691

Query: 554  LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSN 612
                ++   G L++ +   +G++P++L      N    + A S  P + Y        + 
Sbjct: 692  ARIKIDTINGNLSNSRVKYIGSKPVSLSVIKFSNEIEGILAISSAPWISYLYRDSFKITP 751

Query: 613  VNLKEVSHMCPFNSAAFP-DSLAIAKEGELTIGTI-------DDIQKLHIRSIPLGEHPR 664
            +   ++++   F S     + +   K+  L I ++       D  Q L I +  L   PR
Sbjct: 752  LLEIDITNGSSFISEDIGGEGIVGIKDNNLIIFSVGKEDSVFDPSQDLTIATTKLRYTPR 811

Query: 665  RICHQEQSRTFAI-----------CSLKNQSCAEESEMHF---------------VRLLD 698
            ++     +R F             C++         E ++               ++++D
Sbjct: 812  KMI-TNGNRLFISESEYNVQGPFKCNINGDVKENVDEDYYEAFGYEWKQNSWASCIQVVD 870

Query: 699  DQTFEFISTYPLDTFEYGCSILSCSFSDDSN-----VYYCVGTAY---VLPEENEPTKGR 750
             ++ + I +  LD  E   S+ + SF+  S       +  VG      +LP  N   K  
Sbjct: 871  SKSNQVIQSLQLDGNESIVSMSAVSFNKTSTPSVPASHLVVGVCTNQTILP--NSYDKSY 928

Query: 751  ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
            +  F +    LQL+ + E       L  F  KLL A    I+LY     D G ++L  + 
Sbjct: 929  LYTFKIGKKHLQLVHKTELDHIPQVLENFQDKLLVASGNHIRLY-----DIGQKQLLKKS 983

Query: 811  ----GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE--ERARDYNANWMSAV 864
                    +I  +  QT  + I++ D  KS S++  K +E   +    A D     ++++
Sbjct: 984  TTIIDFSTNINKIIPQT--NRIIICDSHKS-SIVFAKFDESQNQFVPFADDVMKRQITSI 1040

Query: 865  EILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLG 908
              LD D  +G +   N+F  R                K  +G  +    +L+ + E+H+G
Sbjct: 1041 MNLDIDTLIGGDKFGNIFVTRIDEDISKQADDDWTILKTQDGILNSCPYKLQNLIEFHIG 1100

Query: 909  EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKV 967
            + +  F  G L       ++    +VI+  + G IG++  L  +  +  L  LQ  +++ 
Sbjct: 1101 DIITSFNLGCL-------NLAGTESVIYTGLQGTIGLLIPLVSKSEVELLFNLQLYMQQS 1153

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
               + G +H + RS+ N       KN +DGDL+E FL+   +   EIS+ +N SV ++ K
Sbjct: 1154 QNNLVGKDHLKLRSYYNP-----IKNVIDGDLLERFLEFDISLKIEISRKLNKSVNDIEK 1208

Query: 1028 RVEEL 1032
            ++ +L
Sbjct: 1209 KLIDL 1213


>sp|Q7XWP1|CPSF1_ORYSJ Probable cleavage and polyadenylation specificity factor subunit 1
            OS=Oryza sativa subsp. japonica GN=Os04g0252200 PE=3 SV=2
          Length = 1441

 Score = 90.5 bits (223), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 87/353 (24%), Positives = 160/353 (45%), Gaps = 33/353 (9%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
            +E  ST P+  FE   ++    L  + + ++     +GTAYVL  E+   +GR+L+F   
Sbjct: 1095 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVL-GEDVAARGRVLLFSFT 1153

Query: 756  VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
              +    L+ E   KE+KGAV ++ +  G LL A   KI L KW        EL +   +
Sbjct: 1154 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GAELTAVAFY 1208

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
               +  + +    +F++ GD+ KSI  L +K +   +   A+D+ +    A E L D   
Sbjct: 1209 DAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGST 1268

Query: 873  LG-----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            L      ++ N  +F     +    +  +G +L    E+H+G  + +F     +  LP  
Sbjct: 1269 LSLVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQ 1322

Query: 927  DVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
             +    T    ++FG ++G IG IA +    +  L+ LQ  L   +  V GLN   +R F
Sbjct: 1323 GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQF 1382

Query: 983  NNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            ++  K       N +D +L+ S+  LS     ++++ +  +  ++     +++
Sbjct: 1383 HSNGKGHRPGPDNIIDFELLCSYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 1435



 Score = 66.2 bits (160), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 90/393 (22%), Positives = 160/393 (40%), Gaps = 109/393 (27%)

Query: 110 QVLDIKFLYGCAKPTIVVLYQDNKD------ARH----VKTYEVALKDKDFVEGP--WSQ 157
            V D  F++G  +P +V+L++          ++H    +  + +++  K   + P  WS 
Sbjct: 250 HVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLK---QHPVIWSA 306

Query: 158 NNLDNGADLLIPVPPPLCGVLIIGEETIVY------CSA--NAFKAIPI-RPSITKAYGR 208
            NL + A  L+ VPPP+ GVL+I   +I Y      CS   N F + P   P I+K+  +
Sbjct: 307 ANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQ 366

Query: 209 VDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGL-----KIELLGE--TSIAST 253
           V+ D ++         +    AG + LL + ++   V  L     K  +L    TSI ++
Sbjct: 367 VELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNS 426

Query: 254 ISYL-----DNAVVYIGSSYGDSQLIKL-------------------------------- 276
             +L     D+ +V        S L  L                                
Sbjct: 427 FFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSV 486

Query: 277 ------------NLQPDAKGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
                       +L+   K SY+ V +  +N+GP+ DF            +   +Q   +
Sbjct: 487 EELSFQNIIAPNSLESAQKISYI-VRDALINVGPLKDFSYGLRANADPNAMGNAKQSNYE 545

Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------RSSTDDPFDTFLVV 367
           +V CSG  K+GSL +++  I  +    VEL   +G+W++       + + D+ +  +L++
Sbjct: 546 LVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRGQMAEDNEYHAYLII 605

Query: 368 SFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
           S   E R + +   D+L E TE   +  Q  T+
Sbjct: 606 SL--ENRTMVLETGDDLGEVTETVDYFVQASTI 636


>sp|Q9V726|CPSF1_DROME Cleavage and polyadenylation specificity factor subunit 1
            OS=Drosophila melanogaster GN=Cpsf160 PE=1 SV=1
          Length = 1455

 Score = 88.2 bits (217), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 108/436 (24%), Positives = 198/436 (45%), Gaps = 64/436 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
            +R +PL   PR++ +  ++R + + +              +++  +EES    F+     
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085

Query: 696  -----LLDDQTFEFISTYPLDTFE-----YGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
                 L+  +T+E +    + TFE         I+  S+    +    Y C+GT +    
Sbjct: 1086 QFEMVLISPETWEIVPDASI-TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-S 1143

Query: 743  ENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            E+  ++G I ++     + E GK      ++ I +KE KG V +++   G L+  + QKI
Sbjct: 1144 EDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI 1203

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
              Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +  
Sbjct: 1204 --YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1258

Query: 852  RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
             +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YH
Sbjct: 1259 ASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYH 1316

Query: 907  LGEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
            LG+ VN  FR       L  R P     +   V++GT++G +G    LP + Y     LQ
Sbjct: 1317 LGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQ 1375

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
              L    + + GLN +++R+  + KK     ++  +DGDLI S+  ++ +  +E++K + 
Sbjct: 1376 NVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIG 1435

Query: 1020 VSVEELCKRVEELTRL 1035
               EE+   + E+ RL
Sbjct: 1436 TRTEEILGDLLEIERL 1451



 Score = 56.6 bits (135), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L I+ +  K  VLQ D ++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
              ++ I L +L      VLDI+FL+G  +PT+++LY+  +     +K          ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
           L  +  V    W+ N+L      + P+  P+ G L++    ++Y              SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315

Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
           +   A P++P      S+  A +  +D D  + ++    G L++L +  +  + V     
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373

Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                + + S I  L +  +++GS  G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405



 Score = 43.5 bits (101), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G             + ++V  +G  K+G+L +  
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 544

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N I      S EL G   +W++      +SS +D  D F+++S  + T +L      E+ 
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 601

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640


>sp|A8XPU7|CPSF1_CAEBR Probable cleavage and polyadenylation specificity factor subunit 1
            OS=Caenorhabditis briggsae GN=cpsf-1 PE=3 SV=1
          Length = 1454

 Score = 87.0 bits (214), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 84/315 (26%), Positives = 146/315 (46%), Gaps = 35/315 (11%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE  +PT  R         K++++ +KE KG V  L A NG LL+ + QK+  + W  
Sbjct: 1153 VVPEPGQPTSNR---------KIKVLYDKEQKGPVTGLCAINGLLLSGMGQKV--FIWQF 1201

Query: 799  RDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
            +D+    + S    H ++  L+ ++T     +  D  +S+SL+ ++ E  A+   +RD  
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLHSIRT---IALALDARESMSLIRFQEENKAMSIASRDDR 1257

Query: 858  --ANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
              A    A E L D +++G        N  LF+    +  +   E  RL V    ++G  
Sbjct: 1258 KCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGE--RLTVKAAINIGTN 1315

Query: 911  VNRFRHGSLVMRLPDS-------DVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
            +N F        L DS       ++ Q    IFG+++G  G I  L  + Y  L  LQT 
Sbjct: 1316 INAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLTEKSYRRLHFLQTF 1375

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            +  V   + GL+ +  RS    +  V   +A+N +DGD++E +L LS     ++++ + V
Sbjct: 1376 IGSVTPQIAGLHIKGARSSKPSQPIVNGRNARNLIDGDVVEQYLHLSVYDKTDLARRLGV 1435

Query: 1021 SVEELCKRVEELTRL 1035
                +   + +L R+
Sbjct: 1436 GRYHILDDLMQLRRM 1450



 Score = 68.2 bits (165), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 90/399 (22%), Positives = 158/399 (39%), Gaps = 93/399 (23%)

Query: 65  NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYG 119
           N  I   DP  R     +Y     ++PF    +   ++ I L+++      V D+ FL G
Sbjct: 138 NPPIVRTDPANRCAASLVYGKHIAILPFHENSKRILSYIIPLKQIDPRLDNVADMVFLEG 197

Query: 120 CAKPTIVVLYQD----------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIP 169
             +PTI+ LY+             D   +    V + D+ F    W   NL    + L+ 
Sbjct: 198 YYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVV-WQTANLPMDCNSLLS 256

Query: 170 VPPPLCGVLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRY 216
           +P PL G ++ G  TIVY +              + F   P++  +      +D   S Y
Sbjct: 257 IPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNSCYDGFTKFPLK-DMKHLKMTLDCSTSVY 315

Query: 217 L------LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
           +      +G   G L+L  LV +     V  L+   + +TSIA T++      +++GS  
Sbjct: 316 MEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPGHLFVGSRL 375

Query: 269 GDSQLIKLNL----QPDAK----------------------GSYVEV------------- 289
           GDSQL++  L    +  AK                      G  +E+             
Sbjct: 376 GDSQLLEYTLLKVTKESAKKQRLEQQNPSEIELDEDDIELYGGAIEMQQNDDDEQISESL 435

Query: 290 ----LERYVNLGPIVDFC----------VVDLERQGQ-GQVVTCSGAYKDGSLRIVRNGI 334
               L+R +N+GP+   C          ++D +R+     +VT SG  K+G+L + +  +
Sbjct: 436 QFRELDRLLNVGPVKSMCFGRPNYMSNDLIDAKRKDPVFDLVTASGHGKNGALCVHQRSM 495

Query: 335 GINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
                 S  L+G + +W++    ++    +L+VS +  T
Sbjct: 496 RPEIITSSLLEGAEQLWAVGRKENESH-KYLIVSRVRST 533


>sp|Q5BDG7|CFT1_EMENI Protein cft1 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 /
            CBS 112.46 / NRRL 194 / M139) GN=cft1 PE=3 SV=1
          Length = 1339

 Score = 86.3 bits (212), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 96/438 (21%), Positives = 192/438 (43%), Gaps = 70/438 (15%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
            +R++P+G+   ++ +   S T+ +  C        E+ E+H                  +
Sbjct: 907  MRTVPIGQQIDKLTYVSASDTYVLGTCQRCEFRLPEDDELHPEWRNEEISFLPEVNQSSL 966

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
            +++  +T+  I +YPL+  E+   + + S     N +       VGT+     E+ P++G
Sbjct: 967  KVVSPKTWSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLAR-GEDIPSRG 1025

Query: 750  RILVF----IVEDG-------KLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKW 796
             I VF    +V D        +L+LI ++  KGAV +L+   G+  L+AA  QK  +   
Sbjct: 1026 CIYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRG- 1084

Query: 797  MLRDDGT----RELQSECGHHGHILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEE 851
             L++DG+      +  +C      +++  + +G    + GD +K +    Y  E   +  
Sbjct: 1085 -LKEDGSLLPVAFMDMQC-----FVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSL 1138

Query: 852  RARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
             A+D +   + A + L D      + A+++ NL+ ++ + E        +L    ++H G
Sbjct: 1139 FAKDLDYLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTG 1198

Query: 909  EFVN-------------RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
             F +             R   GS  M +   +   +  V+  + NG IG++  +P E Y 
Sbjct: 1199 NFASTVTLLPRTLVSSERAMSGSDKMDI--DNTAPLHQVLVTSHNGSIGLVTCVPEESYR 1256

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
             L  LQ+ L   ++   GLN   +R+  ++      +  LD +L+  +LD+S+ R  EI+
Sbjct: 1257 RLSALQSQLTNTLEHPCGLNPRAYRAVESDASA--GRGMLDSNLLLQYLDMSKQRKAEIA 1314

Query: 1016 KTMNVSVEELCKRVEELT 1033
              +  +  E+   +E ++
Sbjct: 1315 GRVGATEWEIRADLEAIS 1332


>sp|A1DB13|CFT1_NEOFI Protein cft1 OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 /
            FGSC A1164 / NRRL 181) GN=cft1 PE=3 SV=1
          Length = 1400

 Score = 84.3 bits (207), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 88/371 (23%), Positives = 160/371 (43%), Gaps = 37/371 (9%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
            ++++  +T+  I +Y L   EY  ++ +       N +       VGTA+    E+ P++
Sbjct: 1027 LKVVSPRTWTVIDSYSLGPAEYVMAVKNMDLEVSENTHERRNMIVVGTAFAW-GEDIPSR 1085

Query: 749  GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I VF V            D KL+LI ++  KGAV +L+   G+  L+AA  QK  +  
Sbjct: 1086 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1145

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  +         ++GD +K +    Y  E   +    +D
Sbjct: 1146 --LKEDGSLLPVAFMDMQCYVNVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1203

Query: 856  YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A E L D      L A+++ NL  ++ + E        RL    ++H+G F  
Sbjct: 1204 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1263

Query: 913  RFRHGSLVM-----RLPDSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                    M      + D D  +I +      V+  + +G +G++ S+P E Y  L  LQ
Sbjct: 1264 TMTLLPRTMVSSEKAMADPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1323

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN   +R+   E      +  LDG+L+  +LD+ + R  EI+  +   
Sbjct: 1324 SQLTNSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1381

Query: 1022 VEELCKRVEEL 1032
              E+   +E +
Sbjct: 1382 EWEIKADLEAI 1392


>sp|Q4WCL1|CFT1_ASPFU Protein cft1 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 /
            CBS 101355 / FGSC A1100) GN=cft1 PE=3 SV=2
          Length = 1401

 Score = 83.6 bits (205), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 159/371 (42%), Gaps = 37/371 (9%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
            ++++  +T+  I +Y L   EY  ++ +       N +       VGTA+    E+ P++
Sbjct: 1028 LKVVSPRTWTVIDSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFAR-GEDIPSR 1086

Query: 749  GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I VF V            D KL+LI ++  KGAV +L+   G+  L+AA  QK  +  
Sbjct: 1087 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1146

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K +    Y  E   +    +D
Sbjct: 1147 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1204

Query: 856  YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A E L D      L A+++ NL  ++ + E        RL    ++H+G F  
Sbjct: 1205 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1264

Query: 913  RFR-------HGSLVMRLPDS---DVGQIPT-VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                           M  PDS   D   I   V+  + +G +G++ S+P E Y  L  LQ
Sbjct: 1265 TMTLLPRTMVSSEKAMANPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1324

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN   +R+   E      +  LDG+L+  +LD+ + R  EI+  +   
Sbjct: 1325 SQLANSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1382

Query: 1022 VEELCKRVEEL 1032
              E+   +E +
Sbjct: 1383 EWEIKADLEAI 1393



 Score = 33.5 bits (75), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 90/214 (42%), Gaps = 38/214 (17%)

Query: 109 LQVLDIKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKDFVEGPWSQN 158
           L  + + FLY   +PT  +LY           + KD+     + + L+ +       S  
Sbjct: 232 LHPVSLAFLYEYREPTFGILYSQIATSHALLSERKDSIFYTVFTLDLEQRASTTL-LSVP 290

Query: 159 NLDNGADLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIPIR--PSITKAYGRVD-ADGS 214
            L +    ++ +PPP+ G L+IG   +V+   A    A+ +        A+  VD +D +
Sbjct: 291 KLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLA 350

Query: 215 RYLLG-------DHAGLLHL------LVITH---EKEKVTGLKIELL----GET---SIA 251
             L G       D  G L L      +V+ H   +   V+G+ +  L    G T   S A
Sbjct: 351 LRLEGCVVEHISDSTGDLLLVLSSGNMVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAA 410

Query: 252 STISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
           S+ ++L +  V+ GS   DS L+  +  P+ K S
Sbjct: 411 SSSAFLGSGRVFFGSEDADSVLLSWSSMPNPKKS 444


>sp|A1C3U1|CFT1_ASPCL Protein cft1 OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 /
            DSM 816 / NCTC 3887 / NRRL 1) GN=cft1 PE=3 SV=1
          Length = 1401

 Score = 83.6 bits (205), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 96/413 (23%), Positives = 172/413 (41%), Gaps = 56/413 (13%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
            ++ + +GEH   + +   S T+ + +  +      E+ E+H                  +
Sbjct: 969  LKKVAIGEHVDHLAYSISSETYVLGTSHSADFKLPEDDELHPEWRNEAISFLPELRQCCL 1028

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
            +++  +T+  I +Y L   E   ++ + +     N +       VGTA     E+ P +G
Sbjct: 1029 KVVHPKTWTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALAR-GEDIPARG 1087

Query: 750  RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKW 796
             I VF V            D KL+LI ++  KGAV +L+   G+  L+AA  QK  +   
Sbjct: 1088 CIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQGQKCMVRG- 1146

Query: 797  MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
             L++DG+    +      ++  L         +VGD  K I    Y  E   +    +D 
Sbjct: 1147 -LKEDGSLLPVAFMDVQCYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPYKMSLFGKDL 1205

Query: 857  NANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
                + A + L D      L A+++ NL  ++   E        +L V  ++H+G F + 
Sbjct: 1206 EYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSKFHMGHFTST 1265

Query: 914  FR-----HGSLVMRLPDSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
                     S  +   DSD  ++        V+  + +G IG++ S+P E Y  L  LQ+
Sbjct: 1266 LTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEESYRRLSALQS 1325

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
             L   ++   GLN   +R+   E      +  LDG+L+  +L +S+ R  EI+
Sbjct: 1326 QLANTVEHPCGLNPRAYRAI--ESDGTAGRGMLDGNLLYQWLSMSKQRRMEIA 1376


>sp|Q9N4C2|CPSF1_CAEEL Probable cleavage and polyadenylation specificity factor subunit 1
            OS=Caenorhabditis elegans GN=cpsf-1 PE=3 SV=2
          Length = 1454

 Score = 83.6 bits (205), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 146/312 (46%), Gaps = 29/312 (9%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE ++PT  R         K++++ +KE KG V  L A NG LL  + QK+  + W  
Sbjct: 1153 VVPEPDQPTSNR---------KIKVLFDKEQKGPVTGLCAINGLLLCGMGQKV--FIWQF 1201

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN- 857
            +D+    + S    H ++  L+  +     +  D  +S+SL+ ++ +  A+   +RD   
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLH--SLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRK 1258

Query: 858  -ANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN- 912
             A    A +++ D  ++G   ++   N+       E        RL V    ++G  +N 
Sbjct: 1259 CAQPPMASQLVVDGAHVGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINA 1318

Query: 913  --RFRHGSLVMRLPDSD----VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
              R R  + +++L + D    + Q  T +F +++G  G +  L  + Y  L  LQT +  
Sbjct: 1319 FVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGS 1378

Query: 967  VIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
            V   + GL+ +  RS    +  V   +A+N +DGD++E +L LS     ++++ + V   
Sbjct: 1379 VTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRY 1438

Query: 1024 ELCKRVEELTRL 1035
             +   + +L R+
Sbjct: 1439 HIIDDLMQLRRM 1450



 Score = 70.5 bits (171), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 87/395 (22%), Positives = 153/395 (38%), Gaps = 96/395 (24%)

Query: 72  DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTIV 126
           DP  R     +Y     ++PF    +   ++ I L+++      + D+ FL G  +PTI+
Sbjct: 145 DPSNRCAACLVYGKHIAILPFHENSKRIHSYVIPLKQIDPRLDNIADMVFLDGYYEPTIL 204

Query: 127 VLYQDNK----------DARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
            LY+  +          D   +    V + D+ F    W   NL      L+P+P PL G
Sbjct: 205 FLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCSQLLPIPKPLGG 263

Query: 177 VLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRYL------ 217
            L+ G  T+VY +              + F   P++  +      +D   S Y+      
Sbjct: 264 ALVFGSNTVVYLNQAVPPCGLVLNSCYDGFTKFPLK-DLKHLKMTLDCSTSVYMEDGRIA 322

Query: 218 LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
           +G   G L L  L+ +     V  L+   + ETSIA +++      +++GS  GDSQL++
Sbjct: 323 VGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLE 382

Query: 276 LNL---------------QPDAKGSYVEV------------------------------- 289
             L                 D   + +E+                               
Sbjct: 383 YTLLKTTRDCAVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRE 442

Query: 290 LERYVNLGPIVDFCV----------VDLERQGQ-GQVVTCSGAYKDGSLRIVRNGIGINE 338
           L+R  N+GP+   CV          VD +R+     +VT SG  K+G+L + +  +    
Sbjct: 443 LDRLRNVGPVKSMCVGRPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEI 502

Query: 339 QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
             S  L+G + +W++    ++    +L+VS +  T
Sbjct: 503 ITSSLLEGAEQLWAVGRKENESH-KYLIVSRVRST 536


>sp|A2R919|CFT1_ASPNC Protein cft1 OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513)
            GN=cft1 PE=3 SV=1
          Length = 1383

 Score = 83.2 bits (204), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 97/438 (22%), Positives = 179/438 (40%), Gaps = 61/438 (13%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
            ++ + LGE    + +   S  + +  C   +    E+ E+H                 F+
Sbjct: 942  LKRVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGSFI 1001

Query: 695  RLLDDQTFE---------FISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVL 740
            +L+ D   +          + ++ L   EY  +I + S     N +       VGTA+  
Sbjct: 1002 KLVWDHHLQRQDSVILIFHLHSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFAR 1061

Query: 741  PEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
              E+ P++G I VF V            D KL+LI ++  KGAV +L+   G+    + Q
Sbjct: 1062 -GEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQ 1120

Query: 790  KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
              +     L++DG+    +      ++  +         ++GD +K +    Y  E   +
Sbjct: 1121 GQKCMVRGLKEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKM 1180

Query: 850  EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
               A+D +   + A E L D   L    A+++ N+  ++ + E        RL    ++H
Sbjct: 1181 SLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFH 1240

Query: 907  LGEFVNRFRHGSLVM----RLPDSDVG-----QIP--TVIFGTVNGVIGVIASLPHEQYL 955
            +G F +        M    ++  S  G     Q P   V+  T NG +G+I  +P E Y 
Sbjct: 1241 MGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYR 1300

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
             L  LQ+ L   ++   GLN   +R+   E      +  LDG+L+  ++D+S+ R  EI+
Sbjct: 1301 RLSALQSQLTNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIA 1358

Query: 1016 KTMNVSVEELCKRVEELT 1033
              +     E+   +E ++
Sbjct: 1359 GRVGAREWEIKADLEAIS 1376



 Score = 38.1 bits (87), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 73/348 (20%), Positives = 132/348 (37%), Gaps = 105/348 (30%)

Query: 109 LQVLDIKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKDFVEGPWSQN 158
           L  + + FLY   +PT  +LY           + KD      + + L+ +       S +
Sbjct: 225 LHPISLAFLYEYREPTFGILYSQVATSSALLPERKDVVFYTVFTLDLEQQASTV-LLSVS 283

Query: 159 NLDNGADLLIPVPPPLCGVLIIGEETIVYC----SANAFKAIPIRPSITKAYGRVDADGS 214
            L +    ++ +PPP+ G L+IG   +V+       NA         ++       +D +
Sbjct: 284 RLPSDLFRVVALPPPVGGALLIGSNELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLA 343

Query: 215 RYL-------LGDHAGLLHLLVITHEK---------EKVTGLKIELL----GETSI---A 251
             L       LGD +G + L++ T E            V+G+ + LL    G TSI   A
Sbjct: 344 LRLENCIVECLGDSSGDMLLVLTTGEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAA 403

Query: 252 STISYLDNAVVYIGSSYGDSQLI----------KLNLQ---------------------- 279
           +  +++ +  +++GS  GDS L+          K  LQ                      
Sbjct: 404 AASTFIGDGKIFLGSEDGDSVLLGYSYSSSSTKKHRLQAKQVIDDSADMSEEDQSDDDVY 463

Query: 280 --------PD-------------AKGSY-VEVLERYVNLGPIVDFCV-----VDLERQG- 311
                   PD             A G Y   + ++ +N+GP+ D  +      +LE+ G 
Sbjct: 464 EDDLYSTSPDTTLTGRRPSGESSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGD 523

Query: 312 -------QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
                  + Q+V   G++K G L ++   I  +  AS+ L+ +  +W+
Sbjct: 524 RTNSTSPELQIVASQGSHKSGGLVVMAREIDPHVVASISLESVDCIWT 571


>sp|Q6BYK1|RSE1_DEBHA Pre-mRNA-splicing factor RSE1 OS=Debaryomyces hansenii (strain ATCC
            36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
            GN=RSE1 PE=3 SV=2
          Length = 1256

 Score = 79.3 bits (194), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/306 (27%), Positives = 133/306 (43%), Gaps = 52/306 (16%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL----QSECGHHGHI 816
            L+ + + E      ++  FNG+LL  ++  ++LY     D G R+L     S   +  +I
Sbjct: 963  LEFVHKTELDYQPTAIIPFNGRLLVGMSNFLRLY-----DLGQRQLLRKASSNIEYLKNI 1017

Query: 817  LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
            + L  Q  G  IVVGD   S + + Y   E      A D     ++A+  LD D  +G +
Sbjct: 1018 IRLTHQG-GSRIVVGDSSMSTTFVKYDSTENQFIPFADDIMKRQITALVTLDYDTIIGGD 1076

Query: 877  NNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
               N+F  R                +  E   +    RL+ + E++L +    F  GSLV
Sbjct: 1077 KFGNIFVSRVPETISQQSDKDWSLLRYQESYLNGSGSRLKNICEFYLQDIPTSFTKGSLV 1136

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVI--------KGV 971
            M       G   ++I+  + G +G++  L  E  + FL  LQ  LRK          K  
Sbjct: 1137 M-------GGKESIIYTGIQGTLGLLLPLSTENEVKFLGDLQLLLRKYFDYNFDDFDKDK 1189

Query: 972  GGLN-----HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
             G N     H ++RS+ N       KN +DGDLIE F +LS++    I   +N +  E+ 
Sbjct: 1190 NGYNLLGKDHLKFRSYYNP-----VKNVMDGDLIERFYELSQSMKIRIGTELNRTPREIE 1244

Query: 1027 KRVEEL 1032
            K++ E+
Sbjct: 1245 KKISEM 1250



 Score = 63.5 bits (153), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 141/680 (20%), Positives = 264/680 (38%), Gaps = 90/680 (13%)

Query: 7   YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
           +G I  ++  R  G  +D++ I ++  K  VL++D E  + +       S    R T  G
Sbjct: 67  FGIIQGIDKIRLVGTQKDYVVITSDSGKLVVLEFDIEKLQFVPLFQEPHSKNGLRRTSPG 126

Query: 67  QIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQV--LDIKF- 116
           +   +DP  R  LIG    + L   +  +++G+L+     E F+     LQ+  +D  F 
Sbjct: 127 EYLCVDPHNRAILIGAIEKNKLVYKVQSNDEGKLELSSPLETFSKHTLTLQICAMDTGFE 186

Query: 117 --LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDNGADLLI 168
             ++   +       QDN +        + L   +  +G        S   +   +  LI
Sbjct: 187 NPMFAAIECDYNARQQDNGEEEDAGEASLLLNYYELDQGLNHVVKHKSNEKIPGSSSHLI 246

Query: 169 PVPPPLCGVLIIGEETIVYCSANAFK---AIPIRPS------ITKAYGRVDADGSRYLLG 219
           P+P  + G+L+  + TI+Y   +  K    IPIR +      +     R+  +    L+ 
Sbjct: 247 PLPDFIGGLLVCSKSTIIYAHPSKDKLYLPIPIRSNTNETLIVNHVIHRLKKNNFFILVQ 306

Query: 220 DHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS---QLI 274
              G    + I H++  E +  + I       ++ +++   +  ++   +  +    Q  
Sbjct: 307 SQLGDCFKITIDHDEVNESIENINITYFDTIPLSQSLNIFKSGFLFANVATNNKLFYQFE 366

Query: 275 KL-------NLQPDAKGSYVEVLERYVN--------------------LGPIVDFCVVDL 307
           KL        LQ      Y  + E  ++                    L PI D  +++ 
Sbjct: 367 KLGDDNNNTTLQSCNFSDYNSIFELDISKRSFKVAGLENLALVDIMETLNPITDGALIET 426

Query: 308 ERQGQGQVVTCSGAYKDGS-LRIVRNGIGINEQASVELQGIK--GMWSLRSSTDDPFDTF 364
            R           A    S L+ + +GI  N   S  L  IK   + + R   +   D +
Sbjct: 427 LRPEVPDPFKQLTALSSHSYLKTLTHGISTNTVVSSPLP-IKPTAIHTTRIFAESANDEY 485

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
           LV+S    ++ L +++ + +EE     F +   T+       + +VQ+ S  +R +  T 
Sbjct: 486 LVISSTLSSQTLVLSIGEVVEEVNDSQFVTNEPTINVQQVGKSSVVQIYSNGIRHIKHTM 545

Query: 425 R-----ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILTEVKHAQLE 477
           R     +   +W  P G S+  A+ N  QV++      + Y EI   D  L E +  +LE
Sbjct: 546 RNDTIEKKYTDWYPPAGISIIQASTNNEQVIIGLSNREICYFEIDPHDDQLVEYQE-RLE 604

Query: 478 YE--------ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGG 526
                     IS   I+ +    SY   A VG  +D +++  SL     L ++T + L  
Sbjct: 605 MSGGSISALAISSSSISKLQRKSSY---AIVG-CSDETIQAISLKPHNCLEIVTLQALSA 660

Query: 527 EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
                S+ +      + +   + +G  +   ++  TG+L+D +   LG++P+ L      
Sbjct: 661 N--SSSIAMVPHGYSTSVHIGMENGLYVRVTIDEITGKLSDTRIQFLGSKPVQLSVIGLP 718

Query: 587 NTTH--VFAASDRPTVIYSS 604
                 + A S RP + Y S
Sbjct: 719 QLQQNGLLAISSRPWIGYYS 738


>sp|Q9FGR0|CPSF1_ARATH Cleavage and polyadenylation specificity factor subunit 1
            OS=Arabidopsis thaliana GN=CPSF160 PE=1 SV=2
          Length = 1442

 Score = 78.2 bits (191), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 24/302 (7%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
            +E  +  P+ T E+  ++    L  + + ++     VGTAYV   E+   +GR+L+F   
Sbjct: 1097 WETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1155

Query: 758  ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
               D    ++ E   +E KGA+ ++ +  G LL +   KI L+KW    +GT        
Sbjct: 1156 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1211

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD-- 869
                +  + +     FI++GD+ KSI  L +K +   +   A+D+ +    A E L D  
Sbjct: 1212 DAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGS 1271

Query: 870  DIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             + L   +      V   +    +  +G +L    E+H+G  V++F    L +++  S  
Sbjct: 1272 TLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKF----LRLQMVSSGA 1327

Query: 929  GQIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
             +I    ++FGT++G  G IA L    +  L+ LQ  L   +  V GLN   +R F +  
Sbjct: 1328 DKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSG 1387

Query: 987  KT 988
            K 
Sbjct: 1388 KA 1389



 Score = 77.0 bits (188), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 159/403 (39%), Gaps = 108/403 (26%)

Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARHVKTYEVALK-DKDFVEGP 154
           N+R LE   V D  FL+G  +P IV+L ++          +H      AL  +    + P
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 299

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
             WS  NL + A  L+ VP P+ GVL++   TI Y S +A  A+ +    + A    +  
Sbjct: 300 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 359

Query: 213 GSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
            S +                 LL   +G L LL + ++   V  L +     + +AS I+
Sbjct: 360 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 419

Query: 256 YLDNAVVYIGSSYGDSQLI------------------------------KLNLQPD---- 281
            + N++ ++GS  GDS L+                              +L +  D    
Sbjct: 420 SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTSDTFQD 479

Query: 282 -------------------AKGSY-VEVLERYVNLGPIVDFC----------VVDLERQG 311
                              A+ S+   V +  VN+GP+ DF              + +Q 
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL--------------RSST 357
             ++V CSG  K+G+L ++R  I       VEL G KG+W++               ++ 
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
           +D +  +L++S   E R + +   D L E TE   +  Q +T+
Sbjct: 600 EDEYHAYLIISL--EARTMVLETADLLTEVTESVDYYVQGRTI 640


>sp|Q2TZ19|CFT1_ASPOR Protein cft1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
            GN=cft1 PE=3 SV=1
          Length = 1393

 Score = 70.5 bits (171), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 76/331 (22%), Positives = 139/331 (41%), Gaps = 40/331 (12%)

Query: 734  VGTAYVLPEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK 782
            VGTA+   E+   ++G + VF V            D KL+L+ ++  KGAV +L+   G+
Sbjct: 1065 VGTAFARGEDIA-SRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQ 1123

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
                + Q  +     L++DG+    +      H+  +         ++ D +K +    Y
Sbjct: 1124 GFLIVAQGQKCIVRGLKEDGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGY 1183

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRL 899
              E   +   A+D +   + A + L D      L A+++ NL  ++ + E        RL
Sbjct: 1184 SEEPYKMSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRL 1243

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVG---------------QIP--TVIFGTVNGV 942
                ++H G F+      S +  LP + V                +IP   ++  + NG 
Sbjct: 1244 LSRSKFHTGNFI------STLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGS 1297

Query: 943  IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
            +G++  +  E Y  L  LQ+ L   I+   GLN   +R+   E      +  LDG L+  
Sbjct: 1298 VGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAV--ESDGTAGRGMLDGKLLFQ 1355

Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            +LD+S+ R  EI+  +  +  E+    E ++
Sbjct: 1356 WLDMSKQRKVEIASRVGANEWEIKADFEAIS 1386


>sp|Q6FLQ6|RSE1_CANGA Pre-mRNA-splicing factor RSE1 OS=Candida glabrata (strain ATCC 2001
           / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=RSE1
           PE=3 SV=1
          Length = 1296

 Score = 68.9 bits (167), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/349 (22%), Positives = 155/349 (44%), Gaps = 58/349 (16%)

Query: 249 SIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-----------DAKGSYVEVLERYVNLG 297
           S +S +   +N  +YI S +     + LN +            D +   + V+ ++ N+ 
Sbjct: 340 SNSSKLHIFNNGSMYINSQFNYDH-VYLNFESIGDNDENYDKIDNENENISVISKHTNIN 398

Query: 298 PI-VDFCVVD------LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QGIKG 349
           PI ++ C+++      +  QG  +            + I+RN I + E  S  L QG+  
Sbjct: 399 PIALNLCLMENMPLTFMHFQGGNRTTDSE------KVNIIRNAIPLKEYVSSPLPQGVSN 452

Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY--- 406
           ++++++     + +F+ ++ I+ T ++      ++ +  IE +   + T    D +    
Sbjct: 453 IFTIKTQYQ-SYHSFIFLTMINFTTVIL-----KIADDSIEQYIPASDTFKLKDDMTIHV 506

Query: 407 -----NQLVQVTSGSVRLVSSTSRELRN-----EWKSPPGYSVNVATANASQVLLATGGG 456
                N ++QV     R +   S++  N     +W  P G S+  A +N SQ++LA    
Sbjct: 507 ATMGDNSIIQVCKDEFRQILLDSKDEENFKMNLKWYPPAGVSILSAVSNFSQLILALSNN 566

Query: 457 HLVYLEIGDGILTEVKH-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
            +VYL++ +  L E K+  +L   I+ L +  + +N   S+I AVG  +D  V + SL  
Sbjct: 567 EIVYLQLENNTLIEYKNRPELPDVITSLAL--LNDNTKKSEILAVGT-SDNMVNVLSLE- 622

Query: 516 LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGH-LLNFLLNMKTG 563
              I  E +  E    +V+  A + I   L  L  GH L+N  + ++ G
Sbjct: 623 ---IVDEAISFE----TVVFQALDAIPSSLLILNQGHKLVNLHIGVEDG 664


>sp|Q1E5B0|CFT1_COCIM Protein CFT1 OS=Coccidioides immitis (strain RS) GN=CFT1 PE=3 SV=1
          Length = 1387

 Score = 68.2 bits (165), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 159/372 (42%), Gaps = 37/372 (9%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTK 748
            ++LL  +T+  + +Y L   E    + + +      + +      VGTA V  E+  P +
Sbjct: 1015 IKLLSPRTWSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITP-R 1073

Query: 749  GRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I VF I+E          + KL++ A+ + KGAV +++   G+  L+ A  QK  +  
Sbjct: 1074 GSIYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRG 1133

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K I    Y  E   +    +D
Sbjct: 1134 --LKEDGSLLPVAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKD 1191

Query: 856  YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A + L D   +Y L A+++  +  +  + E  T  +  RL     +H G F +
Sbjct: 1192 NEYLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTGHFTS 1251

Query: 913  RF-----RHGSLVMRLP---DSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                      S     P   D DV  +P    V+  +  G IGV+  L  + Y  L  LQ
Sbjct: 1252 TMTLLPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1311

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN + +R+   E      +  +DG+L+  +LD+   R  EI+  +   
Sbjct: 1312 SQLVTSMEHPCGLNPKAYRAV--ESDGFGGRGIVDGNLLLRWLDMGVQRKAEIAGRVGAD 1369

Query: 1022 VEELCKRVEELT 1033
            +E +   +E ++
Sbjct: 1370 IESIRVDLETIS 1381


>sp|Q6CAH5|RSE1_YARLI Pre-mRNA-splicing factor RSE1 OS=Yarrowia lipolytica (strain CLIB 122
            / E 150) GN=RSE1 PE=3 SV=1
          Length = 1143

 Score = 64.7 bits (156), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 155/735 (21%), Positives = 283/735 (38%), Gaps = 102/735 (13%)

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +W++R       D ++V+S+ + T  L + + D + ET   G      TL C  ++ +  
Sbjct: 459  LWTMRDGAGS--DKYIVLSYANAT--LVLEIGDSVVETTSSGLTLDKPTLHC-GSVGSSY 513

Query: 410  VQVTSGSVRLVSSTSRELRNE------WKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
            VQV +  + ++   SRE  +E      W +P G  V  A++++ QV+L      L Y E 
Sbjct: 514  VQVMTDGMNVIP-MSREGSSESLPATKWTAPSG-QVICASSSSHQVVLGLTSS-LFYFED 570

Query: 464  GDGILTEVKHAQLEYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNL 518
              G  +E+      YE+S     + + P+      S   AV    D +VRI S+ P+   
Sbjct: 571  TPG--SELSAYDGAYELSSPPTAVAVAPVPAGRVRSPFVAVAT-DDETVRIVSVDPESMF 627

Query: 519  ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDR--KKVSLGTQ 576
             T    G      S+ L +   + YL   L +G  +   L+  TGE+     K V LG  
Sbjct: 628  ETVAVQGLMATASSLALLSVGQVLYLHMGLANGVYVRVELDPLTGEIVGSWSKFVGLGRL 687

Query: 577  PITLRTFSSKNTT------------HVFAASDRPTVIYSSNKKLLYS--NVNLKEVSHMC 622
             +   T   + +             HV A SD   V    N    ++   ++ + +    
Sbjct: 688  SVVPVTCGGEESILVSSRGVKTCLGHVNATSDT-WVPTGGNSAPFFALDAISGEPLDLAH 746

Query: 623  PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682
             F++   P  +       L I T++  QK     + L    +R+  Q  + T  I    +
Sbjct: 747  SFHTQDCPHGVIGVAGSTLKIFTVNTAQKWTENEVKLEGTAKRLI-QHDATTLTITQNPD 805

Query: 683  QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742
            +  +          +D+           D      SI    F D    Y+ VG +     
Sbjct: 806  RLVS----------VDNGAVGITK----DLGGPPTSICEVMFGDGKR-YFAVGGSRDGSP 850

Query: 743  ENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG 802
                T G I +F      L  +   E +    +L A+NG L+A I  +++LY   L+   
Sbjct: 851  GTSGTSGYISIF--SSSSLGHVHTTEVEAPPLALCAYNGLLVAGIGSQVRLYALGLKQV- 907

Query: 803  TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA--IEERARDYNANW 860
             R+ Q E       LA +  +  + + VGD+ +S+++ +   E+    I     D  +  
Sbjct: 908  LRKAQIELSKRVTCLAHFAGS--NRVAVGDIRQSVTVCVVLEEDSGHVIYPLVCDKISRQ 965

Query: 861  MSAVEILD-DDIYLGAE-NNFNLFTVRKNSEGATDEERGRLEVV-------GEYHLGEFV 911
            ++ +  +D + + LG     F +  +   +    DE+   + +        G  H     
Sbjct: 966  VTCLFFVDYETVALGDRFGGFTMLRIPSEASKLADEDHNAVHLRQLEPTLNGPAHF---- 1021

Query: 912  NRFRHGSLVMRLPDSDVGQIPTVI------------FGTVNGVIGVIASLPHEQYLFLEK 959
             RF H      +    +  +P  I             GTV+  + V++    +Q   L+ 
Sbjct: 1022 -RFDH------VASFHIEDVPVAIHMYNDYLVVCGLLGTVSAFVPVVSP---KQSRDLKT 1071

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            ++  +     G+ G +H ++R +      V  K  +DGD++   L +   R +E+ +   
Sbjct: 1072 IEKFVCASDPGLMGRDHGRFRGYY-----VPVKEVVDGDMLREVLVMDEKRREEVGEKTG 1126

Query: 1020 VSVEELCKRVEELTR 1034
            + VE    RV  + +
Sbjct: 1127 LGVEGAVGRVVNVMK 1141


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.319    0.138    0.404 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 391,240,282
Number of Sequences: 539616
Number of extensions: 17458044
Number of successful extensions: 39427
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 46
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 39003
Number of HSP's gapped (non-prelim): 166
length of query: 1036
length of database: 191,569,459
effective HSP length: 128
effective length of query: 908
effective length of database: 122,498,611
effective search space: 111228738788
effective search space used: 111228738788
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 67 (30.4 bits)